Samples of the format of multiple sequence data



IG/Standard

;PTPK_HUMAN, 132 bases, 82D340FD checksum.
PTPK_HUMAN
QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNR
AKNRYGNIIAYDHSRVILQPVEDDPSSDYINANYID......GYQRPSHY
IATQGPVHETVYDFWRMIWQEQSACIVMVTNL1
;PTPK_MOUSE, 132 bases, 656D494A checksum.
PTPK_MOUSE
QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNR
AKNRYGNIIAYDHSRVILQPVEDDPSSDYINANYIDIWLYRDGYQRPSHY
IATQGPVHETVYDFWRMVWQEQSACIVMVTNL1

GenBank

LOCUS       PTPK_HUMAN       132 bp
DEFINITION  PTPK_HUMAN, 132 bases, 82D340FD checksum.
ORIGIN      
       1  QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR
      51  AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYID.... ..GYQRPSHY
     101  IATQGPVHET VYDFWRMIWQ EQSACIVMVT NL
//
LOCUS       PTPK_MOUSE       132 bp
DEFINITION  PTPK_MOUSE, 132 bases, 656D494A checksum.
ORIGIN      
       1  QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR
      51  AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYIDIWLY RDGYQRPSHY
     101  IATQGPVHET VYDFWRMVWQ EQSACIVMVT NL
//

NBRF

>P1;PTPK_HUMAN
PTPK_HUMAN, 132 bases, 82D340FD checksum.
 QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR
 AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYID.... ..GYQRPSHY
 IATQGPVHET VYDFWRMIWQ EQSACIVMVT NL*

>P1;PTPK_MOUSE
PTPK_MOUSE, 132 bases, 656D494A checksum.
 QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR
 AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYIDIWLY RDGYQRPSHY
 IATQGPVHET VYDFWRMVWQ EQSACIVMVT NL*

EMBL

ID   PTPK_HUMAN
DE   PTPK_HUMAN, 132 bases, 82D340FD checksum.
SQ             132 BP
     QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR AKNRYGNIIA
     YDHSRVILQP VEDDPSSDYI NANYID.... ..GYQRPSHY IATQGPVHET VYDFWRMIWQ
     EQSACIVMVT NL
//
ID   PTPK_MOUSE
DE   PTPK_MOUSE, 132 bases, 656D494A checksum.
SQ             132 BP
     QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR AKNRYGNIIA
     YDHSRVILQP VEDDPSSDYI NANYIDIWLY RDGYQRPSHY IATQGPVHET VYDFWRMVWQ
     EQSACIVMVT NL
//

Fasta

>PTPK_HUMAN, 132 bases, 82D340FD checksum.
QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNR
AKNRYGNIIAYDHSRVILQPVEDDPSSDYINANYID......GYQRPSHY
IATQGPVHETVYDFWRMIWQEQSACIVMVTNL
>PTPK_MOUSE, 132 bases, 656D494A checksum.
QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNR
AKNRYGNIIAYDHSRVILQPVEDDPSSDYINANYIDIWLYRDGYQRPSHY
IATQGPVHETVYDFWRMVWQEQSACIVMVTNL

Phylip 3.2


 2 132 YF
PTPK_HUMAN   QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR
             AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYID.... ..GYQRPSHY
             IATQGPVHET VYDFWRMIWQ EQSACIVMVT NL
PTPK_MOUSE   QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR
             AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYIDIWLY RDGYQRPSHY
             IATQGPVHET VYDFWRMVWQ EQSACIVMVT NL

Phylip (newer version)


 2 132
PTPK_HUMAN   QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR
PTPK_MOUSE   QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR

             AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYID.... ..GYQRPSHY
             AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYIDIWLY RDGYQRPSHY

             IATQGPVHET VYDFWRMIWQ EQSACIVMVT NL
             IATQGPVHET VYDFWRMVWQ EQSACIVMVT NL

GCG MSF

   MSF:  132  Type: P    Check:  9632   .. 

 Name: PTPK_HUMAN     Len:  132  Check:  1563  Weight:  1.00
 Name: PTPK_MOUSE     Len:  132  Check:  8069  Weight:  1.00

//



PTPK_HUMAN      QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR 
PTPK_MOUSE      QLHPAIRVAD LLQHINLMKT SDSYGFKEEY ESFFEGQSAS WDVAKKDQNR 


PTPK_HUMAN      AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYID.... ..GYQRPSHY 
PTPK_MOUSE      AKNRYGNIIA YDHSRVILQP VEDDPSSDYI NANYIDIWLY RDGYQRPSHY 


PTPK_HUMAN      IATQGPVHET VYDFWRMIWQ EQSACIVMVT NL
PTPK_MOUSE      IATQGPVHET VYDFWRMVWQ EQSACIVMVT NL


PAUP NEXUS

#NEXUS
[sample -- data title]

[Name: PTPK_HUMAN        Len:   132  Check: 82D340FD]
[Name: PTPK_MOUSE        Len:   132  Check: 656D494A]


begin data;
 dimensions ntax=2 nchar=132;
 format datatype=protein interleave missing=-;
  matrix
PTPK_HUMA  QLHPAIRVADLLQHINLMKT SDSYGFKEEYESFFEGQSAS WDVAKKDQNRAKNRYGNIIA YDHSRVILQPVEDDPSSDYI NANYID......GYQRPSHY
PTPK_MOUS  QLHPAIRVADLLQHINLMKT SDSYGFKEEYESFFEGQSAS WDVAKKDQNRAKNRYGNIIA YDHSRVILQPVEDDPSSDYI NANYIDIWLYRDGYQRPSHY

PTPK_HUMA  IATQGPVHETVYDFWRMIWQ EQSACIVMVTNL
PTPK_MOUS  IATQGPVHETVYDFWRMVWQ EQSACIVMVTNL

;
  end;

PIR

\\\
ENTRY           PTPK_HUMAN 
TITLE           PTPK_HUMAN, 132 bases, 82D340FD checksum.
SEQUENCE        
                5        10        15        20        25        30
      1  Q L H P A I R V A D L L Q H I N L M K T S D S Y G F K E E Y
     31  E S F F E G Q S A S W D V A K K D Q N R A K N R Y G N I I A
     61  Y D H S R V I L Q P V E D D P S S D Y I N A N Y I D . . . .
     91  . . G Y Q R P S H Y I A T Q G P V H E T V Y D F W R M I W Q
    121  E Q S A C I V M V T N L
///
ENTRY           PTPK_MOUSE 
TITLE           PTPK_MOUSE, 132 bases, 656D494A checksum.
SEQUENCE        
                5        10        15        20        25        30
      1  Q L H P A I R V A D L L Q H I N L M K T S D S Y G F K E E Y
     31  E S F F E G Q S A S W D V A K K D Q N R A K N R Y G N I I A
     61  Y D H S R V I L Q P V E D D P S S D Y I N A N Y I D I W L Y
     91  R D G Y Q R P S H Y I A T Q G P V H E T V Y D F W R M V W Q
    121  E Q S A C I V M V T N L
///

ASN.1

Bioseq-set ::= {
seq-set {
  seq {
    id { local id 1 },
    descr { title "PTPK_HUMAN" },
    inst {
      repr raw, mol aa, length 132, topology linear,
      seq-data
        iupacaa "QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNRAKNRYGNIIAY
DHSRVILQPVEDDPSSDYINANYID......GYQRPSHYIATQGPVHETVYDFWRMIWQEQSACIVMVTNL"
      } } ,
  seq {
    id { local id 2 },
    descr { title "PTPK_MOUSE" },
    inst {
      repr raw, mol aa, length 132, topology linear,
      seq-data
        iupacaa "QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNRAKNRYGNIIAY
DHSRVILQPVEDDPSSDYINANYIDIWLYRDGYQRPSHYIATQGPVHETVYDFWRMVWQEQSACIVMVTNL"
      } } ,
} }



Clustal W


PTPK_HUMAN      QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNRAKNRYGNIIA
PTPK_MOUSE      QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNRAKNRYGNIIA

PTPK_HUMAN      YDHSRVILQPVEDDPSSDYINANYID------GYQRPSHYIATQGPVHETVYDFWRMIWQ
PTPK_MOUSE      YDHSRVILQPVEDDPSSDYINANYIDIWLYRDGYQRPSHYIATQGPVHETVYDFWRMVWQ

PTPK_HUMAN      EQSACIVMVTNL
PTPK_MOUSE      EQSACIVMVTNL


SELEX



# Sample SELEX

PTPK_HUMAN      QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNRAKNRYGNIIA
PTPK_MOUSE      QLHPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNRAKNRYGNIIA

PTPK_HUMAN      YDHSRVILQPVEDDPSSDYINANYID......GYQRPSHYIATQGPVHETVYDFWRMIWQ
PTPK_MOUSE      YDHSRVILQPVEDDPSSDYINANYIDIWLYRDGYQRPSHYIATQGPVHETVYDFWRMVWQ

PTPK_HUMAN      EQSACIVMVTNL
PTPK_MOUSE      EQSACIVMVTNL




Stockholm


# STOCKHOLM 1.0

#=GF SQ 11

rno_305843 MEMSLRPLLSVFVLGLVSTPSTLAQDDPRYTKFLTQHYDAKPK--GRDARYCESMMRRRGLTS----PCK

#=GS rno_305843 AC rno_305843

#=GS rno_305843 DE rno_305843

mmu_11727 MAISPGPLFLIFVLGLVVIPPTLAQDDSRYTKFLTQHHDAKPK--GRDDRYCERMMKRRSLTS----PCK

#=GS mmu_11727 AC mmu_11727

#=GS mmu_11727 DE mmu_11727

ecb_100034041 MAMSLCPLLLVFVLGLGLTPPSLAQDDSRYRQFLTKHYDANPR--GRNDRYCESMMVRRHLTT----PCK

#=GS ecb_100034041 AC ecb_100034041

#=GS ecb_100034041 DE ecb_100034041

ssc_733639 MVILLGPLLLVFMLGLGLAPLSLAKDEDRYTHFLTQHYDAKPK--GRDGRYCESIMKQRGLTR----PCK

#=GS ssc_733639 AC ssc_733639

#=GS ssc_733639 DE ssc_733639

bta_783225 MVMVLSPLFLVFMLGLGLTPLTLAEDDRRYRHFLIQHYDRSPK--GRDNKYCETMMEKRHLTK----PCK

#=GS bta_783225 AC bta_783225

#=GS bta_783225 DE bta_783225

bta_783907 MVMVLSPLFLVFMLDLGLTPQTLAQN-DAYRGFLRKHYDPSPT--GHDDRYCNTMMERRNMTR----PCK

#=GS bta_783907 AC bta_783907

#=GS bta_783907 DE bta_783907

dre_100003397 MEILQSAVIFLLVFSFSFT-VKVPDNESPYEKFLRQHVDP-----DMSVQKCNSEISKRKITAKAGNDCK

#=GS dre_100003397 AC dre_100003397

#=GS dre_100003397 DE dre_100003397

gga_423668 --MAMSSLWWTAILLLALT-VSMCYGVPTYQDFLYKHMDFPKTSFPSNAAYCNVMMVRRGMTAHG--RCK

#=GS gga_423668 AC gga_423668

#=GS gga_423668 DE gga_423668

dre_100101462 -----MKTRQSFIILLLVICASLAVNSQSYNDFKRKHLAPAGMK---EDDCTTLIVTERKIKEKN--QCK

#=GS dre_100101462 AC dre_100101462

#=GS dre_100101462 DE dre_100101462

tr_Q6EUW9 ----MFPKFSFLLIFAVVLSLTHKSLCQDWETFQKKHLTDTVD-------VNCDVEMQKALFN-----CK

#=GS tr_Q6EUW9 AC tr_Q6EUW9

#=GS tr_Q6EUW9 DE tr_Q6EUW9

xla_398124 MLDIMVAVLSSLLTICIILSFSLPSDTQNINAFMEKHIVKEGA-----ETNCNQTIKDRNIRFKN--NCK

#=GS xla_398124 AC xla_398124

#=GS xla_398124 DE xla_398124

//

Stockholm (simple format)


# STOCKHOLM 1.0
HBB_HUMAN   ........VHLTPEEKSAVTALWGKV....NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVL
HBA_HUMAN   .........VLSPADKTNVKAAWGKVGA..HAGEYGAEALERMFLSFPTTKTYFPHF.DLS.....HGSAQVKGHGKKVA
MYG_PHYCA   .........VLSEGEWQLVLHVWAKVEA..DVAGHGQDILIRLFKSHPETLEKFDRFKHLKTEAEMKASEDLKKHGVTVL
GLB5_PETMA  PIVDTGSVAPLSAAEKTKIRSAWAPVYS..TYETSGVDILVKFFTSTPAAQEFFPKFKGLTTADQLKKSADVRWHAERII
HBB_HUMAN   GAFSDGLAHL...D..NLKGTFATLSELHCDKL..HVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANAL
HBA_HUMAN   DALTNAVAHV...D..DMPNALSALSDLHAHKL..RVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVL
MYG_PHYCA   TALGAILKK....K.GHHEAELKPLAQSHATKH..KIPIKYLEFISEAIIHVLHSRHPGDFGADAQGAMNKALELFRKDI
GLB5_PETMA  NAVNDAVASM..DDTEKMSMKLRDLSGKHAKSF..QVDPQYFKVLAAVIADTVAAG.........DAGFEKLMSMICILL
HBB_HUMAN   AHKYH......
HBA_HUMAN   TSKYR......
MYG_PHYCA   AAKYKELGYQG
GLB5_PETMA  RSAY.......
//