Gene YpsIP31758_0510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0510 
Symbol 
ID5387300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp592502 
End bp602554 
Gene Length10053 bp 
Protein Length3350 aa 
Translation table11 
GC content57% 
IMG OID640863481 
Productadhesin/hemagglutinin 
Protein accessionYP_001399503 
Protein GI153948207 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA ACCTGTACCG TATCGTTTTT AATCAAGCGC GCGGCATGTT GATGGTGGTC 
GCTGATATTG CTGCCTCCGG TCGGGCTGCA TCGTCGCCAT CGTCAGGTGT TGGGCATACT
CAGCGCCGCC GGGTCAGTGC CTTATCACCA TTGAGTTTTA GTCTGTTAAT CGCTTTGGGG
GGCATCTCGT TATCCGCTCA GGCCGCAATT GTGGCGGATG GCAGTGCGCC GGGTAATCAG
CAGCCGACCA TTATCAGCAG CGCCAACGGC ACGCCTCAGG TTAATATCCA AACCCCCAGT
AGCGGCGGCG TCTCGCGCAA CGCTTACCGC CAGTTTGATG TCGATAATCG CGGGGTGATC
CTCAATAACG GTCGCGGTGT CAACCAGACC CAGATTGCCG GATTGGTTGA TGGTAATCCG
TGGCTGGCGC GGGGCGAAGC CAGCGTGATC CTCAACGAAG TCAACAGTCG CGACCCGAGT
CAGCTCAACG GCTATATCGA AGTGGCCGGA CGTAAAGCGC AGGTGGTGAT CGCTAACCCC
GCAGGCATTA CCTGCGAGGG CTGTGGTTTT ATCAACGCCA ACCGCGCAAC CCTCACCACC
GGTCAGGCGC AGCTCAATAA CGGCCAACTC ACCGGCTACG ATGTTGAACG GGGTGAAATT
GTTATTCAGG GAAAGGGCCT GGACAGCCGT GGTCAGGATC ATACCGATCT GATCGCCCGT
TCCGTTAAAG TAAATGCGGG CATCTGGGCC AATGAACTGA ATATCACTAC CGGTCGCAAT
CAGGTTGATG CTGCACACCA GAACATCAAT ACCAACGCCG CCGATGGCCG CCCTCGTCCT
GCCGTGGCGG TCGATGTGGC TAATTTGGGG GGGATGTACG CCGGTAAAAT CCGCCTAATC
GGTACTGAAA CCGGTGTTGG CGTGCACAAT GCGGGCGAGA TAGGGGCTTC TGCAGGTGAT
ATCGTGATTA CGGCCGACGG TATGCTGGTG AACCGCGGCC AGATCAGCAG TGCTCAACAA
CTGGCGGTGA ATACCCCCTC AGGCATAGAG AATAGCGGTG TGCTCTATGG GAAGGGCAAT
ACCCAACTGA CCACAGCGGG TAAACTGAGC AACAGTGGCA CGGTCGCGGC GGCGGGTGAC
ACCTTGATCC GCGCGGCAGA GGTTAACAGC AGCCGTAATT CTGTTTTGGG TGCTGGCATT
AAATCCGATA ACAGTGTCAT TACCCGTGGC ACCCTTGATA TTAAAGCCCG TGGGCAGCTA
ACCGCCCAAG GGAAAAATAT TAGCGGCACG GCGCAGACAT TTAATGCGAA CCGTATTGAT
CTCAGCGGTA GCCAAACTCA GAGCGGTGAT CTGACGTTCA CCACCGAAGG TGGCGACATC
GATTTGACAG GGGCTAACCT GTTCGCCAAT CGTCGTCTGT CTGTTTCGAC CCCTTCTTTG
CTACGCACCG ATAAAGCTAA CTTGTTCGCG GAGCAAATCG CGCTCGACGC ACAAGCGCTC
GCCAATGTGG GTGGCGTGAT AACGCAAACT GGGCTGACCG ACTTCAACTT GAATCTACCG
GGTTATATTG ATAACCGTGG TGGCTCTCTC CTCACCCGCG GCAACTTTTT GCTGCAAGCT
GAACACTTGA CCAGTAATAG CCAGAGTTTA CTGGGTGCTG GCATACAGAG TGATGGCAAA
CTGGCTCCGC GTGGTGATCT CAATGTCACC ACGCGGCATG CCTTGATTGC TCAAGGGAAA
ACGCTAGCCG CAGGCACTCT GGCACTCTCC GGCAGCCAGC TTGATCTTAC TGATAGCCTG
ACACAAGCCA AGGATATGCG GTTGACAGCC ACGGAAGGCG ATATTGCGTT AACCGGTGCC
ACGGTGATGG CGGCTAACAC GTTGTTTGCG GATACCCGCC AGATCCTGCG CAGTGATAAG
GCTTACCTCA CCGCTGATCA AATCAATCTC ACCGCCGATT CCCTCTCTAA CGTTGAGGGG
CGGGTCGTCC AAAAAGGCAG CGGTGATTTT AGGCTGGATC TCCCCGGCTA TCTGGACAAT
CGGGGCGGTG TGCTGCTCAC CAAAGGCAAT CTGGCACTGC AAGCCGAACG CCTGACCAGC
AATAGCAAGA GTTTGCTTGG CGCAGGCATA CAAGCCGATG GCAGTAAGGC CAGCAAAGGG
GATTTACAGG CCAATACCAC GCAAGCATTG ATTGCTCAGG GGCAGAATGT CGCTGCGGGC
GCCATGACAT TATCCGGTAG CCGCGTCGAC CTCACTGGCA GCCAGACGCA CGCCAGCAAT
ATCACGATCA CCGCCCGTGA CGGTGATGTT ACGACCCGCG AGGCGATCTT GATCACCCCC
GGCACTCTCT CGATGACTGC GGTAGCCAAC CCTGAACAGA CCCTGAATAA CCGCGGCGGG
AAATTACATG CCGATAATAT CCAACTCAAT CTGGCCAAAC TGGAGAACAG CAACGGGGAA
ATTGCGGCAG CGACCGACCT GTGGCTACGT CTACAAAGCG ATTTCATTCA TCAGGCAGGC
GCGCGTTTAA CGGCAGGGCG TGATCTGCTC TTTAACAGCC GTGGCGCATT GATTAACCAG
TATAAACTGG AAGCCGGGCG GGATATGCAG CTCACCGCGC TCAGTATCCG CAATACCAGT
GCTGATCGTA ATACTAATGC TGATAATAGC AGCCTGCTGG CCGGGCGAGG CTTGTCACTG
AGCACCGATA GCCTGTTTAA CCGTGGGGCT ATCTATACCA CCGGTGTCGG GCAATTTACC
GTCAACGGCA ATACAGAGAA CATCGGTGAA ATTTACACCG AACAGCAACT GACCTTCACC
GCCACCGGCA ATCTGGCTAA CCGTGGCGTG ATGCAAACTC GCGGAGAGAT GCAGCTATCG
GCTCAGGGTG ACTTCAATAA CAGCGGCATG CTGTACAGCG CAGGTGACCA GATGCGGTTG
TCCATCGCCG GTAACCTGAC CAACGAAGGC AAGCTGCACG CCGCCAACGG GGAAATGCGC
CTATTGACCG AGGGTGATCT GGATAACCGC GGTAGCCTTT ATGGCGCAGG GAACAGCGAC
ATCACCACGC AGGGGAATGC CGTTAACACT GGCTCGGTTT ATACCCAGGG CGCGCTACAG
TGGCTGACAA AGGGCAGTGT ACGCAACAGC GCTTCCATCG CCGCACTTGG GGATCTCCAA
CTGCGGGCTA ACGACTTACT TAGCGACAAT CAATCCCTGA TGGCGGCGGG ATTGAAGGCC
GACGGTAGCC GGAGCGACAG CGGTAATCTG GCGGTCAGCA CAGAGCAAGC CTTGATAGCG
CAAGGGCAAA ATATTGCCGC AGGGTCGCTG GCGTTGGCGG GTAGCCAGAT TGATCTCACC
GGCAGCCAGA CTCAGGCCAA CGCCATCAGC CTCACGGCAA AATCGGGTGA TATCACATTA
ACGAGCGCCG TGATTAAAGC CGCCACTCAG CTATTAGTGA CCCAGCTAGC CGCGACACGA
TCATTATTTA TACCGCCATC CTCAACACGA TCATTATCGA CACAATCATC ATCATCAACA
CAAGCATCAT CAACACAAGC ATCGGCCAGT CCCAGCGCCT TGCTGCGTAC CGATAAGGCG
AGTTTGATAG CCGATCAGCT CACGTTCGAT GTGCAAGCGC TCTCCAATCT CGGCGGCGTA
ATCGCGCAAA CGGGGGCCAC CGATTTCAAC CTGAATCTAC CGGGTTATTT AGATAACCGT
GGTGGCACAA TCCTCTCCAA AGGCAACGTG GCCATACAGG CGCAGGGCCT TGATAGCGAC
AGCGGTAGCC TGTTAGGCGC GGGTGTGCAA AGTGATGGCA AGCTGACGAA CGCCGGTGAT
CTGGCGGTGA CGGTCCGTCA GGACTTAATC GCGCACGGGC AAAGCCTTGC CGCCGGTGCC
ATGACGTTAA CCGGCGGCAG GGTTGATCTC ACCGGCAGCC AGACGCAAGC CCGCGGGATA
ACCATCACTG CCAACAAGGG GGATGTCAGC ACCCAGCGCG CCAATATTCT CTCCCTTGGC
TCGCTGGCTA TCAATGCGGG CGCAAATGCA GGGCAAACCC TCAATAACCA AGGCGGCGCA
CTTCAGGCGA ATAACATCGC GTTAAACCTT GGCCAACTGG ATAACCGCAC AGGCAAGATA
GCGGCCAGTC AGGATCTGAC TCTGGGTTTA CAGCGTGACT TCAACATTCT GGCCGACTCT
ACCCTTCAGG CCGGGCGTGA TTTCTCCTTT ACCACCCACG GCGCTTTAAC CAATGACGGG
CAGTTGTTGG CGGGGCGCAA ACTGAGCACC CGTTCGAACA GTTTACTGAA TAACGGCAAT
ATCCGTGCGG TGCAAGCCGA TCTCCGTGCA TCTGGCGCAC TGACTAATCG CGGGGAAATA
TTAACCCGTG GTGGGCTCAG TACTGATGCC AACACCTTGT TTAATAGCGG CACCCTCATC
GGTGCCACCG CCACGCTTAA CGCGCGGGAG CGCATCACTA ACTCGGGTCC TAACGCCCTG
ATCGGTGCAA CGGATAAAAA CGGCACATTA GCGCTGTTGG CACCGGTAAT TGAAAACAGT
GATACCGTCA CGCGCACCGA TACCGCACCG ACCACCACCC TGTTAGGCAT GGGCAAGGTT
ATTCTGGCTG GTGGGCAAGA TAATAGTGGC AATTACTCGT CTGCGGCCCA GGTGCTCAAC
CTTTCCGGTC TGATTGAATC CGGAAATGAT CTGCTGGTTT ACGCCAAAAC GCTGACCAAT
CGCCGTCAGA TTTTAACGGC CACGACAGAC TTTATCGTGG GTGACACCGT GACGGGGGCG
GCTTACTGGA CGGCAGAAAA CCCCGATATT CCGGGGGGGC GTTATACTCA ACCCCCTACC
GGTGGTCCAA TGAACAGTGA TTATATCGGC ACGAACTACA CCTCGACTGT TGCCTATAAC
CGCATCGATC AAATCAGCCC GGAAGCGCAA TTGCTCGCGG GTGGCAATTT GACGCCTCAG
GTGGGCACGC TGGAAAACAA CTGGAGTAAA GTGAGTGCGC AAGGCGTGAT CGATCTTACG
GGGGTCACGC TGCAACAAGA TGACTGGGGG AGCCAACAAC GTCTGGTTGA GCAGACCACC
TCCAGCGGTG AGTATCGCTA CCGGACCTAT AAAGGTAAGT TATGGGGTAT CGCCTGGGGG
CCGGAAATGA AGCTGCGCCC GAACAACCAA TATGCCTCGA GCATCACGGC CAAAACCCTG
ACTGGCAGCG GCACGGTGAT TAACAATACG GTGATAAACA ATGGGGCAGC CCCCGGTGCG
ATTGTCGCCC CACGTGATCG CGACAGCACG GGCAAGAATA TCGCCGTCGA GTTCAACGGG
ATCGCCCTGA CGCTCCCGCG CAGCGGGCTG TATCAGCTCA AGACCGATAA AGGTGACTAT
GCCCCCGGCC CGGAAGCGGC GTTATCTCTC GCGAATATCA GCCCCCCTTC CTCTCTCGAT
GCCACGGGCC AGCGTGGGGT TCCTCCCCCG TCTGACGATC TCAACCGAAC CGGCCTTGTT
ACCCCAGATC GGGCGGTCAG CGGCGGCTAT CTGGTGGAGA CCCATCCGGC GTTTGCCAGC
CTGAATAACT GGAAAGGGTC GGACCTCTAT TTGCAGCAAC TGAGCAGCGA TCCTTCGGTG
ATACACAAAC GGCTGGGGGA TAACGCCTAT GAACAGCGGC TGTTACGGGA TCAGGTGCTG
GCGTTGACGG GGAGAACGGT GGCCAGTGAT TACCGTAGCG AACAGGCGCA GTTCGAGCAG
CTCTTTGCCG CCGGGGTCCA GTACAGCAAA GCGTTCAATT TGGCACCGGG TACGCGCCTC
AGTGCGGAGC AGATGGCAAC CTTAACCGGC AATATCGTGC TGATGGAGAA CCGTGACGTC
GCCGGGCAAA CCGTATTAGT CCCCGTGGTC TATCTGGCGG GGGTTAAACC GGGCGATCTA
CGTGCTAACG GGGCATTGAT TGCGGCGGAG AATATCAGCC TGACCGAGGT ACAGGGGTTC
GCCAATGCGG GGGCGATCAG CGCCACCAAT AACCTGCAAA TCAGCATGGC GAAAGACATC
ACCCTGAACA ACCGTGGCGG CTTGCTTCAG GCGGGTAATC ACCTGCAACT CAGCACGCTG
AACAGCGATA TTGACCTGAC CAGCGCGCGA CTCAATGCCA CGGATCTGCA ACTGGACAGC
GGCCGCGATG TGATCCTGCG TACCGCCAGT GAACAGTACA GCAGCGGTAA CGGCGCGGTG
CAGCGGACGC AAACGATCCT CGGGCCGCTG GCGAGTCTCA ATATCAGTAA TAACGCGGTG
ATCACAGCCC AACGCGATTT TATCCAGCAG GGCGCGGGCA TCAATATTGG CAAGGATCTC
CAGGTGAACA CCGGTGGCGA CTGGCTTCTC AGTACGGTGC AACGTAGCGA CCAGATCAGT
GCGCAGTATG GCGGCGGCAG TGCGACCAGT GGCTCTCTCC GCCATCTGGG CAGCGAGGTG
AAGGTCGGTG GTGCGCTAAG CGCCAACGTA GACAATTTGA CCGCCGTGGG GGCGCGGGTG
AATGCCGGTA CCATCGATGT GCGGGCGCAG AATATCACTC TCAGCGCGGC CACTGACAGC
CTGTCTGTTA CCGGTGGATC CTCAAGCAAG CGCCATACCG CGGCGGTGAA CCTCTATGAT
GAAACGCTTC TCGGCAGCCA GTTGAACGCC ACGGGCGATA TCAATCTGCA GACAGCAAAC
GATATCACCC TCAGTGCCAG CGCGGTGCAA ACGGATGGCG CGTTGAAACT GGCGGCGGGT
GGCGATGTCA CCCTCATCTC CCAGACTGAA CAGCATGACG AGCAGCGCAA TCACACCGGG
ACTAAAAAAG GGCTGGTCTC CAGCACCACC GCCCGCAGCG AAGAGGGCCG GAGCCAGACG
CTGGCGGTGG GTTCGATGCT CTCGGCGGGT TCCATTGATG TCAGTAGCCA AAATATCGCG
GTCGCGGGCA GCAGCGTGGT GGCTGATAAG GATATTCGCC TGCGTGCGCA GGAAAACCTG
ACCGTGAGCA CCGCGCAGCA GAGCGAGAGC GGGTCTCAGC TATTCGAGCA GAAAAAATCC
GGCCTGATGA GCACCGGCGG TATCGGTGTC TTTATCGGGA CTTCCCGGCA GAAAACCACC
GACCAGACCC AAACGGTGAG CCATGTTGGC AGCACGGTGG GCAGCCTGAC GGGCAATGTG
CGTCTCGAGG CGGGCAATCA GTTAACCCTT CACGGCAGCG AAGTGGTGGC GGGTAAAGAC
CTCGCCCTGA CAGGGGCGGA TGTCGCGATC AGCGCCGCAG AAAATAGCCG TTCTCAACAG
TATACTGCCG AGAGTAAACA GCGTGGCCTG ACGGTGGCGC TGTCCGGACC GGTCGGCAGT
GCCGTCAATA CGGCGGTCAC CACCGCCAAA GCGGCCCGAG AAGAAAACAC CGGCCGGCTG
GCGGGATTAC AAGGGGTTAA AGCGGCGCTG TCGGGGGTGC AAGCGGTGCA GGCCGGGCAA
CTGGTGCAGG CCCAGGGAGG CGGTATCACT GAGATGGTGG GCGTCAGTGT GTCGTTAGGC
TCACAAAAAT CGTCCTCGCA GCAACAGCAG GAACAGACCC AGGTGAGCGG TTCGGCCCTG
ACGGCGGGTA ATAACCTGAG CATCAAGGCC ACCGGTGGCG GGAATGCGGC AAACAGCGGC
GATATTCTGA TCGCCGGCAG CCAGCTTAAA GCCGGGGGCG ATACCCGGCT TGATGCGGCG
CGTGACGTGC GGTTACTCGG CGCGGCCAAT AGGCAAAAAA CCGACGGCAG TAACAGCAGC
CGTGGCGGTA GCGTTGGCGT CAGTGTGGGG GGCAGCGGTC TGAGCGTCTT TGCCAATGCC
AACAAGGGGC AGGGCAATGA GCGCGGTGAC GGCACTTTCT GGACGGAAAC CACCGTCGAC
AGTGGCGGAA TGTTCTCGTT GCGCAGCGGT CGCGATACGG CACTGACCGG CGCGCAGGTC
AGCGCTGAAA CGGTCAAGGC CGATGTGGGG CGTAATCTTG CTCTGCAAAG CCAGCAGGAC
CGCGATAATT ATGATGCGAA GCAGAGCCGT GCCAGCGGCG GTATCAGTGT CCCGGTGGCG
GGGGGCGGTG CCGCGGTCAA CCTGAGCATG AGCCGTGACA GGCTATCCAG CCAGTATGAC
TCGGTGCAGG CGCAGACGGG TATTTTTGCC GGTTCTGGCG GTGTTGATAT CCGGGTGGGG
GAGCACACCC AACTGGATGG CGCGGTGATT GCCAGCACGG CGGCAGCCGA TAAAAACACG
CTGGATACCG GCACACTGGG CTTCAGTGAT ATCAAAAATA AAGCGGTATT CACGGTGGAG
CATCAGGGCG GCAGCCTGAG CACCGGTGGC CCGGTGGGGT CAGACCTGCT GAGTAATCTG
AGCGGCATGG TGCTCGCGGG GCTGGGCAAT GGCGGATATG CTGAAGGCAC CACGCAGGCG
GCAGTGAGTG AGGGCACGAT TACCGTTCGC GACACGGAGA ATCAACAGCA GAATGTTGAT
GACCTGAGCC GGGACACCGG GAATGCCAAC GGCAGTATCG GGCCGATTTT TGATAAAGAG
AAAGAGCAGA ACCGGCTGAA AGAAGTGCAG CTGATTGGCG AGATAGGCGG TCAGGCGCTG
GATATTGCCT CCACGCAGGG CAAGATAATT GCCACTCACG CGGCAAACGA CAAGATGAAG
GCGGTGAAGC CGGAAGATAT CGCCGCGGCG GAAAAACAGT GGGAGAAAGC CCATCCGGGC
AAGGCGGCCA CGGCAGAGGA CATCAACCAG CAGATTTACC AGACGGCGTA TAATCAGGCA
TTTAACGCGT CAGGATTTGG GACCGGCGGC CCGGTGCAAC GCGGTATGCA GGCGGCGACA
GCCGCCGTGC AGGGGCTGGC TGGCGGGAAT ATGGGTGCGG CCCTGACGGG TGCCAGTGCG
CCGTATCTGG CGGGGGTGAT TAAGCAAAGT ACAGGCGATA ATCCGGCGGC TAACACAATG
GCACACGCCG TATTGGGCGC GGTGACCGCC TATGCCAGCG GCAACCATGC GCTGGCGGGT
GCGGCTGGCG CGGCCACGGC GGAGTTGATG GCCCCCACGA TTATCAGCGC GCTGGGCTGG
GACAAGAACA CACTCACCGA AGGTCAGAAA CAGGCTGTCA GCGCACTGAG TACATTAGCC
GCCGGGCTGG CCGGTGGCCT GACAGGTGAC AGCACAGCGG ATGCGCTAGC CGGGGGGCAG
GCGGGGAAAA ATGCGGTGGA GAATAACTCT CTGAGTGGAG ATCAAGCCCG CGAGTCTGTT
AAGCAGGTTA CGAGCAACCT GAAAGATCAG GTAAGAGACA AACTGGGTGA AGGCACACTC
TCAGCTATCG TTAACAGTAT AATTAATGCG ACGGCTGACA CAGGTGATGC GATATTAGGT
GGAGCGGATT ACGGCGCTGA TGCGGCGATG GCGCTCACCT CATGCGCCAT GGGAGACAGT
TACTGTACTC AGGCATTAAA CGATCTGGCG GGTAAAAATC AAGCAACGGC AGATACGCTC
AAAGCCCTGA TGAAGAGTGA AACCTGGTCA GCGGTTGCAG GACAGGTGAA AGAAGCGGCT
CAAGGTAACC AGCTTGCTCT GGAAGCCACA GGTGGAATGC TGGCGGGTCT TTTCCTGCCG
GGTAAGAAAT TGCCGGGAAG TAATATAGTT GTCGCTGAAA ATGCTACTAA AGCAATTATT
GATTCGAAGA AGTTTGATTA TCTCTTTGGT AATGCAAAAA GTAGTGGGCA TAACGCCGAT
CGCTCTACCC AATTGGCTCA GACAATGAAC CGTTTGGGGT TAGAAACTAA TGAGAAAGGA
GCAAGTATCC TTACTGAACA CTTAAAACAG GTAGTAAATA CGAAGGGTAA TGTTGTAGAT
ACTTATACTA AAGGAAATCA AGTCTTTGAA GTGAGAGAGT CATTGTTTTT CGGTCCTTCA
GGAAAGGCGG CTAAGTTAGA AACTGCTTTT GAGATCATGC CTGATGGTTC TCGTAGATTT
GTCACAACTA TTCCTAAAGA TGGAAAAAAG TAA
 
Protein sequence
MNKNLYRIVF NQARGMLMVV ADIAASGRAA SSPSSGVGHT QRRRVSALSP LSFSLLIALG 
GISLSAQAAI VADGSAPGNQ QPTIISSANG TPQVNIQTPS SGGVSRNAYR QFDVDNRGVI
LNNGRGVNQT QIAGLVDGNP WLARGEASVI LNEVNSRDPS QLNGYIEVAG RKAQVVIANP
AGITCEGCGF INANRATLTT GQAQLNNGQL TGYDVERGEI VIQGKGLDSR GQDHTDLIAR
SVKVNAGIWA NELNITTGRN QVDAAHQNIN TNAADGRPRP AVAVDVANLG GMYAGKIRLI
GTETGVGVHN AGEIGASAGD IVITADGMLV NRGQISSAQQ LAVNTPSGIE NSGVLYGKGN
TQLTTAGKLS NSGTVAAAGD TLIRAAEVNS SRNSVLGAGI KSDNSVITRG TLDIKARGQL
TAQGKNISGT AQTFNANRID LSGSQTQSGD LTFTTEGGDI DLTGANLFAN RRLSVSTPSL
LRTDKANLFA EQIALDAQAL ANVGGVITQT GLTDFNLNLP GYIDNRGGSL LTRGNFLLQA
EHLTSNSQSL LGAGIQSDGK LAPRGDLNVT TRHALIAQGK TLAAGTLALS GSQLDLTDSL
TQAKDMRLTA TEGDIALTGA TVMAANTLFA DTRQILRSDK AYLTADQINL TADSLSNVEG
RVVQKGSGDF RLDLPGYLDN RGGVLLTKGN LALQAERLTS NSKSLLGAGI QADGSKASKG
DLQANTTQAL IAQGQNVAAG AMTLSGSRVD LTGSQTHASN ITITARDGDV TTREAILITP
GTLSMTAVAN PEQTLNNRGG KLHADNIQLN LAKLENSNGE IAAATDLWLR LQSDFIHQAG
ARLTAGRDLL FNSRGALINQ YKLEAGRDMQ LTALSIRNTS ADRNTNADNS SLLAGRGLSL
STDSLFNRGA IYTTGVGQFT VNGNTENIGE IYTEQQLTFT ATGNLANRGV MQTRGEMQLS
AQGDFNNSGM LYSAGDQMRL SIAGNLTNEG KLHAANGEMR LLTEGDLDNR GSLYGAGNSD
ITTQGNAVNT GSVYTQGALQ WLTKGSVRNS ASIAALGDLQ LRANDLLSDN QSLMAAGLKA
DGSRSDSGNL AVSTEQALIA QGQNIAAGSL ALAGSQIDLT GSQTQANAIS LTAKSGDITL
TSAVIKAATQ LLVTQLAATR SLFIPPSSTR SLSTQSSSST QASSTQASAS PSALLRTDKA
SLIADQLTFD VQALSNLGGV IAQTGATDFN LNLPGYLDNR GGTILSKGNV AIQAQGLDSD
SGSLLGAGVQ SDGKLTNAGD LAVTVRQDLI AHGQSLAAGA MTLTGGRVDL TGSQTQARGI
TITANKGDVS TQRANILSLG SLAINAGANA GQTLNNQGGA LQANNIALNL GQLDNRTGKI
AASQDLTLGL QRDFNILADS TLQAGRDFSF TTHGALTNDG QLLAGRKLST RSNSLLNNGN
IRAVQADLRA SGALTNRGEI LTRGGLSTDA NTLFNSGTLI GATATLNARE RITNSGPNAL
IGATDKNGTL ALLAPVIENS DTVTRTDTAP TTTLLGMGKV ILAGGQDNSG NYSSAAQVLN
LSGLIESGND LLVYAKTLTN RRQILTATTD FIVGDTVTGA AYWTAENPDI PGGRYTQPPT
GGPMNSDYIG TNYTSTVAYN RIDQISPEAQ LLAGGNLTPQ VGTLENNWSK VSAQGVIDLT
GVTLQQDDWG SQQRLVEQTT SSGEYRYRTY KGKLWGIAWG PEMKLRPNNQ YASSITAKTL
TGSGTVINNT VINNGAAPGA IVAPRDRDST GKNIAVEFNG IALTLPRSGL YQLKTDKGDY
APGPEAALSL ANISPPSSLD ATGQRGVPPP SDDLNRTGLV TPDRAVSGGY LVETHPAFAS
LNNWKGSDLY LQQLSSDPSV IHKRLGDNAY EQRLLRDQVL ALTGRTVASD YRSEQAQFEQ
LFAAGVQYSK AFNLAPGTRL SAEQMATLTG NIVLMENRDV AGQTVLVPVV YLAGVKPGDL
RANGALIAAE NISLTEVQGF ANAGAISATN NLQISMAKDI TLNNRGGLLQ AGNHLQLSTL
NSDIDLTSAR LNATDLQLDS GRDVILRTAS EQYSSGNGAV QRTQTILGPL ASLNISNNAV
ITAQRDFIQQ GAGINIGKDL QVNTGGDWLL STVQRSDQIS AQYGGGSATS GSLRHLGSEV
KVGGALSANV DNLTAVGARV NAGTIDVRAQ NITLSAATDS LSVTGGSSSK RHTAAVNLYD
ETLLGSQLNA TGDINLQTAN DITLSASAVQ TDGALKLAAG GDVTLISQTE QHDEQRNHTG
TKKGLVSSTT ARSEEGRSQT LAVGSMLSAG SIDVSSQNIA VAGSSVVADK DIRLRAQENL
TVSTAQQSES GSQLFEQKKS GLMSTGGIGV FIGTSRQKTT DQTQTVSHVG STVGSLTGNV
RLEAGNQLTL HGSEVVAGKD LALTGADVAI SAAENSRSQQ YTAESKQRGL TVALSGPVGS
AVNTAVTTAK AAREENTGRL AGLQGVKAAL SGVQAVQAGQ LVQAQGGGIT EMVGVSVSLG
SQKSSSQQQQ EQTQVSGSAL TAGNNLSIKA TGGGNAANSG DILIAGSQLK AGGDTRLDAA
RDVRLLGAAN RQKTDGSNSS RGGSVGVSVG GSGLSVFANA NKGQGNERGD GTFWTETTVD
SGGMFSLRSG RDTALTGAQV SAETVKADVG RNLALQSQQD RDNYDAKQSR ASGGISVPVA
GGGAAVNLSM SRDRLSSQYD SVQAQTGIFA GSGGVDIRVG EHTQLDGAVI ASTAAADKNT
LDTGTLGFSD IKNKAVFTVE HQGGSLSTGG PVGSDLLSNL SGMVLAGLGN GGYAEGTTQA
AVSEGTITVR DTENQQQNVD DLSRDTGNAN GSIGPIFDKE KEQNRLKEVQ LIGEIGGQAL
DIASTQGKII ATHAANDKMK AVKPEDIAAA EKQWEKAHPG KAATAEDINQ QIYQTAYNQA
FNASGFGTGG PVQRGMQAAT AAVQGLAGGN MGAALTGASA PYLAGVIKQS TGDNPAANTM
AHAVLGAVTA YASGNHALAG AAGAATAELM APTIISALGW DKNTLTEGQK QAVSALSTLA
AGLAGGLTGD STADALAGGQ AGKNAVENNS LSGDQARESV KQVTSNLKDQ VRDKLGEGTL
SAIVNSIINA TADTGDAILG GADYGADAAM ALTSCAMGDS YCTQALNDLA GKNQATADTL
KALMKSETWS AVAGQVKEAA QGNQLALEAT GGMLAGLFLP GKKLPGSNIV VAENATKAII
DSKKFDYLFG NAKSSGHNAD RSTQLAQTMN RLGLETNEKG ASILTEHLKQ VVNTKGNVVD
TYTKGNQVFE VRESLFFGPS GKAAKLETAF EIMPDGSRRF VTTIPKDGKK