Gene Haur_4951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4951 
Symbol 
ID5736787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6278622 
End bp6288806 
Gene Length10185 bp 
Protein Length3394 aa 
Translation table11 
GC content51% 
IMG OID641282118 
Producthypothetical protein 
Protein accessionYP_001547709 
Protein GI159901462 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAGCGG CACTCCTCGT TGGAGTGCTG TTTGTGCTGC AAGGAAGTGG AGCCTTTGCC 
GCTCCAAAGC CTGTGCAGCA AGCCCAAACG TCGGCTGAAC GCCCCATGGC TGCGCCACTT
CTCGGCTCCA ACGATGTGTT GTTGAGCGAT ATGGGCGGGG TTGGCAATGC TGATGGTAGC
ATTGCTGTGA ACCCTGTGGT TGCCTATAAC GTCACCGATA GCCAATATCT TTTGGTGTGG
GAAGGCTTAG AAACCGCCAC TGGCTCATTA AACTTGCATG GTCAATTGAT CGATGCAGTG
ACCGGCCTCG AAATTGGCAC GAATGACTTT TTAATTGCCG ATGACGTTGA TGCGAATGAT
GCTTATGGCA AACCGCAAGT CGTTTGGAAT AGCGTCAACA ACGAATATTT AGTCATTTTT
GAAGGCGATA GCCAAGTTGC CGTCGGTCTT ACCAATGAGC TTGAAGTCTA TGCTCAGCGG
GTTGCCGCCA ACGGCAGCTT GATTGGTACG CCGTTGCGGG TCAGCCAAAT GGGTACTGAT
GGCGTGAATA CCGCCGATGC CTTTGAACCA TCCGTTACCC ATAACGCCAC CAACAATCAA
TATGCGGTGG TTTGGTACGG CGATGATCTG GCTGGCGGTC GCATCGATGG TGAGTTTGAA
GTCTATGTCC AGTTGCTTGG CTTTAGCGGC GGTAACCTGA TTGAGGTTGG CGGCGATGTT
AAAGTCTCCG ATGTTGGCAC AACTGGGAGT GCAACTATTC GCCCAGAAGA CCCCAATATC
GTTTGGAATA GTGCTGCCAA CGAATATTTA GTTGTTTGGC GCTCCGACGA TAGCGGCACC
GACGGCGATT TTGATGTCTA TGGCCAACGC TTAACTGCCG CCCTCGCCGA AGTTGGTGCA
GACGATTTCT TGATCGCCAA TAATGCCAAT CAAGATTCGT TTGATGTGAA TGTTGGCTAC
AATCCAACCA ACAATCTCTA TTTAGTGCTT TGGTCGGGCG ATAACGTGGC CAACTCGGTC
TATAACGTTT ATGGACAGAT CGTCAGCGGG GCAGGAGCGG TAACTGGTGG ACTGTTGACC
CTCTCATCAA CCAACACCGG GGTTTCAACT GACCCAGTTA TTACTTATAA CTCGCGGGAT
AACCAGTTTA TCATCGCGTG GATGGCTCCC AGCGCAGCTG GCAATACCGA GCGCGAAATC
TTCACCCAAA AAATCAATGC TGCAACGGGT GCACGCATCG CCCCCAATGA TGTGCAAGTC
AGCGATATGG GGCCAAATGG TAACGATAAT TTCTTTGCCA ATGGCTTTAT TGGAATTGCC
TACGCTGGCC AGACCCTCAA CCATACCTTA GTTGTTTGGG GTGGGGCTGA CAATCGTGAT
GGTCAAACTA CTGGCGAATC AGAAATTTTT GGCCAATTGA TTACGCCAAT TCTGAATGTA
CGCAAAACCA TCACCAGCAA TATTACCAAC CTCGATGCCT TCGACACGCT AACCTATCAA
ATCGAAGTTG AACATGCCAC GATTGTTGAA GGCGCTGATA CGGTTTCGTT GAGTTTGGCC
GATGCTTTTA ATCTCAACCT GACCGATGAT TTACCAGCCC AACTGAATAG CCCCACGATT
GTCAGTGCGC TTGTTAGTGA TGGGGCAAAC AGCACCAATG TTGCTGGCAA CCTCTCGGTT
GGTAGCGGCG ATCTGGCAAC CTCAACCCCA TTTGGCTTAC GCTACCGCTC CAATGGCAAC
AACAGCGAAA AATTAACCTT GGTGCTGAAC ACCAAAGTTG CCAATACAAG CGTTGCTGGC
GCAATTTTCG GCAATACTGC CAATGCAACG TGGAGCAATA CCAGCTTACT TGGCAGTGTT
ACAGGCTTGA CTGATAGTAG CGCGAATGTG AACGCCACGA TGGCCCGCGC CTTTACCGTC
AGCAAATCGA GCCTTGAAAC TGAGGCGTTG ATTAACCAAG ATGTTACCTA TCATCTTGAC
GTTGGCGTGA TTGAAGGAAC GACCAATAGC TTACAATTTG TCGATACCCT GCCTGCGAAC
ACCAGTTATG TGCCAGGCTC AATTAACGTT AGTAATAGCA ATGGAATGAC GATCAATGGC
TTAATTGCCA ATGTCAGTGG CCAAACCTTA ACGATCAATG CAAGCAGTGT CGTCAATCCA
GGCAATGTTG ATAACGGCGC AACCGCTGAT AGCGATGTAT TCCGTTTATC CTATCAAGTA
AAAGTGCTTG ATGTGCCAGC CAATGTCAGC GGCGTGGTTT TGACCAATAG TGTCAATGCT
TCGGCTAGCC CCGGCCAAGT TGATAATGGC AACACTCATA ACCTGACGAT TGTTGAACCA
TTTTTGGATG TAAGCAAAAC CATCGTTGGC GTATCAACCG CTGTTGATGC TGGCGATACG
GTGCGCTACC AGATTCGGGT CAGCCATACC GCTGCCAGCA CCGCCCAAGC TTATGATGTG
AGTGTAGTTG ATAATTTACC TGCGGTGCTT GGCTCACCAG TGGTTGAATC AGCAACAATC
AGCGACGGTG CAACCAATAC CGACGTGACC AGTAATTTCA CAATCAATGG CTCTGGTCAA
CTTTCAACCA CAACAGCCAC TAATCTTAAT ATCAATACCA ATGGTCCAAA TGATCAATCC
TTGACAATTG TTGTACGCGG CGTTGTCAAT AATACAGTTG CCCCTGGCGC AACGATTGCC
AATACTGCCA ATATTACCTG GCGTAATGCA ACGTCGCTCC AGCGCTCAAA TTATAACGAT
TCGAGCACTG CACCGAATAT TACGGTTCCA GCGCCGTTTA CCGTGACCAA ATCGGTGATT
GCAGGTAGCA CACCTACGAT TGGCAGCCTT GTTACCTATC AATTAACCGT CACAGTCTTA
GAAGGTACAA CCAGCAATAT TCAGCTGGTC GATACCTTAC CTGCTGGGAT GAGTTATGTA
ACTGGTTCAT CCTCACTCAA TGCCAATGGG ATGACCATAG CGACCGTAAC TGTCTCACCT
GTCGGTCAGG TAGTGACCTT TAGTACAGCA AGTGTCGTCA ACCCCGGCAA CTCCGATGCT
CCAAACATTA TGGATACTGA TAGCTTTACA ATTACCTATC AGGCTCGGGT TAACGATGTG
CCTGGCAACG TTGCTGGGAC GGTATTAGCC AATGATGTTG ATGCAACTGC TGATGGCGTT
GCAGCCGATA ACAACAACTC GGTGAGTGTT ACGGTGCGTG AGCCACTCTT GTCGATTGAT
AAGAGCATCA CGACCAGCAC TGCCGGGGTT GATGCTGCGG ATACCGTGCG CTATCGAATT
GAAGTCTTCC CTCAAGCAGC AAGCAATGCC AACGCCTTTG GTCTCAATAT CACCGATGAT
ATGCCGGCTG CCTTGCAAGG TACGGTGATT GAAAGCGCGA CGATTAGCGA TGGCGCAACC
AATACCGATG TTGCCAGTAA CTTCAGCATC AATGGCAGCG GCGATTTGGT CACGCTTACC
CCAGTAGACT TAGCGCTGAA TACCAACGGG CCGAGCGATC AAGTTTTGGT GATTGTAATG
CGCGGCACGG TTCGCAACAC CGTCAACCCA GGTGGCACAA TTGCCAATGC TGCCACTGTG
GTATGGCGCA ATAGCGAAAA CCTGCAACGC GCCAGCTACA CTGCCACCGA TCTTGCGCCA
AGCATCACGA TTCCGGCCAG CTTTAGTGTT ACCAAGACGG TTGCAGCACC AGGCACAAAC
GTTGCCGTCG GCGCAACTGT TACCTATCGC TTAAGTACAA CCGTGATCGA AGGCACGACC
AATAATCTGC AATGGGTTGA TACCTTGCCC GCTGGTATGA GCTATGTACC AGGTTCAGCA
ACCGTTGAAA CCGCCAATGG CATGACTATC CCTAGCCTGA ACGTCTCATT AAGCGGTCAA
GTACTGACTA TCGGTGCTAG CAGTGTAACC AACCCGGGTA ACGTTGATAA TGCTGCTGCT
GCTGATACTG ATAGCTTCAC GATTACCTAT CAGGCTACGG TCAATGATGT TGCTGGCAAT
GTTACTGGCA CGGTGCTAAC CAATGACGTT GATGCAACTG CTAATCCAGG TTTGAGCGAC
AACAATAACT CCGCCAGCAT TACCGTGGTT GAACCACTCT TGGCGATTGA TAAGGCACTC
ACGACCAGCG CGGTTGGAGT TGATGCAGGC GATACCGTGC GCTACCGCAT TGAAGTCTCG
CCACAAGCTA CTAGTGACAG CAATGCCTTT GACCTTAATA TTAGCGATGA TATGCCCGCT
GGCATTATAA ATATGGTGAT TGAAAGTGCC ACAATTAGCG ATGGTGCAAC CAATACCGAT
GTTGCCAGTA GCTTTAGCAT CAACGGCAGC GGCGATTTGG TCACGCTTAC GCCGCCAGAC
TTGTTGCTCA ACACCAATGG CACCAACGAT CAACGCCTAA CGATCGTTGT AGCTGGCCAA
GTACGCAACC AGATTAATCC AGGTGGCTCG ATTGCAAACG CTGCAACCGT GACCTGGCGT
AATAGTGCTG GTGTCCAACG GGCCAGCTAC ACGGCGACCG CTCTTGCGCC AACAATTACC
GTTGCCAACA ACTTTAGCGT CACCAAAACG GTAGTTGCAC CTGGCCCTGA AGTTGGGGCT
GGAGCAACTG TCACCTACCG TTTGAGCACA ACCCTGATCG AAGGTACAAC CGATAATATT
CAGTGGGTTG ATACCTTGCC TGTTGGCATG ACCTATGTAC CAGCTTCGGC AGTGATTGAA
AATGCCAACG GCATGACGGT CAATGGCTTT GCCGCCAATA TCAGCGGCCA AGTACTCACC
ATCAGCACTA GCAGCGTGGT CAATCCAGGC AACGTGGATA ATGCAGCGGT TGCTGATACC
GATAGCTTTA CGATTACCTA TCAGGCAACC GTAGCAGGCA ACGCCACCAG CGGTACATTG
CTTACCAACG ATGTTGATGC GAGTGCCGAC CCAGGCTTGA GCGATACCAA CAATCAAGTT
ACGGTAACGG TGGTTGAGCC AGAACTCAAC ATCAGCAAGA CGATCAATTC AGTTACAACT
GGGATCGACG CTGGTGATGA AGTACGCTAT TTCATCAAGG TGCAGCCAAC CGCAGGTAGC
GGAGCCAACG CCACCAGTGT GATGATTACC GATACCTTGC CCAGCCAATT GACTGCTACC
AGCATTCTCT CGGCAACGAT CAGTGATGGG GCAACTACCA CCAATGTTGC TGGCAACTTC
ACAATTGTTG GTGGCCAATT ACGCACCACT GGCAACCTTG GATTGGATCT CAACACTAAT
GGAGCTAACG ACCAAGTGTT GACCTTGATG GTTCGTGGCT TAGTGGCAGA CTCAACTACG
CCACTCAGCA CGATCAACAA TACTGCCGAC CTTACATGGC GTAACCCGGG CGGTGTCTTC
AATGCCAACT ATAACGATAG TGCCACGACT CCCACGATCA ATGTGGCGAG CACATGGACA
GTTGGTAAGT TTATTGTTCC GCCAATCACC CAAGTCAGCA TCGGTCAAGT CGTAACCTAC
ACCGTTAGCA CGACCGTGAT TGAAGGTACA ACCCTCAATC CAGTCTGGGT TGATACGATT
CCAACTGGTA TGAGCTATGT GCCAGGTTCG GCGCAAATCT CGAATGCCAA CGGCGTGACG
ATCAACGATT TCAGCGTCGC GCTCAGCGGC CAAACCCTGA CGATTAGCGC AACCAGTATC
GTCAACCCCG GCAATACCAA CAATGCTGGC AACGTTGATA TCGACACCTT CTTGATGACC
TATCAAACAG TCGTCGGTAA TGTCGCTGGT GGCACGGTTT TGACCAACGA CCTTGATTCA
AGCGCTAGCC CAAGCTTGGT CGATAACAAT AATCAAGTGT CGGTGACCGT GGTTGAGCCA
ACCTTGAGCG TGGTCAAGAC CATCACTACC GCGACAGGTG GCGTTGACGC TGGAGACACG
GTACGCTACC AAATTCGCGT AGCTCACGTG CCAAGCAGCA ACAATAATGC AACCTCGGTC
TTGCTGACTG ACACCTTGCC ATTACAACTC CAAAACCTGA CGCTGGTTTC AGCGATTGTC
AGCGATGGTG CAACCAGCAC CAATGTTTCT GGTAATTTCT CACTCAGCGG TGGTGTGCTC
CGAGTAACTG GCAATCTCAA TCTTTTGATC GATACCAATG GCTCGAACGA TCAGGTACTG
ACAGTGATTG TTCAAGGCAC AGTGCGCGAT CTGATTACGC CAGACTCGAC AATTGACAAT
TCAGCAACCA CAACTTGGTC GAACGCTAGC GGAGTTAGCC GCCCAGTCTA TACGGCAACG
GGTTCCGCCC CAACCATTAC TGCTCCAAGT GTTTGGAGCG TGACGAAGGC GATTTCGCCA
GCAGTTAGTA CGGTTTCGCC AGGCCAAGTG GTTGAATACA CCTTAACTAC GACGGTACTG
GAAGGCACAA CGACCAACCC TCGCTGGGTC GATACCTTGC CTGTTGGTAT GAGCTATGTT
GCTGGTTCAG CCCAGGTGCT CGATGCCAAT GGCATGACGA TCAATGGCTT TAGCGCTAAC
GTAAGCGGCC AAACATTAAC GATTGCGGCC AGCAGCGTGG TCAACCCAGG CAATGTCAAT
AACGCCGCAG CAACCGATAG CGATAGCTTT GTGATTCGCT ACCGAGCCAT GGTTGATAAC
GACGTAAGCC TTGGCACAAC CTTGACCAAC GACGTTGATG CAACCGCCGA TCCAAGCCTA
AGCGACAACG ACAATTCGGT TAGCGTGACT GTGCGCGAAC CCGACTTGAC GCTAAGTGTT
GCTGAAACCA GCGATGCTCA ACTCGAAGCA GGCGATCCAA TCACCTTTGT CGTAACGGTC
AACAATCCTG CTGGAGCCAA TGCACAGGAT GCGAACACGG TTGCAATTAC TAGCAGCTTT
ACCAGCAACC TCAGCAACTT GAACATTGTG AGCGCCCAAC AAACTGGCGG GGCAACTGGT
GGTAGCTTTA CAATTGTTGG CAACAACCTG CAAACCAATA CCCCAGCAAG CATGCCAATT
GGCTCGCAAA TTGTCTTGAC GATTAGTGCT GAAGTTAACA ACACTGCTGC TCCGGCTAGC
TTGGCTGGCA TGCAAAGCGG CGTAACTTGG CGCAACGCAG CCGATGTAAG CTTGCCACGC
TATAGCAAGA CCGATAGCAT TGATGCGTCG ATTGCTTCGT TCTTTACAGT AACCAAAGCA
ATTGTGCCAG CAGCGAGCAC GGTTTCAGCT GGCCAGACCG TCACCTATAC CCTCGAAGTT
GAGGTGATTG AAGGCAGCAC CAGCAATATC GTGCTAACCG ATACCTTGCC AGTTGGCATG
AGCTATGAAG TTGGCTCAGG AGCAATCTTG AGCAATGGCG GAATCACGCT GAATAACTTT
ACGGTGACCA GCGTAGGCCA AACGGTGATC TTCCGCGTCG ATCCAGCGAC CAATCCTGGT
AGCTACCCAG GCGATGGCCT CGATACCGAT AACTTTAGTT TGGTCTATCG TGCAACCATC
GGCCAAAACG TGCCTTCGGG TACAGCCTTG ACCAACAATG TTGATGCAAC GGCTAGCCCC
AACCTGAGCG ATAACGATAA CTCGGTCACG GTAACCGTGG TTGAGCCTGA CCTCAACATC
AATTTGAGCA AACTTACGGC TGATGGTGGC GTAGATGCCG CCGATAGAAT CCGCCAACAA
CTGGTGATTA GCCATACCAC CAACAGCTTG GCCTCGGCCT ATGATGTTGA TTTGAATGTT
GACTTGCCAG CAAGCTTTGG TACGCCAACC TTGATCTCGG CGATTGTCAG CGATGGTGCA
ACCAGCACCA ACTTTAGTGG TAATTTGGGC TTCGATGCCA ACGGTGATTT GATCAACACT
GGCAACATTA ATCTCTTGAT CAATACCAAT GGCAACAACG ACCAACGACT GACCGTGATT
GTGGAGTACA CGCTGCCCAA TAGCATCAAC CAAGGTGCAA CGGCAAGCAG CGAAGGATCA
ATCACCTGGA AGAATGTTAG CGGTTCGCAA TTACCAAGCT TCACTGCTGA CAGCAATGTG
TTGGATTACA GCGTAGCAAC CAGCTTCGCG CTTGAAAAAG CCGTGGCGAA CACCAGCATT
GCAAGCACAG TTGGCAACCA AGTGACCTTC GGCGAAGTTG TGACCTACGT CTTAACCGCA
ACCGTACTCG AAGGTACAAC CAATAACTTA AGCTTTGTTG ATAGCCTGCC AGTTGGTTTG
ACCTATGTCG CTGGTTCGGC CAGTGTAGCT AACGCCAACG GAATGACTGT CAATGGCTTG
ACCGCTAACC TCAGTGGCCA AACTTTGACA ATTGCTGCCA GCAGCGTGGT CAACCCAGGC
AATGTCAACG ATGCTGATAC GATCGATAGT GATAGCTTTA CGATTACCTA CCAAGCAACC
GTCGATGGCA GCAGTAGCGT GGTACGTGGC GCAAAACTGC TGAATAGCGT CAATGGTAGC
GCCGACCCAA GCTTGACCGA TACGGTCAAT ACTGCCACCG TCGATGTCGC TGAACCAACC
TTGACCATCG TGAAGGCAGT TGATGACAGC ACACCTGACT TTGGTCAAGT GCTCGATTAC
ACCTTGACGA TTGAACACAC CGCCCAAAGC AATTCGATTG CCTATGATCT GCTGGTTGCA
GATAGTTTAC CAACTGGTTT GAGTGCGGTT GCTGGTTCAC TGAGCGTTAT CTCAGGTCCA
GCTCCAACCA GCTTTGTAAT CAATGGCTCA ACGATCGAAG CAACCTTCGC TAGCTTCGCA
ACTGGCTCAA CCACAGTATT AGGCTACCAA GCCCGTGTTG CTTCGCCACC AGCTTTGGCA
ATTGGCGCAA GCTTGGTCAA CGAGGTTGAT CTCACCTGGA CTTCGGCCAG TGGCACGGTC
AACTTAGAAC GCAGCTACAA CGCTAGCGAT GATGTAACCG TCACGCTAAC CAGCGTCGAT
TTGGCAATCA GCAAGCAAGT TACACCAAGT GTTGCAACGC CAGGTACGCC AGTCACCTTT
ACCATCCATT TGACCAACAC TGGCAACCTG CAAACCAGCA ACATCGTCTT GACCGATATT
GTCTCGCCAT TGCTGACCAA TGTGAATGTG AGCAGCCAAG GCCTGACGAT CACCGATACG
GGAGCCAACC CAAGTTACGT TTGGACGGTC AACGATCTTC CAGCGGGCGC AAGCGGTTCG
ATTCAAATTT CGGGGATTAT CAGCCCAAGC TTGGCAAGCG ATACAACCTT AATCAACAAT
GTCAGTGTCA GCGATGATAT TGACCAACTG TTGGCCAACA ATAGTGCTAG CGCGATTGTG
GCCTTGGTTG TGCCACGCAT CAGCTTCAGC CTCAACAATT ACGAGGTGGT CGAAGCAACG
GGCAGCGCAA CCGTAACCGT TCAACTAAAC GCGCCAAATC CAAATAGCGC AGTTCAGGTA
ACCTTTGCCA CCAGCAATGG TACGGCAACG CTCAGCGATT ACACCCCAGT CACCCAAACG
CTGACCATCC CACAAGGTCA AACCAGCGTT CAAGTGATTG TGCCAATCAT TGATGATAGC
ACCTATGAAC TGAACGAAAC CATCTTGTTG AGTTTGACCA ACCCAGCTGG GACAGCCTTC
GGCACACCAA TCACCAGCAC AATCACAATC ATCAACGACG ATCGAGCCTT GGTCTTCATC
CCAATGGCGA CCAAACCAGC TTCGGTTGAC GTAGTGGTTG ATAGCATTCA AGTACTGAAT
GGCACAGTCC GCGTCACAAT TCGCAACACA GGTTCGGAGC CACTGTCGGG CGGCTATTGG
GTTGATGCCT ATATCAATCC ATTCCAAGCG CCAACCGCAG TCAACCAACC ATGGAATAGT
ATTAGTCTCT TCGGGATGGC ATGGGCAGTT CCAACCAATG CACCGGTGCT AGCGCCGAAT
GCAACCTTGA CATTAATCAG CGGTGGCCAA TACTATCGCT CAGATTTGAG CTTCGTGCCA
CCAGGCCAAC TCAGTGCTGG CAGTCGGATC TACGCCCAAG CCGATTCGTT TGGCTTGACC
AACTATGGCG CGATTCTGGA AACTCACGAA ATCAATGGTG GAACCTACAA CAATATTCGC
GGGCCATACA TTGTGCCTGC CAACACCACC TTGGTCTTTG ATGAGCAACG CCCAAGCACC
CCAGAAGATT TGACTGGCGT GCCAGAACGA CCATTACCAC GCTAA
 
Protein sequence
MVAALLVGVL FVLQGSGAFA APKPVQQAQT SAERPMAAPL LGSNDVLLSD MGGVGNADGS 
IAVNPVVAYN VTDSQYLLVW EGLETATGSL NLHGQLIDAV TGLEIGTNDF LIADDVDAND
AYGKPQVVWN SVNNEYLVIF EGDSQVAVGL TNELEVYAQR VAANGSLIGT PLRVSQMGTD
GVNTADAFEP SVTHNATNNQ YAVVWYGDDL AGGRIDGEFE VYVQLLGFSG GNLIEVGGDV
KVSDVGTTGS ATIRPEDPNI VWNSAANEYL VVWRSDDSGT DGDFDVYGQR LTAALAEVGA
DDFLIANNAN QDSFDVNVGY NPTNNLYLVL WSGDNVANSV YNVYGQIVSG AGAVTGGLLT
LSSTNTGVST DPVITYNSRD NQFIIAWMAP SAAGNTEREI FTQKINAATG ARIAPNDVQV
SDMGPNGNDN FFANGFIGIA YAGQTLNHTL VVWGGADNRD GQTTGESEIF GQLITPILNV
RKTITSNITN LDAFDTLTYQ IEVEHATIVE GADTVSLSLA DAFNLNLTDD LPAQLNSPTI
VSALVSDGAN STNVAGNLSV GSGDLATSTP FGLRYRSNGN NSEKLTLVLN TKVANTSVAG
AIFGNTANAT WSNTSLLGSV TGLTDSSANV NATMARAFTV SKSSLETEAL INQDVTYHLD
VGVIEGTTNS LQFVDTLPAN TSYVPGSINV SNSNGMTING LIANVSGQTL TINASSVVNP
GNVDNGATAD SDVFRLSYQV KVLDVPANVS GVVLTNSVNA SASPGQVDNG NTHNLTIVEP
FLDVSKTIVG VSTAVDAGDT VRYQIRVSHT AASTAQAYDV SVVDNLPAVL GSPVVESATI
SDGATNTDVT SNFTINGSGQ LSTTTATNLN INTNGPNDQS LTIVVRGVVN NTVAPGATIA
NTANITWRNA TSLQRSNYND SSTAPNITVP APFTVTKSVI AGSTPTIGSL VTYQLTVTVL
EGTTSNIQLV DTLPAGMSYV TGSSSLNANG MTIATVTVSP VGQVVTFSTA SVVNPGNSDA
PNIMDTDSFT ITYQARVNDV PGNVAGTVLA NDVDATADGV AADNNNSVSV TVREPLLSID
KSITTSTAGV DAADTVRYRI EVFPQAASNA NAFGLNITDD MPAALQGTVI ESATISDGAT
NTDVASNFSI NGSGDLVTLT PVDLALNTNG PSDQVLVIVM RGTVRNTVNP GGTIANAATV
VWRNSENLQR ASYTATDLAP SITIPASFSV TKTVAAPGTN VAVGATVTYR LSTTVIEGTT
NNLQWVDTLP AGMSYVPGSA TVETANGMTI PSLNVSLSGQ VLTIGASSVT NPGNVDNAAA
ADTDSFTITY QATVNDVAGN VTGTVLTNDV DATANPGLSD NNNSASITVV EPLLAIDKAL
TTSAVGVDAG DTVRYRIEVS PQATSDSNAF DLNISDDMPA GIINMVIESA TISDGATNTD
VASSFSINGS GDLVTLTPPD LLLNTNGTND QRLTIVVAGQ VRNQINPGGS IANAATVTWR
NSAGVQRASY TATALAPTIT VANNFSVTKT VVAPGPEVGA GATVTYRLST TLIEGTTDNI
QWVDTLPVGM TYVPASAVIE NANGMTVNGF AANISGQVLT ISTSSVVNPG NVDNAAVADT
DSFTITYQAT VAGNATSGTL LTNDVDASAD PGLSDTNNQV TVTVVEPELN ISKTINSVTT
GIDAGDEVRY FIKVQPTAGS GANATSVMIT DTLPSQLTAT SILSATISDG ATTTNVAGNF
TIVGGQLRTT GNLGLDLNTN GANDQVLTLM VRGLVADSTT PLSTINNTAD LTWRNPGGVF
NANYNDSATT PTINVASTWT VGKFIVPPIT QVSIGQVVTY TVSTTVIEGT TLNPVWVDTI
PTGMSYVPGS AQISNANGVT INDFSVALSG QTLTISATSI VNPGNTNNAG NVDIDTFLMT
YQTVVGNVAG GTVLTNDLDS SASPSLVDNN NQVSVTVVEP TLSVVKTITT ATGGVDAGDT
VRYQIRVAHV PSSNNNATSV LLTDTLPLQL QNLTLVSAIV SDGATSTNVS GNFSLSGGVL
RVTGNLNLLI DTNGSNDQVL TVIVQGTVRD LITPDSTIDN SATTTWSNAS GVSRPVYTAT
GSAPTITAPS VWSVTKAISP AVSTVSPGQV VEYTLTTTVL EGTTTNPRWV DTLPVGMSYV
AGSAQVLDAN GMTINGFSAN VSGQTLTIAA SSVVNPGNVN NAAATDSDSF VIRYRAMVDN
DVSLGTTLTN DVDATADPSL SDNDNSVSVT VREPDLTLSV AETSDAQLEA GDPITFVVTV
NNPAGANAQD ANTVAITSSF TSNLSNLNIV SAQQTGGATG GSFTIVGNNL QTNTPASMPI
GSQIVLTISA EVNNTAAPAS LAGMQSGVTW RNAADVSLPR YSKTDSIDAS IASFFTVTKA
IVPAASTVSA GQTVTYTLEV EVIEGSTSNI VLTDTLPVGM SYEVGSGAIL SNGGITLNNF
TVTSVGQTVI FRVDPATNPG SYPGDGLDTD NFSLVYRATI GQNVPSGTAL TNNVDATASP
NLSDNDNSVT VTVVEPDLNI NLSKLTADGG VDAADRIRQQ LVISHTTNSL ASAYDVDLNV
DLPASFGTPT LISAIVSDGA TSTNFSGNLG FDANGDLINT GNINLLINTN GNNDQRLTVI
VEYTLPNSIN QGATASSEGS ITWKNVSGSQ LPSFTADSNV LDYSVATSFA LEKAVANTSI
ASTVGNQVTF GEVVTYVLTA TVLEGTTNNL SFVDSLPVGL TYVAGSASVA NANGMTVNGL
TANLSGQTLT IAASSVVNPG NVNDADTIDS DSFTITYQAT VDGSSSVVRG AKLLNSVNGS
ADPSLTDTVN TATVDVAEPT LTIVKAVDDS TPDFGQVLDY TLTIEHTAQS NSIAYDLLVA
DSLPTGLSAV AGSLSVISGP APTSFVINGS TIEATFASFA TGSTTVLGYQ ARVASPPALA
IGASLVNEVD LTWTSASGTV NLERSYNASD DVTVTLTSVD LAISKQVTPS VATPGTPVTF
TIHLTNTGNL QTSNIVLTDI VSPLLTNVNV SSQGLTITDT GANPSYVWTV NDLPAGASGS
IQISGIISPS LASDTTLINN VSVSDDIDQL LANNSASAIV ALVVPRISFS LNNYEVVEAT
GSATVTVQLN APNPNSAVQV TFATSNGTAT LSDYTPVTQT LTIPQGQTSV QVIVPIIDDS
TYELNETILL SLTNPAGTAF GTPITSTITI INDDRALVFI PMATKPASVD VVVDSIQVLN
GTVRVTIRNT GSEPLSGGYW VDAYINPFQA PTAVNQPWNS ISLFGMAWAV PTNAPVLAPN
ATLTLISGGQ YYRSDLSFVP PGQLSAGSRI YAQADSFGLT NYGAILETHE INGGTYNNIR
GPYIVPANTT LVFDEQRPST PEDLTGVPER PLPR