Gene HS_0790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0790 
Symbol 
ID4240281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp847130 
End bp858892 
Gene Length11763 bp 
Protein Length3920 aa 
Translation table11 
GC content40% 
IMG OID638104344 
Productlarge adhesin 
Protein accessionYP_719000 
Protein GI113460933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC CAAATACGAT GTAACAACTG GACAAACCAA AGTGGTGTCT 
GAATTAGCGA ATAACCGTCA GGTGGCGAGC CGTGTTGAGG CAGCGGGAAG TCAGCCGAAG
TGCGGTGTGT TTTTGGATAA TTTTTTAGGG GTGTTTAAAC TTGCACCGCT GGCGTTAGCG
TTGTCGGTGG CGTTGCCGAA TGTGGGGTAT ACGGCTAATG TATGGATTGA GTTTGAAAAT
GTTAGAAAAG AAGCTGTAAA TTTAAATGAA GGAACTGGAA TTTGGAACGA TAGAAGGGAT
ATGAATGATC CTAAAAATAG AGAAGCAACT ATACTATCGA GTGGAATGAA TAGAACAGGG
GCTGACACGC AGTTGAGAAA TAAAGATTTT TATAAAACAG TTGTTATTGG TTCTCGAGCT
GTAGGGGGAG GTGGTGGAAC TACCTCTATA GGATATGGAA CTATTGTTGG GAAAAATAAT
TCTGCTGTTT CAATAGGGGA ATCACACCAA GGAACAGCTG TTGGTTATCG TTCATTTGCA
CAGGGAAATG AATCTACCGC ATTGGGGAAT GATGCGGTGG CATGGGGAGA ATCAGCAATA
TCTATCGGGT CGGATAATAT TGGAAAAGGT GTAACTAAAT ATACAAAAAA AGGGTTAGCA
TATGAGGTGT GGGGACTATT TAGAAAAGCA GGAAAGAATT TCAATTATAC AAGTGAATAT
TCGGCAATAG ATAGTGGTAA TTTAAATGTG ACTCTAGAAG ACTATCAACA CTACCTAAAC
ACTAATGAAA TTAGTACCGA AAAATATTTT TACAAAACCC ACAACTGGGC ATATGGCGAC
TCATCTATTG CAATAGGTAG CCGCAACGTT GCTTATGGGC AAGCCGCAGT GGCGATAGGT
ACAGCCTCTG TAGCACAAGG GGACTATTCA ACCGCCTTTG GTATTGGAAC CTATGCAAAA
GGAAACTCAG CAGTAGCTGT TGGTAATGAA ACCTATGTGT ATGCTAATAA TTCAATTGGT
GTCGGTAATG AAGTGCAAGC GATTAACGAT GGCTCGATGG TGTATGGCTA TCAATCTTAT
GCGGGAGGTA GCGGAGCAGT CGCCATCGGT AAAAGAGCAT TAGCTAATGT TGCACCATCA
GACCATTTTA CACAAACAGT TGAAGGTTTT TCGGATAACT GGTATGAGGG TAATAGTACA
ATTCATGCAC TTGGTAAATT AGATGATCCT AGGAAACATG GTAAAAACAA AGGATTAGAT
GACTATTTTC TGCCTAAAAC ACAAAGACAA CAAGGAACAG AAGAAGACAA GGCAGAAAGT
AAAAATAGCG GAGCAGTTGC GATTGGTTAT TATGTATATG CTTTAGGCGA GAACTCAATT
GCATTAGGAA GACAAGCGTA TTCTAAGGGT GACCGTTCGA TTGCCATCGG TCCTTATGCG
TATGGTGCTA AGGAAAAAAC AGCCGCACTA GGTTATGGAT CCAAGGCTAT AGGAGAACAA
TCAATGGCGT TAGGATCGCT GTCTCGTGCA GAGGGGCAAA ACAGTATCGC TATTGGGGTC
AATAGTGCGG TAAAAAATGA AACTAACTCT ATAAAAAGAA ACGGTCAAAA CACGATAGCC
ATCGGTAATG AAACCGAAGC GACGATGGAT AATTCCGTGG CATTGGGTTA TAAATCCACG
ACGAAATATT TTTATAAAGA TGATACGGAT AAACATACCG CAACATTATT AGAAGGAAAA
GACGCCATTA GTTTGCCATC CTATGCACCA GAGGGGACGA GCTATAAGTT ATCAACTGAT
GCAGCAGCAG GTATTGTATC CGTAGGTTGG AAAAAAAATA GCAATGAACT TGGTTTAAGA
CGGATTGTCG GGGTGGCACC GGGTGCGTTG GACTCTGATG TAGCAACCGT GGGGCAGTTA
AAAGCCTTGT ATTACGTTAA AAAAGAAGGG GTAGTCACTT ACTATACCAA AGAAGCTGAC
GACAAACTGA CTAAGCTCAC CAAAGAGGAT AATAAGTTTT ACAAAGTCAA TACCAAAGAT
GGTACGCCTT ATAAAGCACT TGGTGAAGTT AAGGCAGAAA ATGTTTTTGT CGGTCCAAAA
GGGGCGAATG AAACCACCAA AGAAGAAACG ATTCAGCGGA AAAAATACTC GCTTGGCGAT
ATGGGAAATA AAATTAAATT CGCTCATATT TTAGATGGCA ATATTGAAAC AGGATCAGAT
CAAGCGATTA CGGGCAATCA GCTTAATCAG CTTGGTAGCA GTATCTTAGG GTTGACAGTT
AAAACCAATG ATAAAACACA ATTCGATAAA GTTTTATTTG AAGCCGTTGA ATATATTGAC
ACTAGTCATC AAGCTGGAAA AAGAAATACT TTTAAAGATG CACTAACAGA TACGATTAAT
GCGGTTAATA AAGGCTATAA GTTTAGTGAT GGTAGTTCAA ATACTAATAA GGGTCCATAT
TATTTAGGAG CAACGATTGA AATTAAAGCC GGAGAAATAG ACAGAACATA TAAATCTAAC
AACTTAAAAA CAAAACTTGA TTCAAACAAG AATGATAAAG CGGTATTCAC CATCGGCTTA
AGCGATACCC CAGAATTTAC GAGTGTAAAA GTTACAGCTG CACCTACAGA GAACAATCAC
GCTGTCAATA AGGCATACGT GGATGAAAAA CTTCAAAACG TTTCGACGAA TTTACATTAT
TTATCGGTAA AAGGGACGGA TAGCAAGAAA GGACCTGATT CAAATTACAA CAACGACGGT
GCAAAAGCAA GCAATTCAGT AGCCATAGGT GTCGGTGCTA AAGTAGAAGC TCCAACTGAT
ACACAAAATT ATATAGATAA TGCCCAACCA AACGCAGAAG GTGGGGTTGC TATAGGATAT
AATGCTCAAT CTAAAGCCAA AAATGCTATC GTTATAGGCA CAAATGTAAG TGTCGATATA
CCAAATTCTT TTGTATTGGG GTCAAATAAT ATAGTTGATC AAAATTCAAA AGGCACAAAA
AAACACTTAA AAAATAAATA TGATGCAAAA GGTGAAAGAG ACGCTGTAGT TGTTATCGGT
AGTGGAACAA TACTAAAAAA TTCAAAAAGC TCTATAGCAA TAGGTGCGGT TAATATGGGA
AATGGTACGA GTATTTCAGA CAATACACCC ATAAAAGGAA ATTACCTAGA AAATGCAAGA
TGGACAGCAG TAATTGGCAA TAAAAATAGA GTTTATAATG GAACCGATAT GGTTGTGTTA
GGAAATAATA TCCAAGTAAA TACTAACAAA TCAGATTTCA ATGAATCAAC CAATACTAAT
GACAATCTAG TCATAATGGG AAATAAAGCA ATAGCAGCAA ACGCAGCAGG CAGTGTAGTT
ATCGGTAATG AAGCTAAAGC ATTAAATGGA GATGAAAAAA ATCATGTTTA TCAAAAAGTA
GAAAATGTAG TAAGTATAGG CAAGGAGGCA ACTACTAAAG CTTCAGGTGC TATAGCAATA
GGAGAAGAAG CAACTGTAGA ACAAGATGCA GGCGAATCAA TTGCATTAGG TAAAGGCTCC
AAAGCAAAGA ATAAAGAAGA AGCTAAAAAA GACACTAATG TAATGATTTC TGAAAATTCT
ACAAACACAA AAGTTAAATT TAAATGGACA GGTGGTGTTA GTTCTAATAG TGGAAACGAA
AAATCAATTT TAAGCATAGG CGATACAAAT AAAGAGCGTA TCATCAAGCA CGTCGCCCCA
GGTGCAGTAA CTAATAATTC AACCGACGCT ATCAACGGAA GCCAACTTTA CGCCGTCGCT
GATGAATTTT CTAAATTAGC GGTGAATGTA TTAGGTGCGG AAGTTGATAC TGGTTCTGAT
AGAACAGGAT TTAAGAAATC GACATTTGAC GTGGCTAAAT ATCAGGGAAG TACAAATACA
CCAACACAAA AAGAAATGAC CTTTAAAGAC GCAATTGGTC AAAATACAAC AGCGATTAAT
AAAGGCTTTA TTTTTGGCGT TGGCGAAGGA AGTGGTGAAA AAGGAACGCA TTATCTAGGC
GATAAATTAA TCATCAAAGC GGGAGCTGTG GATAAACCAT CTACTACTCA AGATGGAGGA
TACGTTCCTG ATAATATTAA AACAGCGTAT CTATCAAGTA CAAAAGAAAT AGTAATAGGC
ATAAAAGAAT CCCCTACTTT TAAAAATGTA TTAATTACAG AAGAAATTCC TGAGAATACG
TCAGCTGATC CTAAGAAAAA TACTTATGAT AACTATGCTG TCAACAAAAA ATATTTAGAC
AAAAGATTAG AAAAAGTCGC CGCTAATTTC ACGGTTAAGG GGGATAGTAA CGGAACAGAT
GGAAAAGGTT ATACCTTAGA TAAAGATAAT AATGAATTAA CAATTGCAGG AGATAGTAAA
AATATTGAAA CTAAAGTTGA TAAGGACAGT AAAAAAGTAA GTATTACATT AAAAGATGCT
TTAACAGGAA TTACTTCAAT TGCTAATGAT GATACAAAAA TAGAGCTAAA AAATAATGGT
GGAAAATCTA TAATCTTCAA AACTGGAGCT AGTGGTAATG ATGTAACACT AAGTGATGGT
AAATTAAGTG GTGTGTCTGA AATTGGAAAA GATGAAAATG CAAAAATTAC TTTCAACAGC
AACGGTCAAA AAGAAATAGA CTTTAAAGCA GGGAGTACAA CCTATAAATT TAAAGAAACT
GGATTAGATT TAGCTAGCAA ACCAATCACA AATCTAGCAA GTGGATTAGA TCAAAATGGT
AGTGGTGGTA CTAATGTAAG ACAAGGCTTA GACGAGTTAT TAAAACTTAT CGGGACAAAC
GGACAACCAA GCGGATCCTC AAATAGCGAT AAATTAAACA AAGCCATCAA TGCAGGCGAT
TTGCTTCATG TCGCTCAAGG CTTGGTGGAT AAAGGCTTGA AGTTTAAAGT CGATAACGGC
ACAAGTACGA GCGGTTCTAC TACTGAAACA ACCAAAAAAC TTGGCGACAC GTTGACTTTT
AAAGGCGATG GTAAATACCT CACAACAAAA CTGAACGACA CCAATGGTGA GATTTCCTTT
AACCTCAGCG TGGCGGAAAG CATTAGTGAT GCTTCAACTC AAAACACGAC CAACAACAAA
CTCGTGACTG AAAATGCGGT GAAAAATTTT GTGACCAACA AACTTAATAA CCTGAGCAGT
ACGTTGCAGT TAGAGGGGGA TAATACCAAA AATGATAAAG ATCCGGCTGA TCCGATTGGA
AAAGTGGAGT TGAAAACCCA GAAGCTGAAA TTGACTGGGG AAACGAACGA GATTGTCACA
GAGGTTACCA AAGACAATCC AAGCGTGAAA ATCAAATTGG CTCAAAAAGT CAAAAATAAG
CTAGAGATTA TTAATGTTGG TGAGAATACT AACGACGATA ATTCTTTTGC CTTAGGTCAA
AACTCAACAC TGGAAGCGAA AAAACTTGCA CCGAGTGAAG CTACGCCAAA TGTGGGCAGT
AACGATGTAA CGATCACTTG GAACACAGCA GGGGCTTCAA AAGATAAGAA TGATTTGAGA
GAAGTCGTCA GCGTAGGTAG TGCAGACAAA GAACGCATTA TTACGCATGT CGCCGCAGGT
GCGGTTCAAT CAGGTTCAAC CGATGCTATC AATGGCGGGC AACTCCACAG CGTGATTGAT
GTATTTGGTA AATTAGGACT TGACGTCCTC GGAGCGGAAA AAGCCGATAC TGGCGATGGC
TTTAAGAAAT CAAAATTTGA TGTGGTGAAG ACTAATGGTA ATGCGACCGA TTCAAATCCA
GAGAAATCTG AGAAAACCTT TAAAAAAGCC ATTGAAGATA ATATTGCAGC TATCAACAAA
GGCTTGAAAT TTGCTAGCGA TAACGACGGA GAGAAACAAC TCTATCTCGG CTCAACCTTG
AACATTAAAG GGGCGAAAGG CGAGGCTTCA AGTGCGGGTT CAAATTCAGC TTCAAGTGAT
ACAACAAATA ACCACCAAAA TATCTTCACT AAAGCCAGCG ATACGGGGTT AGAAATTGCA
CTGAATGAAG CGTTACAAGG CATTAGCTCG ATTAGCGGTA AAAAAGGTAC TGACGGAAGT
GCGGTAGCGA AAATTGACTT TACGAGTGGC GGTTCAACCA GCCCAACAGT CAAAATCACC
GCAGATGACG GAGAATTTAC CTTCGGTAAA GACGGCTTGA ACTTAAACAG CAAACAAATC
ACAGGTATCG CAAGCGGTTT AGGCTTGAAA GATAGTGCTG ATGGAAATAG CGGCGGAAGT
GCTGGGACTA GCAATTCTGA CACTGAGATT ATCAACAAAG TCCTCTCGGG CAATCCTGAT
AAAGATAATA ACAACGGCAA CAAAATCGCC AACAACGCCA TCAATGTGAA AGACCTGTCC
GAAGTCGCTA AAGCCCTTGT GAAAAAAGGA CTATCCTTTG AGGGGAATGG TGGCAATACG
GATAAAGTTA CTCGCAAACT CGGTGAGACC TTGAAAATTG TGGGTAAAGG TGATCAGGCT
AAAAATATCA CTGTAGCGGA TAACAATATT AAAGTGTCGA AGAAAGCGGA AACTAATGGA
AGCACAACCG ACACATTAGA AATTGGTTTG TCTGATACTT TAACAGCCAT TAAATCTATT
GCCAATGGCG AGAATGCGAA AATTACTTTA AGCGGAAGTA ATGGAAGCAA GGACAAAATT
ACCTTCAAAG CAGGTAGTTC AGAGGTTACC CTAGCCGATG GTAAATTTAG CGGCGTGTCT
GAAATTAATA AAGAATCAGG CAAAGCTGCG TTGAAACTTG AAGCCGACAA AGCGACCTTA
GAAAGTGCTA GCGGTAATTC CAAAGTGGAA TTAAAAGATA AAGCAGTAAC AATTACAGCT
AAAGAGGATA AAGGATCGTT AAAACTCACC GAAAACACTG CAACCTTAGA AAGCACGAAA
GACGGCTCAA ACGTTACTTT AGACAGTACT AGTGCAACCT TAAGTGCGGG CAATGGTAAA
GGAAGCATTA AAGTTGCTAC TGGCTCAGGT GATGGTGCTA ACAAAATCGA GTTAAGTCCG
GAAAATGGCT CTGCTGTTAC CTTAGCGAAA GACGGCACAA ACGGTGTCAA AGCCACGGGT
TTATCTACGG TTGGGTTAGA TGATAAGAAC GCTTTAGTGT TTAAAAATGA TACGATTGGC
ACGGCAGAGT TAAAAGTGGG TGGTGCGACG TTAAAATTTA CGCCTACTGG AAATGGAACT
GGCGGAACTG CTCAAACTGT GAAAATCTCT AATGTGGCGG TTGGTAAGAT AGAAAGTAGT
TCATCAGAAG CTATTACAGG CAGACAGCTT CATGATTTAG CTACTCATTT AGGGGTTAGT
GTTAAAGATG ACAGCGGTAA GAAAATTGCA TTTGAACAAC CAACCTTTGC AGTGATCAAA
GGCGGTACTA AAGATGCAAT GGGTGGTACG ACTACGGCTA CAGGTCCTAC AACCTTTAAA
GACGCCATTA ATCAGTTAAT CACCGCCGTG AATGGTGGCT TGACATTCAA GGGGAATGAT
AATGGTAGCA CCCCAAGTTC AACCACATTA CAATTAGGCG GTACGTTAAC CATTGATAGC
TCGCCTGTTA GCAGTACGAC TGAAAAAGAC ATCACTGTAT CATTAACGCC ATCTACCAAT
GGCGATGCTA AAGATGCCGG CACACTCACG CTTAAATTAA ATAAAGCTGA TAAAGTGGAC
GAAAACGACG AAAAAGTCGT GACTTCTAAG GCTGTAGCTA CAAAACTACA GGAATATACT
CGCAAAGATA CATTGATAGA GGATTTAGAA GAGTATTATC TAGAAATAGA CGGTGCAAAC
GTTAAAGATA AAGCGATATT TGGAGAAAAC GTCGGTATTG AGAAAATCAA TCTTGAAGAA
GATGAAACAG AGGGTACAAG CGAATTAGTA CAAGCCAAAG CCCTAGTCGA CTACCTAAAA
GGCACAGGGG AGAAATCCGT TAAACTTTCT GATTCCGCAA AAACACAAGC GATAGGCGAG
GGTTCTATTT CTATTGGGTA TAATGCCAAC TCTCAAAATG AAGGCTCAAT AGCATTGGGC
TATAATTCGT CAGCGAAAAA TAAAGGAGCG ATTTCTATCG GTCAGGATTC AACCGTCTTG
GGAACAAGTT CTATCGCCGT GGGTAAAGAA AATGATGTCA AAGGTAATTT CTCTTTTGTA
TTAGGTGAAG GCAATACGCT TGATAAAGAA CAAACCTATG TCATCGGTTC AGATAATGAA
ATCAGCGGTG CTAAGAATAT TGCTATAGGA TTGGGCAATA CAGTGGGCGG CAATGAAAAC
ATCGTATTAG GCTCTCATGT AGACCTCAAG GATGATGTAG AAGGTGCCAT TGTATTAGGC
GATAAGTCAA TAGGCGTATC TAATGCTGTA TCGGTAGGGA ATGCGTTAAC AAAAAGAAGG
ATTGTATTTG TCGATACCCC ACAAGGTGAA TACGATGCCG TTAATAAGAA ATATGTGGAC
GACTTGACGT TATCGTATAA AGCGAATAGC ACAGAACCTG CGAAATCTAT CAACCTCAAA
ACAGGAGCGT TGGACTTCGT GAATTCGGAA AATATTTCTG TTGCTGTGGA AGCTGATGGA
AAAATTACGC ATACGTTGAA TAATGAGCTT AAAGAGATTG GAAGTATTAG TGCTACCAAA
GATGAAAAAG GTGCAAAAAT CACTTTATCA AAACCTGATA CAAGCCCTGA TACAAGTAAA
ACATATGAAA AATATGTAAG TTTTAATGAG GCTAAAATAA AAGATATATT AGATGGTACA
ATATCAAGTG ATTCAAAAGA CGCCGTAACA GGTAAACAAC TCGCTGATTT AGCGAAGCAA
TTAGGCGTTG ATGTTAATAC TTCAGATAAA ACGAAATTTA CTGCTCCAAT CTTTGAATAT
CTGTCTAATG TGGACGGCTC ATTTAACAAG AACCCAACCA CATTGAAAGG TGCTATTGAC
GAAGCAAGAG CGAAGTTAAA CGAGGGCTTA AAGTTCGGTG GTGATATTCC AAGTGCGGGT
ACAAATACCA ACAATACCCA CTATCTCGGC TCGACTATTA ATATTGTTCG TTTGGGTACG
CAGACAGGAG GTGCGGTAGC TCCAACATCA ACCAGTGGTT ACAGTGGTAG CAACTTAATC
ACGCAATACA CTTATGATAA GGGCAACGCC AAAATCGAAA TCGGGTTTAA AGACGCACCG
GAATTTAGAA AAGTTACCTT ATCTAAACAA ACATATGGTG ATAGTAAAAT TGGCAACGAA
GACGTAATCA CCAAGTCTTA CCTTGAACAA GCGTTAAACA GCTTTAAGTT TAATGTGGCG
TACGATAATA AAACGGTACA AATCGGTCGT GGCGATACGT TGAAGTTTGA AAATGGCTTG
AATATTCAAG GTAACTTGAA GCAAGAGGGG GCAACACAGC CAAGTGCGGT AACTTCTACC
CCAGCTCCGA CCGCTCCAGA TGTGGCAAGT GGTTCAGAAA GTTCTGCTCC AGCAAGCACT
TCATCAGGTA GTGAAAATGG CGGTGCAGGT ACTACGGTCG TTTCAAATGG TGCAAGTAGT
GATAGTGCAG ATACAGCGGT CGCTTCAAGC TCTCCAACTG CGGGAACTTC AAGCACACCA
ACGACCACGC CAAGCGATAC GAGTACACCA ACCACTACTA CCCCAACAAC AGCTGTAGTA
ACCATTGGTA CAAAAAATGA TTTAACTGAT ATCACCTCGA TTTCTTCCAA ATCTAAAGAT
GGAAGTACGG GCGGAAGTGG TACAGGTTCA AGTGCGGGCG AAAATGATGC AGACAATGGT
ACAACTGGAG AAGTGACGAA ACTTACCTTA GATCCAACTA AGGGGGCAAC TTTCCAAGTG
GGCAAGCAAG GTTCTCAAGT CACTATTAAT GACGATGGTA TTTCCCTCAC TCCGAAAGGG
GCGAATGGAA CGAATAATCC ATCTGCTAGC ACACCATCAA TTGTGATTAA ACCAGCTGAT
TCAACTAGTT CAGCGACTCC AGCTGATCCA AGCTTGCCTG CTGTTGATAA CGGACCATCT
ATTACTTTCT CTACAAAAGA CGAGAATGGT AAGAAAATCG GTTTCGGCAC GATCAAAGGT
CTTGCGGACA TTAAGCCAAA TGAAACGGAT GGAAGCATTG CAGCGAATAA AAACTATGTT
GATAGCAAAA TAAAAGAAGT GACAGCTGGT CAACCTTTTG AATATGCTAC GATAAAGGAT
AACGCTAAAG TTGTGAGAGG GGTGGACGGT AAACTGTATA AAGAGGCTGA TCTGAATAAG
TATTATTACG ATGAGAAAAC TGCTTCATAT AAACCAAGGG ATCCGAAAAA TTCTAGCATG
ACTGAATTAA AAGCATTAAA AAATGATGAG GTTATCGTAA ACTTAATGCC AAAAGGAGAT
GATAAATCTC CGATAGCAAT TGGAAATGTA AAAAGTATTT TAGATGCAGA GGTTTCAACA
AACCAAGATA AAGCAGCAAA AGCAATTGAA GAGCTTATTA AAGAACAGGG AACGTTATCA
ACTAAGAAAG ATAATGTTGC GACAGGTTCT GATATAGCCG CTTTGGCGAA AGCAGGTCTG
AATTTTGGTG TGAGTACAGG AGAAGATATT CATAGAAATT TAGGCGAAAA AATATCTATT
GTAGGTAGAG ATCTGAAAGC AGAAATTTTA AAAGAAATGA ACATTTCTAA TGGAAAAGTT
GATGATGATA AAAAAGCAGA ATATGAAGCT AAATTGGCGG TAGCTAAGAA AGCAATTGCT
GATGGATTTA GCACTAAGAA CGTGGTTACT ATTGCCAATC AAAATTCTGT TGTGATCAAT
ATTGCAGATA AACCTGAGTT TAAAGCTATT TCTATCAAAG ATGATACAGC ACCGGGCATG
TCTTTAGATT TAAATCCAAA TGCGATTAGT ATGGAAGACG ATGATGGTAA TAACACAGTC
ATGGATGCTG GCGGTATGGT CGTTACTGAT AATAACGGAA GTACCGAAGT AAATGCCAAC
AATATTAGTT TGACGGATTC CGAGGACAAA GAAAGCAATT TAGAAGCGGG TAAATTGGTA
TTATCCCATA CAGACAAAGA GCTTGTTGTT GAGGTGGGTG AAAAAGGTGG CAAAATAACC
GGCTTGGAAG TTCGTCATCC AACAGATAGC GATTATGGTA CCGATACAAC TCGTGCAGCA
ACGGAAAGTG CGGTGAAGAA AAACCGAGAT GATGTTGAGG ACGGTGTAAT TGGTCCGATG
GTTTATACTG ATAACAATGG TAATCGCTTA GTTAAAGTAG AAGGTAAATA TTACCTCGTT
TCAGATGTTG AAAATGGTAA AGCCAAGGAA AAAGCCAATG CGGTGGCAAA TCCAGCTCTG
TCGTTAGTGA ATGCTGATAG AAATGGAGGA GATACAAAAG CCCCTGTTGT GTTAAATAAT
GTTGCAGCGG GAGAACTTTC TGAAAATAGT CGTCAAGCGG TAAATGGGGC TCAGTTACAT
CAAACTAATG TGCATGTTCA AAACAATAGC TTGCGTATTA ATAACTTGCA AAACCAAGTC
AATATCTTAA ACAAAGATAT GCGAGCAGGT GTGGCTCAAG CACTTGCACA GGCTAACTTG
CCGGTCAATA TATTACCGGG TAAGAGTACG TTAAGTTTGG CGACCGGTAA CTATATGGGT
ACGCAAGCCT TTGCGGTAGG TTATTCCAGA GTATCTGATA ATGGTAAACT TAGCGTCAAA
TTCAGCTTAG GACATGGTGA TAAGAAAACC TCTGTGGGTG CAGGTATCGG TTATAGCTGG
TAA
 
Protein sequence
MNKIFKTKYD VTTGQTKVVS ELANNRQVAS RVEAAGSQPK CGVFLDNFLG VFKLAPLALA 
LSVALPNVGY TANVWIEFEN VRKEAVNLNE GTGIWNDRRD MNDPKNREAT ILSSGMNRTG
ADTQLRNKDF YKTVVIGSRA VGGGGGTTSI GYGTIVGKNN SAVSIGESHQ GTAVGYRSFA
QGNESTALGN DAVAWGESAI SIGSDNIGKG VTKYTKKGLA YEVWGLFRKA GKNFNYTSEY
SAIDSGNLNV TLEDYQHYLN TNEISTEKYF YKTHNWAYGD SSIAIGSRNV AYGQAAVAIG
TASVAQGDYS TAFGIGTYAK GNSAVAVGNE TYVYANNSIG VGNEVQAIND GSMVYGYQSY
AGGSGAVAIG KRALANVAPS DHFTQTVEGF SDNWYEGNST IHALGKLDDP RKHGKNKGLD
DYFLPKTQRQ QGTEEDKAES KNSGAVAIGY YVYALGENSI ALGRQAYSKG DRSIAIGPYA
YGAKEKTAAL GYGSKAIGEQ SMALGSLSRA EGQNSIAIGV NSAVKNETNS IKRNGQNTIA
IGNETEATMD NSVALGYKST TKYFYKDDTD KHTATLLEGK DAISLPSYAP EGTSYKLSTD
AAAGIVSVGW KKNSNELGLR RIVGVAPGAL DSDVATVGQL KALYYVKKEG VVTYYTKEAD
DKLTKLTKED NKFYKVNTKD GTPYKALGEV KAENVFVGPK GANETTKEET IQRKKYSLGD
MGNKIKFAHI LDGNIETGSD QAITGNQLNQ LGSSILGLTV KTNDKTQFDK VLFEAVEYID
TSHQAGKRNT FKDALTDTIN AVNKGYKFSD GSSNTNKGPY YLGATIEIKA GEIDRTYKSN
NLKTKLDSNK NDKAVFTIGL SDTPEFTSVK VTAAPTENNH AVNKAYVDEK LQNVSTNLHY
LSVKGTDSKK GPDSNYNNDG AKASNSVAIG VGAKVEAPTD TQNYIDNAQP NAEGGVAIGY
NAQSKAKNAI VIGTNVSVDI PNSFVLGSNN IVDQNSKGTK KHLKNKYDAK GERDAVVVIG
SGTILKNSKS SIAIGAVNMG NGTSISDNTP IKGNYLENAR WTAVIGNKNR VYNGTDMVVL
GNNIQVNTNK SDFNESTNTN DNLVIMGNKA IAANAAGSVV IGNEAKALNG DEKNHVYQKV
ENVVSIGKEA TTKASGAIAI GEEATVEQDA GESIALGKGS KAKNKEEAKK DTNVMISENS
TNTKVKFKWT GGVSSNSGNE KSILSIGDTN KERIIKHVAP GAVTNNSTDA INGSQLYAVA
DEFSKLAVNV LGAEVDTGSD RTGFKKSTFD VAKYQGSTNT PTQKEMTFKD AIGQNTTAIN
KGFIFGVGEG SGEKGTHYLG DKLIIKAGAV DKPSTTQDGG YVPDNIKTAY LSSTKEIVIG
IKESPTFKNV LITEEIPENT SADPKKNTYD NYAVNKKYLD KRLEKVAANF TVKGDSNGTD
GKGYTLDKDN NELTIAGDSK NIETKVDKDS KKVSITLKDA LTGITSIAND DTKIELKNNG
GKSIIFKTGA SGNDVTLSDG KLSGVSEIGK DENAKITFNS NGQKEIDFKA GSTTYKFKET
GLDLASKPIT NLASGLDQNG SGGTNVRQGL DELLKLIGTN GQPSGSSNSD KLNKAINAGD
LLHVAQGLVD KGLKFKVDNG TSTSGSTTET TKKLGDTLTF KGDGKYLTTK LNDTNGEISF
NLSVAESISD ASTQNTTNNK LVTENAVKNF VTNKLNNLSS TLQLEGDNTK NDKDPADPIG
KVELKTQKLK LTGETNEIVT EVTKDNPSVK IKLAQKVKNK LEIINVGENT NDDNSFALGQ
NSTLEAKKLA PSEATPNVGS NDVTITWNTA GASKDKNDLR EVVSVGSADK ERIITHVAAG
AVQSGSTDAI NGGQLHSVID VFGKLGLDVL GAEKADTGDG FKKSKFDVVK TNGNATDSNP
EKSEKTFKKA IEDNIAAINK GLKFASDNDG EKQLYLGSTL NIKGAKGEAS SAGSNSASSD
TTNNHQNIFT KASDTGLEIA LNEALQGISS ISGKKGTDGS AVAKIDFTSG GSTSPTVKIT
ADDGEFTFGK DGLNLNSKQI TGIASGLGLK DSADGNSGGS AGTSNSDTEI INKVLSGNPD
KDNNNGNKIA NNAINVKDLS EVAKALVKKG LSFEGNGGNT DKVTRKLGET LKIVGKGDQA
KNITVADNNI KVSKKAETNG STTDTLEIGL SDTLTAIKSI ANGENAKITL SGSNGSKDKI
TFKAGSSEVT LADGKFSGVS EINKESGKAA LKLEADKATL ESASGNSKVE LKDKAVTITA
KEDKGSLKLT ENTATLESTK DGSNVTLDST SATLSAGNGK GSIKVATGSG DGANKIELSP
ENGSAVTLAK DGTNGVKATG LSTVGLDDKN ALVFKNDTIG TAELKVGGAT LKFTPTGNGT
GGTAQTVKIS NVAVGKIESS SSEAITGRQL HDLATHLGVS VKDDSGKKIA FEQPTFAVIK
GGTKDAMGGT TTATGPTTFK DAINQLITAV NGGLTFKGND NGSTPSSTTL QLGGTLTIDS
SPVSSTTEKD ITVSLTPSTN GDAKDAGTLT LKLNKADKVD ENDEKVVTSK AVATKLQEYT
RKDTLIEDLE EYYLEIDGAN VKDKAIFGEN VGIEKINLEE DETEGTSELV QAKALVDYLK
GTGEKSVKLS DSAKTQAIGE GSISIGYNAN SQNEGSIALG YNSSAKNKGA ISIGQDSTVL
GTSSIAVGKE NDVKGNFSFV LGEGNTLDKE QTYVIGSDNE ISGAKNIAIG LGNTVGGNEN
IVLGSHVDLK DDVEGAIVLG DKSIGVSNAV SVGNALTKRR IVFVDTPQGE YDAVNKKYVD
DLTLSYKANS TEPAKSINLK TGALDFVNSE NISVAVEADG KITHTLNNEL KEIGSISATK
DEKGAKITLS KPDTSPDTSK TYEKYVSFNE AKIKDILDGT ISSDSKDAVT GKQLADLAKQ
LGVDVNTSDK TKFTAPIFEY LSNVDGSFNK NPTTLKGAID EARAKLNEGL KFGGDIPSAG
TNTNNTHYLG STINIVRLGT QTGGAVAPTS TSGYSGSNLI TQYTYDKGNA KIEIGFKDAP
EFRKVTLSKQ TYGDSKIGNE DVITKSYLEQ ALNSFKFNVA YDNKTVQIGR GDTLKFENGL
NIQGNLKQEG ATQPSAVTST PAPTAPDVAS GSESSAPAST SSGSENGGAG TTVVSNGASS
DSADTAVASS SPTAGTSSTP TTTPSDTSTP TTTTPTTAVV TIGTKNDLTD ITSISSKSKD
GSTGGSGTGS SAGENDADNG TTGEVTKLTL DPTKGATFQV GKQGSQVTIN DDGISLTPKG
ANGTNNPSAS TPSIVIKPAD STSSATPADP SLPAVDNGPS ITFSTKDENG KKIGFGTIKG
LADIKPNETD GSIAANKNYV DSKIKEVTAG QPFEYATIKD NAKVVRGVDG KLYKEADLNK
YYYDEKTASY KPRDPKNSSM TELKALKNDE VIVNLMPKGD DKSPIAIGNV KSILDAEVST
NQDKAAKAIE ELIKEQGTLS TKKDNVATGS DIAALAKAGL NFGVSTGEDI HRNLGEKISI
VGRDLKAEIL KEMNISNGKV DDDKKAEYEA KLAVAKKAIA DGFSTKNVVT IANQNSVVIN
IADKPEFKAI SIKDDTAPGM SLDLNPNAIS MEDDDGNNTV MDAGGMVVTD NNGSTEVNAN
NISLTDSEDK ESNLEAGKLV LSHTDKELVV EVGEKGGKIT GLEVRHPTDS DYGTDTTRAA
TESAVKKNRD DVEDGVIGPM VYTDNNGNRL VKVEGKYYLV SDVENGKAKE KANAVANPAL
SLVNADRNGG DTKAPVVLNN VAAGELSENS RQAVNGAQLH QTNVHVQNNS LRINNLQNQV
NILNKDMRAG VAQALAQANL PVNILPGKST LSLATGNYMG TQAFAVGYSR VSDNGKLSVK
FSLGHGDKKT SVGAGIGYSW