Gene HS_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1234 
Symbol 
ID4240745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1400065 
End bp1413645 
Gene Length13581 bp 
Protein Length4526 aa 
Translation table11 
GC content38% 
IMG OID638104807 
Productlarge adhesin 
Protein accessionYP_719446 
Protein GI113461377 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TCTTCAAAAC CAAATACGAT GTAACAACTG GACAAACGAA AGTTGTGTCT 
GAATTAGCGA ATAACCGTCA AGTGGCGAGC CGTGTCGAGG GGGCGGGAAG TCAGCCGAAG
TGCGGTGTGT TTTTCGGCGG TATGTTAGGG GCGTTTAAGG TCCTGCCGTT GGCGTTGGTG
ATTGCGGGGA TTTTGGGGGT GAATAACCTG AGTTTTGCTA TTGATTATAT AGAAGTTCAA
GAGACAAAGG TAGGTCCTGA TAATTGGTAT TTAAATTCCA ACGTAGGGGA CAACTCGGTA
CACTTATATG GGTGGAAGTA TAAGCACGGA AATGATGGGA GATATAAAAA TTTTACAGGA
ACAGTTTTAA TAGGGGCTGC TGCTTCAATC GGGGCATCAA ATGCGACTGC TATAGGATAT
CAAGCAAATG CTAAAGGTGA TTCTACGATT AGTATCGGAA AGGAGGCACA GTCATTAGGG
ACTCAATCGG TAGCATTGGG AAATAGAGCC AATGCAAAAG GCGAACAATC ATTGGCACTT
GGGGCGGACT CTAATGCTAC AGGTTACGCT TCTATCGCAC TTGGCGGTGA TGATTTGGGA
GATAATGCTA GTACATATAA ATATGCTAGA CCTCTTTCTC AAGAGGTATG GAACTTGTAT
CGCAGTAATC TGAGCGATTT TCATAATACA GAGAATTATG CTACTACAAA AAATGCTAAT
CCTACTTATG CACAATATTT GCAAGATTCT AGTAATACAT ATTCACAAAA CTGGGCGAAA
GGAAAAGGGG CAATATCTAT TGGTTCGAGA ACTATTGCCT ATGGAAATGG TTCAACATCG
ATAGGGACAT TGTCAATTGC TAAAGGAGAC TATTCAACTG CAATGGGTGC AGGAACTTTA
GCGTTAGGAA ACTCTTCAAT AGCATTAGGT AATGAAGCGT ATGTATATGC TGTGAAATCT
GTTGGTATTG GGAATGAAGT ACAAGCACTC TCTGACGGAT CGATGGTTTA TGGCTTGGAA
TCCTATGCTG GAGGAACTGG CTCGATAGCA ATTGGTACAA GAGCCTTATC AAATGTTAAA
ATGAAAACTA ATGATGTTGA TGGCGTATAC TTAGCAACAC AATATACAGG AGAAAGTAAA
AAGAATAAAC GCCAGATAAC TGCCGAATTA GGACAGTTAG ATCAGAAAAA GGAAAATCTA
TATCAACCAG TAACAGACAA ACAAGAGGGA TCAGGAGAAT TTAAAGCACA GACAAATAAT
ACTGGTGCAA TTGCAATAGG TTATTATGTC ATAGCCAGCG GAGAAAACTC GGTAGCATTA
GGTCGTCAGG CTTATGCTGA GGGAAATAGA GGTATTGCAA TAGGTCCTTA CGCTTATTCA
AAAGGAAGTC AATCATTCGC ATTAGGCTAT GGTGCTAAAG CTTTAAAGGA TGACACTTTT
GCCATAGGAA GCTATTCAAG AGTTGATGGT GAGAACTCTA TCGCACTTGG TATTGAGGCT
AAAGTTTTAA ATAATAGTGG GGATAACAAC CTCAACGGAG AAAACTCACT GGCTTTAGGG
AATAACACCG AAGTTACAAT GAAAAACTCG GTGGCTTTAG GCTATAAATC CACGACAAAG
TATTATTACA AGGTAGATGG CGATAAAAAA GAACTGTCTG TAAGTGATTC TGATAAAGCT
AATAAAGCAA TCGATGTGCC AGCCTATATT CCTAAAGGGA CAAGTTATAA CATTACTACA
GATGCAAACG ACGGGGTTAT TTCTTTTGGT GGTTGGGATA AAGGTAGGGG CAAAGTTGGG
CTTAGAAGAT TAGTAAATGT TGCAGCAGGA GCATTGGATT CTGACGTAGC AACTGTAGGA
CAATTAAGGG CTTTGGAATA TGCAAAAAAA GAAGGCGTAG TGGCTTATTA TACGAAGCAA
GGAAAGCAAA TCTATAAAGT AGTTAAGGGT GATGATGGTA GTTTCTATAA AGCAAACACA
ACAAACGGAA CGCCTTTTGA TAAAAATAAA ATTGATAAGA AAGATGTTTT TGCAGGACCG
AAAGGTGCTA ATGAGAAAAT AACAACTCAA TCAGGTAGTG ATAAAGCATT TGCTGATCTT
GGCGAAAAAA TCAAATTTGC TCATATATTA GATGGAGAGA TTACTAGCGG TTCCGATCAG
GCGATTACGG GGAATCAGCT TAAAAATGTG GGTGATATTT TAGGGATAAC TGTAAACACT
AATAATACAA AATTTGATAA TCCTAGTTTT ACAAAAGTTA AATATAATGG AGCAATTGGT
AATAATAATC ATACAACATT TAAAAGTGCC ATTGATGAGA TTATCATAGC TATTAATAAA
GGACTTAACT TTAAAGCCGG CAATACTACA GAAGCTAAAC AACTGGGAGA TACATTAGAA
TTTATAGGTG ATAGTTATAT TACGCCAACT ATTTCTAATA AAAAAATTAA ATATTCTGTT
CAAGCCACAA CAACACTTAA CGAAACAAAT AATTTAATTA CATCAAAAGC TGTTAAAGAT
TATGTTGATC CTAAATTTAC GCATTATGTA TCTATAAAAG GGACAGGTAG TAGCGATGGA
AATTATGGAA ATAATGGAGC TATTGGAAAA AATTCTATTG CAATAGGGGT TGGTGCTAGT
ACAGATAATA GTGCTAGTGG TGGTATAGCG ATAGGAAATA ATGCACAATC TAAAGCAAAA
AATGCGGTTG TAATAGGGAC AAATGTTAGT ATAGATGTAC CTAATTCATT TGTATTGGGT
TCTGATAATA TAGTTACACA AACCGGTAAG CAAGATAGGA AAGAAAGAGC CGCTGTTGTT
GTAATAGGTA GTGGTACGAC ACTAACAGAA TCAAAGAGTG CAATTGCTAT AGGTGCGGTA
AATGCTGACG GTGGAACGAA AATTGAATAT GCAGCATGGA CAGCGTCTAT CGGGAATAAG
AACAAAATCA AAAACGGTAC GGATATTATT GCTTTAGGGA ATAATATTGA TATAGGTTAT
CAGTATGATA ATAAAAATAA TGTCAGTAAA AATGATAGTG ATAATAGAAA GGAAAATACA
GAAGTCATTG CCATTGGTAA TAGTGCTAAT GCTAACAAGG CAAGCGGGAG TGTATTAATC
GGTGCAAAAA CTACAGCGAA ATCTAATGCT ACACAAGCCG TTATTATAGG TTATGAAGCA
ACAGCGAAAG AAAATGCAAC ACAAACAGTA GTTATAGGTA AAAGTGCTGA ATCAAGTGCA
GCTGGTGCTG TGGTTATCGG AGAAGGTGCA ACAGCAACCG TTGCAAATAG CGTGGCAATA
GGTAAAGGAT CAAAAACAAC GGGGAATACT TCTGCAAATG GATATGATCC ATCAACTAAG
ACAGCTTATG TAGGAAGTGG TAATGCAAAT ACTTGGAAGC CAAATAGTGG AGTATTCTCT
GTTGGTGATG GTTCAAATAC TACTAGAAGA ATAACAGGAG TGGCAGCAGG GTCTGATGAT
ACAGATGCGG TAAACGTGGC ACAACTTAAA AAAGTTGTAA GTGGTGCGGC TACTTTAAAA
TATAAAGCAA ATGGAAGTAA TGAACAAAGT ATTGACCTTA CAAGTAAAGG GTTAAATTTC
AAGGATGGAA CTTATACAAC TGCTACTGTT GAAGCTGATG GTGTGGTTAA ATTTGATTTA
AACAGAACTA CTTCAGAAAA AATAAATAAT GCATTATCAA AAGCAGATGC AGCTACTCAA
TATGCAAAAA TAGATGGTTC TAATATAACA GGAAATGAAG ATAAGTGGAG AGAAAAATTA
GACGTATATA AAAAGGATGA GGCTGATTCT AAAATTAATG AAGTTAAAAC AGCATTAAAT
GGACAAATAA ATGGAAAAGT TGATACAACT ACTTTTGAGG CAGCTAAAAC ACAGTTAGAA
ACTAAAATAG ATGGAAAAGC TGACAAAACT TTAGGAAATA TTGATCAAAC TGGTCAAAAT
AAAATTAAAG GTCTTGCTAT AGCAGCTGTT GAAGTAAAAG GAAAAGATGG TCAAATAACA
GTTGAGTCTA AAAAAGATGC AGACGGTAAT AAAAACACGT TTACTGTGTC ATTAAATGAA
AATATTAAAA CAAAAATAGA GAGTATAGGA ACTGGAAAAG TAGAGGCTAG TGATAACAAT
ACTGTAACAG GGGGGAAAGT ATATACTGCT ATTGAGGGTG CTAAATCAGA ATTAAATGCT
GAAATAGCTA AGAAATTAGA TAAGAGTATT TTTGATACAG CAAAATCAGA ATTAGAAGAT
AAAATAGATC AAAAAGTTGA TACAAGTACT TTTAATGCTG CTACATTTGG ATTAATGGGT
AATGACAGTC AAGCAGTTAC TAAAACATTA AATAACACTA TAAAAATTGA AGGAAGCGAA
AGTGTTGAAT CTAATAAGAA AAATATTTAT GTATCTAAAA ACACTGCAGG TGATGGTTTA
GAAATTAAAC TTGGAGAAAC TTTAACGGGT ATTACATCAG TTGGAAAAGA TGATAAGGCT
AAAATCAGTT TTGGCGGTGA TAATGCTAAA AATGAGATTA CATATACTGT AGGAGATACT
ACAGCAACTG CAACATTCAA ATTTAGTAAA GATGGAATAG ATTTAGGAAG TAAGAAAATA
ACTAATGTTG CAAGTGGAAT TGGAGAGATT GCGTCAGCAA CTAATGATGG AACAGAAACT
AATCTTGATA AAGTATTAAA AGGTTCTCCT GAAGATACAT ATAAATCAAA CGCTGCAAAT
GTTGAAGATC TAGCGAAAGT TTCTAAAGCT ATAATTGATA AAGGACTTTC TTTTGAAAGT
GATAATGGTA AAGTTACTAG AAAAGTAGGG GAGACTTTAA AAATAGTTGG AGAAAAAGCA
ACTGATACTT CAACTTCTGG AAGTACCACA ACTACGATAA CGACTGCACC TGGTAATATC
AAGGTTACTG CTAAGAAAAG TGATACTGCT GGACATGCAA ATGATACACT AGAAATAGGA
TTGTCTAAAG ATTTGACTGG AATTGAGAGT ATTACAAAAG GAGCTAATAA AGCTAAATTA
ACATTAGGAG AGAATACTGC AAGTCTTGAA AGTGCAACAA ATAAATCAAA AATTGAACTA
AAAGATGATG GAATTAGTTT AACAACAAAT AAAGATAAAA CAATTATAGT TAAAGATAAT
GAATTATCTG GAGTAAATAA GATAAGCACT GGAAGTGATA CTAATGCAAA TTCTATTGAT
TTAGCAAATA GTTCTAATGT TGTAATAACA AGTGGTGGTA AAGCATTAAC TATCGCTAAA
GATGGAGCAA AAGACGGAAT ATCATTAACA GGACTAGCAG ATAGAAAAGT TGATGACACA
GGTTATGGAA CAAATGGTTC TGCTGGTAGA GCAGCAACAG AATCAGCATT ATTAGACTTA
AAAACAAAAG GATTAACTTT TGAGGTAAAT GAAAAAAATA CAGATTCTAA GCCAAAAACT
TTAAAAAGAG AATTAGGAGA AACATTAAAA ATAACTGGTA AAGATGGTGA TGTTACAGAC
TTTGATAGTA AATATTCATT AGAAAATGTT GCGACAAAAA TTGATGAAAA TAATAAGGCT
ATAAGAATAG GGTTATTAAA AACTCCAAGA TTTGATGCTT TAGAATTAGG AACAGATGAG
ACTAAAAAAA TATCGCTAAC ACCAGAGTTT GCTAATGGAA ATGAATTAAA ATTAACATTG
ACTGGAACAG GAGCAGGAAA TAATACTAAG GTTAAAATTT CAGGAGTTGC AAATGGTACG
GATGATAATG ATGCTATTAA TAAATCGCAA TTAGATTCAG TGGGAAATGC GACACTTACA
TTTAGTGATG GGACTTCTAC AAATGATTTT GTTAGAAAGA ATAGTGATAG TAAAAAAGTT
GTAATTAGTT CAGGGTCAAA TGTTACAGTA ACTTTAGATA AAACTAAAGA TAATAATAAT
ACAGGACAAT TTACTGTATC ACTAAATAAA GACTTAACAG ATATTAACTC AATAACTCTA
AAAAGTAATG GCGGTACAGA TAACACTAAT AATAAAACAG GAAAAATTAC TGTTGATACT
ACTGGAGATG TAAAAGTTCA ACATGGAGAT GGAACGGCTT CTAAAATAGT TGTTGAAAGT
GATTTTAATG AGTTAAAAAC TAGTGAAGAA ATAACTGTTA CTGGTGGCGG AAAAGTATTG
GGTGCAGAAA CAACTCTTAG TCTAAACGAT AACTCAATCG CTGGAACGAA ATTGAAAACA
GGGACAATTA GTGAGGATAG ATTAGATTCT GCATTGACAA CAAAATTAAA TAGAGAATTT
AAAGTTAAGG TTGATACAAC TACTAGCGAA AACCTTATAG GAAATACTTT AGAATTTGCA
AAAGATTCTA ATTTAACTGT TGTGTTAGAT AGTGATAACA AGAAAATCAC TTATGGATTA
AGCCCAACAT TAACCGGCAT TACTTCCATA GAATCTGCTG CTGGTGGGAA TGGTGCAAAA
TCTAAAATCA CCTTAAATGC TGATCATATA GTAGCAGATA AGGATATATA TGTAGGCTCA
AAAGGAAATG ATGAGAGTAA TAAACTAGTT AAGAAATCAG AACTAACAAG TGAAATCTCT
ACCATCACAA AGAATATTAC AAATAATATT AATACGTTAT CTGAGAATAA ATTAACATTT
AAAGCTGGTA ATACATCATT TGAAAGAGCT AATAATACGG ATAAAAATAT CGTATTTAAA
GGTGAAGGCA ATGTAAATAT TAAATTAGCT ACTGATGATA GTAAAAACAC AGGTACATTT
ACAATAAGTG TTAATGAAAC AAATGTAATT GATGAAAAAG CTGGAACAAA TAAAAATGAT
ACAGAATCAG AAAAACATGC AGATAGATTA ACAACAGAAA AAGCCGTTGT TGATTATGTG
AAAGCTAGAA CATTCAAATT AGCAGGAAAT GATTCTTTTG CAGTATCAAG TCTACTTGAT
GGAACGATTA ATATTAAGGG AGATGTAAGT ACAACATCAG ATCAAGGTAA TATTTATGTT
TCTAAAGGTA CTAGTGATAG TGAGTTAAAA ATACAATTAG GTAAAAACTT AACAGGTATT
GAAAGTATTA AAAAATCTGA AAATGGTGCT AAATTAACAT TAGAAGATAA TAAGCTTGAG
TTGAGTCCAG AAAAAGATGT TAAGGTAACA TTAGAAAAAG ATGGACAAAA AACAACAGCT
AAGGCTACAG GGTTGTCAAC TATTGGAAAT GATGGTGACA ATGCTTTAGT ATTTAATACA
GGTAAGAGTG ATAAGACACA AGCAACCCTA AAAGTTGCCG GAGAAGATTT AACATTTACA
AAATTGGGCA ACAATATTCA AATTTCAAAC CTTGCAAGCG GTTTAGGGTT GAAAGATAGT
GCTGATGGAA ATAACGACGG AAATGCCGGA GATAAAGCTA TTGCCAATGT TCTTGCTGGC
AATCCTAATG GTATGGAAAC TAACGCCATC AATGTGAAAG ACCTGTCCGA AGTCGCTAAA
GCCCTTGTGA AAAAAGGACT ATCCTTTGAG GGGAATGGTG GCAATACGGA TAAAGTTACT
CGCAAACTCG GTGAGACCTT GAAAATTGTG GGTGAAACAT CAGCAACCAC TCCAACAGAT
AGCACGCAAC CAGCAACAGA AACCGCAGCG GATAATATTA TCGTAGCGAA GAAAAAGGAT
AGCGAGGATA CCTTAGAAGT TAAATTGTCT AAAAACTTAA ACGGTATTGC CTCTATTACC
AATGGCGATA AAGCAAAAAT TGCTTTAGAT GATACTAGTG CCACCTTAAG TGCGGGTAAA
GATAAAGGAA GCATTAAAGT TGCTACTGGC TCAGGTGATG ATGCTAACAA AATCGAGTTA
AGTCCTGACA ATGGCTCTAA AGTTACCTTG GCAAAAGATG GCACGAATGG CGTAAAAGCC
ACGGGGTTAT CTACGGTTGG GTTAGATGGT GATAACGCTT TAGTGTTTAC AAATGGTGCT
GGCAACAAGG CAGAGTTAAA AGTTGGCGGA AGTGCATTGA CATTTACTAA AGCCACTACC
GGAAATACCG TGAAAATCTC CAATGTGGAA GATGGAAAAA TTGATACTAG TTCAACCGAA
GCCATTACGG GTAAACAACT TCATGATTTA GCAAGTAAAT TAGGGGTTGA AGTTGAGAGT
GAAACTGGAT TTAAACAACC ATCATTTACT GCAATAAAAG GCGGTACAAG TTCAACTGGT
GATACAAGTA GTGGCACGAC TACGGCTAAA GGTCCTATAA CCTTTAAAGG GGCGATTGAT
GATTTAATCA CCGCAGTTAA TGGTGGATTG ACCTTTGAAG GCAATGTTAG CACCTCAAGT
TCAACCACAT TACAATTGGG CGGTACGTTG ACCATTGATA GCTCAGAAAG TAAAACAAAA
TTAGAGGACT CTAAAGAAAC TAAAGAGAAA GACATCACCA CAAAACTAGA ATCATTGAAT
GGTAGCGACT CTAAAGCAGG TAAACTCACG CTGACTTTAA ATAAAGCAAC ATCTGTCGAT
GAAAATGATG AAAGGGTTGT TACATCTAGT GCGGTAGCAA ATAAATTAAA AAACTATACG
ACAACAGAAA CATTAGAAGA TGACTTTTTA AGAGTTACTG GAGAAAATAT AAATGGTCAT
CAAAAAGAGT TTGGCAAAAA TGTCGGACTT GAAGAAGTAA AACTTGAAGA AGATGAAACA
GAAGGTACAA GCGAATTAGT ACAAGCCAAA GCCCTAGTCG ACTACCTAAA AGGCACAGGG
GAGAAATCCG TTAAACTTTC TGATTCCTCA AAAACACAAG CGATAGGCGA AGGCTCTATT
TCTATCGGTC ATAATGCCGT GTCAAGAAAT GAGGGATCCA TTGCCATGGG TTATAACTCT
GAGGCGTCTA ATAGCGGTGC AATCTCTATC GGTCAGGGTT CAACTGTATT GGGAACGAGT
TCTGTCGCCG TGGGTAAAGA AAATAATATC AAAGGTAATT TCTCTTTTGT GTTAGGTGAA
GGCAATACAC TTGATAAAGA ACAAACCTAT GTCATCGGCT CGGATAATAA AATCAGCGGT
GCTAAGAATA TTGCTATAGG ACTAGGCAAT ACAGTTGGCG GCAATGAAAA CATCGTATTA
GGCTCTCATG TAGACCTCAA AGATGATGTA GAAGGTGCGA TTGTATTAGG CGATAAGTCA
ATAGGCGTAT CTAATGCTGT ATCGGTAGGG AATGCGTTAA CAAAAAGAAG GATTGTATTT
GTTGATACCC CACAAGGTGA ATACGATGCC GTCAATAAGA AATATGTTGA TGATTTGACA
TTGACATATA AATCGAATAA CGATGATAAA ACGAAGAAAT CTATCAACCT CAAAACAGAG
GCGTTGGACT TCGTGAAGTC GGAAAATATT TCTGTTTCTG TAGAAGCTGA TGGAAAAATT
ACGCATACGC TTAATAATAA TCTTATAAGT ATAGTAAGTA TTTCAGGTGG TAAAAACGGA
AGTGAAGATG CTGCGAAAAT TACGTTATCC TCAAAACCTG AAACAGAAGA AGAGGCTAAA
GATTATGTTA AAACAGTCAC TATTAACGAT GCCAAGTTAA AAGGTCTGTT AGACGGTGAA
ATTGCCGAGA GTTCAAAAGA AGCCGTAACA GGTAAACAAC TCGCTGATTT GACTAAGCAA
TTAGGCGTTG ATGTTAATAC TTCAGATAAA ACGAAATTTA ACGAGCTAAT TTTTGAATAT
CTGTCTAAGG TAGACGGCTC ATTCAAAGAG AAGAACCCAA CCACATTGAA AGAGGCTATT
GACGAAACAA GAGCGAAGTT AAACGAGGGC TTAAAGTTCG GCGGTGATAT TCCAAGTGCG
GGTACAAATA CCAACAATAC CCACTATCTC GGTTCGACTA TTAATATTAT TCGTTTGGGT
ATGCCGACAG AGACAGATGA GGTAGCTCCA ACATCAACCA GTGGTTACAG TGGTAGCAAC
TTAATCACGC AATACACTTA TGATAAGGGC AACGCCAAAA TCGAAATCGG GTTTAAAGAC
GCACCGGAAT TTAGAAAAGT TACCTTATCT AAACAAACAT ATGGTGATAG TAAAATTGGC
AACGAAGACG TAATCACCAA GTCTTACCTT GAACAAGCGT TAAACAGCTT TAAGTTTAAT
GTGGCGTACG ATAATAAAAC GGTACAAATC GGTCGTGGCG ATACGTTGAA GTTTGAAAAT
GGCTTGAATA TTCAAGGTAA CTTGAAGCAA GAGGGGGCAA CACAGCCAAG TGCGGTAACT
TCAACTACAG CACCAACCAC AGTAAACACC CCATCAGGTA GTGAAAATGG TGGTGCAGGC
AATATGGCTT CGTCAGGTAA TACATCTCCA GCCGTGGCAA GTAGTGGAAG TGATGCAGGT
GATACAGGTG CAACTGTCGG AAGCAGTACA ACTCCGACAA CCGCACCGAT AAATAATGGC
AGTACAGGTT CAGACAGTGC GGTTACTTCA AGCACGACTT CGTCAACTGG TGCAGGTACA
GATGGAGGAA GTGGTACAAG CAGTACAGCT TCGTCAGGAG CTACTCCAAG TACGACAACA
GCCGGCACAG CAAGCAGTAC ACCAAGCACA ACATCGAACA CTACAGCGGT GGTGACGATT
GGTACGACTG AGGACTTGAC AGGTTTGAAA TCTGCTGAGT TTAACGGCGA TAACGGTAAC
ACGACTAACA TTACCGGTAA TGAAATTGTG TTGAAAGATC AGTCGGGTAA TGCTCATACA
CAAACGGCGA CATCGCAAAT CATCACGGAT AATACAGATC CTGATACGGA AAAAGCGGTT
GTGACAACGG CTGAGGGCGT GACAATGTCT GTGGTCAGCG ATGATCAGGT TTTGGTCAAT
GATCAGACGG CGGAAGCGAA TATTTTAAGC AATGGCAAGA ACACGACAGA AGTGAAAGCA
GGTGAAATTG CTATTAAGGA TAAAGTGGGT CAAAATGTCG TGAGCTTGAA AGTAGCAGAG
GGTGAAGACG GTCAAGATGG CAAGGGAGCT ACGCTGGCGT TTGCCAAAGG CACTGACAGT
AAATCAGGCA CAGGCACGAT CAAAGGGCTT GCGGACATTA AGCCAGATGA AACAGATGGA
AGCCTTGCGG TGAATAAAAA CTATGTGGAT AAACTGGATA AAGTGGCGGT GAAATATGAT
GATCCAACTA AGTCATCCAT CACTTTAGGC GGTAAAAGTG CTAAGAACTA TAGACCTGTA
GTAATTGATA ACCTCAAATC CGGTCTGGGT ATTGATGATA TTAAAGACGG TGGTATTGCT
TCGGTTGCAC AAGGTAAACA AGGTGAGTTG GTGAAAGACC TCGTGGCAGG TAAACTTGAT
ACCACAAAAG ACGCTAGCGG TAAAGCAAAA GACAATCTGC ATAAAGCCGT GAATTTGGCG
GATTTAAAAG CGATTGCACA AGCTGGATTA AACTTTGCCG GTAATGACGG TCGGGATATC
CACAAAAACC TCAGTGAAAC ACTGGCGATT GTGGGACAAG GCTTGGATAA AGACCAAACT
ACTGCCTTCA AAGGCACAAA CGGCAATATT GCGGTGAAAG CAGATAATGG TAAATTGTCC
ATTTCACTCA ATGAAGCCTT GACCGGCTTG AAATCTGCCG AGTTTAACGG CGAAAACGGT
AACACGACTA ACATCACCGG CAATGCGATT GCGTTGAAAG ACGACAAAGG CACTGCCAAC
ATGACAGGTA GTGAAATCGC CTTAGAAGAT ACAGATGGAG CTTCGAATAC ACAAACAGCG
AAATCGCATA CCTTGCAAAA TGGAGCGAAT ATTACAGACG TTAAGGCTGG CTTGATTACA
GTCACAGAGA ATCCGGATAC TGATACAGAA AAATCAGTGA TGATAAGCGC AGAAGGGATG
ACTACAGCTG TCGTGACCGA CGATAAAGTC TTGGTCAATG ACCAAACGGC GGAAGCGAAT
ATTTTAAGCA ATGGCAAGCA CACGACAGAG GTAAAAGCCG GCGAGGTTGC TATTAAGGAC
AAAGCTGGTA AAGATGTCGT GAGCTTGAAA GTCGCTAAGG GTGAAGACGG TCAAGATGGC
AAGGGAGCTA CGCTTGCGTT TGCTAAAGGT GCTGATGATA AAGGCACAGG CACGATTACC
GGCTTGAAAG ATTTAGATGC CACTGCTGAC GGTTCAAGTG CGGCGAATAA AAACTATGTG
GATGAAAAAG TGTCAGATTT GGACAGCAAT CGACCGTTTG ACTTCTATAT AAAAGAAGGA
AACAGCTATA CCAAAGTTAT TAAAGGACGA GATGGTAAGT TCTATGATCC TAAAGATTTA
GAAGGGGCGA AGTATGATGG TTCAAAATAC GTCACCAACG AAGGAACAAC AGTTGACACT
TCTTTATCGG CAGAGGACAA GGTGATTATT CGTGCCGAGC CGACAACCGC ACCTATCGGG
ATTAGCAACG TGGCGAGTGG TTTGGATATT GATGCCGAAA AAGTGAAACA GGCAGAGAAA
GAGGTGAAAG CCAAACGGTC TGAAGCTGAA CGTAAAGCGA CCATGTTAAA AGCGAAAGCT
GCTCTTGTAG AGCAGAAAGA GGCAGAAATT ACCGCACTTG AACAGGAAAT TGAAAACTTG
TCAGGTGATG AAAAAACGCA AAAAGAAGCA GAGTTGAAGG CAATTGAAGC CGAGTTATCT
CAATTCAATG ACGAATTGGC AACGGCAACA AAAGACTTGA AAACCGCCAA TGATGCGTTG
AAAATCGCCA ATAATACGCT GACAAAGTTG ACAGAGGATA AAATTGGCAA CTTAGTAAAA
GGTGAAAATA TTAACCCTAC AAACGGGGCT AATATTGGTG ATTTACAGGC AGTAGCAAGA
GCGGGCTTAA ATTTTGAGGG CAATGACGGC GTGCCTGTCC ATAAAAATTT AGGCGAGAAA
TTGACCATCA AAGGCGAGGG AACATTTAAC AGCAACCTCA CCGCCGCCGG CAATATCAAA
GTGGAAATGG CACAAGATGG CAAAGGCTTA GAAGTCAAAC TGTCTGACCA GTTGAAAAAC
ATGACTTCGT TTGAAACTCG TGAAGTGAAC GGTAAGAAAG CCCGCTTGGA TAGCAACGGT
TTGAGTGTGG AAAATACAAA CACAAAAGAA CGCTCTCATT TAAGCGAAAA CCGTTTAGCG
TTCTTTAAAG ATGGAGCGTT AGGATTGAAT TTAGATGGCA AAGATCGAGC CTTAAAAGTG
GGTGAAAAAG CGATTATTAG CATCAACGGC AAAAATGAGG CTTTAGTTGA AGATCTCAAT
GCCTCTAGTT CAGGTCAGGC AATTGCGAAT AAAAACTATG TGGACGCAAA AAATAATGAA
TTGCGTACAC AGCTTCATAG TGTTAATCGT GAATCACGTT CAGGAATTGC TGGTGCAAAT
GCGGCAGCTG CGTTGCCAAT GATTGCGATG CCGGGTAAAT CAGCACTTGC TGTCTCTGCG
GGGGCTTATA AGGGGCAAAG TGCGGTGGCT TTGGGCTACT CTCGTATGAG CGATAACGGT
AAGATTATGT TGAAACTACA CGGCAACAGC ACCTCAACCG GCGATTTTGG TGGCGGTGTC
GGTATTGGCT GGGCATGGTA A
 
Protein sequence
MNKIFKTKYD VTTGQTKVVS ELANNRQVAS RVEGAGSQPK CGVFFGGMLG AFKVLPLALV 
IAGILGVNNL SFAIDYIEVQ ETKVGPDNWY LNSNVGDNSV HLYGWKYKHG NDGRYKNFTG
TVLIGAAASI GASNATAIGY QANAKGDSTI SIGKEAQSLG TQSVALGNRA NAKGEQSLAL
GADSNATGYA SIALGGDDLG DNASTYKYAR PLSQEVWNLY RSNLSDFHNT ENYATTKNAN
PTYAQYLQDS SNTYSQNWAK GKGAISIGSR TIAYGNGSTS IGTLSIAKGD YSTAMGAGTL
ALGNSSIALG NEAYVYAVKS VGIGNEVQAL SDGSMVYGLE SYAGGTGSIA IGTRALSNVK
MKTNDVDGVY LATQYTGESK KNKRQITAEL GQLDQKKENL YQPVTDKQEG SGEFKAQTNN
TGAIAIGYYV IASGENSVAL GRQAYAEGNR GIAIGPYAYS KGSQSFALGY GAKALKDDTF
AIGSYSRVDG ENSIALGIEA KVLNNSGDNN LNGENSLALG NNTEVTMKNS VALGYKSTTK
YYYKVDGDKK ELSVSDSDKA NKAIDVPAYI PKGTSYNITT DANDGVISFG GWDKGRGKVG
LRRLVNVAAG ALDSDVATVG QLRALEYAKK EGVVAYYTKQ GKQIYKVVKG DDGSFYKANT
TNGTPFDKNK IDKKDVFAGP KGANEKITTQ SGSDKAFADL GEKIKFAHIL DGEITSGSDQ
AITGNQLKNV GDILGITVNT NNTKFDNPSF TKVKYNGAIG NNNHTTFKSA IDEIIIAINK
GLNFKAGNTT EAKQLGDTLE FIGDSYITPT ISNKKIKYSV QATTTLNETN NLITSKAVKD
YVDPKFTHYV SIKGTGSSDG NYGNNGAIGK NSIAIGVGAS TDNSASGGIA IGNNAQSKAK
NAVVIGTNVS IDVPNSFVLG SDNIVTQTGK QDRKERAAVV VIGSGTTLTE SKSAIAIGAV
NADGGTKIEY AAWTASIGNK NKIKNGTDII ALGNNIDIGY QYDNKNNVSK NDSDNRKENT
EVIAIGNSAN ANKASGSVLI GAKTTAKSNA TQAVIIGYEA TAKENATQTV VIGKSAESSA
AGAVVIGEGA TATVANSVAI GKGSKTTGNT SANGYDPSTK TAYVGSGNAN TWKPNSGVFS
VGDGSNTTRR ITGVAAGSDD TDAVNVAQLK KVVSGAATLK YKANGSNEQS IDLTSKGLNF
KDGTYTTATV EADGVVKFDL NRTTSEKINN ALSKADAATQ YAKIDGSNIT GNEDKWREKL
DVYKKDEADS KINEVKTALN GQINGKVDTT TFEAAKTQLE TKIDGKADKT LGNIDQTGQN
KIKGLAIAAV EVKGKDGQIT VESKKDADGN KNTFTVSLNE NIKTKIESIG TGKVEASDNN
TVTGGKVYTA IEGAKSELNA EIAKKLDKSI FDTAKSELED KIDQKVDTST FNAATFGLMG
NDSQAVTKTL NNTIKIEGSE SVESNKKNIY VSKNTAGDGL EIKLGETLTG ITSVGKDDKA
KISFGGDNAK NEITYTVGDT TATATFKFSK DGIDLGSKKI TNVASGIGEI ASATNDGTET
NLDKVLKGSP EDTYKSNAAN VEDLAKVSKA IIDKGLSFES DNGKVTRKVG ETLKIVGEKA
TDTSTSGSTT TTITTAPGNI KVTAKKSDTA GHANDTLEIG LSKDLTGIES ITKGANKAKL
TLGENTASLE SATNKSKIEL KDDGISLTTN KDKTIIVKDN ELSGVNKIST GSDTNANSID
LANSSNVVIT SGGKALTIAK DGAKDGISLT GLADRKVDDT GYGTNGSAGR AATESALLDL
KTKGLTFEVN EKNTDSKPKT LKRELGETLK ITGKDGDVTD FDSKYSLENV ATKIDENNKA
IRIGLLKTPR FDALELGTDE TKKISLTPEF ANGNELKLTL TGTGAGNNTK VKISGVANGT
DDNDAINKSQ LDSVGNATLT FSDGTSTNDF VRKNSDSKKV VISSGSNVTV TLDKTKDNNN
TGQFTVSLNK DLTDINSITL KSNGGTDNTN NKTGKITVDT TGDVKVQHGD GTASKIVVES
DFNELKTSEE ITVTGGGKVL GAETTLSLND NSIAGTKLKT GTISEDRLDS ALTTKLNREF
KVKVDTTTSE NLIGNTLEFA KDSNLTVVLD SDNKKITYGL SPTLTGITSI ESAAGGNGAK
SKITLNADHI VADKDIYVGS KGNDESNKLV KKSELTSEIS TITKNITNNI NTLSENKLTF
KAGNTSFERA NNTDKNIVFK GEGNVNIKLA TDDSKNTGTF TISVNETNVI DEKAGTNKND
TESEKHADRL TTEKAVVDYV KARTFKLAGN DSFAVSSLLD GTINIKGDVS TTSDQGNIYV
SKGTSDSELK IQLGKNLTGI ESIKKSENGA KLTLEDNKLE LSPEKDVKVT LEKDGQKTTA
KATGLSTIGN DGDNALVFNT GKSDKTQATL KVAGEDLTFT KLGNNIQISN LASGLGLKDS
ADGNNDGNAG DKAIANVLAG NPNGMETNAI NVKDLSEVAK ALVKKGLSFE GNGGNTDKVT
RKLGETLKIV GETSATTPTD STQPATETAA DNIIVAKKKD SEDTLEVKLS KNLNGIASIT
NGDKAKIALD DTSATLSAGK DKGSIKVATG SGDDANKIEL SPDNGSKVTL AKDGTNGVKA
TGLSTVGLDG DNALVFTNGA GNKAELKVGG SALTFTKATT GNTVKISNVE DGKIDTSSTE
AITGKQLHDL ASKLGVEVES ETGFKQPSFT AIKGGTSSTG DTSSGTTTAK GPITFKGAID
DLITAVNGGL TFEGNVSTSS STTLQLGGTL TIDSSESKTK LEDSKETKEK DITTKLESLN
GSDSKAGKLT LTLNKATSVD ENDERVVTSS AVANKLKNYT TTETLEDDFL RVTGENINGH
QKEFGKNVGL EEVKLEEDET EGTSELVQAK ALVDYLKGTG EKSVKLSDSS KTQAIGEGSI
SIGHNAVSRN EGSIAMGYNS EASNSGAISI GQGSTVLGTS SVAVGKENNI KGNFSFVLGE
GNTLDKEQTY VIGSDNKISG AKNIAIGLGN TVGGNENIVL GSHVDLKDDV EGAIVLGDKS
IGVSNAVSVG NALTKRRIVF VDTPQGEYDA VNKKYVDDLT LTYKSNNDDK TKKSINLKTE
ALDFVKSENI SVSVEADGKI THTLNNNLIS IVSISGGKNG SEDAAKITLS SKPETEEEAK
DYVKTVTIND AKLKGLLDGE IAESSKEAVT GKQLADLTKQ LGVDVNTSDK TKFNELIFEY
LSKVDGSFKE KNPTTLKEAI DETRAKLNEG LKFGGDIPSA GTNTNNTHYL GSTINIIRLG
MPTETDEVAP TSTSGYSGSN LITQYTYDKG NAKIEIGFKD APEFRKVTLS KQTYGDSKIG
NEDVITKSYL EQALNSFKFN VAYDNKTVQI GRGDTLKFEN GLNIQGNLKQ EGATQPSAVT
STTAPTTVNT PSGSENGGAG NMASSGNTSP AVASSGSDAG DTGATVGSST TPTTAPINNG
STGSDSAVTS STTSSTGAGT DGGSGTSSTA SSGATPSTTT AGTASSTPST TSNTTAVVTI
GTTEDLTGLK SAEFNGDNGN TTNITGNEIV LKDQSGNAHT QTATSQIITD NTDPDTEKAV
VTTAEGVTMS VVSDDQVLVN DQTAEANILS NGKNTTEVKA GEIAIKDKVG QNVVSLKVAE
GEDGQDGKGA TLAFAKGTDS KSGTGTIKGL ADIKPDETDG SLAVNKNYVD KLDKVAVKYD
DPTKSSITLG GKSAKNYRPV VIDNLKSGLG IDDIKDGGIA SVAQGKQGEL VKDLVAGKLD
TTKDASGKAK DNLHKAVNLA DLKAIAQAGL NFAGNDGRDI HKNLSETLAI VGQGLDKDQT
TAFKGTNGNI AVKADNGKLS ISLNEALTGL KSAEFNGENG NTTNITGNAI ALKDDKGTAN
MTGSEIALED TDGASNTQTA KSHTLQNGAN ITDVKAGLIT VTENPDTDTE KSVMISAEGM
TTAVVTDDKV LVNDQTAEAN ILSNGKHTTE VKAGEVAIKD KAGKDVVSLK VAKGEDGQDG
KGATLAFAKG ADDKGTGTIT GLKDLDATAD GSSAANKNYV DEKVSDLDSN RPFDFYIKEG
NSYTKVIKGR DGKFYDPKDL EGAKYDGSKY VTNEGTTVDT SLSAEDKVII RAEPTTAPIG
ISNVASGLDI DAEKVKQAEK EVKAKRSEAE RKATMLKAKA ALVEQKEAEI TALEQEIENL
SGDEKTQKEA ELKAIEAELS QFNDELATAT KDLKTANDAL KIANNTLTKL TEDKIGNLVK
GENINPTNGA NIGDLQAVAR AGLNFEGNDG VPVHKNLGEK LTIKGEGTFN SNLTAAGNIK
VEMAQDGKGL EVKLSDQLKN MTSFETREVN GKKARLDSNG LSVENTNTKE RSHLSENRLA
FFKDGALGLN LDGKDRALKV GEKAIISING KNEALVEDLN ASSSGQAIAN KNYVDAKNNE
LRTQLHSVNR ESRSGIAGAN AAAALPMIAM PGKSALAVSA GAYKGQSAVA LGYSRMSDNG
KIMLKLHGNS TSTGDFGGGV GIGWAW