Gene HS_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1044 
Symbol 
ID4240542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1149951 
End bp1153046 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content36% 
IMG OID638104605 
Producthypothetical protein 
Protein accessionYP_719256 
Protein GI113461187 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase
[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCCTA GATTAAAAAA CATTCCTCAA CTTGATCCTT TAGTTTCTTC CTATTTGGAA 
AAATTAAAAA CGCAACATTT TGAAGGTGAT ATAGCGACAA ATTATGCAGA ACGTTTAAGT
CTAGCAACCG ATAACAGTGT TTATCAACAA CTTCCACAAG CTATTTTATT TCCTAAAACA
ACAAATGATG TGGTATGTCT TACAAAGCTA GCTCAACAAA AAAATTTCCA ATCTCTTACC
TTTACGCCAA GAGGTGGCGG TACAGGAACC AATGGTCAAG CAATTAACCA TAATATCATC
GTTGATCTTT CTCGTTATAT GACAAACATC TTGGAACTCA ATGTGGAGCA ACGGTGGGTT
CGTGTACAAG CCGGTGTTGT TAAAGATCAG CTCAATCAAT TTTTAAAACC CTACGGGTTG
TTTTTTTCAC CGGAACTTTC TACCAGTAAT CGAGCTACTA TTGGCGGAAT GATTAATACT
GATGCTTCCG GACAAGGCTC TCTAAAATAT GGTAAAACTT CAGATCATGT ACTAGCACTA
AAATCTGTCT TAATGAATGG AGAAATTTTA GAAACAAGTG CGGTCAAATC TGATGAATTT
TTACAAAATA TTCAGCATTT ATCATCAACA GGACAAAAAC TTCATCAAGA AATCTTTCAA
CGCTGTCAAC AAAAACGCTC ACAAATTCTC ACTGATTTGC CTCAATTAAA CCGTTTTTTA
ACCGGCTATG ATTTAAAAAA TGTATTTACC GAAGATCAAA GCGAATTTAA TCTTAGTCGT
ATTTTAACCG GATCTGAAGG TTCATTAGCT TTTATTTGCG AAGCGGTTTT AGACTTAACA
CCTATTCCTC AATACCGCAC TTTAATCAAC ATCAAATATA GCTCTTTCGA TGCAGCATTA
CGCAATGCAC CTTTCATGCT GGCGGTTGAA GCATTATCCG TTGAAACCAT TGATAGTAAA
GTACTGAATC TAGCCAAACA AGATATTATT TGGCATTCTG TACATGAATT ACTTACAGAA
GAAAAAGAAA ACCCAATTTT AGGTTTAAAT ATCGTTGAAT ATGCAGGAAA CGACCTCACC
TTAATTCAAA AACAAGTATC TCACTTGTGT CAACTGTTAG ACGATAAAAT AGCTCAACAA
CAAGACAATA TTATCGGTTA TCAAGTTTGC TCAGATCTTC CTTCCATCGA ACGAATTTAT
GCTATGCGAA AAAAAGCGGT GGGCTTATTA GGTAATAGCA AAGGCAATAA AAAACCTATT
CCTTTTGTGG AAGACAGTTG CGTACCGCCT GAAAATCTAG CCAACTATAT TAGTGAGTTT
CGTCAATTAT TAGATAAACA TCACCTAGAT TACGGTATGT TTGGACATGT CGATGCAGGT
GTTCTACATG TACGCCCCGC TTTAGATTTA TGTAATAAAG AACAAGTTTT GCTATTCAAA
ACTATTTCTG ATCAAGTAGC AGATTTAACA AAAAAATATG GTGGCTTAAT TTGGGGAGAA
CATGGAAAAG GAATGCGTTC ACAATACGGT GAAAAATTCT TTACTCCGGA ATTATGGCAA
GAATTACGCT ACATTAAATT TTTATTCGAT CCGAATAATC GTTTAAATCC GGGTAAAATT
TGTACCGCAC TTTATTCTGA ACAAGAACTC TACTCCATTT TGTCGCCAAT GCGGGCAGAT
CAAGATCGTC AAATTCCCAT TCAAATGCGA GAGGAATTTT CCGGTGCGAT GAATTGTAAT
GGTAACGGAT TATGCTTTAA CTTTGATGTT CACAGTGCCA TGTGTCCGTC TATGAAAGTC
AGCAAAAACC GGCTATTTTC GCCCAAAGGG CGTGCAGCAA TAATTCGTGA ATGGTTGCGT
TTGATGGCAA ATGAAAATAT CTCGCCTGAA CAGCTGAATT TTCGTAAAGT CGAAGTGAAA
TTAACAGATC TTGTCAAAAA AATTCGCAAT ACGGTTGCCC AAAAACAAGG GGAATACGAT
TTTTCTCACG AAGTTAAAGA GGCTATGAAT ACTTGTTTAG CCTGTAAGGC TTGTGCTACG
CAATGTCCAA TTAAAATTGA TGTGCCCAGC TTTCGTGCCA AGTTTTTCTA TTTTTACCAT
AACCGCTATT TACGCCCATT GAAAGATTAT GTGGTTGCAA ATGTAGAAAT GATGGCACCT
TTAATGGCAA AAGCACCGAA ATTTTTCAAC TTTTTTACAA CGGCTAAACT CACTCAATCT
TTGGCTGAAA ATCTGCTAGG GATGACTGAT TTACCCTCAT TGTCTGTTCC CTCTTTGCAA
CAACAGTTAG TTGAAATAAA TTATCAAGGT TATTCATTGG AGCAACTGGA AAATCTAAGT
GCGGTAGAAA AACAAAATAT TTTATTGATT GTTCAAGATC CGTTTACCTC TTTTTACGAT
GCAAAAGTCG TCGCTGATTT TGTAGCACTC TGTCAAAAAT TAGGTTACAA AGCAATTGTT
CTGCCTTTTA AACCTAATGG TAAAGCAATG CACATAAAGG GATTCCTAGC ACGTTTTGCC
AAAACTGCAA AAAATCAAGC GGACTTCCTC AATAAGATAA GTAAACTCGG TTTATCTCTG
GTCGGTGTTG ATCCTGCCAT TGTTCTTTCT TATCGTGATG AATATAAAGA AATTCTCGGT
GATGAAAGAG GAGATTTTAA TGTTATCACT GCTCATGAGT GGTTGAAACA AGAATTAAGT
TCAGGGAAAC TTGAGCATAA ACTTACACAA ATTATGCAAA AAAATAACCG CACTTTTAAT
AAAGAAAGTC AACAAAAATG GTATTTATTT CCACATTGTA CGGAAAGTAC TACACTACCG
AACAGTGCAA AAGAATGGCA GCAAATTTTT TCAGCTTTTG GACAAGAATT ACAAACAAAA
AATGTTGGTT GTTGCGGTAT GGCGGGAACG TTTGGGCATG AAATTCAACA CCTAGAGATG
TCAAAAGAAA TTTACCATTT ATCCTGGGCT AAAAAATTAC AAGGAAAAAA TCCTGATTAT
TGTTTAGCTA CGGGGTATTC TTGTCGCAGT CAAGTTAAAC GTATGCTCCA TTGGCAACCT
AAGCATCCTA TTCAAGCCCT ATTATCAATT ATTTAA
 
Protein sequence
MLPRLKNIPQ LDPLVSSYLE KLKTQHFEGD IATNYAERLS LATDNSVYQQ LPQAILFPKT 
TNDVVCLTKL AQQKNFQSLT FTPRGGGTGT NGQAINHNII VDLSRYMTNI LELNVEQRWV
RVQAGVVKDQ LNQFLKPYGL FFSPELSTSN RATIGGMINT DASGQGSLKY GKTSDHVLAL
KSVLMNGEIL ETSAVKSDEF LQNIQHLSST GQKLHQEIFQ RCQQKRSQIL TDLPQLNRFL
TGYDLKNVFT EDQSEFNLSR ILTGSEGSLA FICEAVLDLT PIPQYRTLIN IKYSSFDAAL
RNAPFMLAVE ALSVETIDSK VLNLAKQDII WHSVHELLTE EKENPILGLN IVEYAGNDLT
LIQKQVSHLC QLLDDKIAQQ QDNIIGYQVC SDLPSIERIY AMRKKAVGLL GNSKGNKKPI
PFVEDSCVPP ENLANYISEF RQLLDKHHLD YGMFGHVDAG VLHVRPALDL CNKEQVLLFK
TISDQVADLT KKYGGLIWGE HGKGMRSQYG EKFFTPELWQ ELRYIKFLFD PNNRLNPGKI
CTALYSEQEL YSILSPMRAD QDRQIPIQMR EEFSGAMNCN GNGLCFNFDV HSAMCPSMKV
SKNRLFSPKG RAAIIREWLR LMANENISPE QLNFRKVEVK LTDLVKKIRN TVAQKQGEYD
FSHEVKEAMN TCLACKACAT QCPIKIDVPS FRAKFFYFYH NRYLRPLKDY VVANVEMMAP
LMAKAPKFFN FFTTAKLTQS LAENLLGMTD LPSLSVPSLQ QQLVEINYQG YSLEQLENLS
AVEKQNILLI VQDPFTSFYD AKVVADFVAL CQKLGYKAIV LPFKPNGKAM HIKGFLARFA
KTAKNQADFL NKISKLGLSL VGVDPAIVLS YRDEYKEILG DERGDFNVIT AHEWLKQELS
SGKLEHKLTQ IMQKNNRTFN KESQQKWYLF PHCTESTTLP NSAKEWQQIF SAFGQELQTK
NVGCCGMAGT FGHEIQHLEM SKEIYHLSWA KKLQGKNPDY CLATGYSCRS QVKRMLHWQP
KHPIQALLSI I