Gene HS_0205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0205 
Symbolimp 
ID4239720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp191691 
End bp194036 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content36% 
IMG OID638103741 
Productorganic solvent tolerance protein 
Protein accessionYP_718412 
Protein GI113460351 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.420377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ATTATTACAC TGTACTTTCT CTGTCAATTT TGACCGCACT TTATAGCACG 
TCCAGTCAAG CGAACTTACA ACAGCAATGT TTGATCGGTG TTCCTCATTT TCAAGGTGAA
ATCGTTCAAG GCGATCCCAA TGAATTGCCG GTTTATATTG AAGCGGATCA CGCTAAAATG
AATCAATCGA CACATGCTCA ATACGAAGGA AATGTTAATG TTAAACAGGG CAACCGTCAT
TTAACAGCGG GAATGATTGA AATTGAGCAA CACGGAAAAG ATAATGCGAA ACGTTATGCG
TATGCTAAAA ATGGGTTTGA CTACAAAGAT AATTTAATTC AGCTCAATGG TGATAATGCC
AAAATTCACC TTGATAGCAA AGATGCCAAT ATCCAAGATG CAGATTATCA ATTGGTTGGA
CGACAAGGGC GGGGAACTGC TGATGAAGTT GAACTTCGTG AACATTATCG AGTGATGAAA
AATGCAACCT TCACATCTTG TTTGCCTAAT AGCGACGCTT GGTCAATTGA GGCTAAGGAA
ATGCGTCAAC ATATTCAAGA AGAATATGCG GAAATGTGGC ATGCTCGTTT TAAAGTATCC
GGTATCCCTA TTTTCTACAC GCCTTACCTA CAATTACCTA TCGGTGATCG CCGTCGATCC
GGATTACTTA TTCCCAAAGC AGGTATCTCA ACTCGGCATG GTTATTGGTA TGCACAACCG
TTTTATTGGA ATATAGCACC AAACTTTGAT GCGACATTTA CCCCTAAATA TATGTCTCAT
CGAGGTTGGC AATTAAATGC AGAAACTCGC TATCTGACTC GTATCGGTGA GGGAAAATTT
GTCGTTGAAT ACTTAAAAAC AGATCGTCAT TCTGACTCTT TAAATACGGC TCGTTCACGT
CATCTCTTTT ATTGGGGACA TAATTCTCAT TTTCTAAAAG ATTGGCGTTT AAATGTAAAT
TATACAAAAG TAAGCGATAA ACATTATTTC AATGATTTTG AGTCTGAATA TGGAAACAGT
ACAGACGGAT ATGTAGATCA ACAGGCGAGC ATTTCTTACT ATCAACCGAA TTACAACCTT
TCTATTTCAG CGAAACAATT CCAAATTTTC GATAAAGTAG ATATTGGACC TTATCGTGCG
TTGCCACAAA TTGATTTTAA TTATTATCGC AACGAAATTG CTAATGGCTT AGTTGACTTT
AGTTTATTTT CACAAGTAGT TCGTTTTGAT AATGACAGTG CGTTAATGCC AACTGCTTGG
CGTTTCCATA TAGAACCGAG TTTGACTTTT CCACTTTCCA ATCGTTACGG CAGTTTAAAT
ATTGAAACTA AACTTTACGC AACACGCTAT CTACAAAAAC GAGGTAAAGG AGAAAATGCA
GAAGAGATTA AAAAAACGGT TAATCGTGTT TTACCACAAA TCAAGCTGGA TTTTCAAACG
GTCTTAGCAA ATAGACAAAG TTTCATTGAG GGTTATACCC AAACTCTTGA GCCAAAATTT
CAATATTTGT ACCGCCCTTA TAAAGATCAG TCGGATATTG GTCTAAAACA ACAAAATAAT
GATTACTTAG GTTTCGGTTA CGACTCAACT TTATTACAAC AGGATTATTT TTCTTTGTTT
CGAGATCGCC GTTATAGCGG TTTAGATCGC ATAGTTTCAG CAAATCAAAT TACTCTTGGT
GGAACGACCA GATTTTATGA TAAAAATGCA AATGAACGCT TTAACTTATC TATTGGACAA
ATTTATTACC TTAAAGACTC TCGCACAGAT AATAATCCAC AAAATATGGC TCAAGGCAGA
TCTTCTTCCT GGTCTTTAGA AAGTAACTGG CGTATCAATA GCAAATGGAA TTGGCGTGGA
AGTTATCAAT ATGACACACA TTTAAACCAA ACATCTTTGG CAAATACCGT CTTAGAATAC
AATTCGGAGA AAAATAACTT AATCCAACTC AGTTATCGAT ATGTTAACCA GTCTTATATC
GATCAAAATT TAATTGGTAA AAATACTTAT GGACAAAGTA TAAAACAACT TGGTATGACA
ACAGCTTGGG AGCTAACTGA TCATTGGACA CTGGTTGGTC GCTATTATCA AGATCTCGCA
TTGAAAAAGC CGGTTGAACA ATATTTGGGA ATACAATATA ACTCTTGTTG CTGGTCTATA
GGTGTTGGAG CAAGACGTTA TGTAACCAAT AGAGCAAATC AACGCAATGA TGAAGTGCTT
TATGATAATA GCTTAAGCCT CACTTTTGAG TTACGTGGAT TATCTCCTTC GGATCATAAA
AATAATATAG ATGAAATGCT GAAAAAAGGA AAACTGCCTT ATATTAAAGC CTTTAGCCTA
TACTAA
 
Protein sequence
MKKNYYTVLS LSILTALYST SSQANLQQQC LIGVPHFQGE IVQGDPNELP VYIEADHAKM 
NQSTHAQYEG NVNVKQGNRH LTAGMIEIEQ HGKDNAKRYA YAKNGFDYKD NLIQLNGDNA
KIHLDSKDAN IQDADYQLVG RQGRGTADEV ELREHYRVMK NATFTSCLPN SDAWSIEAKE
MRQHIQEEYA EMWHARFKVS GIPIFYTPYL QLPIGDRRRS GLLIPKAGIS TRHGYWYAQP
FYWNIAPNFD ATFTPKYMSH RGWQLNAETR YLTRIGEGKF VVEYLKTDRH SDSLNTARSR
HLFYWGHNSH FLKDWRLNVN YTKVSDKHYF NDFESEYGNS TDGYVDQQAS ISYYQPNYNL
SISAKQFQIF DKVDIGPYRA LPQIDFNYYR NEIANGLVDF SLFSQVVRFD NDSALMPTAW
RFHIEPSLTF PLSNRYGSLN IETKLYATRY LQKRGKGENA EEIKKTVNRV LPQIKLDFQT
VLANRQSFIE GYTQTLEPKF QYLYRPYKDQ SDIGLKQQNN DYLGFGYDST LLQQDYFSLF
RDRRYSGLDR IVSANQITLG GTTRFYDKNA NERFNLSIGQ IYYLKDSRTD NNPQNMAQGR
SSSWSLESNW RINSKWNWRG SYQYDTHLNQ TSLANTVLEY NSEKNNLIQL SYRYVNQSYI
DQNLIGKNTY GQSIKQLGMT TAWELTDHWT LVGRYYQDLA LKKPVEQYLG IQYNSCCWSI
GVGARRYVTN RANQRNDEVL YDNSLSLTFE LRGLSPSDHK NNIDEMLKKG KLPYIKAFSL
Y