Gene HS_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0678 
SymbolsrlE 
ID4240166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp723482 
End bp724477 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content41% 
IMG OID638104230 
ProductPTS system, glucitol/sorbitol-specific IIBC component 
Protein accessionYP_718890 
Protein GI113460823 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3732] Phosphotransferase system sorbitol-specific component IIBC 
TIGRFAM ID[TIGR00825] PTS system, glucitol/sorbitol-specific, IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000916157 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATA AAGCAGTTTA TATTGAAAAA GGTAACGGTG GCTGGGGTGG TCCATTAACA 
TTACCTATTA TTGAAGGCAA AAAAATACTT TATATGACCG GAGGCACTCG TCCTGCTATT
GTGAATAAAT TGGTGGAATT AACTGGCTGG GAAGCAGTAG ATGGTTTTAA AGAAGGCGAA
CCGCCAGCAG AAGAAATTGG AATTGCAATT GTTGACTGTG GTGGTGTATT ACGTTGTGGG
CTTTACCCAA AACGTCGCAT TCCAACTATT AATATTCACG CAACCGGCAA ATCAGGACCT
TTAGCAGAAT TTATTGTTGA AGATATTTAT GTTTCTGCCG TAAAAGAAAA TAATATTTCG
TTAATTGATG TGGATGCCGC AGCCCTTTCT GTCATATCTG AAGAAAAAAT TGCTTCATCA
ACTACCGCAA AAGAGTATGA TTCAAGTAAA AAAATCACCG AACAAAGTAA TGGTTTATTG
GCAAAAATAG GTATGGGAAT GGGTTCTGTC GCCGCACTCT TTTTCCAAGC CGGACGTGAT
ACTATTGATA CAGTCCTGAA AACCATTTTA CCGTTTATGG CATTCGTTTC CGCCTTGATT
GGAATCATTA TTGCTTCCGG TTTGGGGGAT TTAATTGCTC ATGGTTTAAC ACCGCTTGCA
AATAGCCCAC TCGGTTTAGT TACTCTTGCG TTAATTTGTT CTTTCCCACT ACTCTCTCCA
TTCTTAGGTC CCGGAGCTGT TATTGCTCAA GTTATCGGTG TATTAGTAGG CACACAAATC
GGTTTAGGGA ACATTCCACC ACATTTAGCC CTGCCGGCAC TTTTTGCCAT CAATGCTCAA
GCCGCTTGTG ATTTCATTCC CGTGGGGCTT TCAATGGCAG AAGCGAAACA AGATACTGTG
CGAGTGGGCG TACCTTCTGT ATTAGTCAGT CGTTTTTTGA CTGGTGCACC GACAGTATTA
ATTGCATGGG CGGTATCTGC ATTCATTTAT CAGTAA
 
Protein sequence
MSNKAVYIEK GNGGWGGPLT LPIIEGKKIL YMTGGTRPAI VNKLVELTGW EAVDGFKEGE 
PPAEEIGIAI VDCGGVLRCG LYPKRRIPTI NIHATGKSGP LAEFIVEDIY VSAVKENNIS
LIDVDAAALS VISEEKIASS TTAKEYDSSK KITEQSNGLL AKIGMGMGSV AALFFQAGRD
TIDTVLKTIL PFMAFVSALI GIIIASGLGD LIAHGLTPLA NSPLGLVTLA LICSFPLLSP
FLGPGAVIAQ VIGVLVGTQI GLGNIPPHLA LPALFAINAQ AACDFIPVGL SMAEAKQDTV
RVGVPSVLVS RFLTGAPTVL IAWAVSAFIY Q