Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0678 |
Symbol | srlE |
ID | 4240166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 723482 |
End bp | 724477 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638104230 |
Product | PTS system, glucitol/sorbitol-specific IIBC component |
Protein accession | YP_718890 |
Protein GI | 113460823 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3732] Phosphotransferase system sorbitol-specific component IIBC |
TIGRFAM ID | [TIGR00825] PTS system, glucitol/sorbitol-specific, IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000916157 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATA AAGCAGTTTA TATTGAAAAA GGTAACGGTG GCTGGGGTGG TCCATTAACA TTACCTATTA TTGAAGGCAA AAAAATACTT TATATGACCG GAGGCACTCG TCCTGCTATT GTGAATAAAT TGGTGGAATT AACTGGCTGG GAAGCAGTAG ATGGTTTTAA AGAAGGCGAA CCGCCAGCAG AAGAAATTGG AATTGCAATT GTTGACTGTG GTGGTGTATT ACGTTGTGGG CTTTACCCAA AACGTCGCAT TCCAACTATT AATATTCACG CAACCGGCAA ATCAGGACCT TTAGCAGAAT TTATTGTTGA AGATATTTAT GTTTCTGCCG TAAAAGAAAA TAATATTTCG TTAATTGATG TGGATGCCGC AGCCCTTTCT GTCATATCTG AAGAAAAAAT TGCTTCATCA ACTACCGCAA AAGAGTATGA TTCAAGTAAA AAAATCACCG AACAAAGTAA TGGTTTATTG GCAAAAATAG GTATGGGAAT GGGTTCTGTC GCCGCACTCT TTTTCCAAGC CGGACGTGAT ACTATTGATA CAGTCCTGAA AACCATTTTA CCGTTTATGG CATTCGTTTC CGCCTTGATT GGAATCATTA TTGCTTCCGG TTTGGGGGAT TTAATTGCTC ATGGTTTAAC ACCGCTTGCA AATAGCCCAC TCGGTTTAGT TACTCTTGCG TTAATTTGTT CTTTCCCACT ACTCTCTCCA TTCTTAGGTC CCGGAGCTGT TATTGCTCAA GTTATCGGTG TATTAGTAGG CACACAAATC GGTTTAGGGA ACATTCCACC ACATTTAGCC CTGCCGGCAC TTTTTGCCAT CAATGCTCAA GCCGCTTGTG ATTTCATTCC CGTGGGGCTT TCAATGGCAG AAGCGAAACA AGATACTGTG CGAGTGGGCG TACCTTCTGT ATTAGTCAGT CGTTTTTTGA CTGGTGCACC GACAGTATTA ATTGCATGGG CGGTATCTGC ATTCATTTAT CAGTAA
|
Protein sequence | MSNKAVYIEK GNGGWGGPLT LPIIEGKKIL YMTGGTRPAI VNKLVELTGW EAVDGFKEGE PPAEEIGIAI VDCGGVLRCG LYPKRRIPTI NIHATGKSGP LAEFIVEDIY VSAVKENNIS LIDVDAAALS VISEEKIASS TTAKEYDSSK KITEQSNGLL AKIGMGMGSV AALFFQAGRD TIDTVLKTIL PFMAFVSALI GIIIASGLGD LIAHGLTPLA NSPLGLVTLA LICSFPLLSP FLGPGAVIAQ VIGVLVGTQI GLGNIPPHLA LPALFAINAQ AACDFIPVGL SMAEAKQDTV RVGVPSVLVS RFLTGAPTVL IAWAVSAFIY Q
|
| |