Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2826 |
Symbol | srlE |
ID | 6145871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2901618 |
End bp | 2902577 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617695 |
Product | PTS system, glucitol/sorbitol-specific, IIB component |
Protein accession | YP_001744850 |
Protein GI | 170681652 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3732] Phosphotransferase system sorbitol-specific component IIBC |
TIGRFAM ID | [TIGR00825] PTS system, glucitol/sorbitol-specific, IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.204454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0000775819 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCGTA TTCGGATCGA AAAAGGAACG GGTGGCTGGG GCGGCCCGCT TGAGCTGGAA GCCAAACCGG GAAAAAAAAT CGTCTATATC ACCGCCGGTA CCCGGCCTGC GATTGTTGAC AAACTGGCAC AGCTTACTGG CTGGCAGGCT ATTGACGGAT TTAAAGAAGG TGAACCCGCG GAGGCGGAAA TTGGTGTCGC GGTAATCGAC TGTGGCGGCA CATTACGCTG TGGCATCTAT CCGAAACGGC GTATTCCCAC CATTAATATC CACTCGACGG GCAAGTCCGG CCCGCTGGCG CAGTACATTG TGGAAGATAT TTATGTCTCT GGCGTAAAAG AAGAAAACAT CACTGTAGTG GGTGATGCGA CACCACAACC CTCTTCCGTG GGCCGTGACT ATGACACCAG CAAGAAAATC ACCGAACAAA GCGATGGTTT ACTGGCGAAG GTGGGAATGG GTATGGGTTC TGCCGTTGCC GTGTTGTTTC AATCTGGTCG TGACACCATC GACACTGTAT TAAAAACCAT TCTGCCGTTT ATGGCGTTCG TCTCGGCGCT TATCGGCATC ATTATGGCTT CTGGCCTTGG TGACTGGATT GCCCACGGTC TTGCTCCGCT GGCGAGCCAT CCACTGGGTC TGGTCATGTT GGCGCTCATC TGCTCCTTCC CGCTGCTTTC ACCTTTCCTC GGCCCAGGCG CAGTTATCGC GCAGGTTATC GGCGTATTGA TTGGTGTGCA GATTGGTCTC GGCAATATTC CGCCGCATCT GGCTTTACCT GCACTGTTTG CCATCAACGC GCAGGCGGCC TGCGACTTCA TCCCGGTCGG TTTGTCGCTG GCGGAAGCTC GTCAGGACAC GGTTCGCGTC GGTGTCCCTT CTGTTCTGGT GAGCCGCTTT TTAACCGGCG CGCCAACTGT ACTGATCGCC TGGTTTGTCT CCGGTTTTAT CTATCAATAG
|
Protein sequence | MTRIRIEKGT GGWGGPLELE AKPGKKIVYI TAGTRPAIVD KLAQLTGWQA IDGFKEGEPA EAEIGVAVID CGGTLRCGIY PKRRIPTINI HSTGKSGPLA QYIVEDIYVS GVKEENITVV GDATPQPSSV GRDYDTSKKI TEQSDGLLAK VGMGMGSAVA VLFQSGRDTI DTVLKTILPF MAFVSALIGI IMASGLGDWI AHGLAPLASH PLGLVMLALI CSFPLLSPFL GPGAVIAQVI GVLIGVQIGL GNIPPHLALP ALFAINAQAA CDFIPVGLSL AEARQDTVRV GVPSVLVSRF LTGAPTVLIA WFVSGFIYQ
|
| |