Gene B21_02518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02518 
SymbolsrlE 
ID8115150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2665566 
End bp2666525 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content55% 
IMG OID644848718 
Producthypothetical protein 
Protein accessionYP_003000291 
Protein GI251785987 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3732] Phosphotransferase system sorbitol-specific component IIBC 
TIGRFAM ID[TIGR00825] PTS system, glucitol/sorbitol-specific, IIBC component 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.660273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATA TTCGGATCGA AAAAGGAACG GGGGGCTGGG GCGGCCCGCT TGAGCTGGAA 
ACCACGCCGG GCAAAAAAAT CGTCTATATC ACCGCCGGTA CCCGGCCTGC GATTGTCGAC
AAACTGGCAC AGCTTACTGG CTGGCAGGCT ATTGACGGAT TTAAAGAAGG TGAACCCGCG
GAGGCGGAAA TTGGTGTCGC GGTAATCGAC TGTGGCGGCA CATTACGCTG CGGCATTTAT
CCGAAACGGC GTATTCCCAC CATTAATATC CACTCGACGG GCAAGTCCGG TCCACTGGCG
CAGTACATTG TGGAAGATAT TTATGTCTCT GGCGTAAAAG AAGAAAACAT CACTGTAGTG
GGTGATGCGA CACCACAACC CTCTTCCGTG GGCCGTGACT ATGACACCAG CAAGAAAATC
ACCGAACAAA GCGATGGTTT ACTGGCGAAG GTGGGAATGG GTATGGGTTC TGCCGTTGCC
GTGTTGTTTC AATCTGGTCG TGACACCATC GACACTGTAT TAAAAACCAT TCTTCCGTTT
ATGGCATTCG TCTCGGCGCT CATTGGCATC ATTATGGCTT CTGGCCTTGG TGACTGGATT
GCCCACGGCC TTGCTCCGCT GGCGAGCCAT CCACTGGGTC TGGTCATGCT GGCGCTCATC
TGCTCCTTCC CGCTGCTTTC ACCTTTCCTC GGCCCAGGCG CAGTTATCGC ACAGGTTATC
GGCGTATTGA TTGGCGTGCA GATTGGTCTC GGCAATATTC CGCCGCATCT GGCTTTACCG
GCACTGTTTG CCATCAACGC GCAGGCGGCC TGCGACTTCA TCCCGGTCGG TTTGTCGCTG
GCGGAAGCCC GTCAGGACAC GGTTCGCGTC GGTGTCCCTT CTGTACTGGT GAGCCGCTTT
TTAACCGGCG CACCGACTGT ACTGATCGCC TGGTTTGTCT CCGGTTTTAT CTATCAATAG
 
Protein sequence
MTHIRIEKGT GGWGGPLELE TTPGKKIVYI TAGTRPAIVD KLAQLTGWQA IDGFKEGEPA 
EAEIGVAVID CGGTLRCGIY PKRRIPTINI HSTGKSGPLA QYIVEDIYVS GVKEENITVV
GDATPQPSSV GRDYDTSKKI TEQSDGLLAK VGMGMGSAVA VLFQSGRDTI DTVLKTILPF
MAFVSALIGI IMASGLGDWI AHGLAPLASH PLGLVMLALI CSFPLLSPFL GPGAVIAQVI
GVLIGVQIGL GNIPPHLALP ALFAINAQAA CDFIPVGLSL AEARQDTVRV GVPSVLVSRF
LTGAPTVLIA WFVSGFIYQ