Gene EcolC_2500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2500 
Symbol 
ID6066847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2750605 
End bp2752038 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content53% 
IMG OID641601906 
ProductPTS system glucose-specific transporter subunits IIBC 
Protein accessionYP_001725458 
Protein GI170020504 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR02002] PTS system, glucose-specific IIBC component
[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.139043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.87594e-08 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTTTAAGA ATGCATTTGC TAACCTGCAA AAGGTCGGTA AATCGCTGAT GCTGCCGGTA 
TCCGTACTGC CTATCGCAGG TATTCTGCTG GGCGTCGGTT CCGCGAATTT CAGCTGGCTG
CCCGCCGTTG TATCGCATGT TATGGCAGAA GCAGGCGGTT CCGTCTTTGC AAACATGCCA
CTGATTTTTG CGATCGGTGT CGCCCTCGGC TTTACCAATA ACGATGGCGT ATCCGCGCTG
GCCGCAGTTG TTGCCTATGG CATCATGGTT AAAACCATGG CCGTGGTTGC GCCACTGGTA
CTGCATTTAC CTGCTGAAGA AATCGCCTCT AAACACCTGG CGGATACTGG CGTACTCGGA
GGGATTATCT CCGGTGCGAT CGCAGCGTAC ATGTTTAACC GTTTCTACCG TATTAAGCTG
CCTGAGTATC TTGGCTTCTT TGCCGGTAAA CGCTTTGTGC CGATCATTTC TGGCCTGGCT
GCCATCTTTA CTGGCGTTGT GCTGTCCTTC ATTTGGCCGC CGATTGGTTC TGCAATCCAG
ACCTTCTCTC AGTGGGCTGC TTACCAGAAC CCGGTAGTTG CGTTTGGCAT TTACGGTTTC
ATCGAACGTT GCCTGGTACC GTTTGGTCTG CACCACATCT GGAACGTACC TTTCCAGATG
CAGATTGGTG AATACACCAA CGCAGCAGGT CAGGTTTTCC ACGGCGACAT TCCGCGTTAT
ATGGCGGGTG ACCCGACTGC GGGTAAACTG TCTGGTGGCT TCCTGTTCAA AATGTACGGT
CTGCCAGCTG CCGCAATTGC TATCTGGCAC TCTGCTAAAC CAGAAAACCG CGCGAAAGTG
GGCGGTATTA TGATCTCCGC GGCGCTGACC TCGTTCCTGA CCGGTATCAC CGAGCCGATC
GAGTTCTCCT TCATGTTCGT TGCGCCGATC CTGTACATCA TCCACGCGAT TCTGGCAGGC
CTGGCATTCC CAATCTGTAT TCTTCTGGGG ATGCGTGACG GTACGTCGTT CTCGCACGGT
CTGATCGACT TCATCGTTCT GTCTGGTAAC AGCAGCAAAC TGTGGCTGTT CCCGATCGTC
GGTATCGGTT ATGCGATTGT TTACTACACC ATCTTCCGCG TGCTGATTAA AGCACTGGAT
CTGAAAACGC CGGGTCGTGA AGACGCGACT GAAGATGCAA AAGCGACAGG TACCAGCGAA
ATGGCACCGG CTCTGGTTGC TGCATTTGGT GGTAAAGAAA ACATTACTAA CCTCGACGCA
TGTATTACCC GTCTGCGCGT CAGCGTTGCT GATGTGTCTA AAGTGGATCA GGCCGGCCTG
AAGAAACTGG GCGCAGCGGG CGTAGTGGTT GCTGGTTCTG GTGTTCAGGC GATTTTCGGT
ACTAAATCCG ACAACCTGAA AACCGAGATG GATGAGTACA TCCGTAACCA CTAA
 
Protein sequence
MFKNAFANLQ KVGKSLMLPV SVLPIAGILL GVGSANFSWL PAVVSHVMAE AGGSVFANMP 
LIFAIGVALG FTNNDGVSAL AAVVAYGIMV KTMAVVAPLV LHLPAEEIAS KHLADTGVLG
GIISGAIAAY MFNRFYRIKL PEYLGFFAGK RFVPIISGLA AIFTGVVLSF IWPPIGSAIQ
TFSQWAAYQN PVVAFGIYGF IERCLVPFGL HHIWNVPFQM QIGEYTNAAG QVFHGDIPRY
MAGDPTAGKL SGGFLFKMYG LPAAAIAIWH SAKPENRAKV GGIMISAALT SFLTGITEPI
EFSFMFVAPI LYIIHAILAG LAFPICILLG MRDGTSFSHG LIDFIVLSGN SSKLWLFPIV
GIGYAIVYYT IFRVLIKALD LKTPGREDAT EDAKATGTSE MAPALVAAFG GKENITNLDA
CITRLRVSVA DVSKVDQAGL KKLGAAGVVV AGSGVQAIFG TKSDNLKTEM DEYIRNH