Gene EcolC_4119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4119 
Symbol 
ID6066001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4544375 
End bp4545445 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID641603541 
Productputative fructose-specific phosphotransferase system protein FrvX 
Protein accessionYP_001727044 
Protein GI170022090 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTG AGTTACTGCA ACAGTTGTGC GAAGCCAGCG CCGTCAGCGG CGATGAACAG 
GAAGTTCGCG ACATTCTGAT AAACACGCTG GAACCTTGCG TGAATGAAAT CACCTTCGAT
GGTCTGGGCA GCTTTGTTGC CCGTAAGGGG AATAAAGGTC CAAAAGTTGC CGTTGTCGGA
CATATGGATG AAGTCGGCTT TATGGTCACC CACATCGACG AGAGCGGTTT TCTGCGTTTT
ACCACCATTG GCGGCTGGTG GAATCAGTCG ATGCTCAACC ACCGGGTAAC CATACGCACA
CACAAGGGAG TGAAAATCCC TGGTGTGATT GGTTCCGTCG CGCCTCATGC GTTAACGGAA
AAGCAAAAGC AACAACCGCT GTCATTTGAT GAGATGTTCA TTGATATTGG CGCGAACAGT
CGCGAAGAGG TGGAAAAGCG CGGCGTGGAA ATTGGTAATT TTATTAGCCC GGAAGCCAAT
TTTGCCTGCT GGGGCGAAGA TAAAGTGGTC GGCAAGGCGT TGGATAACCG CATCGGCTGC
GCAATGATGG CTGAACTATT GCAGACGGTG AATAATCCCG AAATTACGCT GTATGGCGTT
GGCAGTGTGG AAGAAGAAGT TGGGCTACGC GGGGCGCAAA CCTCGGCGGA ACACATTAAA
CCGGACGTCG TGATCGTGTT GGATACCGCC GTAGCGGGCG ATGTTCCGGG CATTGATAAC
ATTAAATACC CGCTGAAACT GGGCCAGGGG CCGGGGCTGA TGCTGTTTGA CAAGCGCTAC
TTCCCCAACC AGAAACTGGT AGCAGCGTTA AAAAGCTGTG CCGCACATAA CGATTTACCG
CTGCAATTTT CCACCATGAA AACCGGTGCG ACGGATGGCG GGCGCTACAA CGTGATGGGC
GGCGGGCATC CGGTTGTCGC GCTGTGTCTG CCAACTCGTT ATCTGCACGC CAACAGCGGT
ATGATTTCAA AAGCCGATTA CGAAGCTCTG CTCACGCTGA TACGGGGTTT TCTGACGACC
TTAACTGCGG AGAAAGTCAA CGCGTTTAGC CAGTTCCGTC AGGTGGATTA A
 
Protein sequence
MNIELLQQLC EASAVSGDEQ EVRDILINTL EPCVNEITFD GLGSFVARKG NKGPKVAVVG 
HMDEVGFMVT HIDESGFLRF TTIGGWWNQS MLNHRVTIRT HKGVKIPGVI GSVAPHALTE
KQKQQPLSFD EMFIDIGANS REEVEKRGVE IGNFISPEAN FACWGEDKVV GKALDNRIGC
AMMAELLQTV NNPEITLYGV GSVEEEVGLR GAQTSAEHIK PDVVIVLDTA VAGDVPGIDN
IKYPLKLGQG PGLMLFDKRY FPNQKLVAAL KSCAAHNDLP LQFSTMKTGA TDGGRYNVMG
GGHPVVALCL PTRYLHANSG MISKADYEAL LTLIRGFLTT LTAEKVNAFS QFRQVD