Gene EcSMS35_4288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4288 
Symbol 
ID6147274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4391040 
End bp4392110 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID641619109 
Productputative fructose-specific phosphotransferase system protein FrvX 
Protein accessionYP_001746233 
Protein GI170683436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.427406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTG AGTTGCTGCA ACAGTTGTGC GAAGCCAGCG CCGTCAGCGG CGATGAACAG 
GAAGTTCGCG ACATTCTGAT AAACACGCTG GAACCTTGCG TTAATGAGAT CACCTTTGAT
GGTCTGGGCA GCTTTGTTGC CCGTAAGGGG AATAAAGGGC CAAAAGTTGC TGTTGTCGGG
CATATGGATG AAGTCGGCTT TATGGTCACC CACATCGACA AGAGCGGTTT TCTGCGTTTT
ACCACCATTG GCGGCTGGTG GAATCAGTCG ATGCTCAACC ACCGGGTAAC CATACGGACA
CACAAGGGAG TGAAAATCCC TGGTGTAATT GGCTCCGTCG CCCCACATGC GTTAACCGAA
AAGCAAAAGC AACAACCGCT GTCATTTGAT GAGATGTTCA TTGATATTGG CGCGAACAGT
CGCGAAGAGG TGGAAAAACG CGGCGTGGAA ATTGGCAATT TTATTAGCCC GGAAGCCAAT
TTTGCCTGCT GGGGCGAAGA TAAAGTGGTC GGCAAGGCGT TGGATAACCG CATCGGCTGC
GCAATGATGG CCGAGCTACT ACAGACAGTA AATAACCCCG AAATTACGCT TTATGGCGTT
GGCAGTGTGG AAGAAGAAGT TGGGCTACGC GGGGCACAAA CCTCGGCTGA ACACATTAAA
CCGGATGTGG TGATCGTGCT GGATACCGCC GTCGCGGGCG ATGTTCCGGG CATTGATAAC
ATTAAATACC CGCTGAAACT GGGCCAGGGG CCGGGGCTGA TGCTGTTTGA CAAGCGCTAC
TTCCCCAACC AGAAACTGGT GGCGGCGTTA AAAAACTGTG CCGCACATAA CGATTTACCG
CTGCAATGTT CCACCATGAA AACCGGAGCG ACGGATGGCG GGCGCTACAA CGTGATGGGC
GGCGGGCGTC CAGTTGTCGC GCTGTGTCTG CCAACGCGTT ATCTGCACGC CAATAGCGGT
ATGATTTCAA AAGCCGATTA CGATGCTCTG CTCACGTTGA TAAGGGATTT TCTGACCACC
TTAACCGCGG AGAAAGTCAA CGCGTTTAGC CAGTTCCGTC AGGTGGATTA A
 
Protein sequence
MNIELLQQLC EASAVSGDEQ EVRDILINTL EPCVNEITFD GLGSFVARKG NKGPKVAVVG 
HMDEVGFMVT HIDKSGFLRF TTIGGWWNQS MLNHRVTIRT HKGVKIPGVI GSVAPHALTE
KQKQQPLSFD EMFIDIGANS REEVEKRGVE IGNFISPEAN FACWGEDKVV GKALDNRIGC
AMMAELLQTV NNPEITLYGV GSVEEEVGLR GAQTSAEHIK PDVVIVLDTA VAGDVPGIDN
IKYPLKLGQG PGLMLFDKRY FPNQKLVAAL KNCAAHNDLP LQCSTMKTGA TDGGRYNVMG
GGRPVVALCL PTRYLHANSG MISKADYDAL LTLIRDFLTT LTAEKVNAFS QFRQVD