Gene EcE24377A_4428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4428 
Symbol 
ID5586970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4417737 
End bp4418807 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID640928043 
Productputative fructose-specific phosphotransferase system protein FrvX 
Protein accessionYP_001465387 
Protein GI157157925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTG AGTTACTGCA ACAGTTGTGC GAAGCCAGCG CCGTCAGCGG CGATGAACAG 
GAAGTTCGCG ACATCCTGAT AAACACGCTG GAACCTTGCG TTAATGAAAT CACCTTTGAT
GGTCTGGGCA GCTTTGTTGC CCGTAAGGGG AATAAAGGTC CAAAAGTTGC CGTTGTCGGG
CATATGGATG AAGTCGGCTT TATGGTCACC CACATCGACG AGAGCGGTTT TCTGCGCTTC
ACCACCATTG GCGGCTGGTG GAATCAGTCG ATGCTCAACC ACCGGGTAAC CATACGCACA
CACAAGGGAT TTAAAATCCC TGGTGTGATT GGTTCCGTCG CGCCTCATGC GTTAACGGAA
AAGCAAAAGC AACAACCGCT GTCATTTGAT GAGATGTTCA TTGATATTGG CGCGAACAGT
CGCGAAGAAG CGGAAAAGCG CGGCGTTGAA ATTGGCGATT TTATTAGCCC GGAAGCCAAT
TTTGCCTGCT GGGGCGAAGA TAAAGTAGTC GGAAAGGCGC TGGATAACCG CATCGGCTGC
GCGATGATGG CGGAGCTACT ACAGACAGTA AATAACCCAG AAATTACGCT GTACGGCGTT
GGCAGTGTGG AAGAAGAAGT TGGGCTACGC GGGGCACAAA CCTCGGCGGA ACACATTAAA
CCGGATGTGG TGATTGTGCT GGATACCGCT GTCGCAGGTG ATGTTCCGGG CATTGATAAC
ATTAAATACC CGCTGAAACT GGGCCAGGGG CCGGGGCTGA TGCTGTTTGA CAAGCGCTAC
TTCCCCAACC AGAAACTGGT AGCAGTGTTA AAAAACTGTG CCGCACATAA CGATTTACCG
CTGCAATTTT CCACCATGAA AACCGGAGCG ACGGATGGCG GGCGCTACAA CGTGATGGGC
GGCGGGCGTC CGGTTGTCGC GCTGTGTCTG CCAACTCGTT ATCTGCACGC TAACAGCGGC
ATGATTTCAA AAGCCGATTA CGATGCTCTG CTCACGCTGA TACGGGATTT TCTGACGACC
TTAACTGCGG AGAAAGTCAA CGCGTTTAGC CAGTTCCGTC AGGTGGATTA A
 
Protein sequence
MNIELLQQLC EASAVSGDEQ EVRDILINTL EPCVNEITFD GLGSFVARKG NKGPKVAVVG 
HMDEVGFMVT HIDESGFLRF TTIGGWWNQS MLNHRVTIRT HKGFKIPGVI GSVAPHALTE
KQKQQPLSFD EMFIDIGANS REEAEKRGVE IGDFISPEAN FACWGEDKVV GKALDNRIGC
AMMAELLQTV NNPEITLYGV GSVEEEVGLR GAQTSAEHIK PDVVIVLDTA VAGDVPGIDN
IKYPLKLGQG PGLMLFDKRY FPNQKLVAVL KNCAAHNDLP LQFSTMKTGA TDGGRYNVMG
GGRPVVALCL PTRYLHANSG MISKADYDAL LTLIRDFLTT LTAEKVNAFS QFRQVD