Gene EcHS_A4126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4126 
Symbol 
ID5593413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4118048 
End bp4119118 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID640923229 
Productputative fructose-specific phosphotransferase system protein FrvX 
Protein accessionYP_001460688 
Protein GI157163370 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTG AGTTACTGCA ACAGTTGTGC GAAGCCAGCG CCGTCAGCGG CGATGAACAG 
GAAGTTCGCG ACATTCTGAT AAACACGCTG GAACCTTGCG TGAATGAAAT CACCTTCGAT
GGTCTGGGCA GCTTTGTTGC CCGTAAGGGG AATAAAGGTC CAAAAGTTGC CGTTGTCGGA
CATATGGATG AAGTCGGCTT TATGGTCACC CACATCGACG AGAGCGGTTT TCTGCGTTTT
ACCACCATTG GCGGCTGGTG GAATCAGTCG ATGCTCAACC ACCGGGTAAC CATACGCACA
CACAAGGGAG TGAAAATCCC TGGTGTGATT GGTTCCGTCG CGCCTCATGC GTTAACGGAA
AAGCAAAAGC AACAACCGCT GTCATTTGAT GAGATGTTCA TTGATATTGG CGCGAACAGT
CGCGAAGAGG TGGAAAAGCG CGGCGTGGAA ATTGGTAATT TTATTAGCCC GGAAGCCAAT
TTTGCCTGCT GGGGCGAAGA TAAAGTGGTC GGCAAGGCGT TGGATAACCG CATCGGCTGC
GCAATGATGG CTGAACTATT GCAGACGGTG AATAATCCCG AAATTACGCT GTATGGCGTT
GGCAGTGTGG AAGAAGAAGT TGGGCTACGC GGGGCGCAAA CCTCGGCGGA ACACATTAAA
CCGGACGTCG TGATCGTGTT GGATACCGCC GTAGCGGGCG ATGTTCCGGG CATTGATAAC
ATTAAATACC CGCTGAAACT GGGCCAGGGG CCGGGGCTGA TGCTGTTTGA CAAGCGCTAC
TTCCCCAACC AGAAACTGGT AGCAGCGTTA AAAAGCTGTG CCGCACATAA CGATTTACCG
CTGCAATTTT CCACCATGAA AACCGGTGCG ACGGATGGCG GGCGCTACAA CGTGATGGGC
GGCGGGCGTC CGGTTGTCGC GCTGTGTCTG CCAACTCGTT ATCTGCACGC CAACAGCGGT
ATGATTTCAA AAGCCGATTA CGAAGCTCTG CTCACGCTGA TACGGGGTTT TCTGACGACC
TTAACTGCGG AGAAAGTCAA CGCGTTTAGC CAGTTCCGTC AGGTGGATTA A
 
Protein sequence
MNIELLQQLC EASAVSGDEQ EVRDILINTL EPCVNEITFD GLGSFVARKG NKGPKVAVVG 
HMDEVGFMVT HIDESGFLRF TTIGGWWNQS MLNHRVTIRT HKGVKIPGVI GSVAPHALTE
KQKQQPLSFD EMFIDIGANS REEVEKRGVE IGNFISPEAN FACWGEDKVV GKALDNRIGC
AMMAELLQTV NNPEITLYGV GSVEEEVGLR GAQTSAEHIK PDVVIVLDTA VAGDVPGIDN
IKYPLKLGQG PGLMLFDKRY FPNQKLVAAL KSCAAHNDLP LQFSTMKTGA TDGGRYNVMG
GGRPVVALCL PTRYLHANSG MISKADYEAL LTLIRGFLTT LTAEKVNAFS QFRQVD