Gene EcolC_3238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3238 
Symbol 
ID6066795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3548591 
End bp3549586 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content55% 
IMG OID641602653 
Productfructokinase 
Protein accessionYP_001726187 
Protein GI170021233 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.67247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0186988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACGA ATTGTCGGCG GCCTTGCATT GCCAATCCGG TTGTCCGTCT CTACGCTATT 
GATATTGAAA AAAATAAGGA GAGTACCGTG CGTATAGGTA TCGATTTAGG CGGCACCAAA
ACTGAAGTGA TTGCACTGGG CGATGCAGGG GAGCAGTTGT ACCGCCATCG TCTGCCCACG
CCGCGTGATG ATTACCGGCA GACTATTGAA ACGATCGCCA CGTTGGTTGA TATGGCGGAG
CAGGCGACGG GGCAGCGCGG AACGGTAGGT ATGGGCATTC CTGGCTCAAT TTCGCCTTAC
ACCGGTGTGG TGAAGAATGC CAATTCAACC TGGCTCAACG GTCAGCCATT CGATAAAGAC
TTAAGCGCGA GGTTGCAGCG GGAAGTGCGG CTGGCAAATG ACGCTAACTG TCTGGCGGTT
TCAGAAGCAG TAGATGGCGC GGCAGCGGGA GCGCAGACGG TATTTGCCGT GATTATCGGC
ACGGGATGCG GCGCGGGCGT GGCATTCAAT GGGCGGGCGC ATATCGGCGG CAATGGCACG
GCAGGTGAGT GGGGACACAA TCCGCTACCG TGGATGGACG AAGACGAACT GCGTTATCGC
GAGGAAGTCC CTTGTTATTG CGGTAAACAA GGTTGTATTG AAACCTTTAT TTCGGGCACG
GGATTCGCGA TGGATTATCG TCGTTTGAGC GGACATGCGC TGAAAGGCAG TGAAATTATC
CGCCTGGTTG AAGAAAGCGA TCCGGTAGCG GAACTGGCAT TGCGTCGCTA CGAGCTGCGG
CTGGCAAAAT CGCTGGCACA TGTCGTGAAT ATTCTCGATC CGGATGTGAT TGTCCTGGGG
GGCGGGATGA GCAATGTAGA CCGTTTATAT CAAACGGTTG GGCAGTTGAT TAAACAATTT
GTCTTCGGCG GCGAATGTGA AACGCCGGTG CGTAAGGCGA AGCACGGTGA TTCCAGCGGC
GTACGCGGCG CTGCGTGGTT ATGGCCACAA GAGTAA
 
Protein sequence
MITNCRRPCI ANPVVRLYAI DIEKNKESTV RIGIDLGGTK TEVIALGDAG EQLYRHRLPT 
PRDDYRQTIE TIATLVDMAE QATGQRGTVG MGIPGSISPY TGVVKNANST WLNGQPFDKD
LSARLQREVR LANDANCLAV SEAVDGAAAG AQTVFAVIIG TGCGAGVAFN GRAHIGGNGT
AGEWGHNPLP WMDEDELRYR EEVPCYCGKQ GCIETFISGT GFAMDYRRLS GHALKGSEII
RLVEESDPVA ELALRRYELR LAKSLAHVVN ILDPDVIVLG GGMSNVDRLY QTVGQLIKQF
VFGGECETPV RKAKHGDSSG VRGAAWLWPQ E