Gene EcolC_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0937 
Symbol 
ID6068454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1020025 
End bp1021503 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content48% 
IMG OID641600345 
Productcarbohydrate kinase FGGY 
Protein accessionYP_001723933 
Protein GI170018979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAA AATACATCAT AGGGATTGAT GGCGGAAGTC AGAGCACAAA AGTGGTGATG 
TACGATCTGG AAGGTAATGT GGTTTGCGAA GGTAAAGGCT TATTACAGCC GATGCACACG
CCAGATGCCG ATACTGCAGA ACATCCTGAC GACGATTTAT GGGCATCATT ATGTTTTGCC
GGTCACGATT TGATGAGTCA GTTTGCCGGG AATAAAGAAG ATATTGTCGG TATTGGTCTG
GGATCCATCC GTTGCTGCCG TGCGTTATTG AAAGCCGATG GCACGCCAGC TGCGCCGTTG
ATTAGCTGGC AGGATGCACG CGTTACACGC CCTTACGAAC ATACGAATCC TGACGTGGCG
TATGTCACCT CTTTTTCGGG TTATCTGACG CATCGCTTAA CCGGCGAGTT TAAAGACAAT
ATCGCCAACT ATTTTGGTCA GTGGCCGGTG GATTATAAGA GCTGGGCATG GAGCGAAGAT
GCTGCGGTAA TGGATAAGTT TAATATCCCC CGTCATATGC TGTTTGATGT GCAAATGCCT
GGCACCGTCC TCGGACATAT CACACCACAA GCCGCACTGG CGACACATTT CCCGGCAGGA
CTGCCGGTTG TTTGTACCAC CAGTGATAAA CCGGTAGAAG CTCTGGGGGC TGGATTACTG
GATGATGAAA CTGCGGTAAT TTCTCTTGGC ACTTACATCG CATTGATGAT GAACGGCAAA
GCACTGCCGA AAGATCCGGT GGCGTACTGG CCGATTATGT CTTCTATTCC GCAAACATTG
CTGTATGAAG GTTACGGTAT TCGCAAAGGT ATGTGGACGG TGAGCTGGCT GCGTGACATG
TTAGGCGAGT CGTTAATTCA GGATGCCAGG GCGCAGGATC TTTCACCGGA AGATTTACTC
AACAAGAAAG CCTCTAGTGT GCCACCTGGC TGTAATGGGC TGATGACGGT GCTGGACTGG
CTGACCAATC CGTGGGAACC GTACAAACGC GGGATTATGA TCGGCTTTGA TTCCAGCATG
GATTACGCAT GGATATATCG TTCGATATTG GAAAGTGTGG CGCTAACGCT GAAGAACAAT
TACGACAATA TGTGTAATGA AATGAATCAC TTTGCGAAGC ATGTGATCAT TACTGGCGGC
GGTTCGAACA GCGATCTGTT TATGCAAATT TTTGCCGACG TGTTCAACCT TCCGGCACGC
CGTAACGCCA TTAACGGTTG TGCAAGTCTG GGAGCAGCGA TTAATACAGC GGTAGGTCTG
GGGCTATACC CGGATTACGC AACGGCTGTT GATAAGATGG TTCGCGTGAA AGATATCTTT
ATACCGATTG AGAGCAATGC CAAACGCTAC GACGCGATGA ATAAAGGCAT TTTCAAAGAC
CTAACCAAAC ATACTGATGT GATCCTGAAA AAATCGTATG AAGTGATGCA TGGGGAATTG
GGGAATGTGG ATTCGATCCA GAGCTGGTCG AATGCGTAA
 
Protein sequence
MSKKYIIGID GGSQSTKVVM YDLEGNVVCE GKGLLQPMHT PDADTAEHPD DDLWASLCFA 
GHDLMSQFAG NKEDIVGIGL GSIRCCRALL KADGTPAAPL ISWQDARVTR PYEHTNPDVA
YVTSFSGYLT HRLTGEFKDN IANYFGQWPV DYKSWAWSED AAVMDKFNIP RHMLFDVQMP
GTVLGHITPQ AALATHFPAG LPVVCTTSDK PVEALGAGLL DDETAVISLG TYIALMMNGK
ALPKDPVAYW PIMSSIPQTL LYEGYGIRKG MWTVSWLRDM LGESLIQDAR AQDLSPEDLL
NKKASSVPPG CNGLMTVLDW LTNPWEPYKR GIMIGFDSSM DYAWIYRSIL ESVALTLKNN
YDNMCNEMNH FAKHVIITGG GSNSDLFMQI FADVFNLPAR RNAINGCASL GAAINTAVGL
GLYPDYATAV DKMVRVKDIF IPIESNAKRY DAMNKGIFKD LTKHTDVILK KSYEVMHGEL
GNVDSIQSWS NA