Gene EcolC_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3942 
Symbol 
ID6064442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4326162 
End bp4327091 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content55% 
IMG OID641603355 
ProductD-allose kinase 
Protein accessionYP_001726870 
Protein GI170021916 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC AGCATAACGT CGTAGCGGGC GTGGATATGG GGGCAACGCA TATCCGCTTT 
TGTCTGCGGA CAGCAGAAGG TGAAACGCTA CACTGCGAAA AAAAGCGGAC CGCAGAAGTC
ATTGCTCCCG GCCTGGTGTC GGGTATCGGC GAAATGATTG ACGAGCAACT CAGGCGCTTT
AACGCTCGCT GTCATGGTCT GGTGATGGGA TTTCCGGCGC TGGTCAGTAA AGATAAACGC
ACCATTATTT CTACGCCTAA CCTGCCGTTA ACAGCGGCGG ATTTATATGA TCTCGCCGAT
AAGCTCGAAA ATACGCTGAA TTGTCCGGTT GAGTTTTCCC GCGACGTTAA CCTGCAACTC
TCCTGGGACG TAGTAGAAAA CCGCCTTACG CAACAACTGG TTCTGGCGGC CTATCTCGGT
ACGGGGATGG GGTTCGCAGT GTGGATGAAC GGTGCGCCGT GGACGGGTGC ACACGGTGTG
GCAGGCGAAC TGGGTCATAT CCCCCTGGGA GATATGACCC AACACTGCGC GTGTGGCAAT
CCTGGGTGCC TGGAAACCAA TTGCTCTGGA ATGGCGCTAA GACGCTGGTA CGAACAACAG
CCCCGAAATT ACCCATTGCG CGATCTTTTC GTCCATGCGG AAAACGCCCC TTTCGTCCAG
AGTCTGCTTG AAAACGCGGC ACGGGCCATT GCCACCAGCA TTAATCTGTT CGATCCCGAT
GCGGTGATCC TGGGCGGTGG CGTGATGGAT ATGCCCGCCT TCCCACGCGA GACTCTCGTT
GCCATGACCC AAAAGAACCT GCGCCGTCCA CTGCCGCATC AGGTCGTGCG CTTTATTGCC
GCCTCATCTT CTGACTTTAA TGGCGCTCAG GGTGCAGCAA TATTGGCGCA TCAACGTTTT
TTGCCACAGT TCTGTGCTAA AGCCCCATGA
 
Protein sequence
MQKQHNVVAG VDMGATHIRF CLRTAEGETL HCEKKRTAEV IAPGLVSGIG EMIDEQLRRF 
NARCHGLVMG FPALVSKDKR TIISTPNLPL TAADLYDLAD KLENTLNCPV EFSRDVNLQL
SWDVVENRLT QQLVLAAYLG TGMGFAVWMN GAPWTGAHGV AGELGHIPLG DMTQHCACGN
PGCLETNCSG MALRRWYEQQ PRNYPLRDLF VHAENAPFVQ SLLENAARAI ATSINLFDPD
AVILGGGVMD MPAFPRETLV AMTQKNLRRP LPHQVVRFIA ASSSDFNGAQ GAAILAHQRF
LPQFCAKAP