Gene EcolC_0909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0909 
Symbol 
ID6068653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp981819 
End bp983237 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content51% 
IMG OID641600316 
ProductL-fuculokinase 
Protein accessionYP_001723905 
Protein GI170018951 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02628] L-fuculokinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAG AAGTTATCCT GGTACTCGAC TGTGGCGCGA CCAATGTCAG GGCCATCGCG 
GTTAATCGGC AGGGCAAAAT TGTTGCCCGC GCCTCAACGC CTAATGCCAG CGATATCGCG
ATGGAAAACA ACACCTGGCA CCAGTGGTCT TTAGACGCCA TTTTGCAACG CTTTGCTGAT
TGCTGTCGGC AAATCAATAG TGAACTGACT GAATGCCACA TCCGCGGTAT CGCCGTCACC
ACCTTTGGTG TGGATGGCGC TCTGGTAGAT AAGCAAGGCA ATCTGCTCTA TCCGATTATT
AGCTGGAAAT GTCCGCGAAC AGCAGCGGTT ATGGACAATA TTGAACGGTT AATCTCCGCA
CAGCGGTTGC AGGCTATTTC TGGCGTCGGA GCCTTTAGTT TCAATACGTT ATATAAGTTG
GTGTGGTTGA AAGAAAATCA TCCACAACTG CTGGAACGCG CGCACGCCTG GCTCTTTATT
TCGTCGCTGA TTAACCACCG TTTAACCGGC GAATTCACTA CTGATATCAC GATGGCCGGA
ACCAGCCAGA TGCTGGATAT CCAGCAACGC GATTTCAGTC CGCAAATTTT ACAAGCCACC
GGTATTCCAC GCCGACTCTT CCCTCGTCTG GTGGAAGCGG GTGAACAGAT TGGTACGCTA
CAGAACAGCG CCGCAGCAAT GCTCGGCTTA CCCGTTGGCA TACCGGTGAT TTCCGCAGGT
CACGATACCC AGTTCGCCCT TTTTGGCGCT GGTGCTGAAC AAAATGAACC CGTGCTCTCT
TCCGGTACAT GGGAAATTTT AATGGTTCGC AGCGCCCAGG TTGATACTTC GCTGTTAAGT
CAGTACGCCG GTTCCACCTG CGAACTGGAT AGCCAGGCAG GGTTGTATAA CCCAGGTATG
CAATGGCTGG CATCCGGCGT GCTGGAATGG GTGAGAAAAC TGTTCTGGAC GGCTGAAACA
CCCTGGCAAA TGTTGATTGA AGAGGCTCGT CTGATCGCGC CTGGCGCGGA TGGCGTAAAA
ATGCAGTGTG ATTTATTGTC GTGTCAGAAC GCTGGCTGGC AAGGAGTGAC GCTTAATACC
ACGCGGGGGC ATTTCTATCG CGCGGCGCTG GAAGGGTTAA CCGCGCAATT ACAGCGCAAT
CTACAGATGC TGGAAAAAAT CGGGCACTTT AAAGCCTCTG AATTATTGTT AGTCGGCGGA
GGAAGTCGCA ACACATTGTG GAATCAGATT AAAGCCAATA TGCTTGATAT TCCGGTAAAA
GTTCTCGACG ACGCCGAAAC GACCGTCGCA GGAGCTGCGC TGTTCGGTTG GTATGGCATA
GGGGAATTTA ACAGCCCGGA AGAAGCCCGC GCGCAGATTC ATTATCAGTA CCGTTATTTC
TACCCGCAAA CTGAACCTGA ATTTATAGAG GAAGTGTGA
 
Protein sequence
MKQEVILVLD CGATNVRAIA VNRQGKIVAR ASTPNASDIA MENNTWHQWS LDAILQRFAD 
CCRQINSELT ECHIRGIAVT TFGVDGALVD KQGNLLYPII SWKCPRTAAV MDNIERLISA
QRLQAISGVG AFSFNTLYKL VWLKENHPQL LERAHAWLFI SSLINHRLTG EFTTDITMAG
TSQMLDIQQR DFSPQILQAT GIPRRLFPRL VEAGEQIGTL QNSAAAMLGL PVGIPVISAG
HDTQFALFGA GAEQNEPVLS SGTWEILMVR SAQVDTSLLS QYAGSTCELD SQAGLYNPGM
QWLASGVLEW VRKLFWTAET PWQMLIEEAR LIAPGADGVK MQCDLLSCQN AGWQGVTLNT
TRGHFYRAAL EGLTAQLQRN LQMLEKIGHF KASELLLVGG GSRNTLWNQI KANMLDIPVK
VLDDAETTVA GAALFGWYGI GEFNSPEEAR AQIHYQYRYF YPQTEPEFIE EV