Gene EcSMS35_2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2944 
SymbolfucK 
ID6147088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3020767 
End bp3022185 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content51% 
IMG OID641617813 
ProductL-fuculokinase 
Protein accessionYP_001744968 
Protein GI170682163 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02628] L-fuculokinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.124367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAG AAGTTATCCT GGTACTCGAC TGTGGCGCGA CCAATGTCAG GGCCATCGCG 
GTTAATCGGC AGGGAAAAAT TGTTGCCCGC GCCTCAACGC CTAATGCCAG CGATATCGCG
ATGGAAAACA ACACCTGGCA CCAGTGGTCT TTAGACGCCA TTTTGCAACG CTTTGCTGAT
TGCTGTCGGC AAATCAATAG TGAACTGACT GATTGCCACA TCCGCGGTAT CGCCGTCACC
ACCTTTGGTG TGGATGGCGC TCTGGTAGAT AAGCAAGGCA ATCTGCTCTA TCCGATCATT
AGCTGGAAAT GTCCGCGAAC AGCCGCGGTA ATGGACAATA TTGAACGGTT GATCTCCGCA
CAGCAGTTGC AGGCTATTTC TGGAGTCGGA GCCTTTAGTT TCAATACGTT ATATAAGTTG
GTGTGGTTGA AAGAAAATCA TCCACAACTG CTGGAACGCG CGCACGCCTG GCTCTTTATT
TCGTCGCTGA TTAACCACCG TTTAACCGGC GAATTCACTA CTGATATTAC GATGGCCGGA
ACCAGCCAGA TGCTGGATAT CCAGCAACGC GATTTCAGTC CGCAAATTTT ACAAGCCACC
GGTATTCCAC GCCGACTCTT CCCTCGTCTG GTGGAAGCGG GTGAACAGAT TGGTACGCTA
CAGAACAGCG CCGCAGCAAT GCTCGGCTTA CCCGTTGGCA TACCGGTGAT TTCCGCAGGT
CACGATACCC AGTTCGCCCT TTTTGGCGCT GGTGCCGAAC AAAATGAACC CGTGCTCTCT
TCCGGTACAT GGGAAATTTT AATGGTCCGC AGCGCCCAGG TTGATACTTC GCTGTTAAGT
CAGTACGCCG GTTCTACCTG CGAACTGGAT AGCCAGGCAG GGTTGTATAA CCCAGGTATG
CAATGGCTGG CATCCGGCGT GCTGGAATGG GTGAGAAAAC TGTTCTGGAC GGCTGAAACT
CCCTGGCAAA TGTTGATTGA AGAAGCTCGT CTGATCGCTC CTGGCGCAGA TGGAGTGAAA
ATGCAGTGTG ATTTATTGTC GTGTCAGAAC GCTGGCTGGC AAGGAGTGAC GCTTAATACC
ACGCGGGGGC ATTTCTATCG CGCGGCGCTG GAAGGGTTAA CCACGCAATT ACAGCGCAAT
CTACAGATGC TGGAAAAAAT CGGGCACTTT AAAGCCTCTG AATTATTGTT AGTTGGCGGA
GGAAGTCGCA ACACATTGTG GAATCAGATT AAAGCCAATA TGCTTGATAT TCCGGTAAAA
GTTCTCGACG ACGCAGAAAC GACCGTCGCA GGAGCTGCGC TGTTCGGTTG GTATGGCGTA
GGGGAATTTA ACAGCCCGGA AGAAGCCCGC GCGCAGATTC ATTATCAGTA CCGTTATTTC
TACCCGCAAA CTGAACCTGA ATTTATAGAG GAAGTGTGA
 
Protein sequence
MKQEVILVLD CGATNVRAIA VNRQGKIVAR ASTPNASDIA MENNTWHQWS LDAILQRFAD 
CCRQINSELT DCHIRGIAVT TFGVDGALVD KQGNLLYPII SWKCPRTAAV MDNIERLISA
QQLQAISGVG AFSFNTLYKL VWLKENHPQL LERAHAWLFI SSLINHRLTG EFTTDITMAG
TSQMLDIQQR DFSPQILQAT GIPRRLFPRL VEAGEQIGTL QNSAAAMLGL PVGIPVISAG
HDTQFALFGA GAEQNEPVLS SGTWEILMVR SAQVDTSLLS QYAGSTCELD SQAGLYNPGM
QWLASGVLEW VRKLFWTAET PWQMLIEEAR LIAPGADGVK MQCDLLSCQN AGWQGVTLNT
TRGHFYRAAL EGLTTQLQRN LQMLEKIGHF KASELLLVGG GSRNTLWNQI KANMLDIPVK
VLDDAETTVA GAALFGWYGV GEFNSPEEAR AQIHYQYRYF YPQTEPEFIE EV