Gene EcHS_A2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2947 
SymbolfucK 
ID5594172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2955677 
End bp2957095 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content51% 
IMG OID640922065 
ProductL-fuculokinase 
Protein accessionYP_001459575 
Protein GI157162257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02628] L-fuculokinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0145106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAG AAGTTATCCT GGTACTCGAC TGTGGCGCGA CCAATGTCAG GGCCATCGCG 
GTTAATCGGC AGGGAAAAAT TGTTGCCCGC GCCTCAACGC CTAATGCCAG CGATATCGCG
ATGGAAAACA ACACCTGGCA CCAGTGGTCT TTAGACGCCA TTTTGCAACG CTTTGCTGAT
TGCTGTCGGC AAATCAATAG TGAACTAACT GAATGCCACA TCCGCGGTAT CGCCGTCACC
ACCTTTGGTG TGGATGGCGC TCTGGTAGAT AAGCAAGGCA ATCTGCTCTA TCCGATCATT
AGCTGGAAAT GTCCGCGAAC AGCAGCGGTT ATGGACAATA TTGAACAGTT GATCTCCGCA
CAGCGGTTGC AGGCTATTTC TGGCGTCGGA GCCTTTAGTT TCAATACGTT ATATAAGTTG
GTGTGGTTGA AAGAAAATCA TCCACAACTG CTGGAACGCG CGCACGCCTG GCTCTTTATT
TCGTCGCTGA TTAACCACCG TTTAACCGGC GAATTCACTA CTGATATCAC GATGGCCGGA
ACCAGCCAGA TGCTGGATAT CCAGCAACGC GATTTCAGTC CGCAAATTTT ACAAGCCACC
GGTATTCCAC GCCGACTCTT CCCTCGTCTG GTGGAAGCGG GTGAACAGAT TGGTACGCTA
CAGAACAGCG CCGCAGCAAT GCTCGGCTTA CCCGTTGGCA TACCGGTGAT TTCCGCAGGT
CACGATACCC AGTTCGCCCT TTTTGGCGCT GGTGCCGACC AAAATGAACC CGTGCTCTCT
TCCGGTACAT GGGAAATTTT AATGGTCCGT AGCGCCCAGG TTGATACTTC GCTGTTAAGT
CAGTACGCCG GTTCCACCTG CGAACTGGAT AGCCAGGCAG GGTTGTATAA CCCAGGTATG
CAATGGCTGG CATCCGGCGT GCTGGAATGG GTGAGAAAAC TGTTCTGGAC GGCTGAAACA
CCCTGGCAAA TGTTGATTGA AGAGGCTCGT CTGATCGCGC CTGGCGCGGA TGGCGTAAAA
ATGCAGTGTG ATTTATTGTC GTGTCAGAAC GCTGGCTGGC AAGGAGTGAC GCTTAATACC
ACGCGGGGGC ATTTCTATCG CGCGGCGCTG GAAGGGTTAA CCGCGCAATT ACAGCGCAAT
CTACAGATGC TGGAAAAAAT CGGGCACTTT AAAGCCTCTG AATTATTGTT AGTCGGCGGA
GGAAGTCGCA ACACATTGTG GAATCAGATT AAAGCCAATA TGCTTGATAT TCCGGTAAAA
GTTCTCGACG ACGCCGAAAC GACCGTCGCA GGAGCTGCGC TGTTCGGTTG GTATGGCATA
GGGGAATTTA ACAGCCCGGA AGAAGCCCGC GCGCAGATTC ATTATCAGTA CCGTTATTTC
TACCCGCAAA CTGAACCTGA ATTTATAGAG GAAGTGTGA
 
Protein sequence
MKQEVILVLD CGATNVRAIA VNRQGKIVAR ASTPNASDIA MENNTWHQWS LDAILQRFAD 
CCRQINSELT ECHIRGIAVT TFGVDGALVD KQGNLLYPII SWKCPRTAAV MDNIEQLISA
QRLQAISGVG AFSFNTLYKL VWLKENHPQL LERAHAWLFI SSLINHRLTG EFTTDITMAG
TSQMLDIQQR DFSPQILQAT GIPRRLFPRL VEAGEQIGTL QNSAAAMLGL PVGIPVISAG
HDTQFALFGA GADQNEPVLS SGTWEILMVR SAQVDTSLLS QYAGSTCELD SQAGLYNPGM
QWLASGVLEW VRKLFWTAET PWQMLIEEAR LIAPGADGVK MQCDLLSCQN AGWQGVTLNT
TRGHFYRAAL EGLTAQLQRN LQMLEKIGHF KASELLLVGG GSRNTLWNQI KANMLDIPVK
VLDDAETTVA GAALFGWYGI GEFNSPEEAR AQIHYQYRYF YPQTEPEFIE EV