Gene EcolC_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4212 
Symbol 
ID6067760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4653708 
End bp4654838 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content55% 
IMG OID641603644 
ProductTDP-4-oxo-6-deoxy-D-glucose transaminase 
Protein accessionYP_001727136 
Protein GI170022182 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0399] Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 
TIGRFAM ID[TIGR02379] TDP-4-keto-6-deoxy-D-glucose transaminase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCAT TTAACGCACC GCCGGTGGTG GGAACCGAAC TCGACTATAT GCAGTCGGCA 
ATGGGTAGCG GCAAACTGTG TGGCGATGGC GGTTTTACCC GTCGCTGCCA GCAGTGGCTG
GAGCAACGTT TTGGCAGCGC CAAAGTGTTA CTGACGCCGT CCTGCACCGC TTCGCTGGAG
ATGGCGGCGC TGCTGCTCGA TATCCAGCCT GGCGATGAAG TGATCATGCC GAGCTACACC
TTTGTCTCCA CCGCCAATGC CTTTGTGCTG CGTGGCGCAA AAATCGTTTT TGTGGATGTT
CGCCCGGACA CCATGAACAT CGACGAAACG CTGATTGAAG CGGCGATCAC CGACAAAACG
CGCGTTATCG TGCCGGTCCA TTACGCGGGT GTGGCCTGCG AAATGGACAC CATTATGGCG
TTGGCGAAAA AGCATAATCT TTTTGTGGTG GAAGATGCCG CTCAGGGCGT GATGTCCACT
TACAAAGGGC GTGCACTGGG AACCATTGGT CATATTGGCT GCTTTAGCTT CCATGAAACC
AAAAACTACA CGGCGGGTGG TGAAGGCGGC GCGACGCTGA TTAACGATAA AGCGTTAATC
GAACGAGCCG AGATCATCCG TGAAAAGGGC ACTAACCGCA GCCAGTTCTT CCGTGGTCAG
GTCGATAAAT ATACCTGGCG CGATATTGGC TCCAGCTATT TGATGTCCGA TCTGCAAGCT
GCGTACCTGT GGGCGCAACT GGAAGCAGCG GATCGTATCA ACCAGCAACG TCTGGCGCTG
TGGCAAAACT ACTACGATGC GTTAGCGCCT CTGGCGAAAG CCGGGCGTAT CGAGCTGCCG
TCGATTCCCG ATGGCTGCGT GCAGAACGCG CATATGTTCT ACATTAAACT GCGGGATATT
GATGACCGGA GCGCGTTGAT TAACTTTCTG AAAGAAGCGG AAATCATGGC GGTGTTTCAT
TACATTCCGC TGCACGGTTG CCCTGCGGGG GAACACTTTG GTGAGTTCCA CGGTGAAGAT
CGCTACACCA CCAAAGAGAG CGAGCGCCTG CTGCGCCTGC CGCTGTTCTA CAACCTGTCG
CCCGTCAATC AGCGTACGGT AATTGCGACT TTGTTGAACT ACTTCTCCTG A
 
Protein sequence
MIPFNAPPVV GTELDYMQSA MGSGKLCGDG GFTRRCQQWL EQRFGSAKVL LTPSCTASLE 
MAALLLDIQP GDEVIMPSYT FVSTANAFVL RGAKIVFVDV RPDTMNIDET LIEAAITDKT
RVIVPVHYAG VACEMDTIMA LAKKHNLFVV EDAAQGVMST YKGRALGTIG HIGCFSFHET
KNYTAGGEGG ATLINDKALI ERAEIIREKG TNRSQFFRGQ VDKYTWRDIG SSYLMSDLQA
AYLWAQLEAA DRINQQRLAL WQNYYDALAP LAKAGRIELP SIPDGCVQNA HMFYIKLRDI
DDRSALINFL KEAEIMAVFH YIPLHGCPAG EHFGEFHGED RYTTKESERL LRLPLFYNLS
PVNQRTVIAT LLNYFS