Gene EcSMS35_4155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4155 
SymbolwecE 
ID6145995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4254793 
End bp4255923 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content55% 
IMG OID641618978 
ProductTDP-4-oxo-6-deoxy-D-glucose transaminase 
Protein accessionYP_001746110 
Protein GI170681601 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0399] Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 
TIGRFAM ID[TIGR02379] TDP-4-keto-6-deoxy-D-glucose transaminase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCAT TTAACGCACC GCCGGTGGTG GGAACCGAAC TCGACTATAT GCAGTCGGCA 
ATGGGTAGCG GCAAACTGTG CGGCGATGGT GGTTTTACCC GTCGCTGCCA GCAGTGGCTG
GAGCAACGTT TTGGCAGCGC CAAAGTGTTA CTGACGCCAT CCTGCACCGC TTCGCTGGAG
ATGGCGGCGC TGCTGCTCGA TATCCAGCCT GGCGATGAAG TGATCATGCC GAGCTACACC
TTTGTCTCCA CCGCCAATGC CTTTGTGCTG CGTGGCGCAA AAATCGTTTT TGTGGATGTT
CGCCCGGACA CCATGAACAT CGACGAAACG CTGATTGAAG CGGCGATCAC CGACAAAACG
CGCGTTATCG TGCCGGTTCA TTACGCGGGT GTGGCCTGCG AAATGGACAC TATTATGGCG
TTGGCGAAGA AGCATAATCT ATTTGTGGTG GAAGATGCCG CTCAGGGCGT GATGTCCACT
TACAAAGGGC GTGCACTGGG AACCATTGGT CATATTGGCT GCTTTAGCTT CCATGAAACC
AAAAACTACA CGGCGGGCGG TGAAGGCGGC GCGACGCTGA TTAACGATAA AGCGTTGATC
GAACGAGCCG AGATCATCCG TGAAAAAGGC ACAAACCGCA GCCAGTTCTT CCGTGGTCAG
GTCGATAAAT ATACCTGGCG CGATATTGGC TCCAGCTATT TGATGTCCGA TCTGCAAGCA
GCGTACCTTT GGGCGCAACT GGAAGCCGCT GAACGTATCA ATCAGCAGCG TCTGGCGCTG
TGGCAAAACT ACTACGATGC GTTAGCGCCT CTGGCGAAAG CCGGGCGTAT CGAGCTGCCG
TCGATTCCCG ATGGCTGCGT GCAGAACGCG CATATGTTCT ACATTAAACT GCGGGATATT
GATGACCGGA GCGCGTTGAT TAACTTTCTG AAAGAAGCGG AAATCATGGC GGTGTTCCAT
TACATTCCGC TGCACGGTTG CCCTGCGGGG GAGCGCTTTG GTGAGTTCCA CGGTGAAGAT
CGCTACACCA CCAAAGAGAG CGAGCGCCTG CTGCGCCTGC CGTTGTTCTA CAACCTGTCG
CCCGTCAATC AGCGTACGGT AATTGCGACC TTGTTGAACT ACTTCTCCTG A
 
Protein sequence
MIPFNAPPVV GTELDYMQSA MGSGKLCGDG GFTRRCQQWL EQRFGSAKVL LTPSCTASLE 
MAALLLDIQP GDEVIMPSYT FVSTANAFVL RGAKIVFVDV RPDTMNIDET LIEAAITDKT
RVIVPVHYAG VACEMDTIMA LAKKHNLFVV EDAAQGVMST YKGRALGTIG HIGCFSFHET
KNYTAGGEGG ATLINDKALI ERAEIIREKG TNRSQFFRGQ VDKYTWRDIG SSYLMSDLQA
AYLWAQLEAA ERINQQRLAL WQNYYDALAP LAKAGRIELP SIPDGCVQNA HMFYIKLRDI
DDRSALINFL KEAEIMAVFH YIPLHGCPAG ERFGEFHGED RYTTKESERL LRLPLFYNLS
PVNQRTVIAT LLNYFS