Gene EcSMS35_4372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4372 
SymbolmenA 
ID6143401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4458288 
End bp4459226 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content53% 
IMG OID641619193 
Product1,4-dihydroxy-2-naphthoate octaprenyltransferase 
Protein accessionYP_001746317 
Protein GI170682345 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1575] 1,4-dihydroxy-2-naphthoate octaprenyltransferase 
TIGRFAM ID[TIGR00751] 1,4-dihydroxy-2-naphthoate octaprenyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.560485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGCGTA TTATGACTGA ACAACAAATT AGCCGAACTC AGGCGTGGCT GGAAAGTTTA 
CGACCTAAAA CCCTCCCCCT CGCCTTTGCT GCAATTATCG TCGGGACGGC GCTGGCATGG
TGGCAAGGTC ACTTCGATCC ACTGGTCGCC CTGCTGGCAT TGATTACCGC CGGGCTATTA
CAGATCCTTT CTAACCTCGC CAATGATTAC GGCGATGCGG TAAAAGGCAG CGATAAACCT
GACCGCATTG GGCCGCTACG CGGCATGCAA AAAGGGGTCA TTACCCAGCA AGAGATGAAA
CGGGCGCTCA TTATTACCGT TGTGCTCATC TGTCTTTCCG GGCTGGCACT GGTTGCAGTG
GCGTGCCATA CGCTGGCCGA TTTTGTCGGT TTCCTGATTC TTGGCGGGTT GTCGATCATT
GCCGCTATCA CCTACACCGT GGGCAATCGT CCTTATGGTT ATATCGGTTT GGGTGATATT
TCCGTACTGG TTTTCTTTGG CTGGTTGAGC GTCATGGGGA GCTGGTATTT ACAGGCTCAT
ACATTGATTC CGGCACTGAT CCTTCCGGCA ACCGCATGCG GCCTGCTGGC AACGGCAGTA
CTGAACATTA ATAACCTGCG TGATATCAAT AGCGACCGCG AAAACGGCAA AAACACGCTG
GTGGTGCGCT TAGGTGCAGT AAACGCGCGT CGTTATCATG CCTGCCTGCT GATGGGCTCT
CTGGTGTGTC TGGCGCTGTT TAATCTCTTT TCGCTGCATA GCCTGTGGGG CTGGCTGTTC
CTGCTGGCGG CACCGTTGCT GGTGAAGCAA GCTCGTTATG TGATGCGCGA AATGGACCCG
GTGGCGATGC GACCAATGCT GGAACGCACT GTCAAAGGAG CGTTACTGAC TAACCTGCTG
TTTGTTTTAG GGATATTCCT AAGCCAGTGG GCAGCATAA
 
Protein sequence
MARIMTEQQI SRTQAWLESL RPKTLPLAFA AIIVGTALAW WQGHFDPLVA LLALITAGLL 
QILSNLANDY GDAVKGSDKP DRIGPLRGMQ KGVITQQEMK RALIITVVLI CLSGLALVAV
ACHTLADFVG FLILGGLSII AAITYTVGNR PYGYIGLGDI SVLVFFGWLS VMGSWYLQAH
TLIPALILPA TACGLLATAV LNINNLRDIN SDRENGKNTL VVRLGAVNAR RYHACLLMGS
LVCLALFNLF SLHSLWGWLF LLAAPLLVKQ ARYVMREMDP VAMRPMLERT VKGALLTNLL
FVLGIFLSQW AA