Gene EcSMS35_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2420 
SymbolmenF 
ID6142620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2468727 
End bp2470022 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content53% 
IMG OID641617293 
Productisochorismate synthase, menaquinone-specific 
Protein accessionYP_001744465 
Protein GI170680455 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.37861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAATCAC TTACTACGGC GCTGGAAAAT CTACTGCGCC ATTTGTCGCA AGAGATTCCG 
GCGACACCCG GCATTCGGGT TATCGATATT CCTTTCCCTC TCAAAGACGC TTTTGATGCC
TTGAACTGGC TGGCCAGTCA GCAGGTGTAT CCGCAATTCT ACTGGCAACA ACGTAATGGT
GATGAAGAAG CTGCCGTCCT GGGTGCGATA ACCCGTTTTA CGTCGTTGGA CCAGGCACAA
CGTTTTCTTC GCCAGCACCC GGAACACGCC GACTTACGCA TCTGGGGGTT GAATGCGTTT
GACCCGTCGC AGGGCAATTT ACTTTTACCA CGCCTCGAAT GGCGACGCTG TGGCGGTAAA
GCCACGCTGC GGCTGACGCT ATTCAGCGAA AGATCCCTTC AGCATGATGC GATTAAGGCA
AAAGAATTTA TCGCCACACT GGTGAGTATT AAACCCTTGC CTGGGTTACA TTTAACCACC
ACCCGAGAAC AACACTGGCC GGACAAAACG GGCTGGACGC AATTAATCGA ACTGGCAACG
AAAACCATCG CCGAAGGCGA GCTCGACAAA GTGGTGCTTG CTCGGGCAAC TGACCTGCAT
TTCGCAAGTC CGGTCAATGC GGCGGCGATG ATGGCTGCCA GTCGTCGACT GAATCTGAAT
TGCTACCATT TTTACATGGC CTTTGATGGC GAAAATGCTT TTCTTGGCTC TTCACCGGAA
CGATTATGGT GGCGGCGTGA CAAAGCGCTG CGTACTGAAG CGCTGGCGGG AACGGTAGCA
AATCATCCTG ATGATAAGCA GGCGCAGCGG TTGGGTGAGT GGCTGATGGC GGATGATAAA
AACCAGCGCG AGAACATGTT GGTGGTGGAG GATATCTGCC AACGATTGCA GGCCGATACC
CAGACGTTGG ATGTTTTACC GCCGCAGGTA CTGCGTCTGC GTAAAGTGCA GCATCTTCGC
CGCTGTATCT GGACTGCACT CAACAAAGCG GATGATGTGA TCTGTTTACA TCAGTTGCAG
CCAACGGCAG CAGTTGCTGG CTTACCGCGC GATCTGGCGC GACAGTTTAT CGCCCGTCAC
GAACCGTTCA CCCGAGAATG GTACGCCGGT TCTGCGGGCT ATCTCTCATT ACAACAAAGC
GAATTCTGCG TTTCCCTGCG CTCAGCAAAA ATTAGCGGCA ATGTCGTGCG TCTATATGCT
GGCGCGGGCA TTGTCCGTGG TTCCGACCCC GAGCAAGAGT GGCAGGAAAT CGACAACAAA
GCGGCAGGGC TGCGTACTTT ATTACAAATG GAATAG
 
Protein sequence
MQSLTTALEN LLRHLSQEIP ATPGIRVIDI PFPLKDAFDA LNWLASQQVY PQFYWQQRNG 
DEEAAVLGAI TRFTSLDQAQ RFLRQHPEHA DLRIWGLNAF DPSQGNLLLP RLEWRRCGGK
ATLRLTLFSE RSLQHDAIKA KEFIATLVSI KPLPGLHLTT TREQHWPDKT GWTQLIELAT
KTIAEGELDK VVLARATDLH FASPVNAAAM MAASRRLNLN CYHFYMAFDG ENAFLGSSPE
RLWWRRDKAL RTEALAGTVA NHPDDKQAQR LGEWLMADDK NQRENMLVVE DICQRLQADT
QTLDVLPPQV LRLRKVQHLR RCIWTALNKA DDVICLHQLQ PTAAVAGLPR DLARQFIARH
EPFTREWYAG SAGYLSLQQS EFCVSLRSAK ISGNVVRLYA GAGIVRGSDP EQEWQEIDNK
AAGLRTLLQM E