Gene Mmar10_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0100 
Symbol 
ID4283843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp102845 
End bp104188 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content67% 
IMG OID638139563 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_755334 
Protein GI114568654 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.858934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCAT CTTTGAAGCG CCTTTCCGGC GCCATGCGGG CCAGGCCCGC GCCTGCCCTG 
TCCGGAACCA TCAAGGCACC GGGCGACAAG TCAATATCCC ATCGCGCCTT TATTCTGGGC
GGGCTGGCAA AAGGCGTGAC CGAAGTCACC GGCCTGCTCG AAAGCGATGA TGTGATCAAT
TCCGGGCGCG CCGCCGCAGC GCTAGGCGCC AAGGTCGAGC ATCTCGGCCC GGGACACTGG
CGGATCGACG GGTGCGGCGG TCAATGGACC ACCCCGTCCG CGCCGCTGGA CTTCGGCAAT
GCCGGAACCG GTGTCCGCCT GATGATGGGC GCCGTGGCCG GGACCGGGAC CAGCGCCGAC
TTCATCGGCG ATGAAAGCCT GTCCTCCAGG CCGATGCGTC GCGTGACCGA TCCGCTCGGC
GAAATGGGCG CCCGTTTCAC CACGACCGGG GGACGCCTGC CGGCTCATCT GGACGGCGGC
CCGCTGGCCG GGATTCACTA CACGCCGCCC ATTGCCTCGG CACAGGTGAA GTCCGCCGTC
CTGCTGGCGG CACTCGGTGC GACCGGAACC ACGGTGGTTC ATGAACCGCA GATCACCCGT
GACCACACGG AAACCATGCT CAGGGCCTTC GGTGTCACGC TGACGGTCGA ACGAGACGGC
GCTGCGGCGA CCGTCACCCT GACCGGGCCG CAAACGCTCA TCGCCTGCCC GGTCGACGTG
CCGGGAGACC CGTCCTCCTC GGCCTTCGCG ATCGTGGCGG CACTGATCAG TCCAGGCTCC
GACATCACCC TGGAAGGCGT GATGGACAAT CCGGCCCGGA CCGGCCTGAT CGAGACGCTG
AAGGAAATGG GCGCCGACAT CACCCTCACA CCGGGACCGG ACATGGCCGG CGAGAAAACC
ATGCATATCC ACGTGAAACA CAGCCAGCTG CACGGCATCA CGGTGCCGGC AACGCGGGCA
CCGTCGATGA TCGACGAATA TCCGGTCCTG TGTGTCGCGG CGGCCTATGC GGACGGCATC
ACCCACATGC CGGGCCTCGA GGAGTTGCGC GCCAAGGAGA GTGATCGTCT CGCGGGGAGT
GCAGCGATGC TGCGCGCCAA TGGCGTTCCG GTCGAGGAAG GCGAGGACAG CCTCGCCGTC
ACCGGCATGG GCATTGGCGG CGTGCCCGGC GGTGGCCGCA CCGTGACTCA CCACGACCAT
CGCCTCGCCA TGAGCGGCCT GGTGATCGGG CTGGGTGCCA AGGCCGCCTC CTCCGTCGAT
GACATTGCCA TGATCGCCAC CTCCTACCCC GATTTCTTCG ACCATATCGC CACGCTCGGC
GGCCGCCTGG AGCCCCTCAC ATGA
 
Protein sequence
MTPSLKRLSG AMRARPAPAL SGTIKAPGDK SISHRAFILG GLAKGVTEVT GLLESDDVIN 
SGRAAAALGA KVEHLGPGHW RIDGCGGQWT TPSAPLDFGN AGTGVRLMMG AVAGTGTSAD
FIGDESLSSR PMRRVTDPLG EMGARFTTTG GRLPAHLDGG PLAGIHYTPP IASAQVKSAV
LLAALGATGT TVVHEPQITR DHTETMLRAF GVTLTVERDG AAATVTLTGP QTLIACPVDV
PGDPSSSAFA IVAALISPGS DITLEGVMDN PARTGLIETL KEMGADITLT PGPDMAGEKT
MHIHVKHSQL HGITVPATRA PSMIDEYPVL CVAAAYADGI THMPGLEELR AKESDRLAGS
AAMLRANGVP VEEGEDSLAV TGMGIGGVPG GGRTVTHHDH RLAMSGLVIG LGAKAASSVD
DIAMIATSYP DFFDHIATLG GRLEPLT