Gene Mmar10_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1341 
Symbol 
ID4284661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1471878 
End bp1473032 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content61% 
IMG OID638140821 
Productprephenate dehydratase 
Protein accessionYP_756571 
Protein GI114569891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.409344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.126516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGG ACATGACGCG GGATGAAATC CGCTCGGCTA TCAGCGATAT CGATAAGGAA 
TTGATCGATG CCATCGCCCG TCGCTCCGGA CTGGTGGAAG AAATCCTCAA GGCCAAGGCG
CGGACCGGCA GTCCGGTCAG GGACCGGGAG CGCGAGCGCG ACGTGCTCCG GGCCGCCGTC
CAGCAGGGCA GCGAGAAAGG CGTTCCGGCT GAACTGGTCG AGACGCTGTT TCATGCGCTG
TTCGAGGCGT CGGTCCGCCG TCAGCGCCAG CAATTCGACT CCATGCGCAA TGACGAGCTC
AATGAGGCCA CGGTCGCCTA TCTGGGCGGT CCCGGCAGCT ACAGCCACAT TGCCGCGCAA
AAGGTCTTCC AGCGTCGCAA TGCGACGGTG GTGCCATCGC CGAAACGGGA CTTTGTCTCG
ATTTTCCGTG CGGTCGAGAA TGCCGAAGTC GATTATGGCG TGATCCCGAT CGAGAACACC
ACGACCGGGT CGATCAACGA GGTCTACGAC ATCTTGATCA ATTCCCACAC GCAGATCATT
GGTGAGTTCC TGCTGCGGGT CGATCATTGC CTGGTCGGTC GGGCGAGCGG GCAGGGGCGC
GTGCGCCGTG TGTTCGGCCA TCCTCAGGCG CTGGCCCAGT GCCGGCGCTA TATCAGCTCG
CATCCGGAGC TTGAAACCCA CATGGCGGCC TCGACCACGC GCGCGCTGGA GCGGCTGTTG
GAGGACGATG ATACGGCGGT GGCCGTCGCC GGCGAGGATG CGGCCCGCCT GTTCGGCATG
GATATTCTGG AGCGCAATGT CGGTGACCAC GAGCAGAATA TCACGCGGTT CATCGTCATC
GGACGCAAGT CCAAACTCCC GACGCGCGAG GTCGAGTGCA AGACGTCGAT GATGTTCACC
ACACGCGACA CGCCGGGATC GCTGGTCAAT GCGTTGATCG GTTTCCGTGA TAACGGGATC
AATCTGGTGA AGCTGGAATC GCGGCCGATT GCCGGCAATC CCTGGGAAGA AATGTTCATC
ATGGATGTCG AAGGGCATCT GGAGGACAGC AAGATCCGCG AGTCCATGTC GGTGCTTGAG
GAGCATACCC GCGAGATCAA ACTGCTCGGC TGTTATGCAA TGGACGCCAT CGACAAGGTG
TCGGTCGCCG AATGA
 
Protein sequence
MATDMTRDEI RSAISDIDKE LIDAIARRSG LVEEILKAKA RTGSPVRDRE RERDVLRAAV 
QQGSEKGVPA ELVETLFHAL FEASVRRQRQ QFDSMRNDEL NEATVAYLGG PGSYSHIAAQ
KVFQRRNATV VPSPKRDFVS IFRAVENAEV DYGVIPIENT TTGSINEVYD ILINSHTQII
GEFLLRVDHC LVGRASGQGR VRRVFGHPQA LAQCRRYISS HPELETHMAA STTRALERLL
EDDDTAVAVA GEDAARLFGM DILERNVGDH EQNITRFIVI GRKSKLPTRE VECKTSMMFT
TRDTPGSLVN ALIGFRDNGI NLVKLESRPI AGNPWEEMFI MDVEGHLEDS KIRESMSVLE
EHTREIKLLG CYAMDAIDKV SVAE