Gene Mmar10_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0291 
Symbol 
ID4284033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp343941 
End bp345665 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content66% 
IMG OID638139754 
Productdihydroxyacid dehydratase 
Protein accessionYP_755522 
Protein GI114568842 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.20791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACG ACAGACGCAA ATCTTCAGAC GCCATCACAG CCGGCGCGGC CCGCGCCCCG 
GCCCGGGCCA TGTTGCGCGC GACCGGCATG ACGGATGGTG ATTTCGACAA GCCGATGATC
GGGGTGATCA ACACCTGGAC CACGGTCACG CCCTGCAACA TGCACCTGGC GGACCTGGCC
GCGCCAGTCC GCGAAGCCGT TCGTGAAGCC GGTGGCCATC CGGTCGACTT CAACACGATC
GTCGTGTCCG ACGGGATCTC GATGGGTACC GAAGGCATGC GTGCCTCGCT GATCTCGCGC
GAGGTGATCA CCGACTCGAT TGAACTGGCG ACCCGGGGTC ACAGCCTGGA TGGCGTCGTT
ATCCTCGTCG GTTGTGACAA GACCATTCCG GCGGCCGCGA TGGCGCTGGC CCGCATGGAT
GTGCCCGGTT GCATCCTCTA CGGCGGCACG ATCATGCCGG GCAAGCTGGG CGATCAGGCC
CTGTCCATCC AGGACGTGTT CGAAGCTGTA GGCGCCCATG CCGCGGGGAC CCTGGATGAT
GCAGGCCTGG ACAAGGTCGA AAAAGCGGCT TGCCCGGGCG CCGGCGCCTG TGGCGGCCAG
TTTACCGCCA ATACGATGGC CATGATCCTG ACCATGCTGG GTCTGTCTCC GATGGGCGTG
AACGACATTC CCGCCCCGCA CCCCGACAAG CCGGAGGCGG CGGCTCGCTG CGGCCGACTG
GCTGTCGAGC TGGCGAAATC CGGCACCACC CCGCGCCGTT TCATCACCGA AGCCTCCCTT
CGCAATGCCG TTGTCGGCGC TTCCGCCTCG GGCGGGTCGA CCAATGCCGT CCTGCACGTC
GCCGCAATCG CCGCCGAAGC CGGCATACCG TTCGACATTG CCGAGTTCGA CCGTATCTCC
AGCGAAGCCC CCGTCATCAC CGACCTGAAG CCGGGCGGGC GTTTTCTGGC CCACCACATG
TTCCTGGCCG GCGGCTCCCG CCTGTTCGGT CAGCGCCTGA TCGAGGGCGG CTTGCTGGCT
GATACCCCGA CTGTCTCCGG CAAGAGCCTG CATGAAGAGT GCGCCAGCGC CGAAGAGTCA
CTCAATCAAC GTGTCATCCA ATCCGTTGCC AACCCGGTCA AACCGGATGG TGGTTTCCGC
GTCCTGACCG GCGACCTGGC CCCCGAAGGC GCAGTCCTGA AACTGTCGGG TCATGCCCGC
AGCGAATTTT CCGGACCGGC CCGGGTGTTC GAATGCGAGG AAGACGCCTT CGCCGCCGTG
GAGGCCAACT CGGTCAAGGC TGGTGACATC ATCATCATCC GCAATGAGGG CCCCAAGGGC
GGCCCCGGAA TGCGGGAAAT GCTGGGCGTG ACCGCCGCCC TGGTCGGTCA GGGACTGGCC
GGCGATGTCG CCCTGATCAC CGATGGCCGC TTTTCCGGCG CCTCCAAGGG CTTCGTGATC
GGTCATGTCA GCCCCGAGGC CGCCGATGGC GGTCCGATCG GACGGGTCCG AAATGGAGAC
AGGGTGCGCA TTGATGTCGC CGCCCGCCGG ATCGATGTCG ATGCCGACCT GTCTGCCCGT
CCGCAATCCT CGTCCGGCCG GCCCGCCCCG ACCGGTGTTT TCGCCAAATA TGCCGCGCTG
GTCTCTTCGG CCTCGCGCGG CGCCACCACC ATCATCGCGC CGACTGCCAC CAAGGCGTCG
CCGCAAACAA CCGACATGCA ATCCAAGCAG GAGATGCCCG CATGA
 
Protein sequence
MTHDRRKSSD AITAGAARAP ARAMLRATGM TDGDFDKPMI GVINTWTTVT PCNMHLADLA 
APVREAVREA GGHPVDFNTI VVSDGISMGT EGMRASLISR EVITDSIELA TRGHSLDGVV
ILVGCDKTIP AAAMALARMD VPGCILYGGT IMPGKLGDQA LSIQDVFEAV GAHAAGTLDD
AGLDKVEKAA CPGAGACGGQ FTANTMAMIL TMLGLSPMGV NDIPAPHPDK PEAAARCGRL
AVELAKSGTT PRRFITEASL RNAVVGASAS GGSTNAVLHV AAIAAEAGIP FDIAEFDRIS
SEAPVITDLK PGGRFLAHHM FLAGGSRLFG QRLIEGGLLA DTPTVSGKSL HEECASAEES
LNQRVIQSVA NPVKPDGGFR VLTGDLAPEG AVLKLSGHAR SEFSGPARVF ECEEDAFAAV
EANSVKAGDI IIIRNEGPKG GPGMREMLGV TAALVGQGLA GDVALITDGR FSGASKGFVI
GHVSPEAADG GPIGRVRNGD RVRIDVAARR IDVDADLSAR PQSSSGRPAP TGVFAKYAAL
VSSASRGATT IIAPTATKAS PQTTDMQSKQ EMPA