Gene Mmar10_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2274 
Symbol 
ID4286740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2478558 
End bp2479661 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content67% 
IMG OID638141776 
Product3-dehydroquinate synthase 
Protein accessionYP_757504 
Protein GI114570824 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.492898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGACA TTCTGCGGGT CACCGTCGGC TTGGGAGACC GGGCCTATGA CGTGCTGGTC 
GGCGCGGGCG CCCTTTCCGC AGCCGGTCCG GAACTGGTCG CCCATTTCCC GCGCGGCCGG
GCCATCCTGG TGACCGACCG GCATGTCGCC GACCTGCATC TCGACGCCGT GACCGCACAG
CTGAGCAGGC TGGGCCTGCG TGTCGAGCCG GTCATTATCG CCCCCGGCGA AACATCCAAG
AGCTGGGCCG GGCTGGAACA GGTCGTCGAC GCGCTTCTCG ATCGCAATAT CGAGCGATCC
GAAGCCGTGA TCGCGCTGGG TGGCGGTGTC ATTGGCGACC TGACCGGATT TGCGGCGGCG
GTGACCAAGC GCGGGGTCAA CTTCATCCAG ATTCCCACCA CCCTGCTCGC CCAGGTCGAC
AGCTCGGTTG GCGGCAAGAC CGGTATCAAT ACAACCCACG GCAAGAATTT CGCCGGTAGT
TTCCACCAGC CAAAGCTGGT GATCGCCGAT CGCGATCTGC TGGCAACGCT CCCGGACCGC
GAGCGCCGTG CCGGCTATGC GGAAATCGTC AAGGCCGCGC TGATCGGTGA TGCCCCGCTG
TTCGCGCAGC TGGAAGCCGC TGGCGCTGGC GTGCTGGACG GTGCCGACCT GGATCAGGCT
GTTGCAGCGG CGGTCGCCTT CAAGGCCCGG ATCGTCGCCG AGGACGAACG CGAAACAGGC
GTCCGCGCCC TGCTCAATCT GGGCCATACT TTCGGCCATG CCTTTGAAGC CGATGCGCCC
AAGGATGTGA TCCGGCATGG CGAGGCGGTC GCGGTCGGCA CGGCGCTGGC CTTTGCCTAT
TCCGCCCATC GCGGCGATTG CAGCGCCGAC CACGCGGCAC GCGTCGCGGC CCATTTGCGC
GCGGTCGGGC TGCCGGCCAG TCCCGCCGAA CTTGCGCACA GCGACTGGAA TGCCGCCAGC
CTCGTTTCCC GGATGCGCGA CGACAAGAAG AACCGCGACG GCCGCATCAC CCTCATCCTC
GCCCGCGCCA TCGGCGCAGC ATTCATTGAC CCGGCGGCCG ACGAAGCCGA CCTTCTCGCC
TTTATGGAGA CCCAGCTCTC ATGA
 
Protein sequence
MSDILRVTVG LGDRAYDVLV GAGALSAAGP ELVAHFPRGR AILVTDRHVA DLHLDAVTAQ 
LSRLGLRVEP VIIAPGETSK SWAGLEQVVD ALLDRNIERS EAVIALGGGV IGDLTGFAAA
VTKRGVNFIQ IPTTLLAQVD SSVGGKTGIN TTHGKNFAGS FHQPKLVIAD RDLLATLPDR
ERRAGYAEIV KAALIGDAPL FAQLEAAGAG VLDGADLDQA VAAAVAFKAR IVAEDERETG
VRALLNLGHT FGHAFEADAP KDVIRHGEAV AVGTALAFAY SAHRGDCSAD HAARVAAHLR
AVGLPASPAE LAHSDWNAAS LVSRMRDDKK NRDGRITLIL ARAIGAAFID PAADEADLLA
FMETQLS