Gene Mmar10_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0974 
Symbol 
ID4284952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1071803 
End bp1073173 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content65% 
IMG OID638140443 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_756205 
Protein GI114569525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGT CCCCCCAATC CTGGCGCTCC AAACCCGTCA GTCAGATGCC GAACTACAAG 
GATGCCGCCA AGCTGGATGC CGCGATTGCC GAGCTTTCGG CGCGTCCGGC TCTGGTCTTT
GCCGGCGAGG CACGACGCCT GCGGCGCCAG CTGGCCGATG TGACGGCGGG CAAGGCCTTC
CTGTTGCAGG GCGGTGATTG TGCCGAGAGC TTCAAGGAGT TCTCGACCGA GGGGGTGCGC
GACACCTTCC GTGTGCTGCT GCAGATGGCG GTGGTGATGA CCTTTGCCGC CTCCAAGCCG
ATCGTGAAGG TCGGCCGGAT CGCGGGCCAG TTCGCCAAGC CCCGCTCGGC GGACATGGAG
ACCATTGATG GCGTGTCATT GCCCAGCTAT CGCGGTGACA GCGTCAATGG ACCCGAATTC
ACGCCGGAAG CGCGTGAGCC CGATCCGCAG CGCTTGATCC GCGCTTATGA CCAGTCAGCC
TCGACACTGA ACCTGCTGCG CGCCTTTGCG TCAGGCGGTT ATGCCGACCT GCACAATGTC
CACCAGTGGA CCCAGGACTT CGTCAGCGAC AGCCCGGCGG CGGAACGCTA TGCCGAGACC
GCGGCGCGCA TCTCCGAAGC CCTGGCCTTC ATGAAGGCCT GCGGCATCGG TCGCGACAGT
GCGCCGTCGC TGGAAGCGGT CGACTTCTTC ACCAGTCACG AAGCGCTGCA TCTGCCCTTC
GAGGAAGCGT TGACCCGCCG CGACCCCAAT ACCGGCCAGT GGTATGCCAC CTCGGCGCAC
ATGATCTGGA CCGGCGAGCG GACCCGTCAG CTCGACGGGG CCCATGTCGA GTATGCGCGG
GGCATCGCCA ATCCCGTCGG CGTCAAATGC GGCCCGACCA TGCAGCCGGA CGACCTGCTG
CCGCTGATCG ACGCGCTGAA CCCGGACAAT GAGGCCGGGC GCCTCGTCCT GATCGTGCGC
ATGGGCGCCG ATAATGTGGT CAAGAACTTG CCCAAGCTCG CCGCCGCCGT GACCAAGGCC
GGCCGCAAGG TGGTCTGGTC GTCCGACCCG ATGCACGGCA ACACCCACAA AACCTCAAAT
GGCTACAAGA CCCGCGACTT TGACCGCATC CTGTCCGAAC TCGAGGGCTT CATGGATGTA
CTCTATGCCG AGGGGGCCTA TCCCGGCGGT GTGCATTTCG AGATGACCGG TCGCGATGTG
ACCGAGTGCG TCGGCGGCGC CAAGACGGTC ACCGAGGCTG ATCTGGCGGC GCGCTATCAC
ACCCATTGCG ATCCGCGCCT GAATGCCGAC CAGGCGCTCG ACATGGCCTT CCGCATTGCC
GAGAGCCTGA AGCGGGTCCG CAACAACAAC TCGGCTGCCA ACGCGGCCTG A
 
Protein sequence
MTWSPQSWRS KPVSQMPNYK DAAKLDAAIA ELSARPALVF AGEARRLRRQ LADVTAGKAF 
LLQGGDCAES FKEFSTEGVR DTFRVLLQMA VVMTFAASKP IVKVGRIAGQ FAKPRSADME
TIDGVSLPSY RGDSVNGPEF TPEAREPDPQ RLIRAYDQSA STLNLLRAFA SGGYADLHNV
HQWTQDFVSD SPAAERYAET AARISEALAF MKACGIGRDS APSLEAVDFF TSHEALHLPF
EEALTRRDPN TGQWYATSAH MIWTGERTRQ LDGAHVEYAR GIANPVGVKC GPTMQPDDLL
PLIDALNPDN EAGRLVLIVR MGADNVVKNL PKLAAAVTKA GRKVVWSSDP MHGNTHKTSN
GYKTRDFDRI LSELEGFMDV LYAEGAYPGG VHFEMTGRDV TECVGGAKTV TEADLAARYH
THCDPRLNAD QALDMAFRIA ESLKRVRNNN SAANAA