Gene Mmar10_2208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2208 
Symbol 
ID4284977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2409334 
End bp2410914 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content64% 
IMG OID638141710 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_757438 
Protein GI114570758 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.283964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.12448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGA ACACCCAAGG ACGCCCCACC CGCATGGCCG AGAACATCGC CCCCGATAAT 
CTGGCAACCG GCGGCTATGC CGACACCATT TCCGGCAGCC GTGGTCTCGA TCAGGCCGAA
CCGTTGATCT TCGAGCGCGG TGGCATGGAT CGCTGCGGCG TTGACCTGCC TGAGCCCAAG
GGGCTCAAGA CACGTCTGGG CGGTCTCGAG CGCAAGGATG CGATCGGCCT GCCCGGTCTC
GCCGAGCCGG AAACCATGCG CCACTATGTG CGCCTGTCGC GGAAGAATTA CGCCATCGAT
CTGGGCCTTT ACCCGCTTGG CTCGTGCACG ATGAAGCACA ATCCGCGTCT CAACGAGAAG
GTCGCGCGGA TGCCGGGCTT TGCCGATGTG CACCCGCTGC AGCCGGCCTC GACGGTGCAG
GGCGCCTATC AGGTGATGGG CGAGCTGGCC CATTGGCTGA TGACGCTGAC CAACATGCCC
GCCGTCGCCC TGTCGCCGAA GGCCGGTGCC CATGGCGAGT TCTGCGGCAT GATGGCGATC
CGCGCCAAGC TGGATGCCGA TGGCCAGACC GGCCGTCGCC GCATTCTCGT TCCGGAAAGC
GCCCACGGTA CCAATCCCGC AACCGCCGTA CAGTGTGGTT TTACCGTTGA CGAGATCCCG
GCGGACAAGA CCGGTCGTGT CGACATGGAG GCCTTCAAGG CCAAGCTCGG CGAAGATGTC
GCCGGTATCA TGCTGACCAA TCCCAACACG TGTGGCCTGT TCGAGCGGGA CATTCGCGAG
ATTGCCGATT TGATCCACGA GGCCGGCGGG TATTTCTACT GCGATGGTGC GAACTTCAAC
GCCATTGTCG GCCGGGTCCG CCCGGGTGAT CTCGGCGTCG ACGCCATGCA TATCAATCTG
CACAAGACTT TCTCCACCCC GCATGGCGGT GGTGGCCCGG GTTCCGGCCC GACCGTGCTT
TCCGAAGCGC TGGCCCCCTT CGCTCCGGTG CCTTACGTCG TCGCGACGGA GAAGGATGGC
TGGGACCTGG TCGAGCACAT GGACGAGGCC GAGGGCACGC CCTTTGGCCG CATGGCGGCC
TTCCACGGCC AGATGGGCAT GTTCACCCGC GCCCTGACCT ACATGATGAG CCATGGCTCG
GATGGTCTCA AACAGGTCGC CGAGGATGCT GTCCTCAATG CCAATTACAT CCAGGCCCGC
CTCAGCCATG TCATGACCGT GGCCTTCGAG GGCACCTGCA TGCATGAGGC CCTGTTTGAC
GATCGCTTCC TCAAGGATAC GGGCGTCACC ACGCTCGACT TCGCCAAGGC GATGATCGAT
GAGGGCTATC ATCCGATGAC CATGTATTTC CCGCTGGTCG TCCATGGTGC CATGTTGATC
GAGCCGACCG AGACCGAGTC GATGAGCGGT CTGGATCGCT TCATCGAAGT GCTCGATGCG
CTCGCGACCG CGGCCAAGGC CGGTGACACG GGCCGCTTCC TGGCGGCTCC GGTCCACGCC
CCGACCAAGC GTCTCGACGA AACCCGCGCT GCCCGCCAGC CGGTCCTGCG CTGGACGCCG
TCGGCAGACG CGGCTGAATG A
 
Protein sequence
MSMNTQGRPT RMAENIAPDN LATGGYADTI SGSRGLDQAE PLIFERGGMD RCGVDLPEPK 
GLKTRLGGLE RKDAIGLPGL AEPETMRHYV RLSRKNYAID LGLYPLGSCT MKHNPRLNEK
VARMPGFADV HPLQPASTVQ GAYQVMGELA HWLMTLTNMP AVALSPKAGA HGEFCGMMAI
RAKLDADGQT GRRRILVPES AHGTNPATAV QCGFTVDEIP ADKTGRVDME AFKAKLGEDV
AGIMLTNPNT CGLFERDIRE IADLIHEAGG YFYCDGANFN AIVGRVRPGD LGVDAMHINL
HKTFSTPHGG GGPGSGPTVL SEALAPFAPV PYVVATEKDG WDLVEHMDEA EGTPFGRMAA
FHGQMGMFTR ALTYMMSHGS DGLKQVAEDA VLNANYIQAR LSHVMTVAFE GTCMHEALFD
DRFLKDTGVT TLDFAKAMID EGYHPMTMYF PLVVHGAMLI EPTETESMSG LDRFIEVLDA
LATAAKAGDT GRFLAAPVHA PTKRLDETRA ARQPVLRWTP SADAAE