Gene Mmar10_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1914 
Symbol 
ID4286432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2094437 
End bp2095711 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content62% 
IMG OID638141414 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_757144 
Protein GI114570464 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCGG CCGTACGGCC TTTGGATTGT GTCGTCTTGG TCTTATATGG ACCTCCCCCC 
GCGCGGATAA AAGGCCCTGC TGACATGACC ATTGCCGCGA CCGAGCGTAC CCGTGCCCCC
TATGCCTGCC ATCCAGCCCA ATCACGCGGG CGCCGTTTTC CGCAGGCTGA CAGCGCCATG
CGCAATGCAT TCCAGCGCGA TCGTGACCGA GTCATACACT CGGCGGCCTT TCGCCGGCTG
AAGGGAAAGA CCCAGGTATT TGTGGCCCAT GAGGGCGATC TCTATCGCAC CCGTCTGACC
CATTCCCTGG AAGTGTCACA GATCGCCCGC ACGCTGGCGC GGGCCTTGCG CGGCGATGAG
GACCTGGCCG AGGCGCTCGC TCTCGCCCAT GATCTGGGGC ATCCGCCGTT CGGGCATGAG
GGCGAGCGTG AGCTGGCCCT GAAGATGAAG GATTTCGGCG GCTTTGATCA CAACGCCCAG
ACCCTGCGCG CGATCACCAA GCTGGAAGTG CGCTATCCCG AGTTCGATGG CCTCAACCTG
ACCTGGGAAA CCCTGGAAGG TGTCGTCAAG CATAATGGCC CCTTGCTCGG GCCGGGGCAG
ACGGAAGCCG ATCTGCCCTG GGCTTTCACT GACTATGAGG GCTGGCGAGA CCTCGAATTC
GAAACCCATG CCGGCCTCGA GGCCCAGATC GCGGCGCTGG CAGATGATAT CGCCTACAAT
AATCACGACA TTGATGACGG GTTGAGTTCC GGCCTGCTGG AAATCGAGCC TCTGCTCGAG
CTGCCGCTGG TCGGCGATAT ATTCCGCCGG GTCCGGGAGC GGTGGCCGGA CAAGCCGCAA
ACCATCATTA TCCATGAAGC GGTGCGCGAA CTGATCGGCG TCATGGTGGC GGATGTCCTC
GCAGAATCCG GCAAACGGCT TGATCGGGCC CGTCCCGACA GCGCCCAGGC CCTGCGTGAG
CTGGATCACC CGGTGGTGGC TTTTTCCGAG GAAATGGTGC TGCATCTCGC CGCGCTGCGC
CGCCATCTCT TTGCCCACAT GTATCGGCAC TACAAGGTCA ACCGGATGAT GAGCCAGGCG
CGCCGGGTGA CCGGCGAACT GTTTGACCTG TATCTGGCCG ATCCGGGTGT CTTGCCCAGC
GATGTGCAGG CAGGCATGAC AGGTGCCGGT ACCGCGCAGA CAGCGCGCGC GGTTTGCGAC
TATATCGCCG GCATGACGGA TCGCTTTGCA GTGGAAGAGC ACAGACGGCT TTTCACCGTG
CAGGGGTATT TCTAG
 
Protein sequence
MLSAVRPLDC VVLVLYGPPP ARIKGPADMT IAATERTRAP YACHPAQSRG RRFPQADSAM 
RNAFQRDRDR VIHSAAFRRL KGKTQVFVAH EGDLYRTRLT HSLEVSQIAR TLARALRGDE
DLAEALALAH DLGHPPFGHE GERELALKMK DFGGFDHNAQ TLRAITKLEV RYPEFDGLNL
TWETLEGVVK HNGPLLGPGQ TEADLPWAFT DYEGWRDLEF ETHAGLEAQI AALADDIAYN
NHDIDDGLSS GLLEIEPLLE LPLVGDIFRR VRERWPDKPQ TIIIHEAVRE LIGVMVADVL
AESGKRLDRA RPDSAQALRE LDHPVVAFSE EMVLHLAALR RHLFAHMYRH YKVNRMMSQA
RRVTGELFDL YLADPGVLPS DVQAGMTGAG TAQTARAVCD YIAGMTDRFA VEEHRRLFTV
QGYF