Gene Mmar10_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1970 
Symbol 
ID4284678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2151468 
End bp2152406 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content67% 
IMG OID638141470 
Productspore coat polysaccharide biosynthesis protein glycosyltransferase-like protein 
Protein accessionYP_757200 
Protein GI114570520 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.750902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCC CCTTCAAAAT CGCCATTCGG GCGGATGGCT CGAAGCTGAT TGGCCTCGGA 
CACGTCATGC GCTGCGGCGC GCTGGCCAAT GCGCTGGCTG AGATCGGTGC CGGAATTGCC
TGGTTGACGA CAACGCCTCA GCATTTGCCC GCCGGCCTGT CGCATGCGGT CGAGCCCGTG
CAACTGGACA ATGACGAACA GCTGGCCGAC GCGCTGACCG CCCGCAACAT CCACCATCTT
GTCGCCGACT GGCACCGCAC CGACCCGCAG CGTGTCCACA GCCTGAGGGC AAACGGCGTG
CACGTCAGCC TGGTCGGCAA TTTCCTGCAG GATGCAGTCC CTGACCTGCA TATCAGGCAA
GGCTTCCTGC CGGGCATGTC GCCATCCGGG GCGCCAACCT TGAGTGGCCC GAAATACCTG
TTGCTTCCCG CCTCATGCGA GGCGCTGCCG CCACGCCCCG TCGCGGCGAC AGCCCGGCGG
GTCCTGCTGT CGCTGGGCGG CACTGACAGC CCGCTTCTGG CACGCATCCG GGACCGCCTG
GCGCAAGGCT TTCCGGCAAT CGAGGTCGAT GGGCGCGGCC CCGTCGGCAA TGGTCCGATC
CCGCCTCTGA CCGAAGCCAT GCGAAGAGCC GATATCGGAA TTCTGGCTGG CGGAACGAGC
TTGCACGAGG CGGCAGCGAC CGGCCTGCCG AGCCTGTGTC TGCCTATCGC CGCCAATCAG
TTCGAGCGGG CCGGTCATTT TGAAAGCGCG GGCCTCGGCA TCAGTCTGGA TCCGGCAGAC
CCCGGTTTCG ACCAGCAGTT CGACACGGCA CTGGCCAGGC TTGTTTCAGA TCAAGCCGGA
CGACAGGACA TGGCCCGAAC CGGCCAGGCC CTGGTGGACG GCGGCGGAGC CCGACGCGTC
GCCACCCACC TGGCCGCCCT CATCACCGCC GGGACCTGA
 
Protein sequence
MARPFKIAIR ADGSKLIGLG HVMRCGALAN ALAEIGAGIA WLTTTPQHLP AGLSHAVEPV 
QLDNDEQLAD ALTARNIHHL VADWHRTDPQ RVHSLRANGV HVSLVGNFLQ DAVPDLHIRQ
GFLPGMSPSG APTLSGPKYL LLPASCEALP PRPVAATARR VLLSLGGTDS PLLARIRDRL
AQGFPAIEVD GRGPVGNGPI PPLTEAMRRA DIGILAGGTS LHEAAATGLP SLCLPIAANQ
FERAGHFESA GLGISLDPAD PGFDQQFDTA LARLVSDQAG RQDMARTGQA LVDGGGARRV
ATHLAALITA GT