Gene Mmar10_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1681 
Symbol 
ID4283862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1847626 
End bp1849203 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content65% 
IMG OID638141169 
Productsugar transferase 
Protein accessionYP_756911 
Protein GI114570231 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.895795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0521671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGA GACCGATGGG GACTGCCGTG GATTATTCCA ACCAGCCAAG CCAGACCGAG 
CACGCGCCCC GCAAAGCGTC GGGGCCGGCC AATGACATGC CCGACACGCT GCGCCCGGCC
AATCCGGACA CAGCCAGCCT GACGCGCGCC CATCTGAAAG CAGACGCGCG CGCCCCCCGC
AACCGGGTCA ATCGAAAGGC GCTCGGTCAC ATCTGGCAGA GCATCGACAT ATTCGCCGTG
CTGCTCCTGA CCACGACCGG CGCCTATGCG CTGTCAGGGG GGGACGTCAT CGCCGTGGCC
GCCGGGGAGT TGTTTCCCCT TCTCGCCTTC GCGGCCCTTT GCCCGGCTTT CACCCTGTTG
ATGGGGTTGT ACAAGGTGGA AGCGCGAGAA AGCGCCGCCT TTCGCATGTT GCGCGCCGTC
ATTGCGACCG CGCTGACCGG CAGCGCGATT ACTGCATTGT CCCTCATTAC AGCGCCCGAC
ATGGCACCGC AGATCGCCAC ATTCGCCCTC ACGGCCGTGG GTGCACTGAC CCTCCTGCAT
GTCATCTATG CCGGCTTTGT CCAGCACTGG GCCCGGTCGG GCCGGCTGGC CCGCAATGTT
GTTCTCGTCG GAGCAACGGC CAATGCCAGC AAGCTGATCA AGGCCAATGC CGGGTCTGGG
ACCGTCAATG TTGTCGGCAT TTTCGACGAC CGCGCCGCCC GCAGCCCCCA GGCACTGGCT
GGAGCGCCCT ATCTGGGAAC AACCGACGAC CTTTTGAGCT GGTCGCTGCT GCACGAGGTC
GACCGCATCA TCCTGACGGT AACACCGAAG GCAGAAGACC GTGTCCGCTT GCTGCTGGGC
AAGCTGCGCG CCCTGCCGCA CACGGTCTGT CTCCTGCTTG ACCTCGACAG TTTCGATCCG
GCCGAAACCA CGCTCGATGA TATTATCGGC GTCCAGGCGG CGCGGATGAG CGGCGTCGAG
GAACGCTTCG GACATGATCT GGCCAAGCGA ACACAGGATA TCGTCCTGGC CCTCGGATTG
AGCCTCGTCG CCCTGCCCGT CATGGCACTG ATCGCCCTGG CGGTCCGTCT GGGCAGCCCC
GGTCCGGTCC TGTTCCGCCA GGTCCGTGAA GGCTTCAACG GCCGCCCGAT CAAGGTCCTG
AAATTCCGGA CCATGCGTCA CGACCCGGCG TCGGCCGCCA AGCCGATGCG TCAGGTCGAA
CTCGATGATC CGCGCGTGAC ACGAATCGGC GGTTTCCTGC GCAAGACAAG CCTCGATGAG
CTGCCGCAAT TGTGGAATGT CCTGGTCGGT GAGATGTCGC TGGTCGGTCC GCGCCCACAC
GCGCCGGGCA TGCGGACAGG TGGCACCGAA ACAGCCAAGC TGGTTGCCGA ATATGCCCAT
CGCCATCGCG TCAAGCCGGG CATCACAGGC TGGGCCCAGA TCAACGGATC GCGCGGCCCC
TTGCACTCAC CGGAAGCCGC CCGCGAGCGG GTGGCCTATG ACGTCGCCTA TATCGCCAAG
GCCAATTTCT GGTTCGATCT GTGGATCATG GCCCGCACCC TGCCGGCCCT GCTCGGTGAC
AAGGCCAATA TTCGCTGA
 
Protein sequence
MSARPMGTAV DYSNQPSQTE HAPRKASGPA NDMPDTLRPA NPDTASLTRA HLKADARAPR 
NRVNRKALGH IWQSIDIFAV LLLTTTGAYA LSGGDVIAVA AGELFPLLAF AALCPAFTLL
MGLYKVEARE SAAFRMLRAV IATALTGSAI TALSLITAPD MAPQIATFAL TAVGALTLLH
VIYAGFVQHW ARSGRLARNV VLVGATANAS KLIKANAGSG TVNVVGIFDD RAARSPQALA
GAPYLGTTDD LLSWSLLHEV DRIILTVTPK AEDRVRLLLG KLRALPHTVC LLLDLDSFDP
AETTLDDIIG VQAARMSGVE ERFGHDLAKR TQDIVLALGL SLVALPVMAL IALAVRLGSP
GPVLFRQVRE GFNGRPIKVL KFRTMRHDPA SAAKPMRQVE LDDPRVTRIG GFLRKTSLDE
LPQLWNVLVG EMSLVGPRPH APGMRTGGTE TAKLVAEYAH RHRVKPGITG WAQINGSRGP
LHSPEAARER VAYDVAYIAK ANFWFDLWIM ARTLPALLGD KANIR