Gene Mmar10_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1688 
Symbol 
ID4285691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1855701 
End bp1856966 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID638141176 
ProductO-antigen polymerase 
Protein accessionYP_756918 
Protein GI114570238 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0910283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.076767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG CCGCGCACCA TGATCCGGCT TTCCTCGCTG CCTGGCGCCG CGCACCGGCA 
CTACCTTTCA CCGGACTGGT TGAAGGTGGG CTGACGCTTC TTTGCCTCTT CCTGTTCTCG
CAAGGCCTGA TCGGCCCGCT CTTCGCCGAT CCCGCTGACC CGGACAGCTC AGTTGTGCTG
CGCCTCATCT GGCTGCCAGT CTATGCCATC ACGCTGGCGC TCGCGGTGAC GCGGCCGGGG
GCCGTGATGC GGACGCTGAC AGGCAATTGG TTGATGGTGG CGCTGGTCCT GTTGACTGCG
GTCTCCGTCA TCTGGTCGAT CGCGCCAGAG ACGACCCTGC GCCGTTCCTT CGCCCTGATC
ATGACCACCC TGTTCGGCTT CTGGATGGCC GCGCGCTGGT CCTGGCGCGG CCTGATCCTT
CTGACTGCGA CAACCTTTGT CGGCCTCGCT GTTGTGTCGA CACTGATGGC GCTGGCCATG
CCGTCCCTGG GTGTTGACCA CGAAGTCCAT GCCGGCGCCT GGAAGGGCGT GTGGTGGGAG
AAGAATACGC TGGGCGCCAT GATGGCCTGG GGGGCGGTGG CCTGTTTTGC GGCGCTGCAT
GTCGATCCCC GACGGCGCTG GATCTGGATG GGCGGGGCGA TCCTGTGTTG CGCCCTGGTC
CTGCTGTCGA CCTCCAAGAC CGCGCTTCTC GCGCTGCTAC TCGGGATTGG CGGCGCTGTC
GGGATCGCGC TGTGCCGGCG CGGCTTCGGC TTTGCCAGCC TGATGCTGTT TCTCGGCCTG
ACCGGCGCGG TTGGTGGTGC CCTGATCCTG TTAATAGCAC CCGTTGAGTT CCTGGAGCTC
CTTGGTCGCG ATGCCACGTT GACCGGCCGG ACGGATATCT GGGCCATCCT GTCGCGCCAG
GCCGCGGAAG TTCCGTGGAC GGGCTATGGC TATATGGCCT TCTGGGCCGA TGAGACAGGG
CCGGTTTACT GGGTCAGGCA GGGCACGGAC TGGCCGGTCC CGACTGCCCA TAATGGCTGG
ATCGAGACGG CGCTGGCGAT CGGCCTGCCG GGTGTCGTGC TGTTGGGTCT GGTCTATGGG
CGCGCAGTAA TGCGGTCCCT TGGACGTTTG TTCCATGGGC CGGAAACCTA TTGGACACTG
ACTTTTCTGG CCATGCTGGG CCTGGTCAGC ATCTCTGAAT CCAACTTCCT TCAGCAGAAT
TCGATCGGCT GGGTCCTGCT CGTCGCGACG GCTGCCAAGT TGGCAGACCG ACGGGCCGGC
GACTAG
 
Protein sequence
MSLAAHHDPA FLAAWRRAPA LPFTGLVEGG LTLLCLFLFS QGLIGPLFAD PADPDSSVVL 
RLIWLPVYAI TLALAVTRPG AVMRTLTGNW LMVALVLLTA VSVIWSIAPE TTLRRSFALI
MTTLFGFWMA ARWSWRGLIL LTATTFVGLA VVSTLMALAM PSLGVDHEVH AGAWKGVWWE
KNTLGAMMAW GAVACFAALH VDPRRRWIWM GGAILCCALV LLSTSKTALL ALLLGIGGAV
GIALCRRGFG FASLMLFLGL TGAVGGALIL LIAPVEFLEL LGRDATLTGR TDIWAILSRQ
AAEVPWTGYG YMAFWADETG PVYWVRQGTD WPVPTAHNGW IETALAIGLP GVVLLGLVYG
RAVMRSLGRL FHGPETYWTL TFLAMLGLVS ISESNFLQQN SIGWVLLVAT AAKLADRRAG
D