Gene Mmar10_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1105 
Symbol 
ID4284279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1207504 
End bp1209402 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content64% 
IMG OID638140583 
Productglycosyl transferase family protein 
Protein accessionYP_756336 
Protein GI114569656 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.293938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.766721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCA AGGATCGTCA GGACAGATCC GGACCGGGCA AATCCGCCCG GCCCGCTGAT 
TTCTTCCGGT CCGTCGACAG CGCGCCGAAG GGTATCAGCG TCAAGCGCGA CCGGTCGGTC
ACCGACGGAT TCCTGATCAC CTTCCTGGCG CTGACCATTC TCGCCCTGTC GGCCGCGGCC
TGGGCCAATG CCCGCTGGCC GGGCAGCGTT CCGACCGACC TCATTTTCTT CACCGCCGAC
CGCTTCAACG CCGTCCTGCC AATCCGCATC TTCCTGGTGA TCTTCTACAT CAGCTATGCC
GCCTATGCGC ACGGCCCGGT GCTGGCGCGG CTGCGGCTCG GCTTCAGCTT CCTGATCAAG
CTGGCGGGCA TAGCGGCCAT AGTCGACGGT GCGGCCATGC TGTCCTGGCA GCAGGCCGAC
GCAGTCTGGC CGGTCCACGT CCAGCAAATC CTGGTCGGAC TGTCCGGACT GGCGATCTTC
CCGCACACCG TGCTCAACCA GGCGCGACTG CCGGGCCCGT GCGGGTCACC GGTGCGCCGC
CGCGGTCGCT ATCACGAATA CTGGTTGATC AGCGTGGCTG CGCTGACGTC GGCGATCGGC
GCGGTCGTGG CCCTGACACT GTATGGCGGC GAAGTGGAGC AATTGCGCAA TCTGGCGCTG
TTGGGCGGCA TGGGGCCGGG CGTTTTCCTC GCCCAGCAAT TCTTCACCGT GCAACTGGCC
TCGCTCGGCT GGATGCGCAA CACGCTGTCA CGGCGCCGGT CCTTCTCGCC GCCAGTGGCC
GTCCTGATCC CCGCCCACAA TGAATCCCAT CTCATCGGCC AGACGATTGA CGCCATTGAC
GAGGCCGCCG CCCATTATGA CGGGCCGGTG CGCATCCTGC TGATGGAGAA CTGCTCCAGC
GACGACACGG CCGACGTGGC CCGTCAGGCA ATCGCCAAAT GCCGTTGTGC CCGGGGCGAG
GTGATCGAGA GCATCATTCC CGGCAAGGCC AAGGCGCTCA ATCACGGTCT TGAGCTGATC
AGCGAGGACT ATGTGGTCCG CATCGATGCC GATACCCAGA TCGACCCGCA ATCCCTGCGG
CTGGCGATGC GCCATTTTGC CAACGAGACC GTCGGCACGG TCGGCGGCCT GCCACTGCCA
CTCAAGCGCA CGGGGCTGCT GGACAAGTTC CGCACCATCG AAGTGCTCAA CCGCCACGGC
TATTTCCAGG TCGCGCTGGG TGCCTTCAAC GGCATTCTGG GCATTCCGGG CATGTTCTGC
ATCTATCGGC GTGACGTGCT GATGGAGGCC GGCGGCATTG TCGAGGAAAT GAATGGCGAG
GACACCGACA TTGTCCTGCG CATGACCAAT CTGGGCTATC GCGCCGTTTC CGATCCGCGG
GTCAAATTCC GTACCGAAGT GCCCGACAGC ATGGAATTCC TGCGCGAGCA GCGGACCCGC
TGGTTCCGCA GCCTCTATCA TGTCACCGCG CACAATCGCG AAATGCTGTT CCAGGGCAAT
CTGATCACCG GGGCGGTGGT GCTGCCCTTC ACGCTGATGA ATGGCGCGCG CCGGGCGATG
ATGGCGCCGC TGGCGATCTA TGGTGTCGTG CTGTTCTTCA TGTTCGGAGG GATTTACGAC
CACCCGCATC TGACCACGGT GCTGGCGGTC ATGTTCGGCA TGCCCTTCAT CATGGCCTGC
GCGGTGGTCG CCTTCTGGCG CCGCCCGGAC CTGATCCTCT ACATGCCGGC CTATATGGCC
TTCCGGCTGC TGCGCTCCTA CTACACGCTG GGCGCAACGC TGACGCTGGT CTATCCGGGC
TCGTCAAAGG ATCGTCGTTT CGATCCGCCC AAGCCGCTGG CCGTGCCGGC GGAAGAGCGG
GTCGGGCTGG TCGAGAAGCC ACCGATCATC GCCGAGTAG
 
Protein sequence
MAGKDRQDRS GPGKSARPAD FFRSVDSAPK GISVKRDRSV TDGFLITFLA LTILALSAAA 
WANARWPGSV PTDLIFFTAD RFNAVLPIRI FLVIFYISYA AYAHGPVLAR LRLGFSFLIK
LAGIAAIVDG AAMLSWQQAD AVWPVHVQQI LVGLSGLAIF PHTVLNQARL PGPCGSPVRR
RGRYHEYWLI SVAALTSAIG AVVALTLYGG EVEQLRNLAL LGGMGPGVFL AQQFFTVQLA
SLGWMRNTLS RRRSFSPPVA VLIPAHNESH LIGQTIDAID EAAAHYDGPV RILLMENCSS
DDTADVARQA IAKCRCARGE VIESIIPGKA KALNHGLELI SEDYVVRIDA DTQIDPQSLR
LAMRHFANET VGTVGGLPLP LKRTGLLDKF RTIEVLNRHG YFQVALGAFN GILGIPGMFC
IYRRDVLMEA GGIVEEMNGE DTDIVLRMTN LGYRAVSDPR VKFRTEVPDS MEFLREQRTR
WFRSLYHVTA HNREMLFQGN LITGAVVLPF TLMNGARRAM MAPLAIYGVV LFFMFGGIYD
HPHLTTVLAV MFGMPFIMAC AVVAFWRRPD LILYMPAYMA FRLLRSYYTL GATLTLVYPG
SSKDRRFDPP KPLAVPAEER VGLVEKPPII AE