Gene Mmar10_0535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0535 
Symbol 
ID4285816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp622533 
End bp625751 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content67% 
IMG OID638140000 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_755766 
Protein GI114569086 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTA TGGGTAACCC TTGGAGCGTG AAAGGGATCG ATCCCCGCGC CCGGGCGAGA 
GCCAAGACGG CCGCGCGAAA GGAGGGCATG ACCCTCGGCG AATGGCTCAA TCGCGTCATC
CTGGAAGACA ACGACCCGTC CTCGCCGCAA TGGGATGACG CCCTGGAGGC CTTCCCCGGC
TTTGGTGGCC AGTCCGACCT GAACGAGGAC GAAGACCGCC TGTTGCGGGC CATGGTCAAT
CGCCTGTCCG AGCGCGTCGA CAGCAGCGAG CAGATGTCGG CCCGCACGCT CGGCGGTATC
GACAAGGCGA TCACCCAGCT CGCCGAAAAG ATCACCAAGA CCGGCGAACG CACCAACGCA
CAGCTGGAAA CGGCGCGCGA CAGCATTGCC CGGGTCAAGA AAGCCCAGGA CGAATTGGGC
GACCGTGTGC GCTCGATCGA GACAACCAGC GGAAGCGGCG CCTCTCCCGA TACGGTGAAG
GCGGTCGAGA CCACCATCAT GAAGCTGGCC CGCCGGCTCT ATGAACACGA GAACGACACG
GCGGCCCGCG TCCACGAAGT CGGCGACGAC ACCAAGCGCA TGGCCGAGAG CCTGGAAAGC
CGCCTCGCCC GGCTGGAAAA CCGCGCCGAG GATTTTGCCG ACATCAACAA GCGGCGCGAG
GACCGGACGT CCGAGACCTT GAACGGACTG CATACCTCGA CCGAGATGCT GAAGGCCCGC
ATCGAGGGCT CCGAGCGCAT CACCAATGAT GCGGCTCGCC TTCTGGAGAC ATCCTTTACC
CGGCTGGACG ACCGTCTGCG TCAGCTGGAG ACCCGCAATT CCTCCGATAC CGTCGAACTG
GAACGCCGCT TCGAGCGCCT GACCGATGAT GTCGCCCGCA CCATTGCGGA AACCCGCAAC
CAGGTCAGCC AGGCGCTCGA CAAGGCTGCA GCCGAGCCGC GCGTCGACCG GCTTGAGGAC
GCGCTCGCCA AGGCCCTCGA CCGGATTGAT ACCGCCGAAC GCCGCCAGGG CGACAACATG
TCCCGCCTCG GCGAAGAGAT CACCAAGCTC GCCGGCGCCA TCGACCGCCG CTTGACCGAG
AGCGAGCGCC GCTCGAGCGC TGCCCTGCGC GAAACCCAGT CCGAGCAGAA ACTCGACCGC
CGCCTGGACG AAGTCCGTCA GGAACAAAAG GACTCCATGA AGCGCATGGG CGAAGAGGTC
ACCCGCCTCG GCCGTGCCCT GGGCGAACGC ATCGTCAAGT CCGAAGAACG CGCCAGCGCC
GTCGTCGAGA CAGCCACCGA CCGTATGGCC CAGATGATGG ACCGGCTGGA GCAATCCGGC
CGCGAAGACA ATCTGGATGA CCGCCTGCGC CAGTCCGAAG AGCGCACGGC GCAACGGATT
GAAGAGGCCC TGACCGGCGT CAAGGACCGG ATGAGCGCCG TGAAGGCCGA AACCGAGCAG
GCCTTGTCAC CGGTCCAGCG CGCCATGTCG GCCCTCGCCG ACCGCCTCGA AGCCATTGAA
AGCCGCGGCA AGCCGGACAG CACACCTGCC GAAACCGACA CGCCAACAGA GGCGGCTGCC
AGGCCGGCCC CATCGGACCC GGATATCGAT TTCGACACGC CGCTCTCGCC CCCCCCACAA
GCGGAAACGC CGTCGGGTGG CTTTGAGATT GACGGCGATG ATCCCTTTCT TGCCGAGGCC
GCTCCCGCTG CACCGCTCGC CAGCAAGCCG GCTGCCCCTG TGGCTGCTGT GGCGCCGATC
CTGCGTGAGC AGGCCGCACC GCGACCAGCA CCCGCTCAAG CGCCAACACC GACCCAGGCC
CGCGGCCCCG TACACCCTCC CGCCCAAGCG GACCGTCCCC AGCGCATGGG CGCGACCGCC
GACGCCGATT TCCTCGCGGC GGCCCGCGAG CGCACCCGTG CCGGCATTCC CGGACAGTTC
ACCGAGAGCG CCGTGCACCG CACCCCGCAA TCCGGCCTTG GCCGGACCCT GCTTTACGCC
TTGCCGGTCG TCGCCCTGGT CATGTTGTCC GGTGCCGGCG CCCTGTTGAT CTGGGAAGCC
TGGCAGGGCG ATGGCGAGCG CGTCGCTGCG GAAGCCGCTG CCGAACGCAG TTTCATCGCT
CAGGTTGAAG CCGACCTGTC CGGATCGGGA ACAGCGGGCG CCGCCAGCGC GCAGCCTGCT
GGCGACCCGG TCGATGCCGC GTCCGACACT CCGGCAGAAA CCCTTACGGC TGAAACCCGC
ACGGCTGACG CGTCACCACC CGCACAGACA AGCCGCGCGC CGGCCAACCA GGTTGCCGAC
GCCAGTGGCA TGAGCTTGCC GGCAGCAGCC GCAACGCCTC AGACCGAGCC AGCTGCAGCA
CAAGCGCGCG CACCTGAAGC CGCGCCCCGC ACTCCTGCGC CGGACACCGC GCCGCCCTCG
ACCGGCCCTG CCCGCATGAC GCTGGAGAGC GCCGCCGCCG AAGGAAATCC TGTGGCGCGT
TATCAGCTCG GCGTGCGCGC CCTTGATGCC GGCGACGCCG CCACGGCAGC CATCCTGCTC
CGGCGCGCTG CCGAACAGGG TGTGCCCGCT GCACAATACC GCTTTGGCAA ATTGCTGGAG
ACGGGCGAAG GCGTCGAAAT CAATCTGGAA GACGCCCGCC GCTGGACCGA GCGCGCCGCC
AATGCCGGTC ATCGCCGGGC CATGCACAAT CTCGGTGTGA TGTACTATTA TGGCAGTGGC
GCCGCCCAGA ACATGGAGAC TGCGGCGCGT TGGTTCCAGG AGGCTGCCCT GCTCGGTCTG
CGCGATTCCC AGTTCAATCT GGCGCTGCTC TACGAGACCG GCGACGGCGT TCCGCTCAGC
CTGCCTGATG CCTTTGCCTG GTTCTCGATC GCCGCCAGTG ACAGCGATCC GACCGCCGGC
GAACGCGCCG CAACGCTGGC CGAGATGATT GAGCCGGCTG CCCTTGAAGC GGCACGATCG
ACTGCTGCGG GCTTTAATCC GCGTCCAATC GACGCCGAAG CAAACGGTAT CTATTCCGAC
CTGCCCTGGG AGCGTGTTGC GACCACGGAC ATGGCGACAG TTCGCCGGGC CCAGGGTTTT
CTATCTGTAC TGGGCTATGG CCCGGGTCCG ATTGACGGAG AAATCGGTGG CCGCACCCGC
GACGCCATCA TGCAATTCGA GGCAGACCAG GGTCTGCCCC GCACCGGACG TGTCGATGCT
GTTCTTGTTG AACGGCTCGA GCGCGCCGCT GCGGGTTGA
 
Protein sequence
MSSMGNPWSV KGIDPRARAR AKTAARKEGM TLGEWLNRVI LEDNDPSSPQ WDDALEAFPG 
FGGQSDLNED EDRLLRAMVN RLSERVDSSE QMSARTLGGI DKAITQLAEK ITKTGERTNA
QLETARDSIA RVKKAQDELG DRVRSIETTS GSGASPDTVK AVETTIMKLA RRLYEHENDT
AARVHEVGDD TKRMAESLES RLARLENRAE DFADINKRRE DRTSETLNGL HTSTEMLKAR
IEGSERITND AARLLETSFT RLDDRLRQLE TRNSSDTVEL ERRFERLTDD VARTIAETRN
QVSQALDKAA AEPRVDRLED ALAKALDRID TAERRQGDNM SRLGEEITKL AGAIDRRLTE
SERRSSAALR ETQSEQKLDR RLDEVRQEQK DSMKRMGEEV TRLGRALGER IVKSEERASA
VVETATDRMA QMMDRLEQSG REDNLDDRLR QSEERTAQRI EEALTGVKDR MSAVKAETEQ
ALSPVQRAMS ALADRLEAIE SRGKPDSTPA ETDTPTEAAA RPAPSDPDID FDTPLSPPPQ
AETPSGGFEI DGDDPFLAEA APAAPLASKP AAPVAAVAPI LREQAAPRPA PAQAPTPTQA
RGPVHPPAQA DRPQRMGATA DADFLAAARE RTRAGIPGQF TESAVHRTPQ SGLGRTLLYA
LPVVALVMLS GAGALLIWEA WQGDGERVAA EAAAERSFIA QVEADLSGSG TAGAASAQPA
GDPVDAASDT PAETLTAETR TADASPPAQT SRAPANQVAD ASGMSLPAAA ATPQTEPAAA
QARAPEAAPR TPAPDTAPPS TGPARMTLES AAAEGNPVAR YQLGVRALDA GDAATAAILL
RRAAEQGVPA AQYRFGKLLE TGEGVEINLE DARRWTERAA NAGHRRAMHN LGVMYYYGSG
AAQNMETAAR WFQEAALLGL RDSQFNLALL YETGDGVPLS LPDAFAWFSI AASDSDPTAG
ERAATLAEMI EPAALEAARS TAAGFNPRPI DAEANGIYSD LPWERVATTD MATVRRAQGF
LSVLGYGPGP IDGEIGGRTR DAIMQFEADQ GLPRTGRVDA VLVERLERAA AG