Gene Mmar10_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1000 
Symbol 
ID4285298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1094529 
End bp1096064 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content64% 
IMG OID638140470 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_756231 
Protein GI114569551 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.760695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAAT GTGTGATGAC CGACCTGGTA TTGGTGCTGG GCGACCAGCT GAGCCCCGAC 
CTCGCGAGCC TGCGCCAGAC CGCGCCGGGC GAGGCCGATA TCCTGATGGT CGAGGCGATC
AGCGAAACAG GCTATGTCTG GCATCACCGC AAGAAGATCG CCCTGGTCCT CTCCGCCATG
CGCCATTTCG CGGCCGAACT GGAAGCGACG GGACGGCGGG TGGTCTATCA CCGGCTCGAC
CAGGCCAGGC CCTGCACCAG TTTCACCGAC GCGGTCGAGC GGGCGCTGGC GGAGGGGGCG
TACAAGCGCA TCATCCTGAC CGAATCGGGC GAATGGCGGC TGCGCAAGGA GTTCGAGGGC
TGGCGGGACC GTTTCGATGT GCCTGTCGTG ATCGTTGAGG ACGACCGCTT CATCTGCAGC
CATGACCGCT TCCACCGCTG GGCCGAGGGG CGCAAGCAAT TGCGGATGGA GTATTTCTAC
CGCGAGATGC GCCGCGAGAC CGACCTGCTC ATGGACGGCG ACAAGCCGGC CGGCGGGCGC
TGGAATTATG ACGCCGACAA CCGCAAGCCA GCCGATGCCG ACTTGTTCAT GCCGCGCCCG
CACGCCGTCG AGCCGGACGC GGTGACACAG TCGGTGATCG CCCTTGTAGC ATCACGCTTT
CCTGACGGGT TCGGCGATAT CGAGCCGTTT CGCTTCGCCG TGACGCGGGT CGGGGCCGAG
GCGGCGCTGG ATCATTTCAT CGACACCGCC CTCGCCGATT TCGGCCACTA TCAGGACGCC
ATGCTGGCCG GCGAAGCCTT CCTCTATCAC TCGCTGCTGT CGGCCTACAT CAATATCGGC
CTGCTCGACC CGCTTGCCGT CTGTCATCGT GTCGAAGCGG CGTGGCGCGC CGGCAGGGCG
CCGCTGAACG CGGCCGAAGG CTTCATCCGC CAGATCATCG GCTGGCGCGA ATATGTGCGC
GGCATCTATT GGCGCGAAGG ACCGGACTAT GTCCGCCGCA ACGCCCTGAA GGCGACACGC
CCCCTGCCCG AATTCTACTG GACCGGCGAG ACCGACATGG CCTGCCTCGC CGCCACGATC
GACCAGACCC GGCGCGAGGC TTATGCCCAC CATATCCAGC GCCTGATGGT GACCGGCACC
TTTGCCATGA TCGCCGGGAT TGATCCGCAC GCCGTGCACG AATGGTATCT CGCCGTCTAT
ATCGATGCGT TTGAATGGGT CGAGGCGCCC AATGTGATCG GCATGTCGCA ATTCGCCGAT
GGCGGCTTGC TGGCCTCCAA GCCCTACGCC GCCAGCGGCG CCTATATCGA CCGCATGTCG
GACTACTGTT CCGGCTGTCG TTTCAATGTG AAGGACAAGA CCGGGCCAGA CAGCTGCCCG
TTCAATTCAC TCTATTGGGA CTTCCTCGAC CGCAATGCCG ATACCTTGCG CGGCAATCCG
CGGCTCGGAC CGGTCTATCG CAACTGGGAC CGGATGAGCG ACGACAAACG CCAGGCCTAT
CGCGACCGGG CCCGCGACGT GCTCGACACA CTCTAG
 
Protein sequence
MKECVMTDLV LVLGDQLSPD LASLRQTAPG EADILMVEAI SETGYVWHHR KKIALVLSAM 
RHFAAELEAT GRRVVYHRLD QARPCTSFTD AVERALAEGA YKRIILTESG EWRLRKEFEG
WRDRFDVPVV IVEDDRFICS HDRFHRWAEG RKQLRMEYFY REMRRETDLL MDGDKPAGGR
WNYDADNRKP ADADLFMPRP HAVEPDAVTQ SVIALVASRF PDGFGDIEPF RFAVTRVGAE
AALDHFIDTA LADFGHYQDA MLAGEAFLYH SLLSAYINIG LLDPLAVCHR VEAAWRAGRA
PLNAAEGFIR QIIGWREYVR GIYWREGPDY VRRNALKATR PLPEFYWTGE TDMACLAATI
DQTRREAYAH HIQRLMVTGT FAMIAGIDPH AVHEWYLAVY IDAFEWVEAP NVIGMSQFAD
GGLLASKPYA ASGAYIDRMS DYCSGCRFNV KDKTGPDSCP FNSLYWDFLD RNADTLRGNP
RLGPVYRNWD RMSDDKRQAY RDRARDVLDT L