Gene Rmar_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1037 
Symbol 
ID8567678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1185077 
End bp1186570 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content68% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003290318 
Protein GI268316599 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATGA TGCGCAAGCC GCTGCTGAAA GTCCTGATCG GAGCACTCGG GCTGTTGCTC 
GTGCTGATCG TCGTGCTGCT GGTGCGGGCC TGGCGGGTTG GGCAGCAGGT TGAATCGACC
GAAAATCTGG AGCCGCTGCA GCTCACGCTC GATGCGGAGG CGCTGGCGCA GCGGCTGGCC
GGTGCACTCC GGTTTCCCAC CGTATCCAAT CAGGATCCGG CGCGCATCGA CAGCAGTGCG
TTTCGGGCAC TGCACACCTA CCTGAAAGAA AATTTTCCGC AGGTACACGC CCATCTCCGT
CGGGAGATCA TCGGTGGGCT GAGCCTGCTT TACACCTGGC CGGGACAGGA CACGACGCTG
CCGGCTGTGG TCTTCATGGG GCATCAGGAC GTGGTGCCGA TTGCCACGCC GGAAGCCTGG
ACACACCCGC CGTTCGGCGG CGTGGTGGCC GACGGGTTCG TCTGGGGACG TGGAGCGCTG
GACGACAAGA TCGGCGTGCT GGGCGTGCTG GAGGCCGTCG AGCACCTGCT GGCCGACGGA
TTCCGGCCCG TGCGAACGGT CTATCTGGCC TTCGGGCACG ACGAAGAAGT GGGCGGGCGG
CACGGCGCCC GGCAAATCGC CGAGCGGCTG GCGGCGCGAG GCGTCCGGCT GATCGCCGTC
GTGGACGAAG GCGGCTTCGT GGTGGACGGC GTCATTCCGG GCATGACGCG GCCGGTGGCG
CTGGTGGGCG TGGCCGAGAA GGGCTACGTG AGTCTGGAGC TGACGGCCAC GGCGCCGGGT
GGACATTCCT CGACGCCGCC CACGCAGACG GCCATCGGGA CGCTCAGCCG GGCCATCGTG
ACGCTGGAGG ACAACCCCTT TCCGGCACGA CTCGACGGAC CCACCCGGGG ACTGCTGGAA
CGGCTGGCGC CTTACGTCAC CTTCGGACCG CGCGTGGTGC TGGCCAACCT GTGGCTTTTC
GGACCGGTGG TGAAATGGAT GCTGGCCCGC TCGCCGGCCG GCAACGCCAG CCTGCGCACG
ACGACCGCGC CGACCATCTT CGAGGCGGGC GTCAAAGAGA ACGTACTGCC GACGCAGGCC
CGGGCCGTGG TAAACTTCCG GATCTACCCG GGCGAAACGG CCGAAAGCGT GGAGCAGCGC
GTGCGGACAC TGCTCGAAGA CCTGCCGTTG CAGGTGCGCC GGCTCGAAGA GACGGTCACC
GACCCGTCGC CGGTCTCCGA TTTCGAGGGC GAGGCGTTCC GGCGGGTGGT GGCCGCCATC
CGACAGGCAC GGGCCGACGC GCCGCCCGTT GTGGCGCCCT ATCTGGTGCC GGGCGCCACA
GACGCCCGCT ACTTCACGGC ACTGAGCCCG AACGTGTATC GGTTCATCGG CGCGCAGATC
ACGCCCGAAC TGCTCGCCAC CATCCACGGG GTGGACGAAC GCGTTGCGGT GGACGAATAC
GTGCAGGCCG TCCGCACCTA CTACGCGTTG ATCCGCGCGC TGAGCGGCCC CTGA
 
Protein sequence
MRMMRKPLLK VLIGALGLLL VLIVVLLVRA WRVGQQVEST ENLEPLQLTL DAEALAQRLA 
GALRFPTVSN QDPARIDSSA FRALHTYLKE NFPQVHAHLR REIIGGLSLL YTWPGQDTTL
PAVVFMGHQD VVPIATPEAW THPPFGGVVA DGFVWGRGAL DDKIGVLGVL EAVEHLLADG
FRPVRTVYLA FGHDEEVGGR HGARQIAERL AARGVRLIAV VDEGGFVVDG VIPGMTRPVA
LVGVAEKGYV SLELTATAPG GHSSTPPTQT AIGTLSRAIV TLEDNPFPAR LDGPTRGLLE
RLAPYVTFGP RVVLANLWLF GPVVKWMLAR SPAGNASLRT TTAPTIFEAG VKENVLPTQA
RAVVNFRIYP GETAESVEQR VRTLLEDLPL QVRRLEETVT DPSPVSDFEG EAFRRVVAAI
RQARADAPPV VAPYLVPGAT DARYFTALSP NVYRFIGAQI TPELLATIHG VDERVAVDEY
VQAVRTYYAL IRALSGP