Gene Rmar_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1823 
Symbol 
ID8568475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2131147 
End bp2132817 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content64% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003291094 
Protein GI268317375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0958668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT CACTGCGTTA CACCCTCCCG GCCATTCTGC TGCTGGCGCT GGGCATTCTG 
CTGGGCTGGA ATCTGCAACA GGCCGTTTCC GACACCGACA CGCTGGCCAG CCTGCGCAAG
CTCGAAGAAG CCTTTCTGAC GATCACGCAG CGCTACGTCG ATCCGGTCGA GCCCGAACCG
CTGGCCGAGG AGGCCATCCG GTCCATGCTC CAGGAGCTGG ACCCCCACTC CGTGTACATC
ACCGCCGAGG AAATGAAGGA ACTCCGGGAA AGCTACCAGG GCTCCTTCGG CGGGATCGGG
ATCTGGTTCG AGGTGGTGGA CGACACGGCC CGCGTGGTGG CCACCATCAG CGGCGGGCCC
AGCGAGGCGG TCGGACTCCA ACCCGGCGAT CGGATCATCA AAATCGAAGA CTCCAGCGCC
GTGGGCCTTT CCTCGACGGA AATTCAGAAG CGGCTTAAAG GTCCGGAAGG CACCAAAGTC
CGGGTAACCA TTCGCCGGCT GGGCGTCCGC GAGCCCCTGG AGTTTACGAT CACGCGCGAC
CGCATTCCGC TCTACACGGT CGATGCCGCC TACATGCTCG ACGAGCGGAC CGGCTACATC
CGCATCAGCC GCTTTGCCAT GACCACCTAC GATGAATTCC TGGAGCACCT AGACCGCCTC
AAGCGCCAGG GCATGGAGCG GCTGGTGCTG GACCTGCGCG GCAATCCGGG CGGCATCATG
GAAGCGGCCG TGGAGCTGGT CGATGAACTG TTGCCCGAAG GCTACACGAT CGTCTACACG
CGCGGGCGCG TCGCTCAGGC GGAAATGACC CGTCGCTCCA CCTCGGGCGG CCGCTTCGAG
ACGCAGCCGG TCATCGTACT GGTCGATCGC AATTCGGCCT CGGCCAGCGA GATCGTGGCC
GGCGCGCTGC AGGACAACGA CCGGGCCCTG ATCGTGGGGC TTCGCACCTT CGGGAAAGGG
CTGGTGCAGA ACCAGTTTCC GCTCTCCGAC GGCAGCGTCA TCCAGCTGAC GGTCGCCCGC
TACTACACGC CCTCGGGTCG CCTGATTCAG ACGCCCTACC ACGGCGGTGA CCTGGAGGAC
TACTACCGGG AAAAGTTCGC CGACTACGAA ACGGCCGTCT TCCATCCGGA GGATTACATC
AACGAGATCC CGGACTCGCT GAAGTTCAAG ACGGTGCACG GCCGCACGGT CTTCGGCGGC
GGTGGCATTC TGCCCGATGT GATCGTTCCG CCCGACACGA ACTCGATCCT GCTGGAAGTC
AGCCGTCGCA ACCTGCCCTC CACCTTCGTC CGCACCTGGT TCAATCAGCA TGAACAGGCC
ATCCGCGCGC AGTGGAACAA CCGGAAGGAC GCCTTTCTGG CCTCGTTCGA AGTGGACGAC
ACGCTGTGGC AGGCCTTCCT GGACTACGCC CGGGAGCAGG GCCTCTTTGC GGCCGATTCT
GCCGCGACGC CTCGCTTCAC GGTCGCACAG GCCGAAGCGC ACCGGCACGA ACTGAGCACG
CTGCTGCAAG CCTATCTGGC CTGGCAACTG TTCGGCCGTG AGGCGTCAAT CCCGCTGTTC
AACGAAATCG ATCCCGTACT GCACGAAGCG CTCAAGCACT GGGACCGGGC CGAGGCGCTG
GCCGCCTATT TCGCCCCGAA AGCGGGCGAC ACGGTACGCA AAGGGCGTTA G
 
Protein sequence
MKKSLRYTLP AILLLALGIL LGWNLQQAVS DTDTLASLRK LEEAFLTITQ RYVDPVEPEP 
LAEEAIRSML QELDPHSVYI TAEEMKELRE SYQGSFGGIG IWFEVVDDTA RVVATISGGP
SEAVGLQPGD RIIKIEDSSA VGLSSTEIQK RLKGPEGTKV RVTIRRLGVR EPLEFTITRD
RIPLYTVDAA YMLDERTGYI RISRFAMTTY DEFLEHLDRL KRQGMERLVL DLRGNPGGIM
EAAVELVDEL LPEGYTIVYT RGRVAQAEMT RRSTSGGRFE TQPVIVLVDR NSASASEIVA
GALQDNDRAL IVGLRTFGKG LVQNQFPLSD GSVIQLTVAR YYTPSGRLIQ TPYHGGDLED
YYREKFADYE TAVFHPEDYI NEIPDSLKFK TVHGRTVFGG GGILPDVIVP PDTNSILLEV
SRRNLPSTFV RTWFNQHEQA IRAQWNNRKD AFLASFEVDD TLWQAFLDYA REQGLFAADS
AATPRFTVAQ AEAHRHELST LLQAYLAWQL FGREASIPLF NEIDPVLHEA LKHWDRAEAL
AAYFAPKAGD TVRKGR