Gene Rmar_2310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2310 
Symbol 
ID8568975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2675760 
End bp2678963 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content66% 
IMG OID 
Productpeptidase S41 
Protein accessionYP_003291577 
Protein GI268317858 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGCC TGCTCGTTCT CCTGCTCGCG CTGCTGACGT TCAATCCTCG ACCGTTTCCT 
CGCTACCCTG CCATTGATCC ATCCGGACAA CAGATCGTCT TTTCCTATCA GGGTGATCTG
TGGCTGGTCC CGGTCACCGG CGGCATGGCC CAGCGCCTGA CCGTCCACCC GGCCTATGAG
GCCTACCCGC GCTGGAGTCC GGACGGCCGC CGCATTGCCT TCACCAGCGA TCGCTACGGC
CACGACGACC TGTTCGTCAT GGAACTGTTC GGCAGCCCCC CGCGCCGCCT CACCTACCAT
TCGGCCGACG ACATCCTGAC CGACTGGACA CGCGACGGCC GTCTGCTTTT CCAGACGCGA
CGCCTGTTCG TCCAGGTCGA GCGTGAGGCC GAAATCCACG CCATCCCCGA CACCGGCGGC
ACGCCGGTGC GCATTCTCGA CGCCACGGGC TTTCAGGCCA CGCTCTCGCC GGACGGCCGC
TTTCTGGCCT TCGAGCGCGG CTCCAACGCC ACCTGGCGCC AGGGCTACCG CGGTTCGGCC
GACCGCGACC TCTGGCTCTT CGATCTGCAG AACCACACTT TCCGTCGCCT GACCGACTTC
GACGGCAACG ATTACCTGCC GGCCTGGGCC GGTCCGCGTA CGCTCCTGTT CATCAGCGAA
CGCGACGGCA CCTACAACCT GTACCGCCTG CCTCTGAACG AAGACGGCAC GCCGCAGGGC
GCTCCGGAGC AACTCACGCA CTTTGAGGGG GACGGTGTCC GCTACTTCAC GGTCAGCGCC
GACGGCCGGA CCATTGCCTT CGAGCGCCAG ACCGACCTCT ACGTGCTCAC GCTTCCCGAG
GGCACCCCGC GCCGCCTGGA GATTCAGATT CCCTACGACG AGCGCTTCGA TCCGGTGGAA
CGCCGCACCT TCACGTCCGA GGCCACCGAA TATGCGCTCT CACCCGACGG CCGGTACGTC
GCCTTTGTCG TCCGCGGCGA GCTGTTCCTG CGCCGTAACG ATCCCGACGA CAACCGCACG
GTGCGCCTGA CACGCCACCC GTGGCGCGAC CGGGAGCCCG CCTGGCTCAA TGACTCGACG
CTCGTGTTCG TCTCGGACCG CGCCGGCCAG TACGATCTGT ACCTGCTGCG GGCGGCCGAC
GCGGGGACTT CCGACCTGTT CGAAAGCCTC ACGCACACCG TCGTGCGCCT GACCGACACG
CCGGAGGACG AGCGCGAACC GGTCGTTGCG CCGGACGGTC GCCATCTCGT GTTCCGCCGG
GGGCGCGGCA CACTGCTGCT GGCCCGCATC GAAGGCGATC GCCTTCGGAT CACGCGCACA
CTGCTCGACG GCTGGGCCAC GCCTGAAGAC GTTGCCTGGA GTCCCGACAG CCGCTGGATC
GCCTACAGCC TGCCCGACCT TGACTTCAAC ACGGAGGTCT ACATTCAGCC CATCGACGGA
AGCCATCCGC CCATCAACGT CAGCCAGCAT CCCAGGCCCG ACACGCACCC GGTCTGGAGT
CCCGACGGCT CCAAGCTTGC CTTTCTGTCG CCGCGCAGCT CCGGCGACGT GGACGTGTGG
TTCGCCTGGC TGCGCCGGGC CGACTGGGAG CGCACCGAAG AGGAATGGGA GGCCCTCGAA
AAGCAATCCG ACCGTAAGCG CCGCGATACG CTCACCGGTC CCATTCAGAT CGACCTGGAA
CGCATCCACG AGCGGCTGCG CCGCGTGACC GCCCTGCCCG GGAACGAGGC CGAACTGGCC
GTCTCGAAAG ACGGCGAAAC CTTCTACTTC GTGGCCAACC GGGGCGGCCG CACGCAGGAC
TACGAGGCCG AAGTGGACCT CTACCGCATC CGCTGGGACG GCTCCGAACT GAAGCGCCTC
ACCGAGAACG ACACGGATCC CCGCCAGGTG CGGCTCAGCC GCGATGGCAA GTACCTGTTC
TTCCTGCGGC CTTCGGGTCA GCTCGTGCGG CTCAACCCGG AGAACGGCCG CCAGGAAACG
CTGCGCTTTG AAGCGCGCAT GGAAATCGAC TACCGCGAGG AGCGTCGCCA GATCTTCGAG
GAAGCCTGGC GCACGCTGGC GCAGGGCTTC TACGACCCGC AGTTCCACGG CGTCGACTGG
CGAACGCTCC ACGACAAATA CCTGCCCTGG GCGCTACAGG CTTCGACGAA CCGGGACTTC
CGCGACGTCT TCTCCTGGAT GCTCGGCGAG CTAAACGCCA GCCACCTGGG CATTTCCGGT
CCCGATCGGG CCGAGACGCA GCGCGAACGT ACCGGCCTGC TGGGCGTGGA GGTCGAGCCC
GTGCCGGGCG GCGTACGCAT CCGGCATGTG GTGCCACGTT CGCCGGCAGA CCGCGAGGAA
AGCCGCCTGC ATGTGGGCGA AGTAATTACG GCCGTCGACG GCACACCCGT AGCCGAAGTG
GACAACTTCT ACCGCCTGCT GGTCGATAAA GTGGACGCGC GTGTGCGGCT GACGGTGCGT
GCGCCGGACG GCCGGACCCG CACCGTGATC ATTCGTCCGG TCGGCTCCCT GAACGAAGCG
CTCTACGAAG AATGGGTGGC CACACGGCGT GCGCTGACCG AACGCTACAG CAACGGCCGC
CTGGGCTACA TCCACGTGCA GGGCATGAAC TGGCCCAGCT TCGAGCGCTT CGAGCGCGAA
CTGGTGGCCA GCGGCCAGGG CAAAGAAGGG TTGATCATCG ACGTGCGCTA TAACGGCGGC
GGCTGGACGA CCGACTACCT GCTGACGGTG CTGACCGTTC GTCGCCACGC CTATACGATT
CCGCGTGGGG CCGCCGAGCG GCTGGATCTG CCCGATCGCC GCGCCTTTCG CGCCCATTAC
CCCTTCGGCG AGCGGCTGCC CTTTGCGGCC TGGACGAAAC CCGTGGCCGC CCTCTGCAAC
CAGAACAGTT TTTCCAACGC CGAGATCTTT TCGCACGCCT TTAAGAACCT GGGGCTGGGG
CCGCTGATAG GCGTGCCCAC CTTCGGCGCA GTCATCTCGA CCGGCGGTGT GGGACTCATC
GACGGATCGT TCGTTCGCCT GCCGTTCCGC GGCTGGTTCG TTTACGCCGA CGACACGAAC
ATGGAGAACG GTCCGGCCGT ACCCGACATC ATCGTCGAGG AAGCACCCGA CAGCAAAGCG
CGGGGAGAGG ATCCCCAGCT CCGGGCGGCC GTCGAAGCAC TGCTGGCCCG TATCGACGCC
CGAAACACCG AGGAAACCCA CTAA
 
Protein sequence
MGSLLVLLLA LLTFNPRPFP RYPAIDPSGQ QIVFSYQGDL WLVPVTGGMA QRLTVHPAYE 
AYPRWSPDGR RIAFTSDRYG HDDLFVMELF GSPPRRLTYH SADDILTDWT RDGRLLFQTR
RLFVQVEREA EIHAIPDTGG TPVRILDATG FQATLSPDGR FLAFERGSNA TWRQGYRGSA
DRDLWLFDLQ NHTFRRLTDF DGNDYLPAWA GPRTLLFISE RDGTYNLYRL PLNEDGTPQG
APEQLTHFEG DGVRYFTVSA DGRTIAFERQ TDLYVLTLPE GTPRRLEIQI PYDERFDPVE
RRTFTSEATE YALSPDGRYV AFVVRGELFL RRNDPDDNRT VRLTRHPWRD REPAWLNDST
LVFVSDRAGQ YDLYLLRAAD AGTSDLFESL THTVVRLTDT PEDEREPVVA PDGRHLVFRR
GRGTLLLARI EGDRLRITRT LLDGWATPED VAWSPDSRWI AYSLPDLDFN TEVYIQPIDG
SHPPINVSQH PRPDTHPVWS PDGSKLAFLS PRSSGDVDVW FAWLRRADWE RTEEEWEALE
KQSDRKRRDT LTGPIQIDLE RIHERLRRVT ALPGNEAELA VSKDGETFYF VANRGGRTQD
YEAEVDLYRI RWDGSELKRL TENDTDPRQV RLSRDGKYLF FLRPSGQLVR LNPENGRQET
LRFEARMEID YREERRQIFE EAWRTLAQGF YDPQFHGVDW RTLHDKYLPW ALQASTNRDF
RDVFSWMLGE LNASHLGISG PDRAETQRER TGLLGVEVEP VPGGVRIRHV VPRSPADREE
SRLHVGEVIT AVDGTPVAEV DNFYRLLVDK VDARVRLTVR APDGRTRTVI IRPVGSLNEA
LYEEWVATRR ALTERYSNGR LGYIHVQGMN WPSFERFERE LVASGQGKEG LIIDVRYNGG
GWTTDYLLTV LTVRRHAYTI PRGAAERLDL PDRRAFRAHY PFGERLPFAA WTKPVAALCN
QNSFSNAEIF SHAFKNLGLG PLIGVPTFGA VISTGGVGLI DGSFVRLPFR GWFVYADDTN
MENGPAVPDI IVEEAPDSKA RGEDPQLRAA VEALLARIDA RNTEETH