Gene Rmar_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1971 
Symbol 
ID8568628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2298481 
End bp2299665 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003291242 
Protein GI268317523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.324427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCGTC TGCTGCTCTG CCTGCTGTTG ATGATGAGCG CAGGGACGGC CCTGGCCCAG 
CAGGACCGTA CCGAGATCGA ACGCCGCCTG CAGGCGCTCC GCGAGCAGAT TCGTCAGGAA
GAAGCCCGTC TGGCCGAAAC GGCCGAGGCC GAACAGGCCA CGCTGCAGAC GCTCGAAAGC
ATCGAACGCC AGATCGCCAT CCGTCGCGAG CTGATCCGGA GCTACCGGGA GCGGCTGGAA
GAGCTGGCCC GCACGATCGA CTCGCTGCAG CAGGCCGCCC GGGCGCTCAG CCAAGAGATC
GAAAAGCTGA AAGCGCAGTA TCGCCGCCGG GCGCTGCACG CCTACAAATA CGGCCGCATG
CACGAGCTGG CCCTGCTGCT CTCGGCGCAG TCCATCAACC AGATGCTCAT CCGTGCCCGC
TACCTGAGCC GCTTTGCACG GCAACGACAG GCCAAGCTCG AAGCCATTCA GCAGGCGACG
GCCGCTCTGG AAGCCCGTCG CCAGGAGCTG CTGGCCGCCC GCCAGGAAAC CGAGCAGTTG
CTGCAGGAGG CCGAGGCCGA GCGGCAACGC CTGGCGCGTC TGGAGCGCGA GCGCCGCCGC
GTGATCGAAG CGCTCCGCGC CCAGCGCGTC TCGCTGGAGC AATCGCTGGC CCAGAAACGC
CAGGCCGCCC GCGAGCTGGA GTCGCGCATC CAGGCGTTGC TCGCAGCCGA ACGGGAGCGG
CAACGCGCCC GCGAAGCGGC CGATCCGTCG GCCGCTGTGG CTTTTGCCGA GCTGACCGGT
TCGTTCGAGC AGAACCGCGG GCGGCTGCCC TGGCCGGCCG AAGGCGCCGT CGTCGAACCC
TTCGGCGAAG TGGTCAACCC CGTCTATGGC ACGCGCACGC CCAATCCCGG CATCCTGATC
GCCACCGCCC CCCAGGCCGA GGTGCGGGCC GTCTTCGACG GCCGCGTGAT CGCCATCGAC
GCCATGCCGG AGTACGGCAC CTACATCCTC ATCCAGCACG GCGAATACCA GACGTTCTAC
AGCAACCTGT CGCTTGTGTA CGTGTCGATC GGCCAGGAAG TACGGGCCGG ACAGGTCATC
GGCCGGGCCG GCACCGACGC CGAACCCAAA CGCGCCGGCG TGTTCTTCTC GCTCTTCCGG
GGTGGCCAAG TGCTCAATCC CATGCCCTGG CTTCGTCCAC GCTGA
 
Protein sequence
MRRLLLCLLL MMSAGTALAQ QDRTEIERRL QALREQIRQE EARLAETAEA EQATLQTLES 
IERQIAIRRE LIRSYRERLE ELARTIDSLQ QAARALSQEI EKLKAQYRRR ALHAYKYGRM
HELALLLSAQ SINQMLIRAR YLSRFARQRQ AKLEAIQQAT AALEARRQEL LAARQETEQL
LQEAEAERQR LARLERERRR VIEALRAQRV SLEQSLAQKR QAARELESRI QALLAAERER
QRAREAADPS AAVAFAELTG SFEQNRGRLP WPAEGAVVEP FGEVVNPVYG TRTPNPGILI
ATAPQAEVRA VFDGRVIAID AMPEYGTYIL IQHGEYQTFY SNLSLVYVSI GQEVRAGQVI
GRAGTDAEPK RAGVFFSLFR GGQVLNPMPW LRPR