Gene Rmar_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1722 
Symbol 
ID8568374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1996510 
End bp1998192 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content64% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003290994 
Protein GI268317275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000358148 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGAA GCTGGCCTTA CGGCCTGGCC CTGTTGCTGG TGGGCGTTGT GCTGGGATTT 
CAGATCGGTG CCGTCGTTTC CGGAGACCCG GCCCGACGGG CGCTTCGTAA GCTGCAGGAA
GCCTTTCTGA TCGTGCAGCA GCGCTACGTG GATCCCGTCG ATTCGGCCCG ACTGACGGAA
AGCGCCCTCG AAGGAATGCT GTCGCGGCTG GATCCGCATT CCGTTTACAT CCCGGCCGAC
GAGATGCGTC GGGTGCAGGA AAGCTTCGAA GGGGCCTTCG AGGGCATTGG CATTGCCTAC
GAGCTGTTGC CGGGACCGAA CGGGCGGGAT ACGATCGCTG TGCAGAGCGT CATTCCCGGC
GGGCCCAGCG AAAAGGCCGG GTTGCTGGCC GGCGATCGGA TTGTGGCGAT CAACGACTCA
AGCGCCATCG GCTTTACGCA CGAGCAGGTG CAGCGGACGC TCAAAGGGCC GCGTGGGACG
CAGGTGCGCG TGACGGTGCG GCGTCCGGGC GTGCCCGAGC TGCTGGAATT CACGATCACG
CGCGATCGCA TTCCGCTCTA TACGGTCGAT GCCGCCTACA TGCTGGACGA GCGGACCGGC
TATCTCAAGC TCAATCGGTT TGCGCGCACG ACCTACCGGG AATTTGCGCA GGCGCTGCGA
CAGCTCCGGC AGCAGGGCAT GGAGCGGCTG GTGCTGGACC TGCGCGACAA CAGCGGGGGC
TATCTGGAAG TGGCCGTGCA GGTGGCCGAC GAGCTGCTGG GCGGCCGTCA ACTCATTGTG
CGTCAGGAAG GAAGGCGTCC GGAGTTTCGG GCCGCCTGGC ATTCGCACCC CGGCGGGCTC
TTTGAGACCG GCCCGCTGAT CGTGCTGGTC AACGAAAACA CGGCCTCGGC CAGCGAGATC
GTGGCGGGCG CATTGCAGGA TCACGATCGG GCGCTGATCG TGGGACGCCG CACATTCGGC
AAGGGGCTGG TGCAGCAGCA GATCACCCTG GCCGATGGCA GTGCCCTGCG GCTGACCGTG
GCCCGCTTCT ATACGCCCTC GGGGCGGCTG ATTCAGACAC CCTACCGGCG AGGCGATCGG
CAGGACTACT ATGCCGAGCA CTGGCGGCGG GCAGTGCGCG ATGTGACCCG TCCTGTTGAA
GAGATTCTGG CCGAAGTGCC CGACTCGCTG CGCTACTACA CCGATGGAGG GCGGATCGTC
TTCGGTGGGG GCGGCATTCT GCCGGATTAT CTCGTGCCGC CCGACACGCT TTCGCCGCTG
GTGCAGGCCG TGCTTCGGCG CAATCTGGAT CAACGTTTCG TCTGGCGCTG GTTCGATCGG
CATGGGACCG AACTTCGTCG TCAGTGGAGG GGCCAACAAG AAACTTTTGT GAAAGACTAC
TGGCCGGATT CAGCCCTGCA GCGGGCTTTT CGTGAGTTTT TGGAAGAAAA TGGGGTTCGC
TTTCATGCAG AAATGGAAGA ACCGGCCGCG CTGCGCTTTT CCGAAGACCG CTGGCGGGCT
GACTGGCCGG TGCTGGGCAC GTTGTTGAAA GCCCAGCTGG CCGTACGACT GTTCGGCCCG
AGGGCACGCT ATCCGGTCTA TCAGGCCGTG GATGCGATAT TGCAGGAGGC GCTGCGGCTG
TGGAAGCCGG CCGAGGAGCT GGCGCAGCGT TACCGAGAAC GGTTACAGAA AGATCGGGAT
TGA
 
Protein sequence
MRRSWPYGLA LLLVGVVLGF QIGAVVSGDP ARRALRKLQE AFLIVQQRYV DPVDSARLTE 
SALEGMLSRL DPHSVYIPAD EMRRVQESFE GAFEGIGIAY ELLPGPNGRD TIAVQSVIPG
GPSEKAGLLA GDRIVAINDS SAIGFTHEQV QRTLKGPRGT QVRVTVRRPG VPELLEFTIT
RDRIPLYTVD AAYMLDERTG YLKLNRFART TYREFAQALR QLRQQGMERL VLDLRDNSGG
YLEVAVQVAD ELLGGRQLIV RQEGRRPEFR AAWHSHPGGL FETGPLIVLV NENTASASEI
VAGALQDHDR ALIVGRRTFG KGLVQQQITL ADGSALRLTV ARFYTPSGRL IQTPYRRGDR
QDYYAEHWRR AVRDVTRPVE EILAEVPDSL RYYTDGGRIV FGGGGILPDY LVPPDTLSPL
VQAVLRRNLD QRFVWRWFDR HGTELRRQWR GQQETFVKDY WPDSALQRAF REFLEENGVR
FHAEMEEPAA LRFSEDRWRA DWPVLGTLLK AQLAVRLFGP RARYPVYQAV DAILQEALRL
WKPAEELAQR YRERLQKDRD