Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1823 |
Symbol | |
ID | 8568475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 2131147 |
End bp | 2132817 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | carboxyl-terminal protease |
Protein accession | YP_003291094 |
Protein GI | 268317375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0958668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAT CACTGCGTTA CACCCTCCCG GCCATTCTGC TGCTGGCGCT GGGCATTCTG CTGGGCTGGA ATCTGCAACA GGCCGTTTCC GACACCGACA CGCTGGCCAG CCTGCGCAAG CTCGAAGAAG CCTTTCTGAC GATCACGCAG CGCTACGTCG ATCCGGTCGA GCCCGAACCG CTGGCCGAGG AGGCCATCCG GTCCATGCTC CAGGAGCTGG ACCCCCACTC CGTGTACATC ACCGCCGAGG AAATGAAGGA ACTCCGGGAA AGCTACCAGG GCTCCTTCGG CGGGATCGGG ATCTGGTTCG AGGTGGTGGA CGACACGGCC CGCGTGGTGG CCACCATCAG CGGCGGGCCC AGCGAGGCGG TCGGACTCCA ACCCGGCGAT CGGATCATCA AAATCGAAGA CTCCAGCGCC GTGGGCCTTT CCTCGACGGA AATTCAGAAG CGGCTTAAAG GTCCGGAAGG CACCAAAGTC CGGGTAACCA TTCGCCGGCT GGGCGTCCGC GAGCCCCTGG AGTTTACGAT CACGCGCGAC CGCATTCCGC TCTACACGGT CGATGCCGCC TACATGCTCG ACGAGCGGAC CGGCTACATC CGCATCAGCC GCTTTGCCAT GACCACCTAC GATGAATTCC TGGAGCACCT AGACCGCCTC AAGCGCCAGG GCATGGAGCG GCTGGTGCTG GACCTGCGCG GCAATCCGGG CGGCATCATG GAAGCGGCCG TGGAGCTGGT CGATGAACTG TTGCCCGAAG GCTACACGAT CGTCTACACG CGCGGGCGCG TCGCTCAGGC GGAAATGACC CGTCGCTCCA CCTCGGGCGG CCGCTTCGAG ACGCAGCCGG TCATCGTACT GGTCGATCGC AATTCGGCCT CGGCCAGCGA GATCGTGGCC GGCGCGCTGC AGGACAACGA CCGGGCCCTG ATCGTGGGGC TTCGCACCTT CGGGAAAGGG CTGGTGCAGA ACCAGTTTCC GCTCTCCGAC GGCAGCGTCA TCCAGCTGAC GGTCGCCCGC TACTACACGC CCTCGGGTCG CCTGATTCAG ACGCCCTACC ACGGCGGTGA CCTGGAGGAC TACTACCGGG AAAAGTTCGC CGACTACGAA ACGGCCGTCT TCCATCCGGA GGATTACATC AACGAGATCC CGGACTCGCT GAAGTTCAAG ACGGTGCACG GCCGCACGGT CTTCGGCGGC GGTGGCATTC TGCCCGATGT GATCGTTCCG CCCGACACGA ACTCGATCCT GCTGGAAGTC AGCCGTCGCA ACCTGCCCTC CACCTTCGTC CGCACCTGGT TCAATCAGCA TGAACAGGCC ATCCGCGCGC AGTGGAACAA CCGGAAGGAC GCCTTTCTGG CCTCGTTCGA AGTGGACGAC ACGCTGTGGC AGGCCTTCCT GGACTACGCC CGGGAGCAGG GCCTCTTTGC GGCCGATTCT GCCGCGACGC CTCGCTTCAC GGTCGCACAG GCCGAAGCGC ACCGGCACGA ACTGAGCACG CTGCTGCAAG CCTATCTGGC CTGGCAACTG TTCGGCCGTG AGGCGTCAAT CCCGCTGTTC AACGAAATCG ATCCCGTACT GCACGAAGCG CTCAAGCACT GGGACCGGGC CGAGGCGCTG GCCGCCTATT TCGCCCCGAA AGCGGGCGAC ACGGTACGCA AAGGGCGTTA G
|
Protein sequence | MKKSLRYTLP AILLLALGIL LGWNLQQAVS DTDTLASLRK LEEAFLTITQ RYVDPVEPEP LAEEAIRSML QELDPHSVYI TAEEMKELRE SYQGSFGGIG IWFEVVDDTA RVVATISGGP SEAVGLQPGD RIIKIEDSSA VGLSSTEIQK RLKGPEGTKV RVTIRRLGVR EPLEFTITRD RIPLYTVDAA YMLDERTGYI RISRFAMTTY DEFLEHLDRL KRQGMERLVL DLRGNPGGIM EAAVELVDEL LPEGYTIVYT RGRVAQAEMT RRSTSGGRFE TQPVIVLVDR NSASASEIVA GALQDNDRAL IVGLRTFGKG LVQNQFPLSD GSVIQLTVAR YYTPSGRLIQ TPYHGGDLED YYREKFADYE TAVFHPEDYI NEIPDSLKFK TVHGRTVFGG GGILPDVIVP PDTNSILLEV SRRNLPSTFV RTWFNQHEQA IRAQWNNRKD AFLASFEVDD TLWQAFLDYA REQGLFAADS AATPRFTVAQ AEAHRHELST LLQAYLAWQL FGREASIPLF NEIDPVLHEA LKHWDRAEAL AAYFAPKAGD TVRKGR
|
| |