Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2042 |
Symbol | |
ID | 8568699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2373000 |
End bp | 2374070 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | signal peptidase I |
Protein accession | YP_003291311 |
Protein GI | 268317592 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGGGC GGGAACGTAA GGAGCTGGCG GAGCAGGCCG GAGACGTGCC TGTGCAGTCG GGCCGGTGGC GGCAGCGCAT GCGCGACTGG CTGACGGCCC TGGGACTGGC CGTAGCGCTG GCGTTGCTCA TCCGGATTTT TGCGCTGGAA GCCTATCGCA TCCCGTCCCC TTCGATGGAG CAGACGCTGC TGGTGGGCGA CTTCGTGCTG GTTTCCAAGC TGCACTACGG ACCCCGGATG CCCATGTCGC TGGGGTTGCC GTTCACGGCG TGGTACGTGC CGGGCGTCGC GTTGCCCTAC CTGCGGCTTC CGGGCTTTAC AGAGATCCGG CGGGGCGATG TGATCGTGTT CAACTATCCC GTCGAGACCG GTCCGATCGA TCGCAAAACG CATTACATCA AGCGGGTGGT GGGGCTCCCG GGCGATACGC TCTGGATTCG GGACAAAGTG GTTTACGTCA ACGGCCGACC ATTTCCCGAT CCCGACCTGG TACAGCAACG CTGGATGCTT CAACTGCGGC CGGGCGCGCG GCTTTCGCCG GATTCACTGC GGGCTCTCGG CGCCCGTAAC GTTTCGCGCT CGGCCTACCG TGCCGGTCTG TTATTTTTCG ATGCCACGGT GGCAGTGGCC CGCCACATTG CCCACCTGAG CGCGGTCGAT ACGTTGCGGC CGTATTCGGC CGCGGCCTTG CTCCAGGGAG CGGCCGCCCG CAAAGCCCGC CGGGAAGAAG ACCGAGGACC GATCTACATT CCCGGACGTG GTGATACGCT CTACCTGACG CCCCGCACCT GGCCGTTCTA TCGCGAACTG CTGATACGCT TCGAAGGGCA CCAGATCTAT CCCCGGCCCG ATGGCACGTT CCTGATCGAC GGACGTCCCG GCCGTTTCTG CGTGATCCGC CAGGACTACT ACTACGTCAT GGGCGACAAC CGGGACAATT CACTCGACAG CCGGGCCTGG GGCCTCGTGC CGGCCGATCA CGTGGTAGGC AAGGCCCTGC TGGTGTATCT GTCCTGGGAT CCCGAGCAGC ACCGCATCCG GTGGAATCGG CTGTTTCGTC CTGTACGCTG A
|
Protein sequence | MVGRERKELA EQAGDVPVQS GRWRQRMRDW LTALGLAVAL ALLIRIFALE AYRIPSPSME QTLLVGDFVL VSKLHYGPRM PMSLGLPFTA WYVPGVALPY LRLPGFTEIR RGDVIVFNYP VETGPIDRKT HYIKRVVGLP GDTLWIRDKV VYVNGRPFPD PDLVQQRWML QLRPGARLSP DSLRALGARN VSRSAYRAGL LFFDATVAVA RHIAHLSAVD TLRPYSAAAL LQGAAARKAR REEDRGPIYI PGRGDTLYLT PRTWPFYREL LIRFEGHQIY PRPDGTFLID GRPGRFCVIR QDYYYVMGDN RDNSLDSRAW GLVPADHVVG KALLVYLSWD PEQHRIRWNR LFRPVR
|
| |