Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0038 |
Symbol | |
ID | 8566662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 37270 |
End bp | 38448 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Cupin 4 family protein |
Protein accession | YP_003289335 |
Protein GI | 268315616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTTC CCGAGACAAT CCTGGGTGGC CTGGCGCCTG AAGAATTCCT GGCGAACTAC TGGCAGAAGC GACCGCTTTT GATCCGGCAG GCGCTGCCGG GCTTTCGGTC GCCCATCACG CCCGAGGAGC TGGCCGGGCT GGCCTGTGAG GAAGGGGTGA CGGCCCGGCT GATTCTGGAA AAAGGCGGGG CCTATCCCTG GGAGGTGCGC TACGGGCCCT TCGAGCCTGA GGATTTTGTC GCGCTGCCGC CCACGCACTG GACGCTGCTG GTGCAGGAGG TCGATCGGCT GGTGCCGGAG GTGGCCGCGC TGCTCGAAAC GGTACGCTTC GTTCCCAACT GGCGTCTGGA CGACATCATG GTCAGCTACG CGCCCGAGGG CGGAACGGTC GGGGCGCATA TCGACAACTA CGACGTGTTT CTGGTGCAGG CCTGGGGGCG TCGTCGCTGG CAGATCAACC ATCGGCCTGT CGAGCGTGAG GAACTGGTGC CGGGACTGGA GGTGCGTCTG CTGGCCCACT TCGAGCCTGA TGCCGAATGG ATTCTGGAGC CCGGCGACGT GCTCTACCTG CCGCCCCGCA TCCCGCACTA CGGCGTGGCG CTGGAGGACT GCATGACGTT CTCGATCGGC TTTCGGGCGC CCGATCAGGC CGAACTGGCC GAAGCCATGC CCCGCATGGC TGCCTGGCTG GACGGCGGAC GGCGTTATGC CGATCCCGAT CTGACGCCCG CCGATGAACC CGGCGAGATC ACACCGGAAG CGCTCGATCA GATTCAGGCG TTGCTCCGGG CGCTGATCGA CGACCGGGAA CGGCTGGCCC GCTGGTTCGG CTGCATCATC ACCGAGCCGC GGCGGGGGCT GCCGCCGGAG CCGCCCGGGC GGCCGCTTTC CGCAAAGCAG CTCCATCGAC GCCTGCAGCA GGGAGCGACG CTTCGGCGCA ACGCGATCCC GGAGCTGGCC TACGTGCGCC ACGCGGACGG ATCGGCCACG TTGTTCGCTT CGGGCGAGGC CTACGAACTG TCGCCCGAAC TGGCCGACGT GGCTCCGCTG CTGACCGGTC GCCGACCGCT GACGGCCGAG ACGCTCCGCC CCTGGCTCGA GCGGGACGAC TTTCTGGAAC TCCTGCAGAC GCTCATCCAT TCCGGCATCC TGTCGCTGAT ACCAGCCCGC AAACGCTGA
|
Protein sequence | MQLPETILGG LAPEEFLANY WQKRPLLIRQ ALPGFRSPIT PEELAGLACE EGVTARLILE KGGAYPWEVR YGPFEPEDFV ALPPTHWTLL VQEVDRLVPE VAALLETVRF VPNWRLDDIM VSYAPEGGTV GAHIDNYDVF LVQAWGRRRW QINHRPVERE ELVPGLEVRL LAHFEPDAEW ILEPGDVLYL PPRIPHYGVA LEDCMTFSIG FRAPDQAELA EAMPRMAAWL DGGRRYADPD LTPADEPGEI TPEALDQIQA LLRALIDDRE RLARWFGCII TEPRRGLPPE PPGRPLSAKQ LHRRLQQGAT LRRNAIPELA YVRHADGSAT LFASGEAYEL SPELADVAPL LTGRRPLTAE TLRPWLERDD FLELLQTLIH SGILSLIPAR KR
|
| |