Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0687 |
Symbol | |
ID | 8567324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 787456 |
End bp | 790656 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003289973 |
Protein GI | 268316254 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000368549 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAACG ACTGGTACAA TGTCTGGCGG AACGGCTGGA AGCGATTCGG CTTCTGGTTG GTGCATGTCT GGCTGCTCAC AGCAAGCGCT TCGATAGCCC AAGCGCAGAC GGGAGCGGTT ACGGGACGCG TGACAGATGC CGACACAGGC GAACCGCTTC CCGGTGTGAA TGTGGTGGTG GAAGAACTGG CGACCGGAGC GGCCACCGAT GTGGAAGGAC GGTATACGAT CTCCGGACTG CGGCCGGGTA CCTACACGCT GCGGGCCTCG TTTGTGGGCT ATGAAGACCA GACGCAAGCG GTTGCGGTGC GTGCCGGAGA GACGGTGGAG GTGAATTTTG CCCTGCAGCC CACGACCCTG GGCTTGCAGG AGGTGGTCGT GGTGGGGTAT GGCACGCAGC GGTGGGAGGA TGTGACGGGT TCGGTGGCGG CAGTCCGTAC GGAAAACCTG GCGTTGATGC CCGTGACGGG GCCCGATCAA GCGCTGGCCG GTCAGGTGGC CGGGGTGCAG GTGCTGCAGG GGAGCGGCAT CCCCGGTGGT GGTCCACAGA TTCAGGTGCG TGGTGTCGGT GCGATCGGCG CGGGAAATCA ACCGCTTTTC GTGGTGGACG GCTTCCCGCT TCCCAGCTCG ACCAATGAAA TTCGCAATCC GCTCAACGAC ATTCCCCCCG AAGACATCGA ATCGATTACA ATCCTGAAGG ACGCTTCGGC GGCAGCAATT TACGGCTCCC GGGCGGCCAA CGGCGTGGTG ATCATTACGA CAAAGAGCGG GCGGGGACAG CGGCCTACCG TAACGATCAC CTCTTCGATC GGCGTGCAGC AGATCCCGCC GGAGCGCAAG CCGGACATCA TGAGTGCCCG GGAGTTTGCC CAGTGGATGA AGGAGCGCTA CGAGGATCAC GTGCGGATCG ACCTGGGCCG GGAGCCAACC CCGGATGATA TTCCGGAGCC CTATCGCAAT CCGGAAAGCG TGCAGGGCGT GGACTGGTTC GATGCCGTAA CACGGACGGC CCTGATGTCG GACTTGAACG TAAGCGTCAG TGGAGGAAGC CAGCTCGTTA CCGCCTACTT TTCGGCCGGC GTGTTGCGGC AGGAAGGCGT GCTCAGGAAC ACCGACTTTA CGCGGTATTC CCTCCGGGCC AACCTGCGCT TCTATCCCAA CGACTGGATC AACCTGGGAT TGAACGTGTC GCCGGTATTT TCACGGCGCA GTCTGCCCGT GCAGGGGGGC GGTGGCATCT GGGATGAGAC TTTGCAGCGC AATGCAGGGG CCATTTCGAT GGGACAGATT CTGGTAACCT GCCCGATTGC CAGTCTCGAC GATGTGCAGG TGGGCTGCCC GGGCACCTTA TCCTGGCCCA ATCCCGTCCT GGCGCTGGAA AGCCTGACGC TTGGCACGGA GTCGGCCCGC TTTGTGGGCA GTTCCTTCGT GGAACTCGAA CCTGTCTCGG GTTTGCGGCT GAAGTCGCAA CTGAATACGG AAGTGTTCGG GAGCGAGACC CAGTTCTACC GGCCCTCGAC GATCGGAACA ATTAACACGC CACCTCCTCT GGCGCCGCAG GGACGCTACC AGACCAGCAG CTATTTGAAC TGGCTGAACG AAAATACGGC CACCTGGAAT CTGGCGCTCG GGGATCATTC GGTCGAGGTG CTGCTGGGCA CCAGCTTTCA GAAGCACACG CAGCGGACCG GAAGCTTTAC CGGCGAGGAA TATCCAAGCG ACGAGGTCAA GACGCTGAAC GCGGCCGCGC GCATTACGGG CCAGACGCTG ATCGAAGAAT GGGGCATGAT CTCTTATTTC GGACGGGTCA ACTACGAGTA TCAGAATAAA TACTTCCTCA CGGCGTCGCT GCGTCGCGAC GGTTCGTCCC GGTTTGGCCC GAGAAATCGA TGGGGGACGT TCCCGGCCGT CGCGCTCGGC TGGCGTCTGT CGGAAGAAGG GTTCCTGCGG GGGGTTAACT GGATTGACGA TCTGAAACTG CGGTTTTCCT GGGGCGAGAC CGGTAACAAC AACATCGGCA ACTACGATTA CATCAGCCGG GTGACCTCGC AGAATTATGT ACTGGGCGGC GGCCTGGCTT CCGGTCGCGT GGTTTCCTCG CTGGGCAACG ATCTGCTGGG ATGGGAGACC ACGCGGGAGA CCAACCTGGG GCTCGATGTG ACGCTGCTGA ACAATCGCAT CAACCTGAGC GCCGAGGTTT ACCAGAGCTA TACGACGGAT CTGCTTCTTG ACGTAGAAAT CCCGCTGTCT TCCGGTTTTA CCACGATCAA GGAGAATCGT GGCAAGGTCA GAAACCGGGG AATCGAGGTG GCGCTGCAGA CGACCCCGGT CAGAAGAAGC AATTTCGTCT GGATCTCGAG CTTTAACGTG TCGGCCAACC GAAATGTCGT CCTGGAACTG GGTCCCACCG GCGCGCCCAT CTACAGCGGT CGCAGCGGTG AGGCAAACCC CACGCATATC ACGATGATCG GCAAGCCGGT GGGTATGTTT TTCGGGTATG TGTTTGAGGG GCTTTATCGC GACTGGGATG ACGTCAACAA CAGTCCGCAT TTTCCGGGCG CGATCCCGGG GAACGTGAAG TACCGCGACG TGAACGGAGA CGGTACGATC TCACCGGTGA GCGACTTCGA CATCATCGGA AATCCCTACC CGGATCTGGT ATTCGGTATC ACCAACAGTA TCACCTATAA AAACCTGGAC GTGCAGCTTG TGCTTACGGG GCAACTGGGC GGGGAGAAGC TGATGGCCTT CAAGGAATCG CTGAACAACA TCGACGGCGT GTTCAATGTG GAGCGCGAGA TGCTGCAGCA GTGGCGCTCG CCCGAAGATC CCGGAAACGG CCGGGTGCCC ACGACGGCCG GGACCGCACT TGGCCGGGTC CTGTATCGCG ATGTCAACTC GCTGTGGGTC AAAGACGCCT CGCACCTGGC CATCAAGAAC ATCACGGTCC GCTATAACCT GCCCAACCGC TGGTTCCAGT CCTGGATGTC GATCAACCGG GCGTCCGTCT ATCTGAGCGC CCAGAACGTG TACTACTTCA CGAGCTATCC GGGCAATCCG GAGCAGACCA ACTACAGCGA TATTGTGGTT GATGCCAATA CTGCTTTGCG CTATGGCAAT CCCAACCTGA CGCCCGGGCT GGACTATGCG CCCTATCCGT TGCCCCGTAC GGTGACGCTG GGCATCGAAC TCTCGTTCTG A
|
Protein sequence | MLNDWYNVWR NGWKRFGFWL VHVWLLTASA SIAQAQTGAV TGRVTDADTG EPLPGVNVVV EELATGAATD VEGRYTISGL RPGTYTLRAS FVGYEDQTQA VAVRAGETVE VNFALQPTTL GLQEVVVVGY GTQRWEDVTG SVAAVRTENL ALMPVTGPDQ ALAGQVAGVQ VLQGSGIPGG GPQIQVRGVG AIGAGNQPLF VVDGFPLPSS TNEIRNPLND IPPEDIESIT ILKDASAAAI YGSRAANGVV IITTKSGRGQ RPTVTITSSI GVQQIPPERK PDIMSAREFA QWMKERYEDH VRIDLGREPT PDDIPEPYRN PESVQGVDWF DAVTRTALMS DLNVSVSGGS QLVTAYFSAG VLRQEGVLRN TDFTRYSLRA NLRFYPNDWI NLGLNVSPVF SRRSLPVQGG GGIWDETLQR NAGAISMGQI LVTCPIASLD DVQVGCPGTL SWPNPVLALE SLTLGTESAR FVGSSFVELE PVSGLRLKSQ LNTEVFGSET QFYRPSTIGT INTPPPLAPQ GRYQTSSYLN WLNENTATWN LALGDHSVEV LLGTSFQKHT QRTGSFTGEE YPSDEVKTLN AAARITGQTL IEEWGMISYF GRVNYEYQNK YFLTASLRRD GSSRFGPRNR WGTFPAVALG WRLSEEGFLR GVNWIDDLKL RFSWGETGNN NIGNYDYISR VTSQNYVLGG GLASGRVVSS LGNDLLGWET TRETNLGLDV TLLNNRINLS AEVYQSYTTD LLLDVEIPLS SGFTTIKENR GKVRNRGIEV ALQTTPVRRS NFVWISSFNV SANRNVVLEL GPTGAPIYSG RSGEANPTHI TMIGKPVGMF FGYVFEGLYR DWDDVNNSPH FPGAIPGNVK YRDVNGDGTI SPVSDFDIIG NPYPDLVFGI TNSITYKNLD VQLVLTGQLG GEKLMAFKES LNNIDGVFNV EREMLQQWRS PEDPGNGRVP TTAGTALGRV LYRDVNSLWV KDASHLAIKN ITVRYNLPNR WFQSWMSINR ASVYLSAQNV YYFTSYPGNP EQTNYSDIVV DANTALRYGN PNLTPGLDYA PYPLPRTVTL GIELSF
|
| |