Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1070 |
Symbol | |
ID | 8567711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 1227151 |
End bp | 1230126 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003290350 |
Protein GI | 268316631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGACA TGAAGCGGTT TTGTCTGGTG GTGCTCGGGC TGCTGTGGCT CGGCACAGCT TCGGAGGTGC TGGGGCAACG GACGGGACGC ATTACCGGCG TGGTTGTCGA TGCCAGTAAC GGCATGCCGT TGCCCGGCGC CAATGTGCTG GTGGCAGGCA CGACGGTCGG GGCCGCCACC GACCTGGAAG GGAAGTTTAT CATCCTGAAT GCGCCGGCCG GTCCTCAAAC GCTGGTGATT TCCTACATCG GATACCAGCG AAAAGAGGTG CCGGTGGAAG TGGTGCCGGG AGGCGAGGTC AGTGTGGAGG TAGCGCTTCA ATGGGCCGGT ATCGAAACCG GAGAGGTTGT GATCACCGCG CAGGCGGCCG GCCAGCTTCA GGCGATCAAC GAGCAGCTCA CGGCACGCAA GATCGTCAAT GTCGTATCAG CCGAACGCAT CCGTGAGCTG CCGGACGAAA GCGCGGCCGC CGCGGTCAGC CGCCTGCCCG GTATCTCCAT CCAGAACGGC GATCAGATCG TCATTCGAGG CGTCGAGGCC AAGTACAACA CCGTCACCGT CAACGGCATC CAGTTGCCGT CCACCACGCT CAACCGGACC ACCGGGCTGG GATTCATTTC GGCCAACATG CTCTCCAGCA TTGAGGTGGC CAAGACGGTG ACGCCCGATA TGGACGCCAA CACCATCGGG GGTAACGTCA ACCTGCGCCT CCGTGAGGCG CCGGAAGGGC TGCACTATGA CGCGCTGGTC TTCGGCGACT ACAACACGCA GGACCATACG GCCGACAACT ACCGGGCCTG GGCGAGCGTC AGCAATCGGT TCTGGAACAA CCGTCTGGGT GTGTTTCTGC AGGCAAACGC CCGCCGTTTC AACGGCGGGG GAGACATTGC TTCGGCTACC TGGGCCGAGT TGCCCCAGGC CGACCCGGTG GCCGGTAGAC GCCCCTACGG ACTGAACCAG TACGACCTGG AAGATCAGGT CAACATCGAC AATGAGTACG GCGCCAGCAT GCTCGTGGAT TACCGGCTGC CGAATCGCGG CAAGCTCATC CTGCAGAATA CCTACTCGGC CGAAGAGTTC GACAACGTCA GCTTCATCGA CCGACTGTAC CTGACTACCG GCGAGCGGCG GTTCCGGATC AATCGGGTGA TCGGCAGCCG CTATCTCCTG GTGAATGCCC TGCAGGGTGA ACACTGGCTC GGGGATGTGG CCAAAGTGGA CTGGGCCCTT TCCCATGCAA AAAGTCGGCG CAAGGACGAT CTGGGGTATG AGACGGAGTT TGCCGGCACG AACTACTTCC AAGGACAGCC GCTGACGTAC TGGACCTCGG AAGACCAGGT CTTCGATATC GAACTGCAAC CAGGGGTCCC GGGAGCGGTG GGCGACGGCC GCACCTTCTA CGAAGATTTC GGAGAGCGGC GGTTGGTCGG GGCCTTTAAC ATCCGCGTGC CCATTACGGT AGGCCCCATT TCGGGTGCGC TGCAGGGCGG CGGCAAATAT ACCCAGCTAA ACCGCGATCG GGACCTGCTC CAGTACTATC GCCGGCTGGG CGACGGGGGC GGACAGAACG TCGGCGCCAA AGACTTTCTG GCGAGCATTG GAGCGGATCC GGAGGCCGCC CTCAACCTTC GCTACTTCAT CGACAGCAGT TATGTCGACG AGCGGGGACA GTATTACCTG GAGGGGCGCT GGCCTTACAG TGGCGCGCTG CGGGTAGATT ATCTGGACAC GTACTTCCGT CTGGCACAGC AGGGATGGGC CACGCCGGCC CTGGCGCAGT CGAATCGGTA TGACTACGAG GCGGAAGAGC GGGTCTCGGC CGGCTACATC ATGGCCGATC TGGACATCGG GCGGCACCTG TCGGTGATCG GTGGGGTGCG CTACGAGAAG TTTAGCTTCA CGAATCGGGC GCCGTTCGTC AATCAGGTGC TTTACGACGG ATCCGGTGAC GTTCGGGATA CCCTGGAGGT TTCGCGCTCG CATCCCCAGT GGTTCCCGAA CATTCAGCTG CGCATCAGCC CGATCGAATG GCTCAACATC CGGCTGGCCT ATACGAAGAC GACCTCTCGC CCGGACTATC AGTACCTGCT GCCCAGCACC TGGGTCGACT CGGGTGAGCG CGGGGAGGCC GGCAACCCCA ACCTGAAGCC GACGCTGGCC GACAACTACG ACGCGTACAT TTCGGTGCAC CACGACCGAA TCGGGCTCTT TACGGTGGGC ATTTTCCGGA AGGTGCTCTC CAACGTGGTG CGTCCGATTT CCATCCAGCG GCGCACGCTC GACCAGTTCG AGGGCACGTT CTGGGCGCCG GAGGCGGCCG GTTATCCGGA GTGCGACGAC GGACGGAAAC ACATCTACTG CCCCGACGGC CCCCTGGTGC CGGATATCAA CCCCGTCGGT CTGATCACTA CCTATGTCAA CAATCCCTAC AAAGGGTATA TCAACGGCTT TGAAATCGAC TGGCAGACCA ACTTCTGGTA TCTGCCGCGG CCCTTCAACA GCCTGGTGCT CAACTTCAAC TACACGCGCC TGCGCTCCAA GATGGACTAC CAGTCCATCT TCCTGGTGCG GACCAGTCCC TTTAGCCCGC CTACCCAGGT AGATACGGTG CGAACGGGGC GACTCTATCA GCAGCCGGAC GATATCCTGA ACATCACGAT CGGCGTGGAT ATCGGAGGCT TTTCGGGTCG CCTGTCCTTC CGCTATCAGG GAGAAGTGCT GGCCAACCTG GACCAGCGCG ATCCGGCCAA CGACGCTTTC ACACGGGCGA TTTATGGCTG GGATTTCTCC CTGCGGCAGC GGCTTCCGAT CAAAGGACTG TCGCTCTTCT TCAACGGCAT CAACATCACG CATGCCGGTA GCTTCGATTA TCGGCGGCTG GTCGTCGGAC CCAATGCCAC CGGGGTCAGC GAGGCCATCA CGCGCATGGC CTACTACCCG CGGCGGTTCC AGCTGGGTAT CCGTTACGGG ATGTAA
|
Protein sequence | MWDMKRFCLV VLGLLWLGTA SEVLGQRTGR ITGVVVDASN GMPLPGANVL VAGTTVGAAT DLEGKFIILN APAGPQTLVI SYIGYQRKEV PVEVVPGGEV SVEVALQWAG IETGEVVITA QAAGQLQAIN EQLTARKIVN VVSAERIREL PDESAAAAVS RLPGISIQNG DQIVIRGVEA KYNTVTVNGI QLPSTTLNRT TGLGFISANM LSSIEVAKTV TPDMDANTIG GNVNLRLREA PEGLHYDALV FGDYNTQDHT ADNYRAWASV SNRFWNNRLG VFLQANARRF NGGGDIASAT WAELPQADPV AGRRPYGLNQ YDLEDQVNID NEYGASMLVD YRLPNRGKLI LQNTYSAEEF DNVSFIDRLY LTTGERRFRI NRVIGSRYLL VNALQGEHWL GDVAKVDWAL SHAKSRRKDD LGYETEFAGT NYFQGQPLTY WTSEDQVFDI ELQPGVPGAV GDGRTFYEDF GERRLVGAFN IRVPITVGPI SGALQGGGKY TQLNRDRDLL QYYRRLGDGG GQNVGAKDFL ASIGADPEAA LNLRYFIDSS YVDERGQYYL EGRWPYSGAL RVDYLDTYFR LAQQGWATPA LAQSNRYDYE AEERVSAGYI MADLDIGRHL SVIGGVRYEK FSFTNRAPFV NQVLYDGSGD VRDTLEVSRS HPQWFPNIQL RISPIEWLNI RLAYTKTTSR PDYQYLLPST WVDSGERGEA GNPNLKPTLA DNYDAYISVH HDRIGLFTVG IFRKVLSNVV RPISIQRRTL DQFEGTFWAP EAAGYPECDD GRKHIYCPDG PLVPDINPVG LITTYVNNPY KGYINGFEID WQTNFWYLPR PFNSLVLNFN YTRLRSKMDY QSIFLVRTSP FSPPTQVDTV RTGRLYQQPD DILNITIGVD IGGFSGRLSF RYQGEVLANL DQRDPANDAF TRAIYGWDFS LRQRLPIKGL SLFFNGINIT HAGSFDYRRL VVGPNATGVS EAITRMAYYP RRFQLGIRYG M
|
| |