Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1517 |
Symbol | |
ID | 8568168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 1768432 |
End bp | 1771434 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | outer membrane assembly lipoprotein YfiO |
Protein accession | YP_003290792 |
Protein GI | 268317073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.261003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCGCA TCATCGGGCG GCTCTGCACG CTGAGCCTGT TGTTCTTCGG ACCTGTGGCA CTGCAGGCGC AGCCTGCGCC GCCGGGTCCG GCCGTTTCGT TCGCCCATGC CCTGGCGCTG CACAGCGACG GATTCTACAC GCTCTCGGCC CAGACGTTTG CGCAGTTTCG GAGCACGTAC CCGGACGATC CGCGGGTCCC CGAGGCGCTG TTTTACGAAG CCGAGGCGCG GCTGGCACTG GGACAGACCG ACGAGGCGGC CGCCCTGCTC CGCGTGTTCG CCGCCCGCTA CCCGACGCAT CCGCTGGTCT ACGAGGCCCA GCTTGCCCTG GGCAAGTACT TTTTCGATAC CGGGCGCTAT GACGACGCCC GCCAGGCCTT CGGCCAGGCG CTGCGTCCCG GCGTGCCCGC CACTCAGGCT GCCCGAGCGC TGTTCTGGAT GGCCGAGTCG GCGCAACGGC TGGGACGTCC GGCCGAGGCC ATCGGCTACT ATCGGCGGCT GGCCGACACG TACCCGAACA CGCGGCTGGC CCCACAGGCG CTGCTGGCAA TGGCTTACAC GCAGGTGGAG ATGGGCGCCT ACGACGAGGC CGCCCGCACC TTCGAAGTGC TGGCCGCACG CTACCCCGCC GCCCCGGAAG CCCGGGGACT GGGTCTGGCG CTGGCGCAGG TTTACTACGA GCTGGGCGAC TACCGCCGTG CCATCGACGA AGTGCAACGT CGCCTGCCCG ACCTGAAAGG CGAGGCGCAG CAACAGGCCT GGCTACTGCT GGCCGAGTCC TACAACCAGC TCCGCGACAG CGAAAACGCC ATCGTCTACT ACCGTCGCGT ACTGGAAGAC CCCGACAGTC CCTACTACCG CCGGGCGCTC TACGGTCTGG CGTGGAATTA CTACTTCGAG GGTGTTTACC AGTGGGCGGC CGACCATTTC CGCCAGGTGC GCGAAGGCCG GCGCGACACG CTCGCCATGA AGGCCACTTA CTACGAGGCG GTCTGTCGCA AGCTGGCCCG CGAGCCGCAG CAGGCACTGG AGCTGTTCCG GACTGTCGTG CTCGAATGGC CCGACAGTCC GCTGGCCCCC CACGCCCAGT ACGAACTGGC CCTGCTGCTC TACGAAATGC GACGGTGGGA AGAAGCGCAT GACGCGTTCG ACTTTCTCGT GCGCACCTAC CCCGACAGCG AACTGCTCGG CGACGCGCTC CGGATGCGCG GCTACACGGC CATCGCGCTC GGCCACTTCG ACGAAGCCTA CGAGAGCTTC GACCGCGCCG TGGCCCTACA GGCCGCCTCG CCACAACTCC GCACCGAGAT CGCCTTCCAG AAGGCCTGGC TTCAGTACCG CCAGCAGAAC TACGCGGCCG CCAGCGAGGC CTTTCTGGAA CTGTACCGTC AGGATCCACG AGGCCCTAAA GCCGGCGATG CCCTTTTCTG GGCGGCCGAA AGCTTCTACC AGCTGGGCCG GCTCGACCGC GCCGAAGCGC TCTTTCGCGA CTACCTCCGG AGTTTTCCCG ATGGCGCCCA CGTCGAAGCC GCCCACTACG CGCTGGGCTG GGTCTATTTC CGCCAGCAGC GCTACGAAGC CGCCATTCAG GCCTTCCAGC AGTTTCTGCG GGCCTATCGC CGCACCGAAG AGGCGGTCCC CTACCGGCTC GACGCGCTGC TCCGGCTGGC CGACAGCTAC TACGCGCTCA AGCGCTATCC CGAGGCCATC CGCTACTATC GCCAGGCGGC CGCCGAAGGC GAAAGCGACT ATGCGCTCTA TCAGATCGGC CAGGCCTACT ACAACGCAGG CAACTACGAG GAAGCCCTGC GCACCTTCAA CCGATTGCTG GAAGAACACC CCGAAAGCAC CTGGCGCGAG GAGGCGCTCT ACCAGATCGG CTACATCCAT TTCCTGAACC AGGAATACGA TCAGGCCATC GCGGCCTACC GGCGGCTGCT CGAACTGGCC CCGAACGATC CGCTGGCCGC CAAGGCGCAG TACGGCATCG GCGACGCGCT CTTCAATGCG GGACGCCTGG AAGCGGCCGT CAACGCCTAC AAACGCGTGC TGGAGCGCTA TCCGCAGAGT CCGTTCGTGG CCGACGCCGC CACCAGCATC CACTTTGCGC TGATCGCCGC CGGCAACGAA GCGCGCGCGC AGGCGCTGAT CGATTCGTTC GCCACGGCCT ACCCGGACAC GCGCATCGTG GACGAACTGC GCTTTCGGCG GGCCGAGGCA CTCTACCGCA GCGGCCGCTC CGAGGAGGCC ATCCGGGCAC TCGAAGCCTT CGTGCGCGGG AGCCATGCGC CGGACCTGAT GGGCGAGGCG CTTTACTACC TGGCCACGCT CTACGCAGAA CAGGAACTGT ACGACGAGGC CGAGCGCACG CTGCAGCAAC TGCTGGCCGC CCATGCCGAG CACCGAAGGA TGCCCGAAGC ACTGCTCCTG CTGGGTAATG TCCAGCTGAA GCAGGAGCGC TACGAAGCCG CCCTGGTGAG CTTCCGGCGC CTGGCGTCAA TGGCACCGGA GCGCTCCGAG CTGCTGGCCC GTGCCCTCTA CGGCCAGAGC GTCGCCCTGC TGGAGCTGGG CCGCTTCGCC GAGGCACGCC AGGCGCTCAC AGAAGCACAG GCGCGTTTTC CGGAAGATGG CCAGCCCGCC ATCCTGCTGC TCGGGCAGGC ACGCCTGGCC GAGGCCGAGG GCCGTCCGGA CGAAGCGGAG CGCCTCTACC GGGCGGTGGT CGGTCGCGCC CAGGACGAAG CCGGTGCCGA GGCGCTCTAT CGCCTCGGGG AACTGCTGCT ACGCCGGGGC GACCCGCATC GGGCCATCGA AGAGCTGAGC CGTCTGCCCA CACTCTTCCC GGGCTATCCA GAATGGCTGG CCCGTGGCTA TCTGGCCCAG GCGCGGGCCT TCCTTGCGCT CGGACAGCGC GGCGAGGCCA CCCGCCTTTA TGATCTGGTC ATCACCGAAT TTCCCAACAC GTCGTTTGCC CGCATTGCTG CCCAGGAAAA AGCCCGGTTG TAA
|
Protein sequence | MLRIIGRLCT LSLLFFGPVA LQAQPAPPGP AVSFAHALAL HSDGFYTLSA QTFAQFRSTY PDDPRVPEAL FYEAEARLAL GQTDEAAALL RVFAARYPTH PLVYEAQLAL GKYFFDTGRY DDARQAFGQA LRPGVPATQA ARALFWMAES AQRLGRPAEA IGYYRRLADT YPNTRLAPQA LLAMAYTQVE MGAYDEAART FEVLAARYPA APEARGLGLA LAQVYYELGD YRRAIDEVQR RLPDLKGEAQ QQAWLLLAES YNQLRDSENA IVYYRRVLED PDSPYYRRAL YGLAWNYYFE GVYQWAADHF RQVREGRRDT LAMKATYYEA VCRKLAREPQ QALELFRTVV LEWPDSPLAP HAQYELALLL YEMRRWEEAH DAFDFLVRTY PDSELLGDAL RMRGYTAIAL GHFDEAYESF DRAVALQAAS PQLRTEIAFQ KAWLQYRQQN YAAASEAFLE LYRQDPRGPK AGDALFWAAE SFYQLGRLDR AEALFRDYLR SFPDGAHVEA AHYALGWVYF RQQRYEAAIQ AFQQFLRAYR RTEEAVPYRL DALLRLADSY YALKRYPEAI RYYRQAAAEG ESDYALYQIG QAYYNAGNYE EALRTFNRLL EEHPESTWRE EALYQIGYIH FLNQEYDQAI AAYRRLLELA PNDPLAAKAQ YGIGDALFNA GRLEAAVNAY KRVLERYPQS PFVADAATSI HFALIAAGNE ARAQALIDSF ATAYPDTRIV DELRFRRAEA LYRSGRSEEA IRALEAFVRG SHAPDLMGEA LYYLATLYAE QELYDEAERT LQQLLAAHAE HRRMPEALLL LGNVQLKQER YEAALVSFRR LASMAPERSE LLARALYGQS VALLELGRFA EARQALTEAQ ARFPEDGQPA ILLLGQARLA EAEGRPDEAE RLYRAVVGRA QDEAGAEALY RLGELLLRRG DPHRAIEELS RLPTLFPGYP EWLARGYLAQ ARAFLALGQR GEATRLYDLV ITEFPNTSFA RIAAQEKARL
|
| |