Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0222 |
Symbol | |
ID | 8566852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 236133 |
End bp | 239252 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Fe-S-cluster-containing hydrogenase |
Protein accession | YP_003289516 |
Protein GI | 268315797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAC TGCCTGTGGT CAATCCTGAC GGTGCCGAGA CGCCCGGTTC GGGCAAGCGC CTCTGGCGCA GCACGGCCGA CCTGCGCCGG GATCCGGAAT GGGTGAAGCT GGCGCACGAC GAGTTCATGC CGGGGGTGGC GGAGCCGCCG AGCGGTACCT CGCGGCGCCA GTTTTTGCAA ATCATGGGGG CGTCGATGGC GCTGGCCGGA CTGACGGCCT GTCGCCGTCC CGTCGAGAAG ATCCTGCCCT ACGTGCGCCA GCCCGAAGAG ATCATTCCGG GCATTCCGCT CTACTACGCC ACGGCCATGC CCTTCCGGGG CAGCGTGCGG CCGCTGCTGG TCGAAAGCCA CGAGGGGCGC CCGACCAAGA TCGAGGGCAA CCCGGATCAT CCGCTCAGCC GGGGTGCGAC GGGCGTCTTC GAGCAGGCTT CGCTGCTGAA TCTGTACGAT CCGGACCGCT CGCAGCAGGT GCTCCGCAAG GGTGAGCCGG CTTCGTGGGG CGACTTCGTG CAGTTTGCCC GGTCGCTGGC CGCCGAGGCG GGCACAAAGC GGCTGGCCGT GCTCTGCGAG CCGAGCAGTT CGCCCACGCT GGCCGCGCTG CGCCGGGAGC TGGAGCGGCG CTACGCACAG GTGCGCTGGG TCACCTACCG TCCGGAGGGC GACGACCACG AGGCGCTGGG ATTGCAGCAG GCCTTCGGCC GTCCGGTGCG GGCCCGCTAC CGCTTTTCGG AGGCCCGTGT GATCGTCAGC CTGGACGCCG ACTTTCTGGG ACCGACCGAC CGCAACTTTG TCGAGAACAC GCGTGAGTTT GCCGCCAGCC GGCGCATGGA GCGGCCTGAA GATGAGATCA GCCGCCTGTA CGTGATCGAA AGCACCTACA CGGTCACGGG CGGCATGGCC GACCACCGGC TGCGGCTGCG CGCCGGCGAC ATTCCGGCGT TCGCCGCGGC GCTGGCGGCC GAGCTGGGCG TCGGCGAACT CCGCGAAGCG GGCGCCCGTT TTGCCGGGCA TCCGTACGTG GTGGAGATTG CCCGCGACCT GCGGGCGGCC GGTGCGCGCG GCGTGGTGCT GGCGGGCGAA ACGCAGCCGC CGGCCGTGCA CGCGCTCTGC GCCGTCATCA ACGACCTGCT GGGAAGCCTG GGCCGCACGG TGATCCTGCA TGCGCTGGAC GAGCCGGCCA CCGCTCAGCA TGCGGCACTG GCCGAGCTGG TGCAGGCCAT GCAGGCCGGT GCGGTGGACG CGCTGCTGCT GCTGAACGTC AACCCGGTCT ACGACGCTCC GGCGGCGCTG GGCTTTGCCG AGGCACTGGC GCAGGTGCCC GAGGTGATCC ACCTGGGACT GCATGTGGAC GAGACGGCCC GCCGGAGCAC CTGGCACCTG CCCTCCACGC ACTACCTGGA AGCCTGGGGC GACGGACGCG CCTACGACGG CACGCTCTCG GTCATCCAGC CGCTGATCGC CCCGCTCTAC GAGGCCGCCC ACTCGCCGCT GGAGGTGCTG GCCCTGCTGG CCACCGGCGA AGAGCAGAGC GCCTACGACC TGGTGCGTAA CACCTGGCGG CGGCTGCTGG CAGGCCGGGG GGCCTTCGAG CAGGCCTGGC AGCGCGTGCT GCACGACGGC TTCCTGCCGG ACTCGGGCTA TCCGACCGTT TCGCTGCGCC CGAACCGTCA GGCCCTGGCC GACTGGCCGC AGGCAGCGGA AGGCGGTCTG GAGGTGGTCT TCCGGCTGGA TCCGACCGTA CTGGACGGCA GCTTCGCCAA CAACGCCTGG GCGCAGGAGC TTCCCGATCC GATCACGAAG ATCGTCTGGG ATAACGTCGC GATCCTGAGC CCGAAGACGG CCGCGGCGCT GGGCGTCAAA GCCGAATACC ACAAGGGCGT CTACATCGCG GACGTGATCG AGCTGTCGCT GGACGGCCGC GCGGTGGAGC TGCCCGTCTG GGTGTTGCCC GGCCATCCGG ACGACTCGAT CACCGTCTAT CTGGGCTACG GTCGCGAGAT CACCTCGACG CGGCCCGAGC GGAAGACGCC CTTCTTCGAC CTGGACGACT ACACGGACAT CTACGGCCAC GGCGCCATTG CCACCGGCGT GGGCGTGAAC GTGGCCCCGC TGCGGCGGCC CGACAACACC TGGGTGGCCT ATGGGGCGCA GGTGCGCAAG ACGGGACGCA CCTACAAGAT CGTGACCACG CAGGACCACG GCTCCATGGT GGGGCGGCCG CTGGTGCGCC TGGCCACGGT GGAGGAATTC CGGAAAAACC CGGACTTCGC AAAAGAGGCC GAGCCCCCGC TCGAAGGTCT GGAGCCGTGG GACCAGTATC CCACGCTCTG GGAGGAAAAT CACCCGAGCA AACAGCCCGC CTTCCAGGAC AGCGATTACT ACCGCAACCA GTGGGCGATG GTCATCGACC TGAACGCCTG CACGGGCTGC AATGCGTGCA TCGTGGCCTG CGATAGCGAG AATAATATTC CGATGGTGGG CAAAAACGAG GTGGGCCGCG GGCGCGAGAT GCACTGGCTG CGCATCGACC GCTACTTCGT GAGCGACGAG GCGCATGCCG ACGATCCGCA GATCGTGGTG CAGCCGGTGC CCTGCATGCA CTGCGAGAAC GCGCCCTGCG AGTCGGTCTG CCCGGTGGCC GCCACGGTGC ACTCGCCGGA CGGGCTCAAC GAAATGGTCT ACAACCGCTG CATCGGTACG CGCTACTGCT CGAACAACTG CCCCTACAAG GTGCGGCGGT TCAACTGGTT CAACTGGGTC AAGACGCTGC CCATTCAGGT GCAGATGGCC CAGAACCCGG ACGTGACCGT GCGCTTCCGC GGGGTGATGG AAAAATGCAC CTACTGCGTG CAGCGCATCC GCGAGGCGCA GCGGCAGGCC AATATCGAAA AGCGGCCGCT CAGGGACGGC GAGGTCAAGA CGGCCTGCCA GCAGGCCTGC CCGGCCGAAG CGATCACGTT CGGTGACCTG AACGACCCGA ACAATGCCGT GGTGAAGCAG CGGCAGAACG CGCGGCGGTA CGAGATGCTG GCGGCGCTCA ACGTCAAGCC GCGCACCTCG TACCTGGCCC GCATTACGAA TCCGAATCCC CGGCTGCTGG AGCAGGAACC GGTGGCCTGA
|
Protein sequence | MIELPVVNPD GAETPGSGKR LWRSTADLRR DPEWVKLAHD EFMPGVAEPP SGTSRRQFLQ IMGASMALAG LTACRRPVEK ILPYVRQPEE IIPGIPLYYA TAMPFRGSVR PLLVESHEGR PTKIEGNPDH PLSRGATGVF EQASLLNLYD PDRSQQVLRK GEPASWGDFV QFARSLAAEA GTKRLAVLCE PSSSPTLAAL RRELERRYAQ VRWVTYRPEG DDHEALGLQQ AFGRPVRARY RFSEARVIVS LDADFLGPTD RNFVENTREF AASRRMERPE DEISRLYVIE STYTVTGGMA DHRLRLRAGD IPAFAAALAA ELGVGELREA GARFAGHPYV VEIARDLRAA GARGVVLAGE TQPPAVHALC AVINDLLGSL GRTVILHALD EPATAQHAAL AELVQAMQAG AVDALLLLNV NPVYDAPAAL GFAEALAQVP EVIHLGLHVD ETARRSTWHL PSTHYLEAWG DGRAYDGTLS VIQPLIAPLY EAAHSPLEVL ALLATGEEQS AYDLVRNTWR RLLAGRGAFE QAWQRVLHDG FLPDSGYPTV SLRPNRQALA DWPQAAEGGL EVVFRLDPTV LDGSFANNAW AQELPDPITK IVWDNVAILS PKTAAALGVK AEYHKGVYIA DVIELSLDGR AVELPVWVLP GHPDDSITVY LGYGREITST RPERKTPFFD LDDYTDIYGH GAIATGVGVN VAPLRRPDNT WVAYGAQVRK TGRTYKIVTT QDHGSMVGRP LVRLATVEEF RKNPDFAKEA EPPLEGLEPW DQYPTLWEEN HPSKQPAFQD SDYYRNQWAM VIDLNACTGC NACIVACDSE NNIPMVGKNE VGRGREMHWL RIDRYFVSDE AHADDPQIVV QPVPCMHCEN APCESVCPVA ATVHSPDGLN EMVYNRCIGT RYCSNNCPYK VRRFNWFNWV KTLPIQVQMA QNPDVTVRFR GVMEKCTYCV QRIREAQRQA NIEKRPLRDG EVKTACQQAC PAEAITFGDL NDPNNAVVKQ RQNARRYEML AALNVKPRTS YLARITNPNP RLLEQEPVA
|
| |