Gene Rmar_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_0222 
Symbol 
ID8566852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp236133 
End bp239252 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content68% 
IMG OID 
ProductFe-S-cluster-containing hydrogenase 
Protein accessionYP_003289516 
Protein GI268315797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAAC TGCCTGTGGT CAATCCTGAC GGTGCCGAGA CGCCCGGTTC GGGCAAGCGC 
CTCTGGCGCA GCACGGCCGA CCTGCGCCGG GATCCGGAAT GGGTGAAGCT GGCGCACGAC
GAGTTCATGC CGGGGGTGGC GGAGCCGCCG AGCGGTACCT CGCGGCGCCA GTTTTTGCAA
ATCATGGGGG CGTCGATGGC GCTGGCCGGA CTGACGGCCT GTCGCCGTCC CGTCGAGAAG
ATCCTGCCCT ACGTGCGCCA GCCCGAAGAG ATCATTCCGG GCATTCCGCT CTACTACGCC
ACGGCCATGC CCTTCCGGGG CAGCGTGCGG CCGCTGCTGG TCGAAAGCCA CGAGGGGCGC
CCGACCAAGA TCGAGGGCAA CCCGGATCAT CCGCTCAGCC GGGGTGCGAC GGGCGTCTTC
GAGCAGGCTT CGCTGCTGAA TCTGTACGAT CCGGACCGCT CGCAGCAGGT GCTCCGCAAG
GGTGAGCCGG CTTCGTGGGG CGACTTCGTG CAGTTTGCCC GGTCGCTGGC CGCCGAGGCG
GGCACAAAGC GGCTGGCCGT GCTCTGCGAG CCGAGCAGTT CGCCCACGCT GGCCGCGCTG
CGCCGGGAGC TGGAGCGGCG CTACGCACAG GTGCGCTGGG TCACCTACCG TCCGGAGGGC
GACGACCACG AGGCGCTGGG ATTGCAGCAG GCCTTCGGCC GTCCGGTGCG GGCCCGCTAC
CGCTTTTCGG AGGCCCGTGT GATCGTCAGC CTGGACGCCG ACTTTCTGGG ACCGACCGAC
CGCAACTTTG TCGAGAACAC GCGTGAGTTT GCCGCCAGCC GGCGCATGGA GCGGCCTGAA
GATGAGATCA GCCGCCTGTA CGTGATCGAA AGCACCTACA CGGTCACGGG CGGCATGGCC
GACCACCGGC TGCGGCTGCG CGCCGGCGAC ATTCCGGCGT TCGCCGCGGC GCTGGCGGCC
GAGCTGGGCG TCGGCGAACT CCGCGAAGCG GGCGCCCGTT TTGCCGGGCA TCCGTACGTG
GTGGAGATTG CCCGCGACCT GCGGGCGGCC GGTGCGCGCG GCGTGGTGCT GGCGGGCGAA
ACGCAGCCGC CGGCCGTGCA CGCGCTCTGC GCCGTCATCA ACGACCTGCT GGGAAGCCTG
GGCCGCACGG TGATCCTGCA TGCGCTGGAC GAGCCGGCCA CCGCTCAGCA TGCGGCACTG
GCCGAGCTGG TGCAGGCCAT GCAGGCCGGT GCGGTGGACG CGCTGCTGCT GCTGAACGTC
AACCCGGTCT ACGACGCTCC GGCGGCGCTG GGCTTTGCCG AGGCACTGGC GCAGGTGCCC
GAGGTGATCC ACCTGGGACT GCATGTGGAC GAGACGGCCC GCCGGAGCAC CTGGCACCTG
CCCTCCACGC ACTACCTGGA AGCCTGGGGC GACGGACGCG CCTACGACGG CACGCTCTCG
GTCATCCAGC CGCTGATCGC CCCGCTCTAC GAGGCCGCCC ACTCGCCGCT GGAGGTGCTG
GCCCTGCTGG CCACCGGCGA AGAGCAGAGC GCCTACGACC TGGTGCGTAA CACCTGGCGG
CGGCTGCTGG CAGGCCGGGG GGCCTTCGAG CAGGCCTGGC AGCGCGTGCT GCACGACGGC
TTCCTGCCGG ACTCGGGCTA TCCGACCGTT TCGCTGCGCC CGAACCGTCA GGCCCTGGCC
GACTGGCCGC AGGCAGCGGA AGGCGGTCTG GAGGTGGTCT TCCGGCTGGA TCCGACCGTA
CTGGACGGCA GCTTCGCCAA CAACGCCTGG GCGCAGGAGC TTCCCGATCC GATCACGAAG
ATCGTCTGGG ATAACGTCGC GATCCTGAGC CCGAAGACGG CCGCGGCGCT GGGCGTCAAA
GCCGAATACC ACAAGGGCGT CTACATCGCG GACGTGATCG AGCTGTCGCT GGACGGCCGC
GCGGTGGAGC TGCCCGTCTG GGTGTTGCCC GGCCATCCGG ACGACTCGAT CACCGTCTAT
CTGGGCTACG GTCGCGAGAT CACCTCGACG CGGCCCGAGC GGAAGACGCC CTTCTTCGAC
CTGGACGACT ACACGGACAT CTACGGCCAC GGCGCCATTG CCACCGGCGT GGGCGTGAAC
GTGGCCCCGC TGCGGCGGCC CGACAACACC TGGGTGGCCT ATGGGGCGCA GGTGCGCAAG
ACGGGACGCA CCTACAAGAT CGTGACCACG CAGGACCACG GCTCCATGGT GGGGCGGCCG
CTGGTGCGCC TGGCCACGGT GGAGGAATTC CGGAAAAACC CGGACTTCGC AAAAGAGGCC
GAGCCCCCGC TCGAAGGTCT GGAGCCGTGG GACCAGTATC CCACGCTCTG GGAGGAAAAT
CACCCGAGCA AACAGCCCGC CTTCCAGGAC AGCGATTACT ACCGCAACCA GTGGGCGATG
GTCATCGACC TGAACGCCTG CACGGGCTGC AATGCGTGCA TCGTGGCCTG CGATAGCGAG
AATAATATTC CGATGGTGGG CAAAAACGAG GTGGGCCGCG GGCGCGAGAT GCACTGGCTG
CGCATCGACC GCTACTTCGT GAGCGACGAG GCGCATGCCG ACGATCCGCA GATCGTGGTG
CAGCCGGTGC CCTGCATGCA CTGCGAGAAC GCGCCCTGCG AGTCGGTCTG CCCGGTGGCC
GCCACGGTGC ACTCGCCGGA CGGGCTCAAC GAAATGGTCT ACAACCGCTG CATCGGTACG
CGCTACTGCT CGAACAACTG CCCCTACAAG GTGCGGCGGT TCAACTGGTT CAACTGGGTC
AAGACGCTGC CCATTCAGGT GCAGATGGCC CAGAACCCGG ACGTGACCGT GCGCTTCCGC
GGGGTGATGG AAAAATGCAC CTACTGCGTG CAGCGCATCC GCGAGGCGCA GCGGCAGGCC
AATATCGAAA AGCGGCCGCT CAGGGACGGC GAGGTCAAGA CGGCCTGCCA GCAGGCCTGC
CCGGCCGAAG CGATCACGTT CGGTGACCTG AACGACCCGA ACAATGCCGT GGTGAAGCAG
CGGCAGAACG CGCGGCGGTA CGAGATGCTG GCGGCGCTCA ACGTCAAGCC GCGCACCTCG
TACCTGGCCC GCATTACGAA TCCGAATCCC CGGCTGCTGG AGCAGGAACC GGTGGCCTGA
 
Protein sequence
MIELPVVNPD GAETPGSGKR LWRSTADLRR DPEWVKLAHD EFMPGVAEPP SGTSRRQFLQ 
IMGASMALAG LTACRRPVEK ILPYVRQPEE IIPGIPLYYA TAMPFRGSVR PLLVESHEGR
PTKIEGNPDH PLSRGATGVF EQASLLNLYD PDRSQQVLRK GEPASWGDFV QFARSLAAEA
GTKRLAVLCE PSSSPTLAAL RRELERRYAQ VRWVTYRPEG DDHEALGLQQ AFGRPVRARY
RFSEARVIVS LDADFLGPTD RNFVENTREF AASRRMERPE DEISRLYVIE STYTVTGGMA
DHRLRLRAGD IPAFAAALAA ELGVGELREA GARFAGHPYV VEIARDLRAA GARGVVLAGE
TQPPAVHALC AVINDLLGSL GRTVILHALD EPATAQHAAL AELVQAMQAG AVDALLLLNV
NPVYDAPAAL GFAEALAQVP EVIHLGLHVD ETARRSTWHL PSTHYLEAWG DGRAYDGTLS
VIQPLIAPLY EAAHSPLEVL ALLATGEEQS AYDLVRNTWR RLLAGRGAFE QAWQRVLHDG
FLPDSGYPTV SLRPNRQALA DWPQAAEGGL EVVFRLDPTV LDGSFANNAW AQELPDPITK
IVWDNVAILS PKTAAALGVK AEYHKGVYIA DVIELSLDGR AVELPVWVLP GHPDDSITVY
LGYGREITST RPERKTPFFD LDDYTDIYGH GAIATGVGVN VAPLRRPDNT WVAYGAQVRK
TGRTYKIVTT QDHGSMVGRP LVRLATVEEF RKNPDFAKEA EPPLEGLEPW DQYPTLWEEN
HPSKQPAFQD SDYYRNQWAM VIDLNACTGC NACIVACDSE NNIPMVGKNE VGRGREMHWL
RIDRYFVSDE AHADDPQIVV QPVPCMHCEN APCESVCPVA ATVHSPDGLN EMVYNRCIGT
RYCSNNCPYK VRRFNWFNWV KTLPIQVQMA QNPDVTVRFR GVMEKCTYCV QRIREAQRQA
NIEKRPLRDG EVKTACQQAC PAEAITFGDL NDPNNAVVKQ RQNARRYEML AALNVKPRTS
YLARITNPNP RLLEQEPVA