Gene Rpal_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0137 
Symbol 
ID6407780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp150131 
End bp151990 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content66% 
IMG OID642710046 
Producthydrogenase, Fe-only 
Protein accessionYP_001989175 
Protein GI192288570 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCACGC CCGATCAGGC CAGCCTGTCT GCCCGCGATC CCGCCGAGGC AACGATCACG 
CTCTCGATCA ACGGCGTCGC CTGCGCCGGC TTCGCCAACG AGACCATCCT GTCCTGCGCG
CGGCGCTACG ACGTCTACAT CCCGACGCTG TGCGAACTGG AAGACATCGA TCACACCCCC
GGCGCCTGCC GGGTCTGCCT GGTCGAGATC CTGCAGGCCG GCAAGGACAC CCCGCAGATC
GTCACCGCCT GCAACACCCC GGTGCGCGAC GGCATGGAGG TGCAGACCCG CTCCAAGAAG
GCGCGGGACA TGCAGCGTCT TCAGGTCGAA CTGCTGATGG CCGATCATCT GCAGGATTGC
GCCACCTGCA TCCGGCACGG CAGTTGCGAG CTGCAGGATT TGGCACAGTT CGTCGGGCTG
CAGCAGAACC GGTTCTTCGA TCGGGAACGG ACCGAGTCGC GCCCGGTCGA CCACAGCTCG
CCGTCGATGG TGCGCGACAT GCGGCGCTGC GTCCGCTGCC AGCGCTGCGT CGCGATCTGC
CGTTATCACC AGAAGATCGA CGCGCTGGCG ATCGAGGGCA GCGGGCTGGA GCGGATGGTG
GCGCTGCGCG ACGCCGATGG CTACCCGAAT TCGGTGTGCG TCTCCTGCGG CCAATGCGTA
CTGGTGTGCC CGACCGGCGC GCTCGGCGAG CGCGATGAGA CCGATCGGGC GCTCGACTAC
ATCTGCGATC CGGAGGTCGT CACCGTGGTG CAGTTCGCCC CTGCGGTGCG GGTGGCGTTC
GGCGAGGAAT TCGGCCTGCC TGCCGGCACC AATGTCGAAG GCCAGATCAT CGCCGCCTGC
CGCAAGCTCG GCGTCGATGT GGTGCTCGAT ACCAATTTCG CCGCCGACGT GGTGATCATG
GAGGAGGGTG CCGAACTGCT GGCGCGGCTG AAGCAGGGGC GGCGCCCGAC CTTCACGTCC
TGCTGCCCGG CCTGGATCAA CTTCGCCGAG ATCCACTATC CGGACGTGCT GCCGCTGCTG
TCCTCGACCA AGTCGCCGCA GCAGGTACTG TCGACGATCG CCAAGAGCTA TCTGCCGGCA
CAGCTCGGCG TTCCGGCCGA GCGTATCCGG GTGATTTCGA TCATGCCGTG CATCGCCAAG
AAGGACGAGG CGGTGCGGCC GCAGATGGTC CATGACGGGC AGCCCGAAAC CGACCTGGTG
CTGACCACGC GCGAATTCGC CCGGCTGCTG CGGCGCGAGG GCATCGATCT GAAGGATCTG
CCGTCGTCGC AGTTCGATCG TCCGTTCCTC AGCGCCTATT CCGGTGCCGG TGCCATCTTC
GGCACCACCG GCGGCGTGAT GGAAGCCGCG GTGCGGACCA TCTACGCGCT GGTGAACGGC
CGCGAACTGG ATCGGATCGA GCTGACGCAG CTGCGCGGCT TCGAAGGGCT GCGCGAGGCA
ACTGTCGATC TCGGCGGCCC GGTCGGCGAG GTCAAGGTCG CGATGGTTCA CGGCCTCGGC
GACACGCGCC GGCTGGTGGA ATCGGTGCTG AGCGGCGAAG CCAACTACGA TTTCATCGAG
GTGATGGCCT GTCCGGGCGG CTGCGTCGAC GGCGGCGGGT CGCTGCGTTC GAAGAAGGCG
TATCTGCCGC TGGCGCTGAA GCGCCGCGAG ACCATCTACA ATGTCGACCG CGCCGCCAAG
GTCCGGCAGT CGCACAACAA TCCGCAGGTG CAGGTGCTGT ACCGCGAACT GCTGCAGGCG
CCCAATTCGG AAATCGCGCA TCGGCTGCTG CACACCCACT ACGCGTCGCG CAAACGCGAG
CTGCAGCACA CCGTGAAGGA GATCTGGGAC GATCTCACCA TGAGCACGAT CCTGTACTGA
 
Protein sequence
MCTPDQASLS ARDPAEATIT LSINGVACAG FANETILSCA RRYDVYIPTL CELEDIDHTP 
GACRVCLVEI LQAGKDTPQI VTACNTPVRD GMEVQTRSKK ARDMQRLQVE LLMADHLQDC
ATCIRHGSCE LQDLAQFVGL QQNRFFDRER TESRPVDHSS PSMVRDMRRC VRCQRCVAIC
RYHQKIDALA IEGSGLERMV ALRDADGYPN SVCVSCGQCV LVCPTGALGE RDETDRALDY
ICDPEVVTVV QFAPAVRVAF GEEFGLPAGT NVEGQIIAAC RKLGVDVVLD TNFAADVVIM
EEGAELLARL KQGRRPTFTS CCPAWINFAE IHYPDVLPLL SSTKSPQQVL STIAKSYLPA
QLGVPAERIR VISIMPCIAK KDEAVRPQMV HDGQPETDLV LTTREFARLL RREGIDLKDL
PSSQFDRPFL SAYSGAGAIF GTTGGVMEAA VRTIYALVNG RELDRIELTQ LRGFEGLREA
TVDLGGPVGE VKVAMVHGLG DTRRLVESVL SGEANYDFIE VMACPGGCVD GGGSLRSKKA
YLPLALKRRE TIYNVDRAAK VRQSHNNPQV QVLYRELLQA PNSEIAHRLL HTHYASRKRE
LQHTVKEIWD DLTMSTILY