Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0137 |
Symbol | |
ID | 6407780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 150131 |
End bp | 151990 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642710046 |
Product | hydrogenase, Fe-only |
Protein accession | YP_001989175 |
Protein GI | 192288570 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCACGC CCGATCAGGC CAGCCTGTCT GCCCGCGATC CCGCCGAGGC AACGATCACG CTCTCGATCA ACGGCGTCGC CTGCGCCGGC TTCGCCAACG AGACCATCCT GTCCTGCGCG CGGCGCTACG ACGTCTACAT CCCGACGCTG TGCGAACTGG AAGACATCGA TCACACCCCC GGCGCCTGCC GGGTCTGCCT GGTCGAGATC CTGCAGGCCG GCAAGGACAC CCCGCAGATC GTCACCGCCT GCAACACCCC GGTGCGCGAC GGCATGGAGG TGCAGACCCG CTCCAAGAAG GCGCGGGACA TGCAGCGTCT TCAGGTCGAA CTGCTGATGG CCGATCATCT GCAGGATTGC GCCACCTGCA TCCGGCACGG CAGTTGCGAG CTGCAGGATT TGGCACAGTT CGTCGGGCTG CAGCAGAACC GGTTCTTCGA TCGGGAACGG ACCGAGTCGC GCCCGGTCGA CCACAGCTCG CCGTCGATGG TGCGCGACAT GCGGCGCTGC GTCCGCTGCC AGCGCTGCGT CGCGATCTGC CGTTATCACC AGAAGATCGA CGCGCTGGCG ATCGAGGGCA GCGGGCTGGA GCGGATGGTG GCGCTGCGCG ACGCCGATGG CTACCCGAAT TCGGTGTGCG TCTCCTGCGG CCAATGCGTA CTGGTGTGCC CGACCGGCGC GCTCGGCGAG CGCGATGAGA CCGATCGGGC GCTCGACTAC ATCTGCGATC CGGAGGTCGT CACCGTGGTG CAGTTCGCCC CTGCGGTGCG GGTGGCGTTC GGCGAGGAAT TCGGCCTGCC TGCCGGCACC AATGTCGAAG GCCAGATCAT CGCCGCCTGC CGCAAGCTCG GCGTCGATGT GGTGCTCGAT ACCAATTTCG CCGCCGACGT GGTGATCATG GAGGAGGGTG CCGAACTGCT GGCGCGGCTG AAGCAGGGGC GGCGCCCGAC CTTCACGTCC TGCTGCCCGG CCTGGATCAA CTTCGCCGAG ATCCACTATC CGGACGTGCT GCCGCTGCTG TCCTCGACCA AGTCGCCGCA GCAGGTACTG TCGACGATCG CCAAGAGCTA TCTGCCGGCA CAGCTCGGCG TTCCGGCCGA GCGTATCCGG GTGATTTCGA TCATGCCGTG CATCGCCAAG AAGGACGAGG CGGTGCGGCC GCAGATGGTC CATGACGGGC AGCCCGAAAC CGACCTGGTG CTGACCACGC GCGAATTCGC CCGGCTGCTG CGGCGCGAGG GCATCGATCT GAAGGATCTG CCGTCGTCGC AGTTCGATCG TCCGTTCCTC AGCGCCTATT CCGGTGCCGG TGCCATCTTC GGCACCACCG GCGGCGTGAT GGAAGCCGCG GTGCGGACCA TCTACGCGCT GGTGAACGGC CGCGAACTGG ATCGGATCGA GCTGACGCAG CTGCGCGGCT TCGAAGGGCT GCGCGAGGCA ACTGTCGATC TCGGCGGCCC GGTCGGCGAG GTCAAGGTCG CGATGGTTCA CGGCCTCGGC GACACGCGCC GGCTGGTGGA ATCGGTGCTG AGCGGCGAAG CCAACTACGA TTTCATCGAG GTGATGGCCT GTCCGGGCGG CTGCGTCGAC GGCGGCGGGT CGCTGCGTTC GAAGAAGGCG TATCTGCCGC TGGCGCTGAA GCGCCGCGAG ACCATCTACA ATGTCGACCG CGCCGCCAAG GTCCGGCAGT CGCACAACAA TCCGCAGGTG CAGGTGCTGT ACCGCGAACT GCTGCAGGCG CCCAATTCGG AAATCGCGCA TCGGCTGCTG CACACCCACT ACGCGTCGCG CAAACGCGAG CTGCAGCACA CCGTGAAGGA GATCTGGGAC GATCTCACCA TGAGCACGAT CCTGTACTGA
|
Protein sequence | MCTPDQASLS ARDPAEATIT LSINGVACAG FANETILSCA RRYDVYIPTL CELEDIDHTP GACRVCLVEI LQAGKDTPQI VTACNTPVRD GMEVQTRSKK ARDMQRLQVE LLMADHLQDC ATCIRHGSCE LQDLAQFVGL QQNRFFDRER TESRPVDHSS PSMVRDMRRC VRCQRCVAIC RYHQKIDALA IEGSGLERMV ALRDADGYPN SVCVSCGQCV LVCPTGALGE RDETDRALDY ICDPEVVTVV QFAPAVRVAF GEEFGLPAGT NVEGQIIAAC RKLGVDVVLD TNFAADVVIM EEGAELLARL KQGRRPTFTS CCPAWINFAE IHYPDVLPLL SSTKSPQQVL STIAKSYLPA QLGVPAERIR VISIMPCIAK KDEAVRPQMV HDGQPETDLV LTTREFARLL RREGIDLKDL PSSQFDRPFL SAYSGAGAIF GTTGGVMEAA VRTIYALVNG RELDRIELTQ LRGFEGLREA TVDLGGPVGE VKVAMVHGLG DTRRLVESVL SGEANYDFIE VMACPGGCVD GGGSLRSKKA YLPLALKRRE TIYNVDRAAK VRQSHNNPQV QVLYRELLQA PNSEIAHRLL HTHYASRKRE LQHTVKEIWD DLTMSTILY
|
| |