Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0623 |
Symbol | |
ID | 4021092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 705647 |
End bp | 708493 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637960811 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_567762 |
Protein GI | 91975103 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.816724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTGG TCCACGAAAT CGATTTCGGC ACGCCGCGTT CGCCATCCGA AACGATGGTG ACGCTGACGA TCGACGGCCG CAGCGTGTCG GTGCCCGAAG GCACCTCGAT CATGCGTGCG GCGATGGAGA TCGGCACCGC GATCCCGAAA CTGTGCGCCA CCGACATGGT CGACGCGTTC GGCTCCTGCC GGCTGTGCCT GGTCGAGATC GACGGCCGCA GCGGCACGCC AGCGTCCTGC ACCACGCCGG TCGCCGACGG CCTCGTGGTG AAGACCCAAA CCGAGCGGCT GAAGCAGATC CGCAAGGGCG TGATGGAGCT GTATATCTCC GACCATCCGC TCGACTGCCT GACCTGTTCG GCCAACGGCG ATTGCGAACT GCAGGACATG GCCGGCGCGG TCGGCCTGCG CGACGTGCGC TACGGTTACT CCGGCAACAA GCATCCCAAT CCGGGGCTGG ATGAGTCCAA CCCGTATTTC ACTTACGATG CGTCGAAGTG CATCGTCTGC TCGCGCTGCG TGCGCGCCTG CGAGGAGGTG CAAGGCACCT TCGCACTGAC CATCGCCGGT CGCGGCTTCG GCTCGGTGGT GTCGCCGGGG ATGCAGGAAT CCTTCCTCGG CTCGGAATGC GTTTCCTGCG GCGCCTGCGT GCAGGCCTGC CCGACCGCGA CGCTGAACGA GAAATCGGTG ATCGAGATCG GCACGCCGGA GCGATCGGTG GTGACGACCT GCGCCTATTG CGGCGTCGGC TGCACCTTCA AGGCGGAGAT GCGTGGCGAG GAAGTGGTCC GCATGGTGCC GTACAAGGAC GGCAAGGCCA ATCGCGGCCA TTCCTGCGTC AAGGGCCGGT TCGCCTGGGG CTACACCAAC CACAAGGAAC GCATCCTCAA GCCGATGATC CGCGCCAGGA TCACCGACCC GTGGCGCGAG GTAAGCTGGG ACGAGGCGTT CTCCCATGCG GCCTCGGAGC TGAAGCGCAT CCAGGCCAAA TACGGTCGCG ACTCGATCGG TGGCATCACC TCGTCGCGCT GCACCAATGA AGAGACCTTC CTGGTGCAGA AGCTGATCCG CGCCGGCTTC GGTAACAACA ATGTCGACAC CTGCGCGCGG GTCTGCCACT CGCCGACCGG CTACGGCCTC TCCACCGCCT TCGGCACCTC GGCCGGCACC CAGGATTTCG ACTCGGTCGA GCACACCGAC GTGGTGATGA TCATCGGCGC CAATCCGACC GACGGCCATC CGGTGTTCGC CTCACGGTTG AAGAAGCGGC TGCGCGCCGG CGCCAAACTG ATCGTGGTCG ACCCGCGCCG GATCGACCTG GTGCGCTCGG CCCATGTCGA GGCGGCGCAG CATCTGCCGC TGAAGCCCGG CACAAACGTC GCGGTGCTGA CCGCGCTGGC GCATGTGATC GTCACCGAGG GGCTCGCCAA CGAAGCCTTC GTGCGCGAAC GCTGCGACTG GAGCGAATAC GAGCACTGGG CGTCGTTCGT GGCGCAGCCG AACAACAGCC CGGAAGCGAC CGCAGCGATG ACCGGCGTCG ATCCGCAAGC GCTGCGCGAA GCGGCGCGGC TCTACGCCAC CGGCGGCAAC GGCGCGATCT ATTACGGCCT CGGCGTCACC GAACACAGCC AGGGCTCGAC CACCGTGATG GCGATCGCCA ACCTGGCGAT GGTCACCGGC AATCTCGGCC GCCAAGGTGT CGGCGTGAAC CCGCTGCGCG GCCAGAACAA CGTTCAGGGC GCCTGCGACA TGGGCTCGTT CCCGCACGAG CTGCCCGGCT ATCGCCACAT CTCCACCGAC GCGGTGCGCG ACAGCTTCGA GGCGCTGTGG GGCGTGACGC TGAACAGCGA ACCGGGCCTA CGCATTCCCA ACATGCTCGA CGCCGCGGTC GATGGCTCGT TCAAGGCGCT CTATGTCCAG GGCGAGGACA TTCTGCAGTC CGATCCCAAC ACCAAGCACG TCGCCGCCGG CCTCGAAGCG ATGGAATGCG TGATCGTGCA CGATCTCTTC CTCAACGAGA CCGCGAACTA CGCGCATATC TTCCTGCCGG GTTCGACCTT CCTGGAGAAG AACGGCACCT TCACCAATGC CGAGCGCCGC ATCCAGCGGG TCCGCAAGGT GATGACGCCG AAGAACGGGC TGGAGGACTG GGAAGTCACG CTTCGCCTCG CCGAGGCGAT GGGTTTCAAG ATGAGCTACG ATCACCCGTC GCAGATCATG GACGAGATCG CCACGCTGAC GCCGACCTTC ACCGGCGTCT CCTACGCAAG ACTCGACGAG CTCGGCTCGA TCCAGTGGCC GTGCAACGCC AACGCGCCGG AGGGCACGCC GGTGATGCAC ATCGATCACT TCGTCCGCGG CAAGGGCAAG TTCGTCATCA CCGAATATGT CGCGACCGAC GAGCGCACCG GGCCGCGCTT CCCGCTGCTG CTGACCACCG GCCGCATCCT GTCGCAGTAC AATGTCGGCG CGCAGACGCG GCGCACCGCC AATACGGTAT GGCACGACGA GGACCGGCTC GAAATCCATC CGCACGACGC CGAGCAGCGC GGCGTCAGGG ACGGCGATTG GGTGCGGCTG GCGAGCCGCG CCGGCGAGAC GACGTTGCGC GCGCTGATCA CCGATCGGGT CGCGCCGGGC GTGGTCTACA CCACGTTCCA CCATCCCGAC ACGCAAGCCA ATGTCGTCAC CACCGAATAT TCCGACTGGG CGACCAATTG CCCGGAATAC AAGGTGACCG CGGTACAGGT GTCGCCGTCC AATGGTCCGT CGGAGTGGCA GCGCGCCTAT GAGGAGCAGG CGACCGCGGC GCGCAGGATC GCCTCGGCCG CCGAAGCCGC GGAGTGA
|
Protein sequence | MALVHEIDFG TPRSPSETMV TLTIDGRSVS VPEGTSIMRA AMEIGTAIPK LCATDMVDAF GSCRLCLVEI DGRSGTPASC TTPVADGLVV KTQTERLKQI RKGVMELYIS DHPLDCLTCS ANGDCELQDM AGAVGLRDVR YGYSGNKHPN PGLDESNPYF TYDASKCIVC SRCVRACEEV QGTFALTIAG RGFGSVVSPG MQESFLGSEC VSCGACVQAC PTATLNEKSV IEIGTPERSV VTTCAYCGVG CTFKAEMRGE EVVRMVPYKD GKANRGHSCV KGRFAWGYTN HKERILKPMI RARITDPWRE VSWDEAFSHA ASELKRIQAK YGRDSIGGIT SSRCTNEETF LVQKLIRAGF GNNNVDTCAR VCHSPTGYGL STAFGTSAGT QDFDSVEHTD VVMIIGANPT DGHPVFASRL KKRLRAGAKL IVVDPRRIDL VRSAHVEAAQ HLPLKPGTNV AVLTALAHVI VTEGLANEAF VRERCDWSEY EHWASFVAQP NNSPEATAAM TGVDPQALRE AARLYATGGN GAIYYGLGVT EHSQGSTTVM AIANLAMVTG NLGRQGVGVN PLRGQNNVQG ACDMGSFPHE LPGYRHISTD AVRDSFEALW GVTLNSEPGL RIPNMLDAAV DGSFKALYVQ GEDILQSDPN TKHVAAGLEA MECVIVHDLF LNETANYAHI FLPGSTFLEK NGTFTNAERR IQRVRKVMTP KNGLEDWEVT LRLAEAMGFK MSYDHPSQIM DEIATLTPTF TGVSYARLDE LGSIQWPCNA NAPEGTPVMH IDHFVRGKGK FVITEYVATD ERTGPRFPLL LTTGRILSQY NVGAQTRRTA NTVWHDEDRL EIHPHDAEQR GVRDGDWVRL ASRAGETTLR ALITDRVAPG VVYTTFHHPD TQANVVTTEY SDWATNCPEY KVTAVQVSPS NGPSEWQRAY EEQATAARRI ASAAEAAE
|
| |