Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0805 |
Symbol | |
ID | 6408458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 843769 |
End bp | 846615 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642710718 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001989838 |
Protein GI | 192289233 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.624345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTGG TCCACGAAAC CGATTTCGGC ACGCCGCCCT CGCTCTCCGA AAACATGGTG ACGCTGACGA TTGACGGTCG CAGCGTCAGC GTACCCGAGG GCACCTCGAT TATGCGCGCC GCGATGGAGA TCGGCACGGC GATCCCGAAA CTCTGTGCCA CCGACATGGT CGATGCGTTC GGCTCATGTC GGCTGTGCCT TGTCGAGATC GATGGCCGCA GCGGCACCCC GGCCTCGTGC ACCACGCCGG TCGCCGACGG TCTCGTCGTC CACACCCAGA CCGAGCGGTT GAAGCAGATC CGCAAAGGCG TGATGGAGCT GTACATCTCC GATCACCCGC TCGACTGCCT AACCTGCTCG GCCAATGGCG ATTGCGAGCT GCAGGACATG GCCGGCGCGG TCGGCCTGCG CGACGTGCGC TATGGATACT CCGGCGAGAA GCATCCCAAT CCAGGGCTCG ACGAGTCGAA TCCGTACTTC ACTTACGACG CGTCGAAGTG CATCGTGTGC TCGCGCTGCG TCCGCGCCTG CGAGGAAGTG CAAGGCACCT TCGCGCTCAC CATCGCAGGC CGCGGCTTCG ACTCTGTCGT GTCGCCCGGC ATGCAGGAGA GTTTCCTCGG TTCGGAATGC GTGTCCTGCG GCGCCTGCGT GCAGGCCTGT CCGACCGCGA CGCTGAACGA GAAGAGCGTG ATCGAGATCG GCACGCCGGA GCGCTCGGTC GTGACCACCT GCGCCTATTG CGGCGTCGGC TGCACCTTCA AGGCCGAGAT GCGCGGCGAA GAGCTGGTGC GCATGGTGCC TTACAAGGAC GGCAAGGCCA ATCGCGGCCA TTCCTGCGTC AAGGGCCGCT TCGCCTGGGG CTACGCCAAT CACAAGGAGC GCATTCTCAA GCCGATGATC CGCGGCAAGA TCACCGAGCC GTGGCGCGAG GTGAGCTGGG ACGAGGCGTT CGCCTATGCG GCGTCCGAAC TGAAGCGGAT CCAGGCCAAA TACGGCCGCG ATTCGATCGG CGGTATCACC TCGTCGCGCT GTACCAATGA AGAAACCTTC CTGGTGCAGA AGTTGATCCG CGCCGGCTTC GGCAACAACA ATGTCGATAC CTGCGCCCGC GTCTGCCACT CGCCGACTGG TTACGGTCTG TCGACCGCGT TCGGCACCTC GGCCGGCACC CAGAATTTCG ACTCGGTCGA GCATGCCGAC GTGGTGATGA TCATCGGCGC CAACCCGACC GACGGCCACC CGGTGTTCGG TTCGCGCCTG AAGAAGCGGC TGCGCCAGGG CGCCAAGCTG ATCGTGGTCG ACCCGCGCCG GATCGATCTG GTGCGCTCGG CCCATATCGA GGCCGCGCAG CATCTGCCGC TGAAGCCCGG CACCAACGTC GCGGTGTTGA CCGCGCTGGC CCACGTCATC GTCACCGAAG GACTCGCCAA CGAAGCCTTC GTCCGCGAGC GCTGCGACTG GAGCGAATAC GAGCACTGGG CGGCGTTCGT CTCGGGCGAG CGGCACAGCC CGGAAGCCAC CGCGGCCTTC ACCGGCGTCG ATCCGCAAGC GCTGCGCGAA GCCGCGCGGC TCTACGCCAC CGGCGGCAAC GGCGCGATTT ATTACGGCCT CGGCGTCACC GAGCACAGCC AGGGTTCGAC CACCGTGATG GCGATCGCCA ACCTCGCGAT GGCCACCGGC AATCTCGGCC GCCCCGGCGT CGGCGTAAAC CCGCTGCGCG GCCAGAACAA CGTGCAGGGC TCGTGCGACA TGGGCTCGTT CCCGCACGAA CTGCCGGGCT ATCGCCACAT CTCGACCGAC TCGGTGCGCG AGAGCTTCGA GGCGCTATGG AACGTCAAAC TCGACAACGA GCCGGGCCTA CGTATTCCCA ACATGCTCGA CGCCGCGGTC GAAGGCTCGT TCAAGGCGCT GTACGTGCAG GGCGAGGATA TCCTGCAGTC CGACCCCAAC ACCAAGCACG TCGCCGCCGG TCTCGAAGCG ATGGAGTGCG TCATCGTCCA TGACCTGTTC CTCAACGAGA CCGCCAATTA CGCCCATATC TTGCTGCCGG GTTCGACCTT CCTTGAGAAG AACGGCACCT TCACAAATGC CGAGCGCCGC ATCCAACGTG TCCGCAAGGT GATGTCGCCG AAGAATGGTC TGGAAGACTG GGAAGTCACG CTGCGGCTTG CCGAGGCGGT CGGTTACAGG ATGAACTACA CGCATCCGTC GCAGATCATG GACGAGATTG CGGCGCTGAC GCCGACTTTC GCGGGCGTGT CCTACGACAG GCTGGAAGAA CTCGGCTCGA TCCAGTGGCC GTGCAACGAC AAGGCGCCGG AAGGCACGCC GGTGATGCAC ATCGATCATT TCGTCCGCGG CAAGGGCAAG TTCGTCATCA CCGAATATGT CGCCACCGAC GAGCGCACCG GGCCACGCTT CCCGCTGCTT CTGACGACCG GGCGCATCCT GTCGCAATAC AATGTCGGGG CGCAGACGCG GCGCACCGCC AATACGGTGT GGCACGACGA GGATCGGCTC GAGATCCATC CGCACGACGC CGAGCAGCGC GGCGTGCGGG ACGGCGACTG GGTGCGACTC GCCAGCCGCG CCGGCGAGAC CACGCTGCGC GCGCTGATCA CCGACCGCGT CGCGCCGGGC GTCGTTTATA CGACGTTCCA TCACCCCGAC ACCCAGGCCA ACGTCGTCAC GACCGAGTAC TCGGACTGGG CCACGAACTG CCCGGAATAC AAGGTGACAG CCGTGCAGGT CGCTCCGTCG AACGGCCCCT CGGAGTGGCA GCGCTCCTAC GAGCAGCAGG CGGCGGCCGC GCGGCGGATT GCTGTGCCGG CGGAGGCCGC GGAGTGA
|
Protein sequence | MALVHETDFG TPPSLSENMV TLTIDGRSVS VPEGTSIMRA AMEIGTAIPK LCATDMVDAF GSCRLCLVEI DGRSGTPASC TTPVADGLVV HTQTERLKQI RKGVMELYIS DHPLDCLTCS ANGDCELQDM AGAVGLRDVR YGYSGEKHPN PGLDESNPYF TYDASKCIVC SRCVRACEEV QGTFALTIAG RGFDSVVSPG MQESFLGSEC VSCGACVQAC PTATLNEKSV IEIGTPERSV VTTCAYCGVG CTFKAEMRGE ELVRMVPYKD GKANRGHSCV KGRFAWGYAN HKERILKPMI RGKITEPWRE VSWDEAFAYA ASELKRIQAK YGRDSIGGIT SSRCTNEETF LVQKLIRAGF GNNNVDTCAR VCHSPTGYGL STAFGTSAGT QNFDSVEHAD VVMIIGANPT DGHPVFGSRL KKRLRQGAKL IVVDPRRIDL VRSAHIEAAQ HLPLKPGTNV AVLTALAHVI VTEGLANEAF VRERCDWSEY EHWAAFVSGE RHSPEATAAF TGVDPQALRE AARLYATGGN GAIYYGLGVT EHSQGSTTVM AIANLAMATG NLGRPGVGVN PLRGQNNVQG SCDMGSFPHE LPGYRHISTD SVRESFEALW NVKLDNEPGL RIPNMLDAAV EGSFKALYVQ GEDILQSDPN TKHVAAGLEA MECVIVHDLF LNETANYAHI LLPGSTFLEK NGTFTNAERR IQRVRKVMSP KNGLEDWEVT LRLAEAVGYR MNYTHPSQIM DEIAALTPTF AGVSYDRLEE LGSIQWPCND KAPEGTPVMH IDHFVRGKGK FVITEYVATD ERTGPRFPLL LTTGRILSQY NVGAQTRRTA NTVWHDEDRL EIHPHDAEQR GVRDGDWVRL ASRAGETTLR ALITDRVAPG VVYTTFHHPD TQANVVTTEY SDWATNCPEY KVTAVQVAPS NGPSEWQRSY EQQAAAARRI AVPAEAAE
|
| |