Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4645 |
Symbol | |
ID | 3972234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 5193842 |
End bp | 5196688 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637927757 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_534486 |
Protein GI | 90426116 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTGA TTCACGAAAT CGATTTCGGC ACGCCGCGCT CCACCTCAAC GACCATGGTG ACGCTGTCGA TCGACGGCCA GAGCGTCACG GTGCCGGAAG GCACCTCGAT CATGCGCGCG GCGATGCAGA TCGGGACGCA AATTCCGAAA CTCTGCGCCA CCGACATGAT GGACGCGTTC GGCTCCTGCC GGCTCTGCCT CGTCGAAGTC GAGGGACGCA ATGGCACGCC GGCGTCCTGC ACCACGCCGG TGGCGGAAGG AATTTCCGTC CGCACCCAGA CCGAGCGGCT GAAGGCGCTG CGCAAGGGCG TGATGGAGCT CTACATCTCC GATCATCCGC TGGACTGCCT GACCTGTTCG GCCAATGGCG ATTGCGAATT GCAGGACATG GCGGGCGCGG TCGGCCTGCG CGACGTGCGC TACGGCTACA AGGGTGAGAC CCATCCCAAT CCGGGCGTCG ACGACTCCAA TCCCTATTTC ACCTTTGACG CCTCGAAATG CATCGTCTGC TCGCGCTGCG TGCGCGCCTG TGAGGACGTG CAGGGCACCT TCGCGTTGAC CATCGCCGGC CGCGGCTTCG GCTCGGTGGT GTCTGCCGGG ATGCAGGAAA GCTTCCTCGG CTCGGAATGC GTGTCCTGCG GCGCCTGCGT GCAGGCCTGC CCGACCGCGA CGCTGAACGA AAAGTCGATC ATCGAGATCG GCACCCCGGA ACATTCGGTG GTGACGACCT GCGCCTATTG CGGCGTCGGC TGCTCGTTCA AGGCGGAGAT GCGCGGCGAG GAAGTGGTGC GCATGCTGCC CTACAAGGAC GGCAAGGCCA ATCGCGGCCA TTCCTGCGTC AAAGGCCGCT TCGCCTGGGG CTACGCCACC CACAAGGAAC GCATCCTCAA CCCGATGATC CGCGACAAGA TCACCGACCC GTGGCGCGAA GTGTCGTGGG ACGAGGCGTT CTCCTACGCC GCGTCGGAGT TCAAGCGCAT CCAGGCCAAA TACGGCCGCG ACTCGGTCGG CGGCATCACC TCGTCGCGCT GCACCAATGA AGAGACTTTC CTGGTGCAGA AGCTGATCCG CGCCGGCTTC GGCAACAACA ACGTCGATAC TTGCGCGCGG GTCTGCCACT CGCCGACCGG CTACGGGCTG TCCACCACCT TCGGCACCTC GGCGGGCACG CAGGATTTCG ACTCCGTCGA GCACACCGAC GTGGTGCTGA TCATCGGCGC CAATCCGACC GACGGCCATC CGGTGTTCGC CTCGCGCTTG AAGAAGCGGT TGCGCGCCGG CGCCAAGCTG ATCGTGGTCG ATCCGCGCCG CACCGACATC GTGCGCTCGG CGCGGGTCGA GGCGGCGCAT CACCTGCCGC TGCAGCCCGG CACCAACGTC GCGGTGCTGA CCGCGATGGC GCATGTCATC GTCACCGAAG GGCTGGTCGA CGAGGCGTTT GTGCGCGAGC GCTGCGACTG GGACGAATAT GAGGGCTGGG CGAGTTTCGT GGCGCTGCCG CAGAACAGCC CGGAGCAGAC GCAAAGTGCG ACCGGCGTCG ATCCGACCGA ATTGCGGCAG GCGGCGCGAC TGTTCGCCAC CGGCGGCAAC GGCGCGATCT ATTACGGGCT CGGCGTCACC GAGCACAGCC AAGGCTCGAC CACCGTGATG GCGATCGCCA ATTTGGCGAT GGCCACCGGC AATATCGGCC GGCCGGGCGT CGGCGTGAAC CCGCTGCGCG GCCAGAACAA CGTGCAGGGC GCCTGCGACA TGGGCTCGTT CCCGCACGAA TTGCCGGGCT ATCGGCATAT CGGCACCGAT TCGGTGCGTG AGAGTTTCGA GGCGCTGTGG GGCGTGCATC TCAACAAGGA GCCGGGCCTG CGGATTCCGA ACATGCTGGA TGCCGCGGTC GACGGTTCGT TCAAGGCGCT CTACGTGCAG GGCGAAGACA TCCTGCAATC CGACCCCAAC ACCAAGCACG TCGCCGCCGG GCTTGAGGCG ATGGAATGCG TCATCGTCCA CGACCTGTTC CTCAACGAGA CCGCGAACTA CGCGCATATC TTCCTGCCGG GCTCCACCTT CCTGGAGAAG AACGGCACCT TCACCAATGC CGAGCGCCGC ATCCAGCGGG TGCGCAAGGT GATGACACCG CGCAACGGCT TGGAGGACTG GCAGGTCACG CTGGGGCTGG CCAAGGCGAT GGGCTATCCG ATGAGCTATG AGCATCCCTC GCAGATCATG GATGAGATCG CGGCGTTGAC GCCGACCTTC GCCGGCGTGT CCTACGCCAA GCTCGACGAA CTCGGCTCGA TCCAGTGGCC GTGCAACGAC AACGCGCCGG AGGGCACCCC GGTGATGCAC ATCGATCACT TCGTCCGCGG CAAGGGCAAA TTCGTCATCA CCGAATATGT CGCCACCGAC GAGCGCACCG GGCCGCGCTA TCCGCTATTG CTGACCACCG GGCGAATCCT CAGCCAGTAC AATGTCGGCG CGCAGACCAG GCGCACCGCC AATGTGGTGT GGCACGACGA GGATCGGCTC GAGATCCATC CGCACGACGC CGAGCAGCGC GGAGTGAGGG ATGGCGATTG GGTGCGGCTG GCGAGCCGCG CCGGCGAAAC CACGCTGCGC GCGCTGATCA CCGAGCGCGT CGCGCCGGGC GTGGTCTATA CCACCTTCCA CCATCCGGAT ACGCAGGCGA ACGTGATCAC CACCGACTAT TCCGACTGGG CAACCAATTG CCCGGAATAC AAGGTCACCG CGGTGCAGGT GGCGCCGTCG AACGGCCCGT CGGAATGGCA GAAGGCCTAT GACGAGCAGG CCCGCCAGGC GCGCCGCATC GCCTCCACCA TCGAAGCCGC GGAGTAA
|
Protein sequence | MSLIHEIDFG TPRSTSTTMV TLSIDGQSVT VPEGTSIMRA AMQIGTQIPK LCATDMMDAF GSCRLCLVEV EGRNGTPASC TTPVAEGISV RTQTERLKAL RKGVMELYIS DHPLDCLTCS ANGDCELQDM AGAVGLRDVR YGYKGETHPN PGVDDSNPYF TFDASKCIVC SRCVRACEDV QGTFALTIAG RGFGSVVSAG MQESFLGSEC VSCGACVQAC PTATLNEKSI IEIGTPEHSV VTTCAYCGVG CSFKAEMRGE EVVRMLPYKD GKANRGHSCV KGRFAWGYAT HKERILNPMI RDKITDPWRE VSWDEAFSYA ASEFKRIQAK YGRDSVGGIT SSRCTNEETF LVQKLIRAGF GNNNVDTCAR VCHSPTGYGL STTFGTSAGT QDFDSVEHTD VVLIIGANPT DGHPVFASRL KKRLRAGAKL IVVDPRRTDI VRSARVEAAH HLPLQPGTNV AVLTAMAHVI VTEGLVDEAF VRERCDWDEY EGWASFVALP QNSPEQTQSA TGVDPTELRQ AARLFATGGN GAIYYGLGVT EHSQGSTTVM AIANLAMATG NIGRPGVGVN PLRGQNNVQG ACDMGSFPHE LPGYRHIGTD SVRESFEALW GVHLNKEPGL RIPNMLDAAV DGSFKALYVQ GEDILQSDPN TKHVAAGLEA MECVIVHDLF LNETANYAHI FLPGSTFLEK NGTFTNAERR IQRVRKVMTP RNGLEDWQVT LGLAKAMGYP MSYEHPSQIM DEIAALTPTF AGVSYAKLDE LGSIQWPCND NAPEGTPVMH IDHFVRGKGK FVITEYVATD ERTGPRYPLL LTTGRILSQY NVGAQTRRTA NVVWHDEDRL EIHPHDAEQR GVRDGDWVRL ASRAGETTLR ALITERVAPG VVYTTFHHPD TQANVITTDY SDWATNCPEY KVTAVQVAPS NGPSEWQKAY DEQARQARRI ASTIEAAE
|
| |