Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0724 |
Symbol | |
ID | 3910020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 812186 |
End bp | 815032 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882616 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_484346 |
Protein GI | 86747850 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.696908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.792249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTGG TTCACGAAAC CGATTTCGGC ACGCCGCGCT CGCCGTCCGA GACGATGGTG ACGCTGACGA TCGACGGCCG CAACGTCAGC GTGCCCGAGG GCACCTCGAT CATGCGCGCC GCGATGGAGA TCGGCACCGC GATTCCGAAA CTCTGCGCCA CCGACATGGT CGACGCGTTC GGCTCGTGCC GGCTGTGCCT GGTCGAGATC GACGGCCGCA GCGGCACGCC GGCGTCCTGC ACCACGCCGG TCGCCGACGG GCTGGTGGTG AAGACCCAGA CCGCGCGGCT GAAGCAGATC CGCAAAGGCG TGATGGAGCT GTATATCTCC GACCACCCGC TCGACTGCCT GACCTGCTCG GCCAATGGCG ATTGCGAATT GCAGGACATG GCCGGCGCGG TCGGCCTGCG CGACGTGCGC TATGGCTATA GCGGCGAGAA GCACCCGAAT CCGGGGCTCG ATGAGTCCAA CCCTTATTTC ACCTACGATC CGTCGAAATG CATCGTCTGC TCGCGCTGCG TCCGCGCGTG TGAAGAAGTG CAAGGCACCT TCGCGCTGAC CATCGCCGGT CGCGGCTTCG ACTCGGTCGT CTCACCGGGC ATGCAGGAGA GCTTCCTCGG CTCGGAATGC GTCTCCTGCG GCGCCTGCGT GCAGGCCTGC CCGACCGCGA CGCTGAACGA GAAGAGCGTG ATCGAGATCG GCACGCCGGA GCGCTCGGTG GTGACGACCT GCGCGTATTG CGGCGTCGGC TGCACCTTCA AGGCCGAGAT GCGCGGCGAG GAAGTCGTCC GCATGGTGCC CTTCAAAGAC GGCAAGGCCA ATCGCGGCCA TTCCTGCGTC AAGGGCCGCT TTGCCTGGGG CTACGCCAAC CACAAGGAAC GCATCCTCAA CCCGATGATC CGCGCCAGCA TCAGCGAGCC GTGGCGCGAA GTGAGCTGGG ACGAAGCGTT TGCTTACGCA GCTTCGGAGT TGAAGCGGAT CCAGGCCACA TACGGCCGCG ATTCGATCGG CGGCATCACC TCGTCGCGCT GCACCAACGA AGAAACCTTC CTGGTGCAGA AGCTGATCCG CGCCGGCTTC GGCAACAACA ATGTCGACAC CTGTGCCCGG GTCTGCCATT CGCCGACCGG CTACGGCCTC TCCACCGCAT TCGGCACCTC GGCCGGCACC CAGGATTTCG ACTCGGTCGA GCACACCGAC GTGGTGATGC TGATCGGCGC CAATCCGACC GACGGCCACC CGGTGTTCGC CTCGCGGCTG AAAAAGCGGC TGCGCGCCGG CGCCAAGCTG ATCGTGGTCG ATCCGCGCCG GATCGATCTG GTGCGCTCGG CGCATGTCGA GGCGGCGCAG CATCTGCCGC TGAAGCCCGG CACCAACGTC GCGGTGCTGA CGTCGATCGC CCATGTGATC GTCACCGAGG GACTCACCCA CGAGGCCTTC GTGCGCGAGC GCTGCGACTG GAGCGAATAC GAGCACTGGG CGGCGTTCGT GGCGCAGCCG GCCAATAGTC CGGAAGCGAC CAGCGCGATG ACCGGCGTCG ATCCGCAGGC GTTGCGCGAG GCGGCAAGGC TGTACGCCAC CGGCGGCAAC GGCGCGATCT ATTACGGCCT CGGCGTCACC GAGCACAGCC AGGGCTCGAC CACCGTGATG GCGATCGCCA ACCTCGCAAT GGCCACCGGC AATCTCGGCC GGCCCGGCGT CGGGGTGAAC CCGCTGCGCG GCCAGAACAA TGTGCAGGGC GCCTGCGACA TGGGCTCGTT CCCGCACGAA CTGCCGGGCT ATCGCCACAT CTCCAGCGAC GCGGTGCGCG AGAGCTTCGA GGCGCTGTGG GGCGTGACGC TGAACAGCGA GCCGGGGCTG CGCATCCCCA ACATGCTCGA CGCCGCGGTC GATGGTTCGT TCAAGGCGCT CTACGTGCAG GGCGAGGACA TCCTGCAATC CGACCCCAAC ACCAGACACG TCGCCGCCGG GCTCGAAGCG ATGGAATGCG TCATCGTGCA CGATCTGTTT CTCAACGAGA CCGCGAACTA CGCGCATATT TTCCTGCCCG GCTCGACCTT CCTGGAGAAG AACGGCACCT TCACCAATGC GGAGCGCCGC ATCCAGCGCG TCCGCAAGGT GATGACGCCG CGCAATGGGC AGGAGGACTG GGAAGTCACG CAGCGGCTCG CCAATGCGAT GGGCTTTTCG ATGAGCTACG AGCATCCGTC GCAGATCATG GACGAGATCG CGGCGCTGAC GCCGACCTTT GCGGGCGTGT CCTACGACCG GCTCGAGCAG CTCGGCTCGA TCCAATGGCC GTGCAACGAG CGCGCGCCGG ACGGCACGCC GGTGATGCAC ATCGACGCTT TCGTCCGCGG CAAGGGCAAG TTCGTCATCA CCGAATATGT CGCCACCGAC GAGCGCACCG GGCCACGCTT CCCGCTGCTG CTGACCACCG GCCGCATCCT GTCGCAGTAC AATGTCGGCG CGCAGACGCG ACGGACCGCC AATACGGTGT GGCACGACGA GGACCGTCTC GAAATCCATC CGCACGATGC CGAGCAGCGC GGCGTCAGGG ATGGCGATTG GGTGCGACTC GCCAGCCGCG CCGGCGAGAC CACGCTGCGC GCGCTGATCA CCGATCGGGT CGCGCCGGGC GTGGTCTACA CCACGTTCCA CCACCCGGAC ACGCAGGCCA ACGTCGTGAC GACCGAGTAT TCCGACTGGG CCACCAACTG CCCGGAATAC AAAGTGACGG CGGTGCAGGT GACGCCGTCC AATGGTCCAT CGGAATGGCA GCGCGACTAT GCCCAGCAGG CGGCCGCCGC TCGCCGGATC GCGACGCCTG CAGAGGCCGC AGAATGA
|
Protein sequence | MALVHETDFG TPRSPSETMV TLTIDGRNVS VPEGTSIMRA AMEIGTAIPK LCATDMVDAF GSCRLCLVEI DGRSGTPASC TTPVADGLVV KTQTARLKQI RKGVMELYIS DHPLDCLTCS ANGDCELQDM AGAVGLRDVR YGYSGEKHPN PGLDESNPYF TYDPSKCIVC SRCVRACEEV QGTFALTIAG RGFDSVVSPG MQESFLGSEC VSCGACVQAC PTATLNEKSV IEIGTPERSV VTTCAYCGVG CTFKAEMRGE EVVRMVPFKD GKANRGHSCV KGRFAWGYAN HKERILNPMI RASISEPWRE VSWDEAFAYA ASELKRIQAT YGRDSIGGIT SSRCTNEETF LVQKLIRAGF GNNNVDTCAR VCHSPTGYGL STAFGTSAGT QDFDSVEHTD VVMLIGANPT DGHPVFASRL KKRLRAGAKL IVVDPRRIDL VRSAHVEAAQ HLPLKPGTNV AVLTSIAHVI VTEGLTHEAF VRERCDWSEY EHWAAFVAQP ANSPEATSAM TGVDPQALRE AARLYATGGN GAIYYGLGVT EHSQGSTTVM AIANLAMATG NLGRPGVGVN PLRGQNNVQG ACDMGSFPHE LPGYRHISSD AVRESFEALW GVTLNSEPGL RIPNMLDAAV DGSFKALYVQ GEDILQSDPN TRHVAAGLEA MECVIVHDLF LNETANYAHI FLPGSTFLEK NGTFTNAERR IQRVRKVMTP RNGQEDWEVT QRLANAMGFS MSYEHPSQIM DEIAALTPTF AGVSYDRLEQ LGSIQWPCNE RAPDGTPVMH IDAFVRGKGK FVITEYVATD ERTGPRFPLL LTTGRILSQY NVGAQTRRTA NTVWHDEDRL EIHPHDAEQR GVRDGDWVRL ASRAGETTLR ALITDRVAPG VVYTTFHHPD TQANVVTTEY SDWATNCPEY KVTAVQVTPS NGPSEWQRDY AQQAAAARRI ATPAEAAE
|
| |