Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4002 |
Symbol | |
ID | 3911809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4568688 |
End bp | 4569626 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885906 |
Product | chlorophyll synthesis pathway protein BchC |
Protein accession | YP_487606 |
Protein GI | 86751110 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.232331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000580255 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACACCA TCGCGGTCGT ACTCAAGCAG CCACAACACG TCGAACTCAG TCGCCTGGCC CTCACGGCCC CGACTGCGGA TGACGTTGTC GTTGATGTTG CCTGGAGCGG GGTCAGCACC GGTACCGAGC GGCTGTTGTG GTCCGGCCGG ATGCCGGCGT TCCCCGGAAT GGGGTACCCG CTGGTGCCGG GATATGAGTC GGTGGGCGAA GTGGTCGAGG CCGGATCGGC GACCGATCTG CAGCCCGGCC AGATGGTCTT CGTACCCGGC GCAAAGTGTT TCGGCGAAGT CCGCGGTCTG TTCGGAGCCT CCGCATCGCG GCTGGTCGTG CCGGCCAAAC GCGTCGTGCC GCTGGATCAG CAACTCGGCG AGCGCGGTAT CCTGATAGCT CTTGCTGCCA CCGCCTATCA CGCGATTGCC GCGCGCCATG CGACGCCGCC GGACTGCATC GTCGGTCACG GCGTGCTCGG CCGCCTGCTG GCGCGGATTT CGATCGCGCT CGGCAATCCG CCGCCGGTGG TGTGGGAGAA GAACCCGATC CGCAGCGGCG GCGCCGTTGG CTACGAAGTG ATCGACCCCG AGGCCGACCA GCGTCGCGAC TACAAAAGCA TCTACGACGT CAGCGGCGAT CCGAAGCTGC TCGATTCTTT GATCTGCCGC ATCGCGTCGA CCGGCGAGAT CGTGCTCGCT GGCTTCTACA GCGAGCCGCT GTCGTTCGCG TTCCCGCCGG CCTTCATGCG CGAAGCCCGG ATCCGGATCG CAGCGGAATG GCAACCGGCG GACATCGGCG CCACCAAGGC GCTGATCGAT TCCGGCAAGC TCTCGCTCGA CGGACTGATT ACGCATCATC AGGAAGCGGC TTCCGCACCT GATGCCTATC GCATCGCCTT CGAAGATCCC GCCTGCCTCA AGATGGTTCT GAACTGGAGA TTGAGCTGA
|
Protein sequence | MDTIAVVLKQ PQHVELSRLA LTAPTADDVV VDVAWSGVST GTERLLWSGR MPAFPGMGYP LVPGYESVGE VVEAGSATDL QPGQMVFVPG AKCFGEVRGL FGASASRLVV PAKRVVPLDQ QLGERGILIA LAATAYHAIA ARHATPPDCI VGHGVLGRLL ARISIALGNP PPVVWEKNPI RSGGAVGYEV IDPEADQRRD YKSIYDVSGD PKLLDSLICR IASTGEIVLA GFYSEPLSFA FPPAFMREAR IRIAAEWQPA DIGATKALID SGKLSLDGLI THHQEAASAP DAYRIAFEDP ACLKMVLNWR LS
|
| |