Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3757 |
Symbol | |
ID | 4024273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4194194 |
End bp | 4195132 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637963961 |
Product | chlorophyll synthesis pathway protein BchC |
Protein accession | YP_570879 |
Protein GI | 91978220 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00485804 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATACCA TCGCGGTCGT CCTCAAGCAG CCGCAACACG TCGAACTCAG TCGCCTGGCT CTCACCCCGC CGACTGCTGA CGACGTTGTT GTCGATGTTG CATGGAGCGG TGTCAGCACC GGTACCGAGC GGCTGTTGTG GTCCGGCCGG ATGCCGGCGT TCCCCGGAAT GGGGTACCCG CTTGTGCCTG GCTATGAGTC CGTCGGCGAA GTGGTCGAGG CCGGATCGGC AACCGATCTG AAGCCCGGCC AAATGGTGTT CGTGCCCGGG GCGAAATGCT TCGGCGAAGT CCGCGGTCTG TTCGGCGCCT CGGCGTCGCG TCTCGTCGTT CCGGCCAAGC GCGTGGTGCC GCTCGATCAG CAGCTCGGTG AGCGCGGCAT CCTGATCGCG CTCGCCGCCA CTGCCTATCA CGCCATTGCC GCCCGCAATG CTGCGCCGCC GGACTGCATC GTCGGTCACG GCGTGCTCGG ACGCCTGCTG GCGCGGATCT CGATCGCGCT CGGCAATCCG CCGCCGGTGG TGTGGGAGAA GAACCCGATC CGCAGCGGCG GGGCCGATGG CTACGCCGTG GTCGATCCCG AGACCGACGA GCGCCGCGAC TACAAGAGCA TCTACGATGT CAGCGGCGAT CCAAAATTGC TCGATTCTTT GATCTGTCGC ATCGCGGCGA CCGGAGAGAT CGTGCTCGCT GGCTTCTACA GCGAGCCTTT GTCGTTCGCG TTCCCTCCGG CCTTCATGCG CGAAGCCCGG ATCCGGATCG CGGCGGAATG GCAGCCGGCG GATATCGGCG CCACCAAGGC GCTGATCGAC ACCGGCAAGC TCTCGCTGGA CGGATTGATT ACTCACCATC AGGAAGCGGC TTCCGCGCCT GACGCCTACC GCATCGCGTT CGAAGATCCC GCCTGCCTCA AAATGGTTCT GAACTGGAGA TTGTCCTGA
|
Protein sequence | MDTIAVVLKQ PQHVELSRLA LTPPTADDVV VDVAWSGVST GTERLLWSGR MPAFPGMGYP LVPGYESVGE VVEAGSATDL KPGQMVFVPG AKCFGEVRGL FGASASRLVV PAKRVVPLDQ QLGERGILIA LAATAYHAIA ARNAAPPDCI VGHGVLGRLL ARISIALGNP PPVVWEKNPI RSGGADGYAV VDPETDERRD YKSIYDVSGD PKLLDSLICR IAATGEIVLA GFYSEPLSFA FPPAFMREAR IRIAAEWQPA DIGATKALID TGKLSLDGLI THHQEAASAP DAYRIAFEDP ACLKMVLNWR LS
|
| |