Gene RPB_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4002 
Symbol 
ID3911809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4568688 
End bp4569626 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content65% 
IMG OID637885906 
Productchlorophyll synthesis pathway protein BchC 
Protein accessionYP_487606 
Protein GI86751110 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.232331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000580255 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACACCA TCGCGGTCGT ACTCAAGCAG CCACAACACG TCGAACTCAG TCGCCTGGCC 
CTCACGGCCC CGACTGCGGA TGACGTTGTC GTTGATGTTG CCTGGAGCGG GGTCAGCACC
GGTACCGAGC GGCTGTTGTG GTCCGGCCGG ATGCCGGCGT TCCCCGGAAT GGGGTACCCG
CTGGTGCCGG GATATGAGTC GGTGGGCGAA GTGGTCGAGG CCGGATCGGC GACCGATCTG
CAGCCCGGCC AGATGGTCTT CGTACCCGGC GCAAAGTGTT TCGGCGAAGT CCGCGGTCTG
TTCGGAGCCT CCGCATCGCG GCTGGTCGTG CCGGCCAAAC GCGTCGTGCC GCTGGATCAG
CAACTCGGCG AGCGCGGTAT CCTGATAGCT CTTGCTGCCA CCGCCTATCA CGCGATTGCC
GCGCGCCATG CGACGCCGCC GGACTGCATC GTCGGTCACG GCGTGCTCGG CCGCCTGCTG
GCGCGGATTT CGATCGCGCT CGGCAATCCG CCGCCGGTGG TGTGGGAGAA GAACCCGATC
CGCAGCGGCG GCGCCGTTGG CTACGAAGTG ATCGACCCCG AGGCCGACCA GCGTCGCGAC
TACAAAAGCA TCTACGACGT CAGCGGCGAT CCGAAGCTGC TCGATTCTTT GATCTGCCGC
ATCGCGTCGA CCGGCGAGAT CGTGCTCGCT GGCTTCTACA GCGAGCCGCT GTCGTTCGCG
TTCCCGCCGG CCTTCATGCG CGAAGCCCGG ATCCGGATCG CAGCGGAATG GCAACCGGCG
GACATCGGCG CCACCAAGGC GCTGATCGAT TCCGGCAAGC TCTCGCTCGA CGGACTGATT
ACGCATCATC AGGAAGCGGC TTCCGCACCT GATGCCTATC GCATCGCCTT CGAAGATCCC
GCCTGCCTCA AGATGGTTCT GAACTGGAGA TTGAGCTGA
 
Protein sequence
MDTIAVVLKQ PQHVELSRLA LTAPTADDVV VDVAWSGVST GTERLLWSGR MPAFPGMGYP 
LVPGYESVGE VVEAGSATDL QPGQMVFVPG AKCFGEVRGL FGASASRLVV PAKRVVPLDQ
QLGERGILIA LAATAYHAIA ARHATPPDCI VGHGVLGRLL ARISIALGNP PPVVWEKNPI
RSGGAVGYEV IDPEADQRRD YKSIYDVSGD PKLLDSLICR IASTGEIVLA GFYSEPLSFA
FPPAFMREAR IRIAAEWQPA DIGATKALID SGKLSLDGLI THHQEAASAP DAYRIAFEDP
ACLKMVLNWR LS