Gene RPC_3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3743 
Symbol 
ID3970338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4168327 
End bp4169307 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content69% 
IMG OID637926853 
Productaminotransferase, class I and II 
Protein accessionYP_533597 
Protein GI90425227 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000372169 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACG GCGGCGATTT GACGGACGCG ATGGCGCGGC ACGGCGGCGC GCCGCAAGCC 
TGGATCGATC TGTCCACCGG CATCAATCCG TGGCCGTGGC CGATTCCCGC GATCGCAGAC
GAGGCCTGGC AGCGGCTGCC GTCGCGTGGC GATGAAGTCG CACTGATCGA CGCCGCGCGC
GCGGCCTATC GCGTGCCCGC GGAGATAGCG ATCGTCGCAG CCTCCGGCAC CCAGGCGCTG
ATCCAGTGGC TGCCGCATCT TGCTGCGCCC GGCGCGGTCG CGATTGTTGG CCCGACCTAC
AGCGAGCACG CGAGCGCGTG GCGCAACGCC GGCCGCGAGG TGATCGCCAT TGATGATGCT
TGCGCATTGC CGAGCAGCGC GCGCCATGCG GTGATCGTCA ACCCGAACAA TCCCGACGGC
CGCGTCGTCG ATCGCGCGCA GCTCGCGGAT GTCGCCGCCG TGCTGCAGGC CCGCGGCGGC
TGGCTGGTGA TCGACGAAGC CTTTGCCGAC GTGACACCTG ACATCAGCGC GACGGCGCTG
TGCGCGGCGT TGCCGATCGT GATCCTGCGC TCGTTCGGCA AGTTCTATGG CCTCGCCGGA
TTGCGGCTCG GCTTCGCGCT GGCCGCACCA TCGATCGCCG ATCGCATCGA AGCTGCGATC
GGGCCGTGGT GCTGCTCCGG ACCGGCGCTG CGGATCGGCG CTGCCGCGCT GCGCGACCGG
GCCTGGGCAG ACGCGACGCG CACCGCCTTG ACGCAACAAG CCATACGTCT CGATGCGGTG
CTGAACAAAG CCGGGCTTAA CGTCGTCGGC GGCACCGCGC TGTATCGATT GACCCGGCAT
CGCGAGGCGT TGCGGATCCA CGATGGCTTG GCGCGACAAC AGATCTGGTG CCGCCGCTTC
GATTGGGCCG ACGATCTGCT GCGGTTCGGC CTGCCTCCGG ACGAGGCCGC ACTGGATCGG
CTGGCGGCTG CGCTGGGATA G
 
Protein sequence
MKHGGDLTDA MARHGGAPQA WIDLSTGINP WPWPIPAIAD EAWQRLPSRG DEVALIDAAR 
AAYRVPAEIA IVAASGTQAL IQWLPHLAAP GAVAIVGPTY SEHASAWRNA GREVIAIDDA
CALPSSARHA VIVNPNNPDG RVVDRAQLAD VAAVLQARGG WLVIDEAFAD VTPDISATAL
CAALPIVILR SFGKFYGLAG LRLGFALAAP SIADRIEAAI GPWCCSGPAL RIGAAALRDR
AWADATRTAL TQQAIRLDAV LNKAGLNVVG GTALYRLTRH REALRIHDGL ARQQIWCRRF
DWADDLLRFG LPPDEAALDR LAAALG