Gene RPD_3160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3160 
Symbol 
ID4023665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3512678 
End bp3513697 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID637963361 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_570287 
Protein GI91977628 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.277203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.200349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGATCC AGAAAAATTG GCAAGAACTG ATTCGACCGA ACAAGCTTCA GGTTTCGCCG 
GGCAGTGATG CGACGCGGTT CGCGACGCTG GTCGCCGAGC CGCTCGAGCG CGGCTTCGGC
CAGACGCTGG GCAACGCGCT GCGGCGCGTG TTGCTGTCCT CGCTGCAGGG CGCTGCGGTG
CAGTCGGTGC AGATCGACGG CGTGCTGCAC GAGTTCTCCT CGATTGCCGG CGTGCGCGAG
GACGTCACCG ACATCGTGCT GAACATCAAG GACATCTCGC TGAAGATGCA GGGCGAAGGC
CCGAAGCGGA TGGTCGTCAA GAAGCAGGGT CCGGGCGTCG TCACCGCCGG CGACATCCAG
ACCGTCGGCG ATATCGTCGT GCTGAACCCC GACCTGCAGA TCTGCACCCT GGACGAGGGC
GCGGAGATCC GCATGGAGTT CACCGTCAAC ACCGGCAAGG GCTACGTCGC CGCCGAGCGT
AACCGTCCCG AGGACGCGCC GATCGGCCTG ATCCCGGTCG ACAGCCTGTA CTCGCCGGTT
CGCAAGGTGT CGTACAAGGT CGAGAACACC CGCGAGGGCC AGATCCTCGA CTACGACAAG
CTGACCATGA CGGTCGAGAC CAACGGCGCG CTGACGCCGG ATGACGCGGT GGCCTTCGCC
GCCCGCATCC TGCAGGATCA GCTCAACGTC TTCGTCAACT TCGAAGAGCC GCGCAAGGAA
GTCACCCAGG AGATCATTCC GGATCTCGCC TTCAACCCGG CTTTCCTCAA GAAGGTGGAC
GAGCTCGAGC TGTCGGTGCG TTCGGCGAAC TGCCTGAAGA ACGACAACAT CGTCTATATC
GGCGATCTGG TGCAGAAGTC GGAAGCAGAG ATGCTGCGCA CTCCGAACTT CGGCCGCAAG
TCGCTGAACG AGATCAAGGA AGTGCTGGCG CAGATGGGCC TGCATCTCGG CATGGAAGTG
CCGGGCTGGC CGCCGGAGAA CATCGACGAG CTCGCCAAGC GTTTCGAAGA TCACTACTGA
 
Protein sequence
MTIQKNWQEL IRPNKLQVSP GSDATRFATL VAEPLERGFG QTLGNALRRV LLSSLQGAAV 
QSVQIDGVLH EFSSIAGVRE DVTDIVLNIK DISLKMQGEG PKRMVVKKQG PGVVTAGDIQ
TVGDIVVLNP DLQICTLDEG AEIRMEFTVN TGKGYVAAER NRPEDAPIGL IPVDSLYSPV
RKVSYKVENT REGQILDYDK LTMTVETNGA LTPDDAVAFA ARILQDQLNV FVNFEEPRKE
VTQEIIPDLA FNPAFLKKVD ELELSVRSAN CLKNDNIVYI GDLVQKSEAE MLRTPNFGRK
SLNEIKEVLA QMGLHLGMEV PGWPPENIDE LAKRFEDHY