Gene RPC_3424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3424 
Symbol 
ID3970179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3810682 
End bp3811701 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content62% 
IMG OID637926535 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_533283 
Protein GI90424913 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.127359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGATCC AGAAAAATTG GCAAGAACTG ATTCGGCCGA ACAAGCTCGT GGTCACGCCG 
GGCTCCGACC CGACGCGTTT TGCGACCCTC GTTGCCGAAC CGCTCGAGCG CGGCTTCGGC
CAGACGCTGG GCAACGCGCT GCGGCGCGTG CTGCTGTCAT CGCTGCAGGG TGCCGCGGTG
CAGTCGGTTC ATATCGACGG CGTGCTGCAT GAGTTCTCCT CGATCGCCGG CGTGCGCGAA
GACGTTACCG ACATCGTGCT GAACATTAAG GACATCTCGA TCAAGATGCA GGGCGAAGGC
CCGAAGCGCA TGGTCGTGAA GAAGCAGGGT CCCGGCACCG TCACCGCCGG CGACATCCAG
ACCGTCGGCG ACGTCGTGGT GCTCAATCCG GACTTGCAGA TCTGCACGCT GGACGAGGGC
GCCGAGATCC GCATGGAATT CACCGTGGCC GGCGGCAAGG GCTACGTCGC CGCCGAGCGC
AACCGTCCCG AGGACGCGCC GATCGGCCTG ATCCCGGTCG ACAGCCTGTT CTCCCCGGTG
CGCAAGGTCT CCTACAAGGT CGAGAACACC CGCGAGGGCC AGATCCTCGA CTACGACAAA
TTGACCATGA CGATCGAGAC CAACGGCGCG ATCTCGCCGG ACGACGCGGT GGCCTATGCC
GCCCGCATCC TGCAGGATCA GCTCAACGTG TTCGTCAACT TCGAAGAGCC GCGCAAGGAA
GTCACCCAGG AGATCATCCC GGATCTGGCG TTCAACCCGG CGTTCCTCAA GAAGGTCGAC
GAGTTGGAGT TGTCGGTGCG TTCGGCGAAC TGCCTGAAGA ACGATAACAT CGTCTATATC
GGCGACCTGG TGCAGAAGTC GGAAGCGGAA ATGCTGCGCA CCCCGAACTT CGGCCGCAAG
TCGCTCAACG AGATCAAGGA AGTGCTGGCC CAGATGGGTC TGCATCTCGG CATGGAAGTG
CCGGGCTGGC CGCCGGAAAA CATCGACGAA TTGGCCAAGC GCTTCGAGGA TCATTACTGA
 
Protein sequence
MTIQKNWQEL IRPNKLVVTP GSDPTRFATL VAEPLERGFG QTLGNALRRV LLSSLQGAAV 
QSVHIDGVLH EFSSIAGVRE DVTDIVLNIK DISIKMQGEG PKRMVVKKQG PGTVTAGDIQ
TVGDVVVLNP DLQICTLDEG AEIRMEFTVA GGKGYVAAER NRPEDAPIGL IPVDSLFSPV
RKVSYKVENT REGQILDYDK LTMTIETNGA ISPDDAVAYA ARILQDQLNV FVNFEEPRKE
VTQEIIPDLA FNPAFLKKVD ELELSVRSAN CLKNDNIVYI GDLVQKSEAE MLRTPNFGRK
SLNEIKEVLA QMGLHLGMEV PGWPPENIDE LAKRFEDHY