Gene Rpal_3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3642 
Symbol 
ID6411318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3902350 
End bp3903369 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content64% 
IMG OID642713522 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001992617 
Protein GI192292012 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.236787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGATCC AGAAAAATTG GCAGGAATTG ATTCGGCCGA ACAAGCTCCA GGTCACCCCC 
GGGAGCGATG CCACCCGGTT CGCGACGCTG GTCGCCGAGC CGCTCGAGCG CGGCTTCGGT
CAGACGCTCG GCAACGCGCT GCGTCGCGTG CTGTTGTCGT CGCTGCAGGG CGCGGCGGTG
CAGTCGGTCC ACATCGACGG CGTTCTGCAC GAGTTCTCCT CGATCGCCGG CGTGCGCGAG
GACGTCACCG ACATCGTGCT GAACATCAAG GACATCTCGC TGAAGATGCA GGGCGAAGGC
CCGAAGCGGA TGGTCGTGAA GAAGCAGGGT CCGGGTGCCG TCACCGCTGG TGACATCCAG
ACCGTCGGCG ACATCGTCGT GCTGAACCCG GATCTGCAGC TCTGCACGCT GGACGACGGC
GCCGAGATCC GCATGGAGTT CACCGTCAAC ACCGGCAAGG GCTACGTCGC CGCCGAGCGC
AACCGTCCCG AGGACGCGCC GATCGGCCTG ATCCCGGTCG ACAGCCTGTA CTCGCCGGTC
CGCAAGGTGT CGTACAAGGT CGAGAACACC CGCGAGGGCC AGATCCTCGA CTACGACAAG
CTCACCATGA CGGTCGAGAC CAACGGCGCG ATCTCGCCGG AAGACGCGGT GGCGTTCGCC
GCCCGCATCC TCCAGGATCA GCTCAACGTC TTCGTCAACT TCGAAGAGCC GCGCAAGGAA
GTCACCCAGG AGATCATCCC GGATCTGGCC TTCAACCCGG CCTTCCTCAA GAAGGTGGAC
GAGCTCGAGC TGTCGGTGCG TTCGGCGAAC TGCCTGAAGA ACGACAACAT CGTCTACATT
GGCGACCTGG TGCAGAAGTC GGAAGCGGAA ATGCTCCGCA CCCCGAACTT CGGGCGCAAG
TCGCTGAACG AGATCAAGGA AGTGCTGGCG CAGATGGGCC TGCATCTCGG CATGGAAGTG
CCGGGCTGGC CGCCGGAGAA CATCGACGAG CTCGCCAAGC GCTTCGAAGA TCACTACTAA
 
Protein sequence
MTIQKNWQEL IRPNKLQVTP GSDATRFATL VAEPLERGFG QTLGNALRRV LLSSLQGAAV 
QSVHIDGVLH EFSSIAGVRE DVTDIVLNIK DISLKMQGEG PKRMVVKKQG PGAVTAGDIQ
TVGDIVVLNP DLQLCTLDDG AEIRMEFTVN TGKGYVAAER NRPEDAPIGL IPVDSLYSPV
RKVSYKVENT REGQILDYDK LTMTVETNGA ISPEDAVAFA ARILQDQLNV FVNFEEPRKE
VTQEIIPDLA FNPAFLKKVD ELELSVRSAN CLKNDNIVYI GDLVQKSEAE MLRTPNFGRK
SLNEIKEVLA QMGLHLGMEV PGWPPENIDE LAKRFEDHY