Gene RPD_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3972 
Symbol 
ID4024489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4419952 
End bp4422087 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content63% 
IMG OID637964175 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_571092 
Protein GI91978433 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.379149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA AGGCGAAGAC GGTTCAGTTG AAAGACAAGG AAAAGGACGA CAAGGCCGAC 
GCGCCGGAGA AGGACTCCGC CGACGCTCCC TCGCCGTTGC TCGACCTGTC GGACGCGGCC
GTCAAGAAGA TGATCAAGCA GGCCAAGAAG CGCGGCTTCG TGACCTTCGA TCAGCTCAAC
GAAGTGCTGC CCTCCGACAC CACGTCGCCG GAGCAGATCG AGGACATCAT GTCGATGCTG
TCCGATATGG GCATCAACGT TTCCGAGGCC GAGGAAAGCG ACAGCGAGGA CGAAGAGTCC
AAGGACGAGG CCGAGGAAGA GCCCGATAAC GACCTCGTCG AGGTCACCCA AAAGGCCGTC
ACCGAGACCA AGAAGTCCGA GCCCGGCGAG CGCACCGACG ATCCCGTCCG GATGTATCTG
CGCGAGATGG GCACCGTCGA GCTGTTGTCG CGCGAGGGCG AAATCGCCAT CGCGAAACGG
ATCGAGGCCG GGCGCGAGGC GATGATCGCC GGGCTGTGCG AAAGCCCGCT GACCTTCCAG
GCGATCATCA TCTGGCGCGA CGAGCTCAAC GAAGGAAAGA TCTTCCTTCG CGACATCATC
GATCTCGAAG CGACCTATGC GGGTCCCGAC GCCAAGAACA ACATGAACCC GGCGATGGCC
GGCGAGACCG GCGAAGAAGC CTCGGCCGAA GGCGAGGGCG GAGCGCCCGC GCATCTCGCG
CCGCCGGCCG CGCCGCCGTC GGCGACGCCG TTCCGCCCCG CGCAGCAGCG CGCCGCGCCG
TCTCAGGCCC CCGCCGGAGA AGGCGGAGGT GAAGGCGCCG CCGAAGGCGA CATGGACGAC
GACGAGTTCG AAAACCAGAT GTCGCTCGCC GCGATCGAGG CCGAACTCAA GCCGAAGGTG
GTCGAGACCT TCGACAAGAT CGCCGACAAC TACAAGAAGC TCCGCAAGCT GCAGGAGCAG
GACATCGCCA ACCAGCTCGA GAGCGCGTCG CAGGGACCAT CGCTGTCGCC GTCGCAGGAG
CGCAAGTACA AGAAGCTCAA GGACGAAATC ATCGTCGAGG TGAAGTCGCT GCGGCTCAAT
CAGGCCCGTA TCGATTCGCT GGTCGAGCAG CTCTACGACA TCAACAAAAA GCTGGTGTCG
TTCGAAGGCC GCCTGCTGCG GCTCGGCGAC AGCCACGGCG TCGCCCGCGA AGACTTTCTG
CGCAACTATC AGGGCTCCGA GCTCGATCCG CGCTGGCTCA ACCGCGTCTC GAAACTGAGC
GCTAAAGGCT GGAAGAACTT CGTCCACCAC GAGAAGGACC GGATCAAGGA ATTGCGCCAG
GAGATCCAGT CGATGGCCGC ATTGACCGGC CTCGAGATCG GCGAATTCCG CAAGATCGTG
CACTCGGTGC AGAAGGGCGA GCGCGAAGCC CGCCAGGCCA AGAAGGAAAT GGTCGAGGCC
AATCTGCGTC TCGTGATCTC GATCGCCAAG AAATACACCA ATCGCGGCCT GCAGTTCCTC
GATCTCATTC AAGAGGGCAA TATCGGCCTG ATGAAGGCGG TCGACAAATT CGAATATCGC
CGCGGCTACA AATTCTCGAC CTACGCGACG TGGTGGATCC GGCAGGCGAT CACGCGCTCG
ATCGCCGACC AGGCCCGCAC GATTCGCATC CCGGTGCACA TGATCGAGAC GATCAACAAG
ATCGTGCGCA CCTCGCGGCA GATGCTCAAC GAGATCGGCC GCGAACCGAC CCCGGAGGAG
CTTGCCGAAA AGCTCGGCAT GCCGCTGGAG AAGGTGCGCA AGGTCCTAAA GATCGCCAAG
GAGCCGCTGT CGCTCGAAAC CCCGGTGGGT GACGAAGAGG ACAGCCATCT CGGCGATTTC
ATCGAGGACA AGAACGCGGT GCTGCCGATC GATGCCGCGA TCCAGTCGAA CCTGCGCGAG
ACCACCACGC GCGTGCTCGC CTCCCTGACG CCGCGCGAAG AACGCGTACT CCGGATGCGC
TTCGGCATCG GCATGAACAC CGACCACACG CTGGAAGAAG TCGGCCAGCA GTTTTCGGTG
ACCCGCGAAC GTATCCGCCA GATCGAAGCC AAGGCGCTGC GCAAGCTGAA GCATCCGTCA
CGGTCGCGGA AGCTGCGGAG CTTCTTGGAT AACTGA
 
Protein sequence
MASKAKTVQL KDKEKDDKAD APEKDSADAP SPLLDLSDAA VKKMIKQAKK RGFVTFDQLN 
EVLPSDTTSP EQIEDIMSML SDMGINVSEA EESDSEDEES KDEAEEEPDN DLVEVTQKAV
TETKKSEPGE RTDDPVRMYL REMGTVELLS REGEIAIAKR IEAGREAMIA GLCESPLTFQ
AIIIWRDELN EGKIFLRDII DLEATYAGPD AKNNMNPAMA GETGEEASAE GEGGAPAHLA
PPAAPPSATP FRPAQQRAAP SQAPAGEGGG EGAAEGDMDD DEFENQMSLA AIEAELKPKV
VETFDKIADN YKKLRKLQEQ DIANQLESAS QGPSLSPSQE RKYKKLKDEI IVEVKSLRLN
QARIDSLVEQ LYDINKKLVS FEGRLLRLGD SHGVAREDFL RNYQGSELDP RWLNRVSKLS
AKGWKNFVHH EKDRIKELRQ EIQSMAALTG LEIGEFRKIV HSVQKGEREA RQAKKEMVEA
NLRLVISIAK KYTNRGLQFL DLIQEGNIGL MKAVDKFEYR RGYKFSTYAT WWIRQAITRS
IADQARTIRI PVHMIETINK IVRTSRQMLN EIGREPTPEE LAEKLGMPLE KVRKVLKIAK
EPLSLETPVG DEEDSHLGDF IEDKNAVLPI DAAIQSNLRE TTTRVLASLT PREERVLRMR
FGIGMNTDHT LEEVGQQFSV TRERIRQIEA KALRKLKHPS RSRKLRSFLD N