Gene Ppro_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpro_3049 
Symbol 
ID4574176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter propionicus DSM 2379 
KingdomBacteria 
Replicon accessionNC_008609 
Strand
Start bp3336693 
End bp3338441 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content49% 
IMG OID639757106 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_902702 
Protein GI118581452 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000323778 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAA AAAATATGGA CGGCGTTAAG CAGTTGATTG ATCTCGGCAA GGAAAAGGGT 
TTCCTGACCT TCGAAGAGGT TAACGACATC CTGCCGCCAG ATATTGCCAC GGAACAGATA
GATGACGTCA TGGGCATGTT TGGCGATCTG GATATCGAGA TCGTTGACTC TGCCCAGAAA
GTCAAGATTC CGAAAATGAA GCTTGAGATG GAAGAAGAAG AGGAAGCGGA AGGTGAGGCC
GAAGAGGTCG AGTTTGAGCC TGGCTCCATC GGGCGTACAA GCGATCCGGT GCGCATGTAT
TTGCGTGAAA TGGGCTCTGT TTCCCTTTTG ACCCGCGAAG GCGAAGTTGA GATTGCCAAG
CGTATTGAGG ATGGAGAGCG TGATGTTGCC AGCGTTATTC TTAATACGCC GATTACCATA
AAAGAGGTGC TGCAACTGGG AGATAAGCTT CGCAAATTCC AGCTGAATGC TTCTGAAATC
AGCAAAGAGG TTGAGGAAGA GTCTCTGGAT GATGAGGAAG AAGATCTTCC CAAGCTGCGC
ATCCTTGAGA TTATCGATGC AATTTCAAGT GGTGATGAGG AATTGACTGG CCTTTCTGAA
GAGCTTGAAA AATGTGCTGC CGCCGCCAAA AAGAAGGAAC TTGAGAAAAA AATTCAGGAA
CTCAGGGAGT CGCAGGCAAA ACTGCTGATC TCGCTGCGCC TGAAGGATCG CCACATAAAT
AAAATATCAG AACGCCTGAA GGAGTTGTCG TTTAAGGTCG ACAGGCTCAT GAAGGAATTG
GCTGATCTGG AGGATCTGAT ACCAGCTGAG AAACTCAGAT TTTTCATGGA CAGTTACCAT
AAGGATGAAG AAAAAGCTCA GGCCGAGTTG CGTAAGCTCA AACTGGGCGC AGAAGAACAG
GATAGAGTTG ATGCCCGATT GCTTTCCACA GCGCGAAAGC TGAAGAAAAT CGAACTGGAA
TCCGGTTTCA AGGCCAGCGA ACTGTCAGCG GCGCTTCAGG CGATTGAAGA AGGTGAGTGT
AAAGCGCGTA TAGCGAAATC CGAACTTATC GAAGCGAATC TTCGCCTGGT TGTATCCATT
GCGAAAAAGT ACACCAATCG AGGCTTGCAG TTTCTTGACC TGATCCAAGA GGGCAACATC
GGCCTTATGA AGGCGGTGGA CAAGTTCGAA TACCAGCGTG GTTACAAATT TTCTACTTAT
GCCACCTGGT GGATACGTCA GGCAATTACC CGTGCAATTG CCGATCAGGC ACGAACCATA
CGGATTCCAG TGCACATGAT CGAGACCATC AACAAGCTCA TCCGCACCAG TCGCCAGTTG
GTACAGGAAA ATGGTCGTGA GCCGGCTCCT GAAGAAATTG CGGAACGGAT GCAATTGCCT
CTGGATAAGG TTCGTAAAGT TTTGAAGATT GCCAAGGAGC CTATTTCGCT GGAGACTCCG
ATTGGCGAGG AAGAAGACTC GCATTTGGGC GATTTTATTG AGGATAAAGC GGTGATTTCT
CCCATAGAAG CGGTCATTAA GGCCAATCTG TCGGAGCAGA CATCCAGGGT ATTATCCACT
CTTACTCCTC GCGAGGAGAA GGTCCTGCGT ATGCGTTTCG GTATCGGCGA AAAGAGTGAC
CACACTCTCG AGGAGGTTGG GCAGGACTTC GCCGTTACCC GTGAGCGGAT TCGCCAGATA
GAGGCAAAGG CGCTGCGCAA ACTGCGGCAT CCCAGTCGCA GTAAAAAGCT CAAGAGCTTT
GTTGAATAA
 
Protein sequence
MVKKNMDGVK QLIDLGKEKG FLTFEEVNDI LPPDIATEQI DDVMGMFGDL DIEIVDSAQK 
VKIPKMKLEM EEEEEAEGEA EEVEFEPGSI GRTSDPVRMY LREMGSVSLL TREGEVEIAK
RIEDGERDVA SVILNTPITI KEVLQLGDKL RKFQLNASEI SKEVEEESLD DEEEDLPKLR
ILEIIDAISS GDEELTGLSE ELEKCAAAAK KKELEKKIQE LRESQAKLLI SLRLKDRHIN
KISERLKELS FKVDRLMKEL ADLEDLIPAE KLRFFMDSYH KDEEKAQAEL RKLKLGAEEQ
DRVDARLLST ARKLKKIELE SGFKASELSA ALQAIEEGEC KARIAKSELI EANLRLVVSI
AKKYTNRGLQ FLDLIQEGNI GLMKAVDKFE YQRGYKFSTY ATWWIRQAIT RAIADQARTI
RIPVHMIETI NKLIRTSRQL VQENGREPAP EEIAERMQLP LDKVRKVLKI AKEPISLETP
IGEEEDSHLG DFIEDKAVIS PIEAVIKANL SEQTSRVLST LTPREEKVLR MRFGIGEKSD
HTLEEVGQDF AVTRERIRQI EAKALRKLRH PSRSKKLKSF VE