Gene Plav_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2159 
Symbol 
ID5456223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2340055 
End bp2341077 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content66% 
IMG OID640877736 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_001413430 
Protein GI154252606 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCCGG GGACCGGGCC GGATCAGGGG TTTGCCATGG CCGAGGCCAA GACAGAGAAA 
ACAGCAGGCA AGGCCGCCGA AAAGGCGCCC AAGGCGCCGA GGGCGCTGGT GAAGGCGCCG
GGGACGCGGC TCGCGCCCGC GCCTAACCCG GAGGGCTCGC TGTCGCGCTA CCTGCAGGAA
ATCCGCAAGT TTCCGATGCT GGAGCCGCAG GAGGAATACA TGCTCGCCAA ACGCTGGCGG
GAGCATGAGG ACACCGAGGC GGCGCACCAG CTTGTGACGT CGCATCTCCG CCTCGTGGCG
AAGATCGCCA TGGGCTATCG CGGCTATGGG CTGCCGATCG GCGAGGTGAT CTCGGAAGGC
AATGTGGGCC TGATGCAGGC GGTGAAGCGC TTCGAGCCGG AGCGCGGCTT CCGGCTTGCG
ACCTATGCCA TGTGGTGGAT CCGCGCCTCG ATCCAGGAAT ATATCCTGCG GTCATGGTCG
CTCGTGAAAA TGGGCACGAC CGCGGCGCAG AAGAAGCTCT TCTTCAATCT CCGCAAGGTG
AAGGGCCAGA TCCAGGCGAT CGAGGAAGGC GACCTCCACC CCGAACAGGT GAAGGTGATT
TCCGACCGGC TCGGCGTGAC CGAGGACGAG GTGATCTCGA TGAACCGGCG GCTTTCCGGC
CCCGACAATT CGCTGAACGC ACCCTTGCGG CAGGACTCGG AAGGCGGCGA GTGGCAGGAC
TGGCTGGTGG ACGACACCGC CGATCAGGAA GCGACGCTGG GCGAGGCCGA GGAGCTGGAC
CAGCGCCGCG AAATGCTGGC CGCCGCCATG GCGACGCTGA ACGAGCGCGA GATGCACATT
CTGACGGAGC GCAGGCTGAA GGACGAGCCG GCGACGCTGG AAGACCTCAG CCAGGAATAT
GGCATCAGCC GAGAGCGCGT GCGCCAGATC GAGGTGCGCG CCTTCGAGAA GCTGCAGAAA
GCCATGAAAA ACGCCATGCG CGCCGATGCG GACGCAAGGC GGGACGCGCT GGCCGGCGTG
TGA
 
Protein sequence
MGPGTGPDQG FAMAEAKTEK TAGKAAEKAP KAPRALVKAP GTRLAPAPNP EGSLSRYLQE 
IRKFPMLEPQ EEYMLAKRWR EHEDTEAAHQ LVTSHLRLVA KIAMGYRGYG LPIGEVISEG
NVGLMQAVKR FEPERGFRLA TYAMWWIRAS IQEYILRSWS LVKMGTTAAQ KKLFFNLRKV
KGQIQAIEEG DLHPEQVKVI SDRLGVTEDE VISMNRRLSG PDNSLNAPLR QDSEGGEWQD
WLVDDTADQE ATLGEAEELD QRREMLAAAM ATLNEREMHI LTERRLKDEP ATLEDLSQEY
GISRERVRQI EVRAFEKLQK AMKNAMRADA DARRDALAGV