Gene Pden_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_4072 
Symbol 
ID4582623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp1225149 
End bp1227143 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content64% 
IMG OID639771381 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_917834 
Protein GI119386779 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.518001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA ACGACGACGA CCAGAAGCCC GACCCCAAGG ACGACGACCC TTCGCTGGAC 
ATGTCGCAGG CCGCAGTCAA GCGCATGATC GCCGAAGGGC GCGAGCGTGG CTTCATCACC
TATGACCAGC TCAATGCCGT CCTGCCGCCC GAACAGGTCA GTTCCGACCA GATCGAGGAC
GTCATGTCCA TGCTGTCGGA AATGGACATC CGCGTGGTCG AGAGCGACGA GGAGGCCGAT
GAGGCCGAAC CGGGCGGCGA ACTGGCGACC ACGGCTTCGA CCTCGCGCGA GCTGACCGTC
GCCACCACCG AAACGGAAAA GCTGGACCGC ACCGACGACC CGGTGCGCAT GTATCTGCGC
GAGATGGGCT CGGTCGAGCT GCTGAGCCGC GAGGGCGAGA TCGCCATCGC CAAGCGCATC
GAGGCCGGCC GCAACACCAT GATTGCCGGG CTTTGCGAAA GCCCCCTGAC CTTCAAGGCC
ATCACCATGT GGCGCCAGGA GCTTCTGGAC GAGGACATCC TGCTGCGCGA CGTGATCGAC
CTCGAGACCA CCTTCGGCCG CGCGATGGAC GAGGACGGCG AGGAAGAGGA GGTGCTGCCG
ACCGAGACCC GCCCGGCCGC GAATACCGCC GGCGCCGCCG GCACCGCGCG CGAGGAGGAG
GAATTCGACG CCGACGGCAA CCCGATGGGC CGCGAGGAGG ACGAGGAGGA CGACGACGGC
CAGAACCTGT CGCTGGCCGC CATGGAAGCC TCGCTCAAGC CCAAGGTGCT GGAGACGCTG
GAGGCCATCG CACGGGGCTA TGAAGAGCTG TCGCACATGC AGGACAACCG CATGTCGGCG
ACGCTGAACG AGGACGGCAC CTTCTCGGCC GCGCAGGAAA GCGCCTATCA GGTGCTGCGC
TCGCATATCG TCGCGCTGGT GAACGAACTG CACCTGCACA ACAACCGCAT CGAGGCGCTG
GTCGACCAGA TCTATGGCAT CAACCGCCGT ATCGTCACCA TCGATTCGAA CGTGGTGAAG
CTGGCCGACG CCGCGCGCAT CAACCGGCGC GAATTCATCG ACGCCTATCG CGGCAGCGAA
CTCGACCCGA CCTGGGTCGA GCGCATGATG CAGAACAAGG GCCGCGGCTG GCAGGCGCTG
TTCGAGAAAT CCCGCCCCCA GGTGGAGAAC CTGCGCGCCG AGATGGCCAT GGTCGGCCAG
CATGTCGGCG TCGATATCGA GGAATTCCGC CGCATCGTCA GCCAGGTCCA GCGCGGCGAA
AAGGAATCGC GCCAGGCCAA GAAGGAAATG GTCGAGGCGA ACCTGCGGCT GGTGATCTCG
ATCGCCAAGA AATACACGAA CAGGGGCCTG CAATTCCTTG ATCTCATTCA GGAAGGCAAT
ATCGGCCTGA TGAAGGCCGT GGACAAGTTC GAGTATCGGC GCGGCTACAA GTTCTCGACC
TATGCGACCT GGTGGATCCG GCAGGCGATC ACGCGCTCGA TCGCCGACCA GGCCCGCACC
ATCCGCATCC CGGTCCACAT GATCGAGACG ATCAACAAGC TGGTGCGCAC CGGCCGCCAG
ATGCTGCACG AGATCGGCCG CGAGCCGACG CCCGAGGAAC TGGCCGAAAA GCTGCAGATG
CCGCTGGAAA AGGTCCGCAA GGTGATGAAG ATCGCCAAGG AGCCGATCAG CCTCGAAACC
CCCATCGGGG ACGAAGAGGA CAGCCAGCTT GGCGATTTCA TCGAGGACAA GAACGCCGTC
CTGCCGCTGG ACAGCGCCAT TCAGGAAAAC CTGAAGGAGA CCACCACGCG CGTCCTGGCC
AGCCTGACCC CGCGCGAGGA GCGGGTGCTG CGCATGCGCT TCGGCATCGG CATGAACACC
GACCACACTC TGGAAGAGGT CGGCCAGCAG TTCAGCGTGA CGCGCGAGCG CATCCGCCAG
ATCGAGGCCA AGGCGCTGCG CAAGCTCAAG CATCCCAGCC GGTCGCGCAA GCTGCGGTCC
TTCTTGGACC AGTAA
 
Protein sequence
MAANDDDQKP DPKDDDPSLD MSQAAVKRMI AEGRERGFIT YDQLNAVLPP EQVSSDQIED 
VMSMLSEMDI RVVESDEEAD EAEPGGELAT TASTSRELTV ATTETEKLDR TDDPVRMYLR
EMGSVELLSR EGEIAIAKRI EAGRNTMIAG LCESPLTFKA ITMWRQELLD EDILLRDVID
LETTFGRAMD EDGEEEEVLP TETRPAANTA GAAGTAREEE EFDADGNPMG REEDEEDDDG
QNLSLAAMEA SLKPKVLETL EAIARGYEEL SHMQDNRMSA TLNEDGTFSA AQESAYQVLR
SHIVALVNEL HLHNNRIEAL VDQIYGINRR IVTIDSNVVK LADAARINRR EFIDAYRGSE
LDPTWVERMM QNKGRGWQAL FEKSRPQVEN LRAEMAMVGQ HVGVDIEEFR RIVSQVQRGE
KESRQAKKEM VEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST
YATWWIRQAI TRSIADQART IRIPVHMIET INKLVRTGRQ MLHEIGREPT PEELAEKLQM
PLEKVRKVMK IAKEPISLET PIGDEEDSQL GDFIEDKNAV LPLDSAIQEN LKETTTRVLA
SLTPREERVL RMRFGIGMNT DHTLEEVGQQ FSVTRERIRQ IEAKALRKLK HPSRSRKLRS
FLDQ