Gene Pnec_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1416 
Symbol 
ID6183377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp1210896 
End bp1213598 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content45% 
IMG OID641671957 
ProductRNA polymerase, sigma 70 subunit, RpoD 
Protein accessionYP_001798129 
Protein GI171464016 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.322339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAATA CCAAGACCAA AAAACCAGCT CCCAAAGCGA AAACTACTGC TAAGCTAGTA 
AAAGCTGCCG CTAAACTAGC TCCAAAAGCT AAGGCACCAG CCAAACCAGT TAAGACTGTT
GCAAAAGCAC CAGCCAAACC AGTTAAGACT GTTGCAAAAG CACCAGCCAA ACCAGTTAAG
ACTGTTGCAA AAGCACCAGC CAAACCAGTT AAGACTGTTG CAAAAGCACC AGCCAAACCA
GTTAAGACTG TTGCAAAAGC ACCAGCCAAA CCAGTTAAGA CTGTTGCAAA AGCACCAGCC
AAACCAGTTA AGACTGTTGC AAAAGCACCA GCCAAACCAG TTAAGACTGT TGCAAAAGCA
CCAGCCAAAC CAGTTAAGAC TGTTGCAAAA GCACCAGCCA AACCAGTTAA GACTGTTGCA
AAAGCACCAG CCAAACCAGT TAAGACTGTT GCAAAAGCAC CAGCCAAACC AGTTAAGACT
GTTGCAAAAG CACCAGCCAA ACCAGTTAAG ACTGCAGCTA AACCAGTTGT AAAAGCTCCC
GCGAAATCTG CCAAGGCAGA AGTCAAGGTT GAAAAAGGTA AAGTAGTAGT AGCTCCTAAG
GTTGAGTCAG TTAAAGAATC AAAAGCAGCA AAAGAGGCTA AAGAAACAAA AGAAGTCAAA
GGTAAAAAAG CCAAGACTGC AGAACAAGAA GCGCCCGTAG TTGTTGAGGA GGAAAAGAAG
CGTGGCCGCA AAGCGAAAGT GGATGCTTCT GCTGAAGCAG TTGAGCCAGT ATTAACGGAT
CGCCAAAAAG CACGTGAGCG CAAAGCTAAA GAGAAGGCCC TTTTAAAAGA ATTTGCTGCA
CAACAGCTAG GCACCGAAGA GCAACAAGAA TTACGTCGTG CTCGCTTGAA AACTTTGATC
AAGATGGGTA TGTCCAAGGG GTATTTAACC CAAGGTGAGA TGAATGACGT GATGTCTGAC
GAGCTGTCTG ATGCAGACGC ACTGGAAACT TTGATTAGCT TGCTTAATGA CATCGGTATT
ACTGTTTACG AACAGGCTCC TGATGCAGAA ACATTAATCC TTTCTGACAA TACAGCAGCA
GCCGCTTCAG AAGAAGAGGC TGAAGCTGCT TTATCTACTG TTGACTCAGA ATTCGGACGC
ACTACAGACC CAGTACGTAT GTATATGCGT GAAATGGGTA CGGTTGACTT GTTGACTCGC
GAAGGCGAGA TTGTCATTGC GAAGAAAATT GAAGCCGGCC TCAAAGATAT GGTGATGGCT
TTGGCTGCTT GTCCTGTAAC GATTGCTGAG ATTCTGAGTA ACGTTGACAA GATTGCTAGC
GGTGAAATAG AGATTGACCA GTTTGTAGAT GGTTTGGTTG ATCCAAATGC AGAAGATATT
AAGCTTGGAC CTGAAGAGCC AGAAGTTGAT CCAGATGCCG AAGAGGGCGA AGAAGATGGA
GAAGATGAAG GCGGCGGTGG TGCTGCAACA GCAAATGCTA AACAGTTGGA GGAGTTGAAG
CAAATCTCTT TAGAGAAATT TGCGATTGTT CGTGCGCAAG CGAAAAAAAT GCGTCGTGCG
TTTGATAAAG ATGGTTACAA CTGCCCTGCA TACATTAAGG CGCAGGATGC AATTCGTGCT
GAATTGCTCG GTTTCCGTTT AACTGCCAAG AGTGTCGAAA AGTTATGTGA CACCATGCGT
TCACAAGTGG ACCAAGTATG GAAGCTTGAG CGCGGTATTG TTAGCTTGTT AGTGGACAAG
GTTGGTGTAA ATCGTGGTGA AGTATTAAAA GACTTCCCTA AGATGTCTAT GAATTTAGCT
TGGACTGACA AGTTGCTCAA AGAGAACAAG CCATATAGCG CTCTATTGCA ACGCAACGTA
CCAGCTATTC AAGAGCTTCA GCAGAAGTTG ATCGATATTC AGAAGAATGT TGTTATTCCA
CTTCCAGAGT TAAAAGAAGT AAATAAGCAA ATGATCGCGG GCGAGAAGCG TGCTCGTGAA
GCAAAACGTG AGATGACAGT AGCCAACTTA CGTTTGGTAA TCTCGATTGC TAAGAAATAC
ACTAATCGTG GTTTGCAATT CTTGGATTTG ATTCAAGAAG GTAATATCGG TTTGATGAAG
GCGGTAGATA AATTTGAATA TCGTCGTGGT TATAAGTTCT CCACCTATGC AACTTGGTGG
ATTCGTCAGG CGATTACTCG TTCGATTGCC GACCAGGCAC GTACGATCCG TATACCGGTT
CACATGATTG AGACGATCAA CAAGATGAAC CGTATCAGCC GTCAGATCTT GCAAGAAACT
GGTCATGAGC CAGATGCTGC AACCTTGGCG CTGAAGATGG AGATTCCGGA AGACAAGATC
CGTAAGATCA TGAAGATCGC TAAAGAGCCT GTCTCTATGG AAACACCGAT TGGTGACGAT
GAAGATTCTC ATTTGGGCGA CTTCATTGAG GATGGCAATA CATTAGCCCC AGCTGAAGCA
GCATTGCATA ACTCTATGCG CGATGTTGTG AAGGATGTTC TCGATTCATT AACACCGCGT
GAAGCCAAAG TATTGCGCAT GCGCTTTGGT GTTGAGATGA GTACCGATCA CACTCTCGAA
GAGGTAGGTA AACAGTTTGA CGTAACGCGT GAGCGTATTC GTCAGATTGA GGCTAAGGCC
CTAAGAAAGA TGCGTCATCC AAGCCGTAGT GACAAACTCA AGACCTTCCT CGAAGAGGAT
TAA
 
Protein sequence
MLNTKTKKPA PKAKTTAKLV KAAAKLAPKA KAPAKPVKTV AKAPAKPVKT VAKAPAKPVK 
TVAKAPAKPV KTVAKAPAKP VKTVAKAPAK PVKTVAKAPA KPVKTVAKAP AKPVKTVAKA
PAKPVKTVAK APAKPVKTVA KAPAKPVKTV AKAPAKPVKT VAKAPAKPVK TAAKPVVKAP
AKSAKAEVKV EKGKVVVAPK VESVKESKAA KEAKETKEVK GKKAKTAEQE APVVVEEEKK
RGRKAKVDAS AEAVEPVLTD RQKARERKAK EKALLKEFAA QQLGTEEQQE LRRARLKTLI
KMGMSKGYLT QGEMNDVMSD ELSDADALET LISLLNDIGI TVYEQAPDAE TLILSDNTAA
AASEEEAEAA LSTVDSEFGR TTDPVRMYMR EMGTVDLLTR EGEIVIAKKI EAGLKDMVMA
LAACPVTIAE ILSNVDKIAS GEIEIDQFVD GLVDPNAEDI KLGPEEPEVD PDAEEGEEDG
EDEGGGGAAT ANAKQLEELK QISLEKFAIV RAQAKKMRRA FDKDGYNCPA YIKAQDAIRA
ELLGFRLTAK SVEKLCDTMR SQVDQVWKLE RGIVSLLVDK VGVNRGEVLK DFPKMSMNLA
WTDKLLKENK PYSALLQRNV PAIQELQQKL IDIQKNVVIP LPELKEVNKQ MIAGEKRARE
AKREMTVANL RLVISIAKKY TNRGLQFLDL IQEGNIGLMK AVDKFEYRRG YKFSTYATWW
IRQAITRSIA DQARTIRIPV HMIETINKMN RISRQILQET GHEPDAATLA LKMEIPEDKI
RKIMKIAKEP VSMETPIGDD EDSHLGDFIE DGNTLAPAEA ALHNSMRDVV KDVLDSLTPR
EAKVLRMRFG VEMSTDHTLE EVGKQFDVTR ERIRQIEAKA LRKMRHPSRS DKLKTFLEED