Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnec_1416 |
Symbol | |
ID | 6183377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. necessarius STIR1 |
Kingdom | Bacteria |
Replicon accession | NC_010531 |
Strand | - |
Start bp | 1210896 |
End bp | 1213598 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641671957 |
Product | RNA polymerase, sigma 70 subunit, RpoD |
Protein accession | YP_001798129 |
Protein GI | 171464016 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.322339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAATA CCAAGACCAA AAAACCAGCT CCCAAAGCGA AAACTACTGC TAAGCTAGTA AAAGCTGCCG CTAAACTAGC TCCAAAAGCT AAGGCACCAG CCAAACCAGT TAAGACTGTT GCAAAAGCAC CAGCCAAACC AGTTAAGACT GTTGCAAAAG CACCAGCCAA ACCAGTTAAG ACTGTTGCAA AAGCACCAGC CAAACCAGTT AAGACTGTTG CAAAAGCACC AGCCAAACCA GTTAAGACTG TTGCAAAAGC ACCAGCCAAA CCAGTTAAGA CTGTTGCAAA AGCACCAGCC AAACCAGTTA AGACTGTTGC AAAAGCACCA GCCAAACCAG TTAAGACTGT TGCAAAAGCA CCAGCCAAAC CAGTTAAGAC TGTTGCAAAA GCACCAGCCA AACCAGTTAA GACTGTTGCA AAAGCACCAG CCAAACCAGT TAAGACTGTT GCAAAAGCAC CAGCCAAACC AGTTAAGACT GTTGCAAAAG CACCAGCCAA ACCAGTTAAG ACTGCAGCTA AACCAGTTGT AAAAGCTCCC GCGAAATCTG CCAAGGCAGA AGTCAAGGTT GAAAAAGGTA AAGTAGTAGT AGCTCCTAAG GTTGAGTCAG TTAAAGAATC AAAAGCAGCA AAAGAGGCTA AAGAAACAAA AGAAGTCAAA GGTAAAAAAG CCAAGACTGC AGAACAAGAA GCGCCCGTAG TTGTTGAGGA GGAAAAGAAG CGTGGCCGCA AAGCGAAAGT GGATGCTTCT GCTGAAGCAG TTGAGCCAGT ATTAACGGAT CGCCAAAAAG CACGTGAGCG CAAAGCTAAA GAGAAGGCCC TTTTAAAAGA ATTTGCTGCA CAACAGCTAG GCACCGAAGA GCAACAAGAA TTACGTCGTG CTCGCTTGAA AACTTTGATC AAGATGGGTA TGTCCAAGGG GTATTTAACC CAAGGTGAGA TGAATGACGT GATGTCTGAC GAGCTGTCTG ATGCAGACGC ACTGGAAACT TTGATTAGCT TGCTTAATGA CATCGGTATT ACTGTTTACG AACAGGCTCC TGATGCAGAA ACATTAATCC TTTCTGACAA TACAGCAGCA GCCGCTTCAG AAGAAGAGGC TGAAGCTGCT TTATCTACTG TTGACTCAGA ATTCGGACGC ACTACAGACC CAGTACGTAT GTATATGCGT GAAATGGGTA CGGTTGACTT GTTGACTCGC GAAGGCGAGA TTGTCATTGC GAAGAAAATT GAAGCCGGCC TCAAAGATAT GGTGATGGCT TTGGCTGCTT GTCCTGTAAC GATTGCTGAG ATTCTGAGTA ACGTTGACAA GATTGCTAGC GGTGAAATAG AGATTGACCA GTTTGTAGAT GGTTTGGTTG ATCCAAATGC AGAAGATATT AAGCTTGGAC CTGAAGAGCC AGAAGTTGAT CCAGATGCCG AAGAGGGCGA AGAAGATGGA GAAGATGAAG GCGGCGGTGG TGCTGCAACA GCAAATGCTA AACAGTTGGA GGAGTTGAAG CAAATCTCTT TAGAGAAATT TGCGATTGTT CGTGCGCAAG CGAAAAAAAT GCGTCGTGCG TTTGATAAAG ATGGTTACAA CTGCCCTGCA TACATTAAGG CGCAGGATGC AATTCGTGCT GAATTGCTCG GTTTCCGTTT AACTGCCAAG AGTGTCGAAA AGTTATGTGA CACCATGCGT TCACAAGTGG ACCAAGTATG GAAGCTTGAG CGCGGTATTG TTAGCTTGTT AGTGGACAAG GTTGGTGTAA ATCGTGGTGA AGTATTAAAA GACTTCCCTA AGATGTCTAT GAATTTAGCT TGGACTGACA AGTTGCTCAA AGAGAACAAG CCATATAGCG CTCTATTGCA ACGCAACGTA CCAGCTATTC AAGAGCTTCA GCAGAAGTTG ATCGATATTC AGAAGAATGT TGTTATTCCA CTTCCAGAGT TAAAAGAAGT AAATAAGCAA ATGATCGCGG GCGAGAAGCG TGCTCGTGAA GCAAAACGTG AGATGACAGT AGCCAACTTA CGTTTGGTAA TCTCGATTGC TAAGAAATAC ACTAATCGTG GTTTGCAATT CTTGGATTTG ATTCAAGAAG GTAATATCGG TTTGATGAAG GCGGTAGATA AATTTGAATA TCGTCGTGGT TATAAGTTCT CCACCTATGC AACTTGGTGG ATTCGTCAGG CGATTACTCG TTCGATTGCC GACCAGGCAC GTACGATCCG TATACCGGTT CACATGATTG AGACGATCAA CAAGATGAAC CGTATCAGCC GTCAGATCTT GCAAGAAACT GGTCATGAGC CAGATGCTGC AACCTTGGCG CTGAAGATGG AGATTCCGGA AGACAAGATC CGTAAGATCA TGAAGATCGC TAAAGAGCCT GTCTCTATGG AAACACCGAT TGGTGACGAT GAAGATTCTC ATTTGGGCGA CTTCATTGAG GATGGCAATA CATTAGCCCC AGCTGAAGCA GCATTGCATA ACTCTATGCG CGATGTTGTG AAGGATGTTC TCGATTCATT AACACCGCGT GAAGCCAAAG TATTGCGCAT GCGCTTTGGT GTTGAGATGA GTACCGATCA CACTCTCGAA GAGGTAGGTA AACAGTTTGA CGTAACGCGT GAGCGTATTC GTCAGATTGA GGCTAAGGCC CTAAGAAAGA TGCGTCATCC AAGCCGTAGT GACAAACTCA AGACCTTCCT CGAAGAGGAT TAA
|
Protein sequence | MLNTKTKKPA PKAKTTAKLV KAAAKLAPKA KAPAKPVKTV AKAPAKPVKT VAKAPAKPVK TVAKAPAKPV KTVAKAPAKP VKTVAKAPAK PVKTVAKAPA KPVKTVAKAP AKPVKTVAKA PAKPVKTVAK APAKPVKTVA KAPAKPVKTV AKAPAKPVKT VAKAPAKPVK TAAKPVVKAP AKSAKAEVKV EKGKVVVAPK VESVKESKAA KEAKETKEVK GKKAKTAEQE APVVVEEEKK RGRKAKVDAS AEAVEPVLTD RQKARERKAK EKALLKEFAA QQLGTEEQQE LRRARLKTLI KMGMSKGYLT QGEMNDVMSD ELSDADALET LISLLNDIGI TVYEQAPDAE TLILSDNTAA AASEEEAEAA LSTVDSEFGR TTDPVRMYMR EMGTVDLLTR EGEIVIAKKI EAGLKDMVMA LAACPVTIAE ILSNVDKIAS GEIEIDQFVD GLVDPNAEDI KLGPEEPEVD PDAEEGEEDG EDEGGGGAAT ANAKQLEELK QISLEKFAIV RAQAKKMRRA FDKDGYNCPA YIKAQDAIRA ELLGFRLTAK SVEKLCDTMR SQVDQVWKLE RGIVSLLVDK VGVNRGEVLK DFPKMSMNLA WTDKLLKENK PYSALLQRNV PAIQELQQKL IDIQKNVVIP LPELKEVNKQ MIAGEKRARE AKREMTVANL RLVISIAKKY TNRGLQFLDL IQEGNIGLMK AVDKFEYRRG YKFSTYATWW IRQAITRSIA DQARTIRIPV HMIETINKMN RISRQILQET GHEPDAATLA LKMEIPEDKI RKIMKIAKEP VSMETPIGDD EDSHLGDFIE DGNTLAPAEA ALHNSMRDVV KDVLDSLTPR EAKVLRMRFG VEMSTDHTLE EVGKQFDVTR ERIRQIEAKA LRKMRHPSRS DKLKTFLEED
|
| |