Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1963 |
Symbol | |
ID | 6164367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1727137 |
End bp | 1730520 |
Gene Length | 3384 bp |
Protein Length | 1127 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641669126 |
Product | DNA-directed RNA polymerase subunit B |
Protein accession | YP_001795324 |
Protein GI | 171186405 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR03670] DNA-directed RNA polymerase subunit B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGATC TCCTCCCCCT TCCCGTCATC GTAAGTCAGC AGGACGGGAA GTTCCCCACG AGAGACGACA GGTGGGTTCT CGTGGAGAGG TTTATAAAAG ACAAGGGGCT TGCTAACCAC CAGATAAAGT CCTTTAACGA TTTCTTGGAT AAGAAGTTGC CCAAGATAGT GGAGGACTTC AAGGTGGTTG AGACGGAGAT AAAGGGTCTC AAGTTGGTTC TTGAGCGGAT TGAGGTGGGG TGGCCCAAGA TTAAGGAGAG CGACGGTTCC GAATCCATAA TATATCCCAT GGAGGCAAGG TTGCGCAACG CCACGTATTC GGCTCCGCTG TATCTCACCG CGACTCTATA CGTCGACGAC GAGCCGTATG TCACCGAGAC TTTCTACATC GGCGAGTTGC CGATAATGGT TAAGTCGAAG AGGTGTAACC TCACCAGGTT GAGGCCTGGC GAGTATATAA AGAAGTTTGA AGATCCGCAG GACTTCGGCG GCTACTTCAT CATCAACGGG AGCGAGCGGG TGATAATATC CCAGGAGGAT CTGGTGGCGG ATAGGCCTAT CTACGACAGG GGGGATAAGC CCTCTGTAAA ATACATCGCC AAGACGATAT CCACAGGCAT AGGGTATAGA AGCACGCTGA CTGTTGAGCT TAACCGAGAC GGCGTTATCT ACGTCACCCT TTCCGCGATG CCGGTCAAGA TACCGTTCCC GATCTACATG AAGGCCCTGG GGCTTGAGAC GGACGAAGAC GTCGTCAAGG CCGTGTCCGA CGACCCGGAG ATCCAGAAGG AGCTTCTGCC GTCTCTCATC GCCGCCAACC AAATAGCGAT TACGAGGGAA GACGCCCTAG ACTTCATCGG CGGCAAGGTG GCGGTGGGTC AGCCAAAGCC CATTAGGGTA GAGAAGGCGT ATCAGCTCCT CGACAAGTAC TTCCTCCCGC ATCTGGGCAC TACGACCCCC GACGAGAAGA GGCAGCAGGA GATTAGGCTT AAGAAGGCGT TGATGTTGGG GCAGATCGTA AAGGGCTTGG TCGAGTTGCA ACTGGGCCGT AGGAAGCCCG ACGATAAAGA CCATGTTGCC AACAAGAGGG TTAGGCTAGT CGGCGATTTG ATGACCCAGC TGTTCCGCAC CGTCTTAAAA CAGTTTCTAC AGGAGCTCAA GAGCCAGCTT GAGAAGTACT ACGCCAGGGG GAGGATACCC CACATACAGA CGATAGTGAG GCCCGACATC ATCACCGAGC GCGTGAGGCA GGCCTTGGCC ACCGGCAACT GGGTAGGCGG GAAGACGGGC GTGTCCCAGA TCCTAGACCG CACCAACTAC ATGTCTACCC TGAGCTACCT CAGGCGGGTG GTGTCGTCGC TGTCTAGGAC CCAGCCTCAC TTCGAGGCGC GCGACCTACA CCCCACTCAG TGGGGCAGAC TGTGCGCCGT GGAGACCCCG GAAGGCCAGA ACGTGGGCTT GGTCAAGAAC CTGGCGCTAC TCGCCGAGAT CACCACCGGG GTAGACGAGG GGGAGGTGGA GCAGTTGCTG TACAACCTGG GCGTGGTGCC TCTGTTGAAG GCCAGGGAGG AGGGCATCCG CGGCGCCGAA GTCTACATAA ACGGCAGATT GATCGGCGTG CATCCAAACC CGGAGGAGCT CGCGAAAACC GTCAGGGGCA TGAGGAGGCA GGGGAAGATC AGCGACGAGA TAAACATAGC ATACCTAGAC GGCTCCGTGT ATGTGAACTG CGACGGCGGC CGCATCAGGA GGCCTTTACT CGTCGTCGAG GGCGGGAGGC TCAAGCTGAC GAAGGAGATC GTGGAAAAGG TGAGACGTGG CGACTTGACT TGGGACGACT TGGTCAAGAT GGGGGTTATC GAGTACCTAG ACGCCGACGA GGAGGAGAAC GCCCACATCG CCGTTGACCC AGACGTCGAC CTCAGCAACT ACACACATGT GGAGATCGTG CCGTCGTCTA TACTCGGCGC CGTGGGCTCC ATTATCCCCT ACCTCGAGCA CAACCAGTCG CCGCGTAACC AGTACGAAGC AGCTATGGCG AAGCAGTCGC TCGGCCTACC CCAGTCCAAC TTCTTGTACA AGCTGGACTC GCGGGGCCAC ATGTTGTACT ACCCCGAGAG GCCTATAGTG ACCACGCGCG GTTTGGAGCT CGTGGGCTAC TCCAGAAGGC CGGCGGGCCA AAACGCGGTT GTGGCGCTAC TCACCTACAC GGGCTACAAC ATAGAGGACG CCGTGATCTT GAACAAGGCC TCTGTGGAGC GCGGCATGTT CCGCTCGGTC TTCTACAGAA CGTACGAGAC CGAGGAGCAG AAGTACCCAG GCGGGGAGGA GGACAAGATA GAGATCCCCG ACAGCTCCGT CAAGGGCTAC AGGGGCCCGG AGGCCTACAG CCACCTGGAC GAGGATGGAA TAGCCCCGCC CGAGGTCTAC GTCAGTAGCA ACGAGGTCCT GGTGGGGAAG ACCTCTCCGC CGAGGTTCTA CACAACGCTG GAGGCCGAGC GTATCTTGAA GGAGAGGAGG GACGCCTCTG TGGCTGTGAG AAGGGGGGAG AAGGGCATCG TGGACAGGGT CATCATTACG GAGTCTCCAG AGGGCAACAA GCTCGTCAAG GTGAGGTTGA GGGAGCTGAG GGTGCCGGAG CTTGGGGATA AGTTCGCCTC CCGCCACGGG CAGAAGGGGG TCGTGGGCAT GTTGCTTAGG CATGAGGACA TGCCGTTTAC CGAAGACGGC ATCGTGCCCG ACATAATAGT CAACCCACAC GCGTTGCCGT CGCGTATGAC CGTGGCGCAA CTACTCGAGA GCATGGCCGG CAAAGTAGGC GCCTCCCTCG GCGCTTTGGT AGACGCCACA GCCTTCGAGG GGGTGAAGGA GGAGGATCTG AGGAAGCTGT TGCTTAAGCT CGGCTACAAG TGGGACGGCA AGGAGGTCAT GTACAGCGGC ATCACCGGCG AGAGGCTGGT GGCCGACATA TTCATAGGCG TTGTCTACTA CCAGAAGCTA CACCACATGG TTGCCGACAA GATCCACGCG AGGGCAAGGG GCCCCGTCCA GATCTTGACT AGGCAGCCGA CCGAAGGCCG CTCCCGCGAG GGCGGTCTGA GGCTTGGCGA AATGGAGAGA GACGTGTTGA TAGCCCACGG CGCCTCGGCC CTGCTGTATG AGAGGCTTGT GGAGTCCAGC GACAAATACA CCATGTATGT CTGCGAGCTC TGTGGACTAC CGGCGTATCT AGACGCCAAG TCAAACAAGC CTAAGTGCCC CATACACGGC GACACAGGCC AATTCGCCAA GGTGGTTGTG CCCTATGCCT ACAAGCTACT TCTACAGGAG CTCATAGCAC TTGGGATATA TCCAAAGCTG GAGCTCACCG AGGTGTTGGA GTAG
|
Protein sequence | MVDLLPLPVI VSQQDGKFPT RDDRWVLVER FIKDKGLANH QIKSFNDFLD KKLPKIVEDF KVVETEIKGL KLVLERIEVG WPKIKESDGS ESIIYPMEAR LRNATYSAPL YLTATLYVDD EPYVTETFYI GELPIMVKSK RCNLTRLRPG EYIKKFEDPQ DFGGYFIING SERVIISQED LVADRPIYDR GDKPSVKYIA KTISTGIGYR STLTVELNRD GVIYVTLSAM PVKIPFPIYM KALGLETDED VVKAVSDDPE IQKELLPSLI AANQIAITRE DALDFIGGKV AVGQPKPIRV EKAYQLLDKY FLPHLGTTTP DEKRQQEIRL KKALMLGQIV KGLVELQLGR RKPDDKDHVA NKRVRLVGDL MTQLFRTVLK QFLQELKSQL EKYYARGRIP HIQTIVRPDI ITERVRQALA TGNWVGGKTG VSQILDRTNY MSTLSYLRRV VSSLSRTQPH FEARDLHPTQ WGRLCAVETP EGQNVGLVKN LALLAEITTG VDEGEVEQLL YNLGVVPLLK AREEGIRGAE VYINGRLIGV HPNPEELAKT VRGMRRQGKI SDEINIAYLD GSVYVNCDGG RIRRPLLVVE GGRLKLTKEI VEKVRRGDLT WDDLVKMGVI EYLDADEEEN AHIAVDPDVD LSNYTHVEIV PSSILGAVGS IIPYLEHNQS PRNQYEAAMA KQSLGLPQSN FLYKLDSRGH MLYYPERPIV TTRGLELVGY SRRPAGQNAV VALLTYTGYN IEDAVILNKA SVERGMFRSV FYRTYETEEQ KYPGGEEDKI EIPDSSVKGY RGPEAYSHLD EDGIAPPEVY VSSNEVLVGK TSPPRFYTTL EAERILKERR DASVAVRRGE KGIVDRVIIT ESPEGNKLVK VRLRELRVPE LGDKFASRHG QKGVVGMLLR HEDMPFTEDG IVPDIIVNPH ALPSRMTVAQ LLESMAGKVG ASLGALVDAT AFEGVKEEDL RKLLLKLGYK WDGKEVMYSG ITGERLVADI FIGVVYYQKL HHMVADKIHA RARGPVQILT RQPTEGRSRE GGLRLGEMER DVLIAHGASA LLYERLVESS DKYTMYVCEL CGLPAYLDAK SNKPKCPIHG DTGQFAKVVV PYAYKLLLQE LIALGIYPKL ELTEVLE
|
| |