Gene Tneu_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1963 
Symbol 
ID6164367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1727137 
End bp1730520 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content58% 
IMG OID641669126 
ProductDNA-directed RNA polymerase subunit B 
Protein accessionYP_001795324 
Protein GI171186405 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGATC TCCTCCCCCT TCCCGTCATC GTAAGTCAGC AGGACGGGAA GTTCCCCACG 
AGAGACGACA GGTGGGTTCT CGTGGAGAGG TTTATAAAAG ACAAGGGGCT TGCTAACCAC
CAGATAAAGT CCTTTAACGA TTTCTTGGAT AAGAAGTTGC CCAAGATAGT GGAGGACTTC
AAGGTGGTTG AGACGGAGAT AAAGGGTCTC AAGTTGGTTC TTGAGCGGAT TGAGGTGGGG
TGGCCCAAGA TTAAGGAGAG CGACGGTTCC GAATCCATAA TATATCCCAT GGAGGCAAGG
TTGCGCAACG CCACGTATTC GGCTCCGCTG TATCTCACCG CGACTCTATA CGTCGACGAC
GAGCCGTATG TCACCGAGAC TTTCTACATC GGCGAGTTGC CGATAATGGT TAAGTCGAAG
AGGTGTAACC TCACCAGGTT GAGGCCTGGC GAGTATATAA AGAAGTTTGA AGATCCGCAG
GACTTCGGCG GCTACTTCAT CATCAACGGG AGCGAGCGGG TGATAATATC CCAGGAGGAT
CTGGTGGCGG ATAGGCCTAT CTACGACAGG GGGGATAAGC CCTCTGTAAA ATACATCGCC
AAGACGATAT CCACAGGCAT AGGGTATAGA AGCACGCTGA CTGTTGAGCT TAACCGAGAC
GGCGTTATCT ACGTCACCCT TTCCGCGATG CCGGTCAAGA TACCGTTCCC GATCTACATG
AAGGCCCTGG GGCTTGAGAC GGACGAAGAC GTCGTCAAGG CCGTGTCCGA CGACCCGGAG
ATCCAGAAGG AGCTTCTGCC GTCTCTCATC GCCGCCAACC AAATAGCGAT TACGAGGGAA
GACGCCCTAG ACTTCATCGG CGGCAAGGTG GCGGTGGGTC AGCCAAAGCC CATTAGGGTA
GAGAAGGCGT ATCAGCTCCT CGACAAGTAC TTCCTCCCGC ATCTGGGCAC TACGACCCCC
GACGAGAAGA GGCAGCAGGA GATTAGGCTT AAGAAGGCGT TGATGTTGGG GCAGATCGTA
AAGGGCTTGG TCGAGTTGCA ACTGGGCCGT AGGAAGCCCG ACGATAAAGA CCATGTTGCC
AACAAGAGGG TTAGGCTAGT CGGCGATTTG ATGACCCAGC TGTTCCGCAC CGTCTTAAAA
CAGTTTCTAC AGGAGCTCAA GAGCCAGCTT GAGAAGTACT ACGCCAGGGG GAGGATACCC
CACATACAGA CGATAGTGAG GCCCGACATC ATCACCGAGC GCGTGAGGCA GGCCTTGGCC
ACCGGCAACT GGGTAGGCGG GAAGACGGGC GTGTCCCAGA TCCTAGACCG CACCAACTAC
ATGTCTACCC TGAGCTACCT CAGGCGGGTG GTGTCGTCGC TGTCTAGGAC CCAGCCTCAC
TTCGAGGCGC GCGACCTACA CCCCACTCAG TGGGGCAGAC TGTGCGCCGT GGAGACCCCG
GAAGGCCAGA ACGTGGGCTT GGTCAAGAAC CTGGCGCTAC TCGCCGAGAT CACCACCGGG
GTAGACGAGG GGGAGGTGGA GCAGTTGCTG TACAACCTGG GCGTGGTGCC TCTGTTGAAG
GCCAGGGAGG AGGGCATCCG CGGCGCCGAA GTCTACATAA ACGGCAGATT GATCGGCGTG
CATCCAAACC CGGAGGAGCT CGCGAAAACC GTCAGGGGCA TGAGGAGGCA GGGGAAGATC
AGCGACGAGA TAAACATAGC ATACCTAGAC GGCTCCGTGT ATGTGAACTG CGACGGCGGC
CGCATCAGGA GGCCTTTACT CGTCGTCGAG GGCGGGAGGC TCAAGCTGAC GAAGGAGATC
GTGGAAAAGG TGAGACGTGG CGACTTGACT TGGGACGACT TGGTCAAGAT GGGGGTTATC
GAGTACCTAG ACGCCGACGA GGAGGAGAAC GCCCACATCG CCGTTGACCC AGACGTCGAC
CTCAGCAACT ACACACATGT GGAGATCGTG CCGTCGTCTA TACTCGGCGC CGTGGGCTCC
ATTATCCCCT ACCTCGAGCA CAACCAGTCG CCGCGTAACC AGTACGAAGC AGCTATGGCG
AAGCAGTCGC TCGGCCTACC CCAGTCCAAC TTCTTGTACA AGCTGGACTC GCGGGGCCAC
ATGTTGTACT ACCCCGAGAG GCCTATAGTG ACCACGCGCG GTTTGGAGCT CGTGGGCTAC
TCCAGAAGGC CGGCGGGCCA AAACGCGGTT GTGGCGCTAC TCACCTACAC GGGCTACAAC
ATAGAGGACG CCGTGATCTT GAACAAGGCC TCTGTGGAGC GCGGCATGTT CCGCTCGGTC
TTCTACAGAA CGTACGAGAC CGAGGAGCAG AAGTACCCAG GCGGGGAGGA GGACAAGATA
GAGATCCCCG ACAGCTCCGT CAAGGGCTAC AGGGGCCCGG AGGCCTACAG CCACCTGGAC
GAGGATGGAA TAGCCCCGCC CGAGGTCTAC GTCAGTAGCA ACGAGGTCCT GGTGGGGAAG
ACCTCTCCGC CGAGGTTCTA CACAACGCTG GAGGCCGAGC GTATCTTGAA GGAGAGGAGG
GACGCCTCTG TGGCTGTGAG AAGGGGGGAG AAGGGCATCG TGGACAGGGT CATCATTACG
GAGTCTCCAG AGGGCAACAA GCTCGTCAAG GTGAGGTTGA GGGAGCTGAG GGTGCCGGAG
CTTGGGGATA AGTTCGCCTC CCGCCACGGG CAGAAGGGGG TCGTGGGCAT GTTGCTTAGG
CATGAGGACA TGCCGTTTAC CGAAGACGGC ATCGTGCCCG ACATAATAGT CAACCCACAC
GCGTTGCCGT CGCGTATGAC CGTGGCGCAA CTACTCGAGA GCATGGCCGG CAAAGTAGGC
GCCTCCCTCG GCGCTTTGGT AGACGCCACA GCCTTCGAGG GGGTGAAGGA GGAGGATCTG
AGGAAGCTGT TGCTTAAGCT CGGCTACAAG TGGGACGGCA AGGAGGTCAT GTACAGCGGC
ATCACCGGCG AGAGGCTGGT GGCCGACATA TTCATAGGCG TTGTCTACTA CCAGAAGCTA
CACCACATGG TTGCCGACAA GATCCACGCG AGGGCAAGGG GCCCCGTCCA GATCTTGACT
AGGCAGCCGA CCGAAGGCCG CTCCCGCGAG GGCGGTCTGA GGCTTGGCGA AATGGAGAGA
GACGTGTTGA TAGCCCACGG CGCCTCGGCC CTGCTGTATG AGAGGCTTGT GGAGTCCAGC
GACAAATACA CCATGTATGT CTGCGAGCTC TGTGGACTAC CGGCGTATCT AGACGCCAAG
TCAAACAAGC CTAAGTGCCC CATACACGGC GACACAGGCC AATTCGCCAA GGTGGTTGTG
CCCTATGCCT ACAAGCTACT TCTACAGGAG CTCATAGCAC TTGGGATATA TCCAAAGCTG
GAGCTCACCG AGGTGTTGGA GTAG
 
Protein sequence
MVDLLPLPVI VSQQDGKFPT RDDRWVLVER FIKDKGLANH QIKSFNDFLD KKLPKIVEDF 
KVVETEIKGL KLVLERIEVG WPKIKESDGS ESIIYPMEAR LRNATYSAPL YLTATLYVDD
EPYVTETFYI GELPIMVKSK RCNLTRLRPG EYIKKFEDPQ DFGGYFIING SERVIISQED
LVADRPIYDR GDKPSVKYIA KTISTGIGYR STLTVELNRD GVIYVTLSAM PVKIPFPIYM
KALGLETDED VVKAVSDDPE IQKELLPSLI AANQIAITRE DALDFIGGKV AVGQPKPIRV
EKAYQLLDKY FLPHLGTTTP DEKRQQEIRL KKALMLGQIV KGLVELQLGR RKPDDKDHVA
NKRVRLVGDL MTQLFRTVLK QFLQELKSQL EKYYARGRIP HIQTIVRPDI ITERVRQALA
TGNWVGGKTG VSQILDRTNY MSTLSYLRRV VSSLSRTQPH FEARDLHPTQ WGRLCAVETP
EGQNVGLVKN LALLAEITTG VDEGEVEQLL YNLGVVPLLK AREEGIRGAE VYINGRLIGV
HPNPEELAKT VRGMRRQGKI SDEINIAYLD GSVYVNCDGG RIRRPLLVVE GGRLKLTKEI
VEKVRRGDLT WDDLVKMGVI EYLDADEEEN AHIAVDPDVD LSNYTHVEIV PSSILGAVGS
IIPYLEHNQS PRNQYEAAMA KQSLGLPQSN FLYKLDSRGH MLYYPERPIV TTRGLELVGY
SRRPAGQNAV VALLTYTGYN IEDAVILNKA SVERGMFRSV FYRTYETEEQ KYPGGEEDKI
EIPDSSVKGY RGPEAYSHLD EDGIAPPEVY VSSNEVLVGK TSPPRFYTTL EAERILKERR
DASVAVRRGE KGIVDRVIIT ESPEGNKLVK VRLRELRVPE LGDKFASRHG QKGVVGMLLR
HEDMPFTEDG IVPDIIVNPH ALPSRMTVAQ LLESMAGKVG ASLGALVDAT AFEGVKEEDL
RKLLLKLGYK WDGKEVMYSG ITGERLVADI FIGVVYYQKL HHMVADKIHA RARGPVQILT
RQPTEGRSRE GGLRLGEMER DVLIAHGASA LLYERLVESS DKYTMYVCEL CGLPAYLDAK
SNKPKCPIHG DTGQFAKVVV PYAYKLLLQE LIALGIYPKL ELTEVLE