Gene Hore_01090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_01090 
Symbol 
ID7313236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp124653 
End bp127925 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content41% 
IMG OID643610531 
ProductDNA-directed RNA polymerase, beta subunit 
Protein accessionYP_002507865 
Protein GI220930957 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000929297 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGG GATTGAGGAG AGAAAGGTAT AGTTTTGCTA AAATTGAGGA TGCACATGAA 
GCACCTTATT TACTTAACAC ACAGATAAGC TCTTATAATT GGTTTCTGGA GGAAGGGTTA
AAAGAAGTAT TTGAAGAAAT TTCACCAATT GAGGATTTTT CTGAGAACCT TGTCCTGGAA
TTTGTCGATT ACCATCTGGG AGAGCCCAAA TACAATGAAG AAGAGTGCCG GGATAGGGAT
GCTACCTATG CAGCACCCTT ACAGGTAAAA GTAAGATTAA TAAATAAAGA TACCGGTGAA
GTTAAAGAAC AGGAAGTTTT TATGGGTGAT TTCCCTTTAA TGACAGACAA GGGAACCTTT
ATTATTAATG GTGCCGAAAG GGTAGTTGTT AATCAGTTAA TCCGGTCTTC AGGTGTTTAT
TTCGGGGAAG AGAGAACTAA AGATGGTAGG CGTCTCGTTT CTGCCAATAT AATTCCTAAT
AGAGGGGCCT GGATTGAATT CGAGTATGAT AAAAAACGAA TTGTTTCAGT CAGGGTTGAC
AGAACCCGTA AAATGCCATC AACTGTACTC TTTAGAGCTC TTGGTTATGG AACAGATGCT
GAATTAATAG ATTTATTCGG TGAACATGAA GTTATACATG ATACCCTTGA AAGGGATAAC
ACTGATTCCC AGGAAGAAGC CCTGATTGAA TTATATAAAA GGTTAAGGCC GGGTGAACCT
CCCACTGTTG AAAGTGCTAA AAATTTACTA GAATCATTAT TCTTTGATCC CAAACGGTAT
GACCTGGCCC TTGTCGGAAG GTATAAAATT AATAAAAAGC TGGGCCTTGA CATTGATCCT
GATAAAAAAT GCCTTGATAA GAGGGATATT GTAGAAACAG TACGTTATTT GCTGAGAATT
ATAGATAATG ATCCGGAGGC CAGTATTGAT GATATAGATC ATCTGGGGAA TCGTCGTTTA
AAGACTGTTG GAGAATTGTT ACAAAATCAG TTCAGGATCG GTTTATCCAG AATGGAACGG
GTTGTCAAGG AAAGAATGAC TATTCAGGAC ATAGATGTAG TAACACCCCA GGCTTTAATT
AATACCAGAC CAGTAGTGGC ATCTATTCAG GAGTTTTTTG GAAGCAGTCA GTTATCTCAG
TTTATGGACC AGACCAATCC CCTGTCTGAA CTGACCCATA AGAGGAGATT AAGTGCTCTT
GGACCGGGTG GGTTAAGTCG AGACCGGGCC GGTTTCGAAG TCAGGGACGT ACATCACTCT
CATTATGGTA GGATCTGTCC AATTGAGACA CCGGAAGGTC CCAATATTGG TCTAATAGGA
TCAATGAGTA CTTATGCCCG TACCAATAAA TTTGGTTTTC TGGAAACACC ATATAGAAAA
GTAGTTAATG GTAAGGCAAC CAATGAAATA GAATATTTAA CTGCTGATGA AGAAGATAAA
TACACCATTG CCCAGGCTAA TGAACCCTTT GATGAAGATG GTAATTTCTT AAATGACCTT
GTAATTGCCA GACATCGTGG TGATATTCTG GAAGTATCTC CTGAAAAAGT TGATTATATG
GATGTTTCTC CAAAACAGCT AGTTGGGGTT TCTGCTTCAA TGATACCCTT TTTAGAAAAT
GACGATGCTA ACAGGGCCCT AATGGGAGCA AACATGCAGC GGCAGGCTGT TCCCCTTATT
AAACCTGATG CCCCCATTGT GGCTACTGGA ATGGAATATA GAGCGGCTAA AGACTCCGGG
GTTGTAATTA TTGCAAAAAA TTCAGGTGTA GTTACCAGGG TTACCGCTGA TGAGATTGTC
ATTAAAACAG ATGATGGTAA GATTGATACC TACAAAATCC TTAAATTTAA GCGTTCAAAC
CAGGGTAGTT GTATAAATCA GCGTCCTATA GTCCGGAAAG GCCAGCGTGT TGACAAAGGC
GATGTTATCG CTGATGGTCC GTCAACAGAC CATGGTGAAA TGGCCCTGGG ACGTAATACT
TTGATTGCCT TTATGCCCTG GGAAGGTTAT AATTATGAGG ATGCTATTTT AATCAGTGAA
AAACTGGTTA AAGAAGATGC CTTTACTTCA GTTCACATAG AAGAGTATGA GGCTGAAGCC
CGGGATACAA AACTTGGCCC TGAAGAAATT ACCAGGGATA TCCCCAATGT TGGCGAAAAT
GCTCTTAAAA ACCTTGATGA ACGCGGTATA ATCAGGGTAG GAGCAGAAGT AAAAGAAGGA
GATATTCTGG TCGGCAAGGT TACCCCTAAA GGTGAAACCG AGTTATCAGC CGAAGAAAGG
TTGTTAAGAG CTATCTTTGG TGAAAAGGCC AGGGAAGTCA GGGATACTTC CCTGAAAGTA
CCCCATGGTG AAGAAGGTAT TATTGTTGAT GTTAAAGTCT TTTCCAGGGA AAATGGAGAC
GAATTAAAGC CCGGTGTAAA TAAACTGGTC AGGGTTTATG TAGCTACCAA ACGCAAGATT
TCTGTTGGTG ATAAAATGGC CGGACGTCAC GGTAATAAAG GTGTAATATC CAGGATTTTA
CCGGAAGAAG ATATGCCCTT TTTACCGAAT GGTGAACCAG TTGAAGTAGT ACTGAATCCC
CTGGGTGTAC CATCACGTAT GAATATCGGG CAGGTTCTGG AAACTCATCT TGGACTTGCT
GCCAAAGCCC TGGGGCTTTA TGTTGAGACA CCTGTATTTA ATGGAGCTCT TGAAGAAGAA
GTAGAAGATT TACTCGAAAA AGCCGGGTTT GACAGGGATG GCAAGACTGT TCTTTATGAT
GGGCGAACAG GTGAGCCCTT TGATAATAGA GTAACTGTTG GTTATATGTA TGTTCTGAAA
CTCCACCATC TGGTAGATGA TAAGATTCAT GCCCGTTCTA CCGGACCTTA TTCACTGGTT
ACCCAGCAGC CTCTTGGTGG TAAGGCTCAG TTTGGTGGCC AGAGGTTCGG TGAGATGGAA
GTCTGGGCTC TGGAAGCATA TGGTGCTGCC TACAGTCTGC AGGAAATGTT AACTATCAAA
TCAGACGATG TTGTGGGCAG GGTAAAAACC TATGAGGCCA TTGTAAAAGG TGAAAATGTT
CCTGAACCAG GAATTCCCGA ATCGTTTAAA GTACTAATCA AAGAGATGCA GAGTCTTGGA
CTTGATGCCA AAATCTTCAC CGAGGATGAG GAAGAACTGC AAATCGCAGA GGAAGAAGAA
GAGTTTACTG ATACTGCAAA AAAACTGGGA CTTGATATGG ATCTAAGTGA TAACAATAAG
AAAGGTTCTG AAAAAGATTC CAGTGAAGAA TAA
 
Protein sequence
MAKGLRRERY SFAKIEDAHE APYLLNTQIS SYNWFLEEGL KEVFEEISPI EDFSENLVLE 
FVDYHLGEPK YNEEECRDRD ATYAAPLQVK VRLINKDTGE VKEQEVFMGD FPLMTDKGTF
IINGAERVVV NQLIRSSGVY FGEERTKDGR RLVSANIIPN RGAWIEFEYD KKRIVSVRVD
RTRKMPSTVL FRALGYGTDA ELIDLFGEHE VIHDTLERDN TDSQEEALIE LYKRLRPGEP
PTVESAKNLL ESLFFDPKRY DLALVGRYKI NKKLGLDIDP DKKCLDKRDI VETVRYLLRI
IDNDPEASID DIDHLGNRRL KTVGELLQNQ FRIGLSRMER VVKERMTIQD IDVVTPQALI
NTRPVVASIQ EFFGSSQLSQ FMDQTNPLSE LTHKRRLSAL GPGGLSRDRA GFEVRDVHHS
HYGRICPIET PEGPNIGLIG SMSTYARTNK FGFLETPYRK VVNGKATNEI EYLTADEEDK
YTIAQANEPF DEDGNFLNDL VIARHRGDIL EVSPEKVDYM DVSPKQLVGV SASMIPFLEN
DDANRALMGA NMQRQAVPLI KPDAPIVATG MEYRAAKDSG VVIIAKNSGV VTRVTADEIV
IKTDDGKIDT YKILKFKRSN QGSCINQRPI VRKGQRVDKG DVIADGPSTD HGEMALGRNT
LIAFMPWEGY NYEDAILISE KLVKEDAFTS VHIEEYEAEA RDTKLGPEEI TRDIPNVGEN
ALKNLDERGI IRVGAEVKEG DILVGKVTPK GETELSAEER LLRAIFGEKA REVRDTSLKV
PHGEEGIIVD VKVFSRENGD ELKPGVNKLV RVYVATKRKI SVGDKMAGRH GNKGVISRIL
PEEDMPFLPN GEPVEVVLNP LGVPSRMNIG QVLETHLGLA AKALGLYVET PVFNGALEEE
VEDLLEKAGF DRDGKTVLYD GRTGEPFDNR VTVGYMYVLK LHHLVDDKIH ARSTGPYSLV
TQQPLGGKAQ FGGQRFGEME VWALEAYGAA YSLQEMLTIK SDDVVGRVKT YEAIVKGENV
PEPGIPESFK VLIKEMQSLG LDAKIFTEDE EELQIAEEEE EFTDTAKKLG LDMDLSDNNK
KGSEKDSSEE