Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2331 |
Symbol | |
ID | 7318116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2452871 |
End bp | 2456947 |
Gene Length | 4077 bp |
Protein Length | 1358 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643617227 |
Product | DNA-directed RNA polymerase, beta subunit |
Protein accession | YP_002514398 |
Protein GI | 220935499 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.524263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTACA GCTACACCGA AAAGAAACGC ATCCGCAAGG ATTTCGGCAA ACGCCCCCAG ATCCTCGATG TCCCCTACCT GTTGACCACG CAGCTGGATT CATACCGCCA GTTTCTGCAG GCCGACCGCA GCGAGGATAA TCGACAGGAC GTGGGCCTGC ACGCGGCGTT CAAGACGGTG TTCCCCATTG TGAGTTACTC GGGCACCGTT GAACTCGAAT ACGTGAGCTA CCGCCTCGGC AAGCCGGTGT TCGACGTCAA GGAGTGCCAG CTGCGCGGCA TGACCTATGC CGCTCCCCTG CGCGTGCTGC TGCGCCTGGT GATCTATGAC AAGGACGCCC CTGCGGGCTC CCGCGTGGTC AAGGACATCA AGGAGCAGGA AGTCTACATG GGCGAACTGC CGCTCATGAC CGAGAACGGC ACCTTCGTGA TCAACGGTAC CGAGCGGGTG ATCGTCTCCC AGCTGCACCG TTCGCCGGGC GTGTTCTTCG ACCACGACAA GGGCAAGACA CACAGCTCGG GCAAGCTGCT GTTCAATGCC CGCGTGATCC CCTACCGCGG TTCCTGGCTG GACTTCGAGT TCGATCCCAA GGACAGCGTG TTCGTGCGCA TCGACCGCCG CCGCAAGCTG CCGGCCACGG TGCTGCTGCG CGCCCTGGGC ATGGAGACCG AGGAGATCCT CGCCACCTTC TTCGAGACCA ACACCGTCAG CATCACCAAG GACGGTTTCG ACATGGAGCT GATCCCCGAG CGTCTGCGCG GTGAAGTGGC AGCCTTCGAC TTCAAGGTGA AGAACAAGGT GCTGGTGGAA AGCGGGCGTC GCATCACCGC CCGTCACGTG CGCGAGCTGG AGGCCGCCGG GATCAAGTCC CTGGAAGTGC CCGCCGAGTA CCTGGTGGGC AAGGTGCTGG CCCACGCGGT CATCGACGAG GACAGCGGCG AGCTGGTGGC CAACGCCAAC GACGAGATCA CCGATGAGCT GCTGAAGAAG CTGCGCGCCG CCGGCATCAA GTCGTTCAAG ACCCTGTACA CCAACGACCT GGACCACGGT CCGTATATCT CCACCACCCT GAACATCGAC ACCTGCCGCT CCCAGCTGGA GGCGCAGGTG GAGATCTACC GCATGATGCG TCCCGGCGAG CCCCCCACCA AGGAGGCCGC GGAGAACCTG TTCAACAACC TGTTCTTCAC CGAGGAGCGC TACGACCTCT CCGCCGTGGG CCGCATGAAG TTCAACCGTC GCGTGGGCCG CGAGGAGATC ACCGGTCCCG GCGTGCTGGA CAAGGACGAC ATCCTGGCGG TGCTCAAGAC CCTGATCGAC ATCCGCAACG GCAACGGCCA GGTGGATGAC ATCGACCACC TGGGCAACCG CCGCGTGCGT TCCGTGGGCG AGATGGCGGA GAACGTCTTC CGTGTCGGCC TGGTGCGCGT CGAGCGCGCC GTGAAGGAGC GCCTGAGCGT CGCCGAGAGC GAAGGCCTCA TGCCCCAGGA ACTGATCAAC GCCAAGCCCG TGGCGGCGGC GGTGAAGGAG TTCTTCGGCT CCAGCCAGCT GTCCCAGTTC ATGGACCAGA ACAACCCGCT CTCCGAGGTG ACCCACAAGC GCCGCATCTC GGCCCTGGGC CCCGGCGGCC TGACCCGCGA GCGTGCCGGC TTCGAGGTGC GCGACGTGCA CCCGACCCAT TACGGCCGCG TGTGCCCGAT CGAGACCCCT GAAGGTCCGA ACATCGGTCT GATCAACTCC CTCGCCGTGT ATGCACGTAC CAACGATTAC GGTTTCCTGG AGACCCCCTA CCGCAAGGTG GAAAACGGCA AGGTGACCAA CGAGATCGTG TACCTGTCCG CCATCGAGGA AGGCCAGTAC GTGATCGCCC AGGCCAACGC CAGTCTGGAT GCCAAGGGCA ACCTGGTGGA CGAACTGGTG TCCTGCCGCC ACGCCAATGA ATTCACCATG TCCACCCCGG ACAAGATCGA GTTCATGGAC ATCTCGCCCA AGCAGATCGT GTCCGTGGCC GCCGCGCTGA TCCCGTTCCT GGAGCACGAC GACGCCAACC GCGCGCTGAT GGGCTCCAAC ATGCAGCGCC AGGCCGTGCC CTGCCTGCGT GCCGAGACCG CCGTGGTGGG GACCGGCATC GAGCGCACCG TGGCCATCGA CTCCGGTTCC TCCATCGTCG CCCGCCGTGG CGGCGTGGTG GACTCCGTGG ACGCCGCCCG CATCGTGGTG CGCGTCAATG ACGATGAGAC CGAGGCTGGC GAGCCGGGCG TGGACATCTA CAACCTGACC AAGTACACCC GCTCCAACCA GAACACCTGC ATCAACCAGC GTCCGCTGGT GAACGTGGGG GACGTGCTGG CCCGGGGCGA CGTGCTGGCC GACGGTTCCT CCACGGATCT GGGCGAGATG GCCCTGGGGC AGAACATGAT GGTCGCCTTC ATGCCCTGGA ACGGCTACAA CTTCGAGGAC TCCATCCTCA TCTCCGAACG CGTGGTGCAG GAGGATCGTT TCACCTCCAT CCACATCGAG GAGCTGACCT GCGTGGCCCG CGACACCAAG CTCGGGCCGG AGGAGATCAG CGCGGACATC CCGAACGTGT CCGAGAGCCT GCTCTCCAAG CTGGATGAGT CCGGCATCGT GTACGTGGGC GCCGAGGTCA AGCCCAACGA CATCCTGGTG GGCAAGGTCA CGCCCAAGGG CGAGACCCAG CTGACCCCGG AAGAGAAGCT GCTGCGCGCC ATCTTCGGTG AGAAGGCCTC CGACGTGAAG GACACCTCCC TGCGCGTGCC CTCCGGCATG GAAGGCACTG TGATCGACGT GCGCGTGTTC ACCCGCGACG GCGTCGACAA GGACAAGCGT GCCCTGCAGA TCGAGGAGGC GGCGCTGGCC GCCGTGCGCA AGGATCTCAA GGACCAGCTG CGCATCTACG AGGACGACAT CTACGACCGC GTCGAGAAGC TGCTGGTGGG CAAGCTGGCC GCCGGTGGCC CCAACAAGCT CAAGGACGGC ACCAAGGTCA CCAAGACCTA CCTTACCGAG GTGCCCCGCG AGAAGTGGTT CGAGGTGCGC ATGCGCACCG AAGAGGTCAA CGAGCAGCTG GAGAAGATGG CTGCATCCCT GAAGGAGCAG CACGAGGCCT TCGAGACCCG CTTCAAGGAG CAGAAGGAAA AGCTCACTCA GGGTGACGAT CTCGCCCCCG GCGTGCTGAA GATGGTCAAG GTCTACCTGG CCGTGAAGCG CCGCATGCAG CCCGGCGACA AGATGGCCGG CCGCCACGGT AACAAGGGTG TGGTCTCCAT GATCGTGCCC GTGGAAGACA TGCCGTATCT GGAAGACGGC ACCCCCGTGG ACATCGTGCT CAACCCGCTC GGCGTGCCTT CCCGTATGAA CGTGGGTCAG GTGCTCGAGA CCCATCTGGG CTGGGCCGCC AAGGGTCTGG GCCAGAAGAT CGGCCGCATG GTCGAGGCCA AGGCCAAGAT CGACGAACTG CGCAAGTTCC TCGACAAGAT CTACAACCAC AGTGCCAAGA AGGTGGATCT GTCCGATTTC AGCGACGAGG AGATCCTCAA GCTGTGCGCC AACCTGAAGA AGGGCGTGCC CATGGCGACC CCGGTGTTCG ACGGTGCCGA GGAAGAAGAG ATCAAGGCCA TGCTGAAGCT GGCCGATCTG CCCGAGAGCG GTCAGACCAC CCTGTTCGAC GGTCGTACCG GCGAGTCCTT CGACCGTCCC GTGACCGTGG GTTACATGCA CATGCTCAAG CTCAACCACC TGGTGGATGA CAAGATGCAT GCCCGTTCCA CCGGTCCGTA CAGCCTGGTG ACCCAGCAGC CGCTGGGTGG TAAGGCGCAG TTCGGTGGTC AGCGCTTCGG CGAGATGGAG GTCTGGGCGC TGGAGGCCTA TGGCGCCGCC TACACCCTGC AGGAGATGCT CACGGTGAAG TCCGACGACG TTCAGGGCCG CAACAAGATG TACAAGAACA TCGTCGACGG CGACCACCGG ATGGAGGCGA ACATTCCGGA GTCCTTCAAC GTGCTGATGA AGGAAATCCG TTCGCTGGCC ATCAACATCG AGCTGGAACA GGACTAA
|
Protein sequence | MAYSYTEKKR IRKDFGKRPQ ILDVPYLLTT QLDSYRQFLQ ADRSEDNRQD VGLHAAFKTV FPIVSYSGTV ELEYVSYRLG KPVFDVKECQ LRGMTYAAPL RVLLRLVIYD KDAPAGSRVV KDIKEQEVYM GELPLMTENG TFVINGTERV IVSQLHRSPG VFFDHDKGKT HSSGKLLFNA RVIPYRGSWL DFEFDPKDSV FVRIDRRRKL PATVLLRALG METEEILATF FETNTVSITK DGFDMELIPE RLRGEVAAFD FKVKNKVLVE SGRRITARHV RELEAAGIKS LEVPAEYLVG KVLAHAVIDE DSGELVANAN DEITDELLKK LRAAGIKSFK TLYTNDLDHG PYISTTLNID TCRSQLEAQV EIYRMMRPGE PPTKEAAENL FNNLFFTEER YDLSAVGRMK FNRRVGREEI TGPGVLDKDD ILAVLKTLID IRNGNGQVDD IDHLGNRRVR SVGEMAENVF RVGLVRVERA VKERLSVAES EGLMPQELIN AKPVAAAVKE FFGSSQLSQF MDQNNPLSEV THKRRISALG PGGLTRERAG FEVRDVHPTH YGRVCPIETP EGPNIGLINS LAVYARTNDY GFLETPYRKV ENGKVTNEIV YLSAIEEGQY VIAQANASLD AKGNLVDELV SCRHANEFTM STPDKIEFMD ISPKQIVSVA AALIPFLEHD DANRALMGSN MQRQAVPCLR AETAVVGTGI ERTVAIDSGS SIVARRGGVV DSVDAARIVV RVNDDETEAG EPGVDIYNLT KYTRSNQNTC INQRPLVNVG DVLARGDVLA DGSSTDLGEM ALGQNMMVAF MPWNGYNFED SILISERVVQ EDRFTSIHIE ELTCVARDTK LGPEEISADI PNVSESLLSK LDESGIVYVG AEVKPNDILV GKVTPKGETQ LTPEEKLLRA IFGEKASDVK DTSLRVPSGM EGTVIDVRVF TRDGVDKDKR ALQIEEAALA AVRKDLKDQL RIYEDDIYDR VEKLLVGKLA AGGPNKLKDG TKVTKTYLTE VPREKWFEVR MRTEEVNEQL EKMAASLKEQ HEAFETRFKE QKEKLTQGDD LAPGVLKMVK VYLAVKRRMQ PGDKMAGRHG NKGVVSMIVP VEDMPYLEDG TPVDIVLNPL GVPSRMNVGQ VLETHLGWAA KGLGQKIGRM VEAKAKIDEL RKFLDKIYNH SAKKVDLSDF SDEEILKLCA NLKKGVPMAT PVFDGAEEEE IKAMLKLADL PESGQTTLFD GRTGESFDRP VTVGYMHMLK LNHLVDDKMH ARSTGPYSLV TQQPLGGKAQ FGGQRFGEME VWALEAYGAA YTLQEMLTVK SDDVQGRNKM YKNIVDGDHR MEANIPESFN VLMKEIRSLA INIELEQD
|
| |