Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0864 |
Symbol | |
ID | 4709608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 932224 |
End bp | 936585 |
Gene Length | 4362 bp |
Protein Length | 1453 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639855323 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_001002442 |
Protein GI | 121997655 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.213662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGGG GTCCGAGCCG CCGGGCTAGC GGCGGCGACT TGAGATCAAC GCGCGAGCGG GCAGCAGACA TGAGAGATCT TCTCAACCTT TTCAAGCAGC CGGGTGCTCA GCTGGAGGAC TTCGATGCGA TTCGGATCGG GCTGGCATCG CCCGAGATGA TCCGCTCCTG GTCCTACGGC GAGGTCAAGA AGCCGGAGAC CATCAACTAC CGCACCTTCA AGCCGGAGCG TGACGGGCTC TTCTGCGCCA AGATCTTCGG CCCGGTGAAG GACTACGAGT GCCTGTGCGG CAAGTACAAG CGCCTCAAGC ACCGCGGCGT GGTCTGTGAG AAGTGCGGCG TCGAGGTGAC CGTCGCCAAG GTACGGCGCG AGCGCATGGG CCATATCGAC CTGGCCAGCC CCGTGGCGCA CATCTGGTTC CTCAAGAGCC TGCCGTCGCG CATCGGCCTG CTGCTGGACA TGACCCTACG CGACGTCGAG CGGGTGCTCT ACTTCGAGGC GTACATCGTC ATCGAGCCCG GCATGACGCC GCTCGAGCAG GGGCAGCTGC TCACCGACGA GCAGTACCTG GAGGCGGTCG AGGAGCACGG CGACGAGTTC GACGCCCGCA TGGGCGCCGA GGCGGTGCTC GAGATCCTCA AGGGCATGGA CCTGGAGGCC GAGGCTCGGC GTCTGCGCGA CGACATCGAG GCCACCGGCT CTGAGTCCAA GATCAAGCGC CTGTCCAAGC GGCTCAAGCT CCTCGAGGCC TTCCTTGAGT CGGGGAACAA GCCCGAGTGG CTGATCATGA CCGTCCTGCC CGTGCTGCCG CCGGACCTGC GGCCGCTGGT GCCCCTCGAC GGCGGCCGGT TCGCCACGTC GGACCTCAAC GACCTCTACC GGCGGGTGAT CAACCGCAAC AACCGCCTCA AGCGGCTGCT CGAACTGGCC GCGCCGGACA TCATCGTGCG CAACGAGAAG CGCATGCTCC AGGAGTCCGT CGACGCCCTG CTCGACAACG GTCGCCGCGG CCGGGCGATC ACGGGCACCA ACAAGCGCCC GCTCAAGTCC CTGGCCGACA TGATCAAGGG CAAGCAGGGT CGCTTCCGCC AGAACCTGCT GGGCAAGCGT GTCGACTACT CCGGCCGCTC GGTCATCGTC GTCGGCCCTA CCCTGCGCCT GCACCAGTGT GGTCTGCCCA AGCGCATGGC CCTGGAGCTG TTCAAGCCGT TCATCTTCTC CAAGCTGCAG CGCCGCGGCC TGGCCACCAC CATCAAGGCG GCCAAGAAGA TGGTCGAGCG CGAGACCGGC GAGGTCTGGG ACATCCTCGA CGAGGTCATC CGCGAGCACC CGGTGCTGCT CAACCGCGCG CCGACGCTCC ACCGCCTGGG TATCCAGGCC TTCGAGCCGG TGCTCATCGA GGGCAAGGCC ATCCAGCTCC ACCCGCTGGT GTGCACCGCC TACAACGCCG ACTTCGACGG CGACCAGATG GCGGTCCACG TGCCGCTGTC GCTCGAGGCG CAGCTCGAGT CCCGGGCGAT GATGATGTCC ACCAACAACA TCCTCTCGCC GGCCTCGGGC GAGCCGATCA TCGTCCCCTC CCAGGACGTG GTGCTCGGCA TCTATTACAT GACCCGCGAG CGCGTAAACG CCCGCGGCGA AGGCATGCGC CTGGCCGGCG TCGAAGAGGT TCACCGCGCC TACCAGACCG GCGCCGCGGA ACTCGGCGCG CGGGTCGAGG TGCGCATCCG CGAGCGGGTC TTCGACGACA GCGGTGAGAT GGTCGAGCGC GTTCAGCGCC GGCAGACCAC CATCGGCCGG GCCCTGCTGT TCGACATCGT GCCGGACGGT CTGCCCTTCG AGGCCGTCGA CCGTGAGCTC GACAAGAAAG GCGTCTCGGG GCTGGTGAAC GCCTGCTACC GCCGCGTCGG CCTGAAGGGC ACGGTGGTCT TCGCTGACCA GCTGATGTAC ACCGGCTTCT TCTATTCCAC CAAGGCCGGC GTCTCCATCG GTGTCGACGA CATGGAGGTG CCCACCGACA AGGAAGAGGC CCTGCGCTCG GCCGAGGAAG AGGTCCGCGA GATCGAGGAC CAGTACGCCT CGGGCCTGGT CACCAGCGGC GAGCGTTACA ACAAGGTCGT CGACATCTGG GCGCACACCA ACGACCAGGT GGCTGGCGCC ATGATGGAGA AGATGGGCAA GGAGACGGTC GTCGATGCCG AGGGCAACGA GACCGAGCAG AAGTCGCTCA ACTCCATCTT CATCATGGCC GACTCCGGCG CCCGTGGTTC GGCGGCGCAG ATCCGTCAGC TCGCCGGCAT GCGCGGCCTG ATGGCCAAGC CGGACGGCTC GATCATCGAG ACGCCGATCA CCGCCAACTT CCGCGAGGGC CTCAACGTCC TGCAGTACTT CATCTCCACC CACGGCGCCC GTAAGGGTCT GGCCGACACG GCGCTAAAGA CCGCCAACTC CGGCTACCTG ACCCGCCGCT TGGTGGACGT CTCCCAGGAC CTGGTGGTCA CCGAGGAAGA CTGCGGCACC ACCGATGGGT TGGTGCAGAC CCCGATCATT GAGGGCGGCG ACGTGGTCGA GACCCTCGCT GAGCGGGTGC TGGGCCGCGT GGTGGCCGAG GACGTGGCCG TACCGGGGAG CACCGACATC GCCGTCGAGG CCGGGACGCT GCTCGACGAG GACTGGGTCG AGCGCCTCGA GCGCATGGGC GTCGACGAGA TCAAGGTCCG CTCCGCGGTC ACCTGTGAGA CCCGTCACGG CGTCTGCGCC AAGTGCTACG GCCGCGACCT GGCCCGTGGC CACGGCGTGA ACATCGGCGA GGCCGTCGGC GTCATCGCCG CGCAGTCCAT CGGCGAGCCG GGTACGCAGC TGACCATGCG GACCTTCCAC ATTGGCGGCG CCGCCTCGCG GGCGGCCGCG GTCTCCCAGG TGGAGGTGCG CAACACCGGT AAGGCGCGAC TGCACAACAT CAAGACCGTG CAGCACCACT CCGGCAGCTA CGTGGCCGTG TCGCGCTCCG GCGAGCTCAC CGTCATGGAC GAGTACGGCC GCGAGCGGGA GCGCTACAAG ATCCCCTACG GCGCCGTGCT CAGCGTCGGC GACGAGGACC CGGTGGAGGC CGGGCAGGTG GTGGCCAACT GGGACCCCCA TACCCACCCG ATCGTCACCG AGGTGGACGG CTACGTGCGC TTCCACGACT TCGTCGAGGG CGTCACGGTC CAGCGCGAGG TCGACGAGGT CACCGGCCTG TCGAGCCTGG TGGTCACCGA CCCGAAGAGC CGGGGCACCG GCGAGCACAA GCGTCAGGTC ACCAACGCCA ACGGCAAGGT CACCGAGGAG CGGGTCGCCT ACAAGGACCT GCGGCCGATG ATCAAGCTGG TCGACGGTGA CGGCAACGAC CTGAACATCG CCGGTACGCA GATCCCGGCC CACTACTACC TGCCGGCCGG GGCCATCGTC TCGCTCGAGG ACAACGCCGA GGTGCGCGTG GGTGACGCCC TGGCGCGTAT CCCGCAAGAG GCGTCCAAGA CGCGTGACAT CACCGGTGGT CTGCCGCGGG TGGCCGACCT GTTCGAGGCG CGTAAGCCGA AGGAGCCGGC GATCCTCGCC GAGGCCTCCG GCACCGTCGG CTTCGGCAAG GAGACCAAGG GCAAGCAGCG GCTGATCATC ACCAAGGCCG ACGGCGAGAC CCACGAGGAG CTCATCCCGA AGTGGCGGAA CGTCACCGTC TTTGAGGGCG AGCACGTGGA GAAGGGCGAG ACCATCGCCG ATGGCGAGCC CAACCCCCAC GACATCCTGC GGCTGCTCGG CGTGACCAAG CTCGCCGAGT ACATCGTCCA GGAGATCCAG GACGTCTTCC GCCTCCAGGG TGTGGGCATC AACGACAAGC ACATCGAGGT GATCGTCCGG CAGATGCTGC GCAAGACCAT CGTCTCCGAT CCGGGCGACT CGCTGCACCT CAAGGGCGAG CAGGTGGATC GGGCGAAGCT GCTCGAAGAG AACGAGCAAC TCCAGGCGCA GGACAAGCAG CCGGCGCAGT GGGAGCCGTC GCTGCTGGGC ATCACCAAGG CGTCGCTGTC CACCGAGTCG TTCATCTCGG CAGCCTCCTT CCAGGAGACC ACCCGGGTGC TGACCGAGGC GGCGACTCGC GGTATCCGCG ACGACCTGCG TGGCCTCAAG GAGAACGTCA TCGTCGGCCG GCTCATCCCG GCGGGCACCG GCTTCGCCTA CCACGCAGCC CGTCGCCAGG AGGCACCCGC GCCGGCGGCG ACGCCGGAGC AGCAGGCCGA GGAGGTCTTC GCCTCCCTCG GCCAAGGCGA GGGCGAGGGC CCCAGCCCGT CCGATGAGGC GAGCGGGCCC GAGGTCGAGT AG
|
Protein sequence | MPRGPSRRAS GGDLRSTRER AADMRDLLNL FKQPGAQLED FDAIRIGLAS PEMIRSWSYG EVKKPETINY RTFKPERDGL FCAKIFGPVK DYECLCGKYK RLKHRGVVCE KCGVEVTVAK VRRERMGHID LASPVAHIWF LKSLPSRIGL LLDMTLRDVE RVLYFEAYIV IEPGMTPLEQ GQLLTDEQYL EAVEEHGDEF DARMGAEAVL EILKGMDLEA EARRLRDDIE ATGSESKIKR LSKRLKLLEA FLESGNKPEW LIMTVLPVLP PDLRPLVPLD GGRFATSDLN DLYRRVINRN NRLKRLLELA APDIIVRNEK RMLQESVDAL LDNGRRGRAI TGTNKRPLKS LADMIKGKQG RFRQNLLGKR VDYSGRSVIV VGPTLRLHQC GLPKRMALEL FKPFIFSKLQ RRGLATTIKA AKKMVERETG EVWDILDEVI REHPVLLNRA PTLHRLGIQA FEPVLIEGKA IQLHPLVCTA YNADFDGDQM AVHVPLSLEA QLESRAMMMS TNNILSPASG EPIIVPSQDV VLGIYYMTRE RVNARGEGMR LAGVEEVHRA YQTGAAELGA RVEVRIRERV FDDSGEMVER VQRRQTTIGR ALLFDIVPDG LPFEAVDREL DKKGVSGLVN ACYRRVGLKG TVVFADQLMY TGFFYSTKAG VSIGVDDMEV PTDKEEALRS AEEEVREIED QYASGLVTSG ERYNKVVDIW AHTNDQVAGA MMEKMGKETV VDAEGNETEQ KSLNSIFIMA DSGARGSAAQ IRQLAGMRGL MAKPDGSIIE TPITANFREG LNVLQYFIST HGARKGLADT ALKTANSGYL TRRLVDVSQD LVVTEEDCGT TDGLVQTPII EGGDVVETLA ERVLGRVVAE DVAVPGSTDI AVEAGTLLDE DWVERLERMG VDEIKVRSAV TCETRHGVCA KCYGRDLARG HGVNIGEAVG VIAAQSIGEP GTQLTMRTFH IGGAASRAAA VSQVEVRNTG KARLHNIKTV QHHSGSYVAV SRSGELTVMD EYGRERERYK IPYGAVLSVG DEDPVEAGQV VANWDPHTHP IVTEVDGYVR FHDFVEGVTV QREVDEVTGL SSLVVTDPKS RGTGEHKRQV TNANGKVTEE RVAYKDLRPM IKLVDGDGND LNIAGTQIPA HYYLPAGAIV SLEDNAEVRV GDALARIPQE ASKTRDITGG LPRVADLFEA RKPKEPAILA EASGTVGFGK ETKGKQRLII TKADGETHEE LIPKWRNVTV FEGEHVEKGE TIADGEPNPH DILRLLGVTK LAEYIVQEIQ DVFRLQGVGI NDKHIEVIVR QMLRKTIVSD PGDSLHLKGE QVDRAKLLEE NEQLQAQDKQ PAQWEPSLLG ITKASLSTES FISAASFQET TRVLTEAATR GIRDDLRGLK ENVIVGRLIP AGTGFAYHAA RRQEAPAPAA TPEQQAEEVF ASLGQGEGEG PSPSDEASGP EVE
|
| |