Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0782 |
Symbol | |
ID | 8383053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 761116 |
End bp | 764157 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644971846 |
Product | DNA-directed RNA polymerase subunit A' |
Protein accession | YP_003129700 |
Protein GI | 257051867 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02390] DNA-directed RNA polymerase subunit A' |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0720962 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAG GACAATCACC CAAGTCGATC GGGCGTCTCA GCTTCGGGCT GATGAACCCC GAGGAGTACC GGGATATGAG TGCCACGAAG GTCATCACGG CCGACACCTA CGACGACGAC GGCTTCCCCA TCGACATGGG GCTGATGGAC CCGCGACTCG GCGTGATCGA CCCCGGACTG GAGTGTAAGA CCTGTGGCAA ACACTCGGGC TCGTGTAACG GCCACTTCGG ACATATCGAA CTCGCCGCTC CCGTCATCCA CGTCGGCTTC TCGAAGCTCA TCCGACGGCT CCTTCGCGGA ACGTGTCGGG AGTGTTCGCG GTTGACAGCC GTCGACGGCT CGAAGTACAC TGATGGCGAC GGCAACGTCG ATCTCGAAGC GGCTTCGGCC GCTGCCGAGG ACAAGAAACG CGAATTCAAG GAGGAACTCC GGCGGGCGCG AGAACTCTCG AACGACCCTT CGGACGTCCT CAAGTCCGCG ATCCGGCAGG CCCGCTCGGC CAGCCGGTGT CCCCACTGCG GCGAAAAGCA GTTCGACGTC AAACACGAGA AACCGACGAC GTACTACGAG ATCATCGAGG TTCCCCACGC CGAACTCGCC GAGCGGCTCA GCATGGCGAT GGACCCGCCG GAAGGTGAGC CAGTCACGCC CGACGAACTC GAAGAGGCGA CTGACGGTGG GTTCGACGCC GGTCGGATCC GGGAACTCCT CTCGGACACC TACCGCCCCG ACCGCAATGA CATCCCCGCG CTCCAGGAGA TCGGCAGCGC GCTCCATCGG TTCAACGACC TCGAAGAGCA CCTCGGCGAC GAACTCGACA TCGAGAGTCC GTCGTCGTTC CTCCTCGAGG AGGACATGGA CAAGCTGATG CCCAGCGACA TCCGGGACTG GTTCGAGGAG ATCCCGGACG ACGACCTCGA GGCGATCGGG GTCAATCCTG AAACGTCCCG GCCGGAGTGG ATGATCCTCA CCGTCCTGCC GGTGCCGCCG GTGACGGCCC GTCCCTCGAT CACGCTGGAC AACGGCCAGC GGAGCGAGGA CGACCTGACT CACAAGCTGG TGGACATCAT CCGGATCAAC CAGCGGTTCA TGGAGAACCG CGAGGCCGGC GCGCCACAGC TGATCATCGA GGACCTCTGG GAACTGCTGC AGTACCACGT CACCACGTTC ATGGACAACG AGATCAGCGG GACGCCGCCG GCCCGACACC GCTCCGGACG GCCGCTGAAG ACACTCTCCC AGCGCCTGAA GGGCAAGGAA GGTCGCTTCC GTGGGAGCCT CTCCGGGAAG CGCGTGAACT TCTCGGCCCG GACCGTCATC TCCCCGGACC CGACCCTTAG CCTCAACGAG GTCGGGGTGC CCGACCGGGT CGCAAGCGAG ATGACCCAGA CGATGAACGT CAACGAGCGC AACCTCGCGG AGGCCCGCCA GTACGTGTCC AACGGACCGG AGGCCCATCC GGGTGCGAAC TACGTCCGCC GGCCGGACGG TCGGCGGCTC AAGGTGACCG AGAAGAACTG CGAGGAGCTC GCCGAGAAGG TCGAGCCGGG CTGGGAAGTG GCTCGCCACC TGCTCGACGG CGACATCGTG ATCTTCAATC GCCAGCCGAG TCTCCACCGG ATGTCCATCA TGGCCCACGA GGTCGTGGTG ATGCCCTACA AGACGTTCCG GCTGAACACC GTCGTCTGTC CGCCGTACAA CGCTGACTTC GACGGCGACG AGATGAACAT GCACGCCCTC CAGAACGAGG AGGCCCGTGC CGAGGCTCGC GTCCTCATGC GCGTCCAGGA ACAGATGCTC TCGCCGCGCT TCGGCGAGAA CATCATCGGG GCGATCCAGG ACCACATCAC GTCGGTGTAC CTGCTGACCC ACCAGAACCC CCACTTCAAC GAGACCCAGG CCCTGGACCT CCTGCGGGCG ACGAACGTCG ACGAACTGCC GGAACCTGAT GGAGAGGAAG ATGGCCGGGC CTACTGGACG GGTCGGTCGA TCTTCTCGGA ACTGTTGCCG GACGACCTCG ACCTCGAATT CACGAGCGAG GCCGGCGACG ACGTCGTCAT CGAGGACGGC CAGTTGCTCG AAGGGACGAT CGACGACAGC GCAGTCGGGG CCTTCGGCGG CGAGATCGTC GACACGATCG CGAAGGTCTA CGACAAGACG CGGGCGCGGA TCTTCATCAA CGAGGTCTCG ACGCTCGCGG TCCGGACGAT CATGCACTTC GGGTTCTCGA TCGGGATCGA CGACGAGTCG ATCCCGCCGG CAGCCCAGGA GCAGGTCGAG GAGGCAATCG GCAGCGCCTA CGACCGCGTC CAGGAACTCA TCGAGACCTA CGAACAGGGC GATCTCGAAT CGCTCCCGGG TCGGACGGTC GACGAGACCC TGGAGATGAA GATCATGCAG ACGCTCGGCC AGGCCCGTGA CAGCGCGGGT GACATCGCCG AGGACCACTT CGCCGAGGAC AACCCGGCAG TGGTGATGGC CGACTCCGGT GCGCGTGGGT CGATGCTCAA CCTGACCCAG ATGGCCGGAT GTGTCGGCCA GCAGGCAGTC CGCGGCGAGC GGATCAACCG TGGCTACGAG GACCGCACGC TCAGCCACTT CGAAGAGAAC GACCTCTCGG CGGAGGCCCA CGGCTTCGTC GAGCACTCCT ATCGTGAGGG CCTCGGCCCG AAGGAGTTCT TCTTCCACGC GATGGGTGGC CGCGAGGGGC TGGTCGACAC GGCAGTCCGG ACCTCCAAGT CCGGGTACCT CCAGCGTCGG CTGATCAACG CCCTCTCCGA ACTGGAAACT CAGTACGACG GCACCGTTCG GGACACCAGC GATACCGTCG TCCAGTTCGA GTTCGGCGAG GACGGCACCT CGCCCGTGGA TGTTTCCTCC AACGAGGACT TCGACATCGA CGTCGAGGCG ATCACCGAGC GGATCGTCGA GGCCGAGTTC GACGATGAAA GCGAGAAGGC GACGTTCCTC GAGCGCGAGG CGAGTCCGAC CAACCTCTCG GAACACGCAG ACGAGTGGTG GCACGCGGAG GCCGGAGACT GA
|
Protein sequence | MSTGQSPKSI GRLSFGLMNP EEYRDMSATK VITADTYDDD GFPIDMGLMD PRLGVIDPGL ECKTCGKHSG SCNGHFGHIE LAAPVIHVGF SKLIRRLLRG TCRECSRLTA VDGSKYTDGD GNVDLEAASA AAEDKKREFK EELRRARELS NDPSDVLKSA IRQARSASRC PHCGEKQFDV KHEKPTTYYE IIEVPHAELA ERLSMAMDPP EGEPVTPDEL EEATDGGFDA GRIRELLSDT YRPDRNDIPA LQEIGSALHR FNDLEEHLGD ELDIESPSSF LLEEDMDKLM PSDIRDWFEE IPDDDLEAIG VNPETSRPEW MILTVLPVPP VTARPSITLD NGQRSEDDLT HKLVDIIRIN QRFMENREAG APQLIIEDLW ELLQYHVTTF MDNEISGTPP ARHRSGRPLK TLSQRLKGKE GRFRGSLSGK RVNFSARTVI SPDPTLSLNE VGVPDRVASE MTQTMNVNER NLAEARQYVS NGPEAHPGAN YVRRPDGRRL KVTEKNCEEL AEKVEPGWEV ARHLLDGDIV IFNRQPSLHR MSIMAHEVVV MPYKTFRLNT VVCPPYNADF DGDEMNMHAL QNEEARAEAR VLMRVQEQML SPRFGENIIG AIQDHITSVY LLTHQNPHFN ETQALDLLRA TNVDELPEPD GEEDGRAYWT GRSIFSELLP DDLDLEFTSE AGDDVVIEDG QLLEGTIDDS AVGAFGGEIV DTIAKVYDKT RARIFINEVS TLAVRTIMHF GFSIGIDDES IPPAAQEQVE EAIGSAYDRV QELIETYEQG DLESLPGRTV DETLEMKIMQ TLGQARDSAG DIAEDHFAED NPAVVMADSG ARGSMLNLTQ MAGCVGQQAV RGERINRGYE DRTLSHFEEN DLSAEAHGFV EHSYREGLGP KEFFFHAMGG REGLVDTAVR TSKSGYLQRR LINALSELET QYDGTVRDTS DTVVQFEFGE DGTSPVDVSS NEDFDIDVEA ITERIVEAEF DDESEKATFL EREASPTNLS EHADEWWHAE AGD
|
| |