Gene Huta_0782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0782 
Symbol 
ID8383053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp761116 
End bp764157 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content65% 
IMG OID644971846 
ProductDNA-directed RNA polymerase subunit A' 
Protein accessionYP_003129700 
Protein GI257051867 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02390] DNA-directed RNA polymerase subunit A' 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0720962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAG GACAATCACC CAAGTCGATC GGGCGTCTCA GCTTCGGGCT GATGAACCCC 
GAGGAGTACC GGGATATGAG TGCCACGAAG GTCATCACGG CCGACACCTA CGACGACGAC
GGCTTCCCCA TCGACATGGG GCTGATGGAC CCGCGACTCG GCGTGATCGA CCCCGGACTG
GAGTGTAAGA CCTGTGGCAA ACACTCGGGC TCGTGTAACG GCCACTTCGG ACATATCGAA
CTCGCCGCTC CCGTCATCCA CGTCGGCTTC TCGAAGCTCA TCCGACGGCT CCTTCGCGGA
ACGTGTCGGG AGTGTTCGCG GTTGACAGCC GTCGACGGCT CGAAGTACAC TGATGGCGAC
GGCAACGTCG ATCTCGAAGC GGCTTCGGCC GCTGCCGAGG ACAAGAAACG CGAATTCAAG
GAGGAACTCC GGCGGGCGCG AGAACTCTCG AACGACCCTT CGGACGTCCT CAAGTCCGCG
ATCCGGCAGG CCCGCTCGGC CAGCCGGTGT CCCCACTGCG GCGAAAAGCA GTTCGACGTC
AAACACGAGA AACCGACGAC GTACTACGAG ATCATCGAGG TTCCCCACGC CGAACTCGCC
GAGCGGCTCA GCATGGCGAT GGACCCGCCG GAAGGTGAGC CAGTCACGCC CGACGAACTC
GAAGAGGCGA CTGACGGTGG GTTCGACGCC GGTCGGATCC GGGAACTCCT CTCGGACACC
TACCGCCCCG ACCGCAATGA CATCCCCGCG CTCCAGGAGA TCGGCAGCGC GCTCCATCGG
TTCAACGACC TCGAAGAGCA CCTCGGCGAC GAACTCGACA TCGAGAGTCC GTCGTCGTTC
CTCCTCGAGG AGGACATGGA CAAGCTGATG CCCAGCGACA TCCGGGACTG GTTCGAGGAG
ATCCCGGACG ACGACCTCGA GGCGATCGGG GTCAATCCTG AAACGTCCCG GCCGGAGTGG
ATGATCCTCA CCGTCCTGCC GGTGCCGCCG GTGACGGCCC GTCCCTCGAT CACGCTGGAC
AACGGCCAGC GGAGCGAGGA CGACCTGACT CACAAGCTGG TGGACATCAT CCGGATCAAC
CAGCGGTTCA TGGAGAACCG CGAGGCCGGC GCGCCACAGC TGATCATCGA GGACCTCTGG
GAACTGCTGC AGTACCACGT CACCACGTTC ATGGACAACG AGATCAGCGG GACGCCGCCG
GCCCGACACC GCTCCGGACG GCCGCTGAAG ACACTCTCCC AGCGCCTGAA GGGCAAGGAA
GGTCGCTTCC GTGGGAGCCT CTCCGGGAAG CGCGTGAACT TCTCGGCCCG GACCGTCATC
TCCCCGGACC CGACCCTTAG CCTCAACGAG GTCGGGGTGC CCGACCGGGT CGCAAGCGAG
ATGACCCAGA CGATGAACGT CAACGAGCGC AACCTCGCGG AGGCCCGCCA GTACGTGTCC
AACGGACCGG AGGCCCATCC GGGTGCGAAC TACGTCCGCC GGCCGGACGG TCGGCGGCTC
AAGGTGACCG AGAAGAACTG CGAGGAGCTC GCCGAGAAGG TCGAGCCGGG CTGGGAAGTG
GCTCGCCACC TGCTCGACGG CGACATCGTG ATCTTCAATC GCCAGCCGAG TCTCCACCGG
ATGTCCATCA TGGCCCACGA GGTCGTGGTG ATGCCCTACA AGACGTTCCG GCTGAACACC
GTCGTCTGTC CGCCGTACAA CGCTGACTTC GACGGCGACG AGATGAACAT GCACGCCCTC
CAGAACGAGG AGGCCCGTGC CGAGGCTCGC GTCCTCATGC GCGTCCAGGA ACAGATGCTC
TCGCCGCGCT TCGGCGAGAA CATCATCGGG GCGATCCAGG ACCACATCAC GTCGGTGTAC
CTGCTGACCC ACCAGAACCC CCACTTCAAC GAGACCCAGG CCCTGGACCT CCTGCGGGCG
ACGAACGTCG ACGAACTGCC GGAACCTGAT GGAGAGGAAG ATGGCCGGGC CTACTGGACG
GGTCGGTCGA TCTTCTCGGA ACTGTTGCCG GACGACCTCG ACCTCGAATT CACGAGCGAG
GCCGGCGACG ACGTCGTCAT CGAGGACGGC CAGTTGCTCG AAGGGACGAT CGACGACAGC
GCAGTCGGGG CCTTCGGCGG CGAGATCGTC GACACGATCG CGAAGGTCTA CGACAAGACG
CGGGCGCGGA TCTTCATCAA CGAGGTCTCG ACGCTCGCGG TCCGGACGAT CATGCACTTC
GGGTTCTCGA TCGGGATCGA CGACGAGTCG ATCCCGCCGG CAGCCCAGGA GCAGGTCGAG
GAGGCAATCG GCAGCGCCTA CGACCGCGTC CAGGAACTCA TCGAGACCTA CGAACAGGGC
GATCTCGAAT CGCTCCCGGG TCGGACGGTC GACGAGACCC TGGAGATGAA GATCATGCAG
ACGCTCGGCC AGGCCCGTGA CAGCGCGGGT GACATCGCCG AGGACCACTT CGCCGAGGAC
AACCCGGCAG TGGTGATGGC CGACTCCGGT GCGCGTGGGT CGATGCTCAA CCTGACCCAG
ATGGCCGGAT GTGTCGGCCA GCAGGCAGTC CGCGGCGAGC GGATCAACCG TGGCTACGAG
GACCGCACGC TCAGCCACTT CGAAGAGAAC GACCTCTCGG CGGAGGCCCA CGGCTTCGTC
GAGCACTCCT ATCGTGAGGG CCTCGGCCCG AAGGAGTTCT TCTTCCACGC GATGGGTGGC
CGCGAGGGGC TGGTCGACAC GGCAGTCCGG ACCTCCAAGT CCGGGTACCT CCAGCGTCGG
CTGATCAACG CCCTCTCCGA ACTGGAAACT CAGTACGACG GCACCGTTCG GGACACCAGC
GATACCGTCG TCCAGTTCGA GTTCGGCGAG GACGGCACCT CGCCCGTGGA TGTTTCCTCC
AACGAGGACT TCGACATCGA CGTCGAGGCG ATCACCGAGC GGATCGTCGA GGCCGAGTTC
GACGATGAAA GCGAGAAGGC GACGTTCCTC GAGCGCGAGG CGAGTCCGAC CAACCTCTCG
GAACACGCAG ACGAGTGGTG GCACGCGGAG GCCGGAGACT GA
 
Protein sequence
MSTGQSPKSI GRLSFGLMNP EEYRDMSATK VITADTYDDD GFPIDMGLMD PRLGVIDPGL 
ECKTCGKHSG SCNGHFGHIE LAAPVIHVGF SKLIRRLLRG TCRECSRLTA VDGSKYTDGD
GNVDLEAASA AAEDKKREFK EELRRARELS NDPSDVLKSA IRQARSASRC PHCGEKQFDV
KHEKPTTYYE IIEVPHAELA ERLSMAMDPP EGEPVTPDEL EEATDGGFDA GRIRELLSDT
YRPDRNDIPA LQEIGSALHR FNDLEEHLGD ELDIESPSSF LLEEDMDKLM PSDIRDWFEE
IPDDDLEAIG VNPETSRPEW MILTVLPVPP VTARPSITLD NGQRSEDDLT HKLVDIIRIN
QRFMENREAG APQLIIEDLW ELLQYHVTTF MDNEISGTPP ARHRSGRPLK TLSQRLKGKE
GRFRGSLSGK RVNFSARTVI SPDPTLSLNE VGVPDRVASE MTQTMNVNER NLAEARQYVS
NGPEAHPGAN YVRRPDGRRL KVTEKNCEEL AEKVEPGWEV ARHLLDGDIV IFNRQPSLHR
MSIMAHEVVV MPYKTFRLNT VVCPPYNADF DGDEMNMHAL QNEEARAEAR VLMRVQEQML
SPRFGENIIG AIQDHITSVY LLTHQNPHFN ETQALDLLRA TNVDELPEPD GEEDGRAYWT
GRSIFSELLP DDLDLEFTSE AGDDVVIEDG QLLEGTIDDS AVGAFGGEIV DTIAKVYDKT
RARIFINEVS TLAVRTIMHF GFSIGIDDES IPPAAQEQVE EAIGSAYDRV QELIETYEQG
DLESLPGRTV DETLEMKIMQ TLGQARDSAG DIAEDHFAED NPAVVMADSG ARGSMLNLTQ
MAGCVGQQAV RGERINRGYE DRTLSHFEEN DLSAEAHGFV EHSYREGLGP KEFFFHAMGG
REGLVDTAVR TSKSGYLQRR LINALSELET QYDGTVRDTS DTVVQFEFGE DGTSPVDVSS
NEDFDIDVEA ITERIVEAEF DDESEKATFL EREASPTNLS EHADEWWHAE AGD