Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0781 |
Symbol | |
ID | 8383052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 758382 |
End bp | 761105 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644971845 |
Product | RNA polymerase Rpb1 domain 5 |
Protein accession | YP_003129699 |
Protein GI | 257051866 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region [TIGR02389] DNA-directed RNA polymerase, subunit A'' |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0893602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACA CGCCAACCGA ATACGATCCG ATGAACCGCT ATGCGGCCGT CGACACGGAC ATCGCGGCCG TCGTCGAGGA CACGGACCTG CCGCGCCGGC TGAAAACTCG CGTCTACGAG GCCATCGAGG ACCGCGACGG CGTCACGGTC GAGCAGGCCG ACGACATCGC CAAGGCCGTC GAGTCCCGCT ATCTCGACAC GCGCGTCGAT CCGCTCGACC CCGTCGGGAC GGTTTCGGCC CAGTCGATCG GGGAACCCGG CACCCAGATG TCGATTCCTG GTGACGAGAG GGTTATCGTC CGCCGGGATG GCGAGACCGA CGTGACCGAG ATCGGCTCGC TCGTCGACGG TCTCGCGTCC GTACGAGAAT CCACGACTGT AGAGGGCCAC GAGGTAGTCT CTGCGCCCGA AGAGATGGAA GTCCCGTCTC TCCGGGCTGA CGAGCGCGTC GAGTGGAAGC CACTCGAACA GGTAAGTCGT CACGAAGCCC CCGACGAGCT CCTCCAGTTC GAACTCGAAT CCGGCCGGAC GATCCGGGCG ACAAAAGCAC ACTCGTTTGT CACACGGGAA GACAACGAGG TCATCCCTGT CGAAGGGAGT GATCTCGAGG AAGGCGACTG GTTGCCCGTC GTCCGTTCGC TCGACAGTGA CGGCCAGGAT TCCGTCGACC TGCGGGAGTA TCTCCCGGCG AGTGACTACT GGTACACCTC GACCCTGACC GACGGTGGGG CCACGACGCT CCCGGGCGGT GAAGACCAGA TCCGGAACAA ACGCCAGGCG CTCGAAGCCG GCGAGATCGA GGAACACGCT GTCTATCCCG TCCAGGGAAC GGTCGGACTA CCCGAACGGT TCCCGCTCGA CGAGGAAACC GGATTCTTCG TCGGGGCGTT CCTCGCCGAG GGGAACCTGA CGGATCATTA CGTTTCCATC TCGAACGTCG ACGAGACGTT CCAGAATCGG GTCCGTGCGT TCGCGGAGCG CTTCGATCTC TCCGTCAACG AATACGAGAA TGACAGCGGA TTTGCCACTG GTTACGACAT TCGCGTCAAT GGAACCGTTC TCGCTGACTT CCTGCGAGAA GTCTGCACCG TAGACGGCGA GAAGGCTATT CCGGACTTCG CGTTCGGTGC AACGTCCTCG TTTGTCCGGG GGCTCCTTTC AGGATACTTC AGTGGAGACG GAAACGTCGG CAAAAACGCC GTCCGAGCTA GTTCGACGTC CAACGAACTG ATCGACGGTG TCACGCTTCT GCTGGCACGA TTCGATATTT ACGCGACGTT CGGCACGCAA GACGACTCGA AGACGCTTCG GGTGCCCAAG AAGTTCGTTC CGGCGTTCAC AGATCGCATC GGGATGATCG GTGAGCGCGG TGACGAACTC GAGGATCTGG CGACCGACGT CGACACCGAC GGCCCGGATA CGACGGATCA GATCCCGAAC TTCGGGGATT CGCTGAAACA GGCGGCCAAA GCGGCGGGCA TCCCATCACG CCAGGTCAAT AGCGCCCACA AGCGCCAGCG AGTCGGTCGC AACCGTCTCC GTTCGCTCGT CGAGCAGATC GACGAAACGG CCGAGGAACG TCCCCCGCAA CTGGGTGCAC TCGAACGGGC AGTCGACGGC GATGTCGTCT GGGAACGGAT CGAATCCATC CGGACCGTCG AACCCGAGAG CGGGTACGTA TACGACTTCT CTGTGGCCGG TCTGGAGACG TTCACGACCG CGCAGGGCGT CGTGACTCAC AACACGATGA ACACCTTCCA CTATGCGGGT GTCGCCGAGA TCGACGTCAC CCAGGGGCTA CCGCGGCTCA TCGAGCTGGT CGACGCCCGC AAGGAGCCCG ACACGCCGAT GATGACCGTC AATCTGGACG GCGAGTACGC CACCGAGCGC GAGCGCGCCC ACGAGGTCGT CTGGAAGATC GAGGCGACGC GCATCCTCGC GCTGGGCGAC GTCTCGACGA ACGTCGCGGA CATGCTCGTT CAGGTCGATC TCAATCCCGA CACGCTCGAG GAGCGCTGGC CGACGGCCGA CAGCGTCGAG AGCGTCGCCG AGGAGATCGC CGCCACGATC GAGTCCCAGC TGGGCGTCGA AACCCAGCGC AAGGAGACGG TCATCCAGTT CGGCCCAGAG GAACCGTCCT ACCGGGATCT CCTCCAGCTG GTCGAGGAGT TACGCGAGAT CGTCTTCAAA GGGATCGAGG AAGTCTCCCG GGTCGTCATC CGCAAGGAGG AGATGGACGA GGACCACGAG AACGACGAGG AGTTCGTCCT CTACACCGAG GGGTCGGCCT TCGGCGACGT CCTCGGGATC GAGGGGGTCG ACGCCTCGCG GACGACGTGT AACAACATCC ACGAGATCTA CAGCAATCTC GGCGTCGAGG CGGCACGCGA GTCGATCATC GAGGAGACGA TGAACACCCT CGAGGAACAG GGGCTGGACG ACGTGAACGT CCGTCACCTG ATGCTGGTCG CCGACATCAT GACCAACAAC GGGGTCATCG AGTCGATCGG TCGCCACGGC ATCTCCGGCA GCAAGGACTC CGTCCTTGCA CGGGCTGCCT TCGAGGTCAC CGTCTCACAC CTGCTGGACG CCGCCATCCA CGGCGAGATC GACGCCCTCA ACGGCGTCAC CGAGAACGTC ATCGTCGGCA AGCCGATCAA GCTCGGCACC GGCGACGTCA ACCTCCGGAT GGGCAGTCAC ACCGGCGGCG GCGCGGCCGA CTGA
|
Protein sequence | MTDTPTEYDP MNRYAAVDTD IAAVVEDTDL PRRLKTRVYE AIEDRDGVTV EQADDIAKAV ESRYLDTRVD PLDPVGTVSA QSIGEPGTQM SIPGDERVIV RRDGETDVTE IGSLVDGLAS VRESTTVEGH EVVSAPEEME VPSLRADERV EWKPLEQVSR HEAPDELLQF ELESGRTIRA TKAHSFVTRE DNEVIPVEGS DLEEGDWLPV VRSLDSDGQD SVDLREYLPA SDYWYTSTLT DGGATTLPGG EDQIRNKRQA LEAGEIEEHA VYPVQGTVGL PERFPLDEET GFFVGAFLAE GNLTDHYVSI SNVDETFQNR VRAFAERFDL SVNEYENDSG FATGYDIRVN GTVLADFLRE VCTVDGEKAI PDFAFGATSS FVRGLLSGYF SGDGNVGKNA VRASSTSNEL IDGVTLLLAR FDIYATFGTQ DDSKTLRVPK KFVPAFTDRI GMIGERGDEL EDLATDVDTD GPDTTDQIPN FGDSLKQAAK AAGIPSRQVN SAHKRQRVGR NRLRSLVEQI DETAEERPPQ LGALERAVDG DVVWERIESI RTVEPESGYV YDFSVAGLET FTTAQGVVTH NTMNTFHYAG VAEIDVTQGL PRLIELVDAR KEPDTPMMTV NLDGEYATER ERAHEVVWKI EATRILALGD VSTNVADMLV QVDLNPDTLE ERWPTADSVE SVAEEIAATI ESQLGVETQR KETVIQFGPE EPSYRDLLQL VEELREIVFK GIEEVSRVVI RKEEMDEDHE NDEEFVLYTE GSAFGDVLGI EGVDASRTTC NNIHEIYSNL GVEAARESII EETMNTLEEQ GLDDVNVRHL MLVADIMTNN GVIESIGRHG ISGSKDSVLA RAAFEVTVSH LLDAAIHGEI DALNGVTENV IVGKPIKLGT GDVNLRMGSH TGGGAAD
|
| |