Gene Huta_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0781 
Symbol 
ID8383052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp758382 
End bp761105 
Gene Length2724 bp 
Protein Length907 aa 
Translation table11 
GC content63% 
IMG OID644971845 
ProductRNA polymerase Rpb1 domain 5 
Protein accessionYP_003129699 
Protein GI257051866 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region
[TIGR02389] DNA-directed RNA polymerase, subunit A'' 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0893602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACA CGCCAACCGA ATACGATCCG ATGAACCGCT ATGCGGCCGT CGACACGGAC 
ATCGCGGCCG TCGTCGAGGA CACGGACCTG CCGCGCCGGC TGAAAACTCG CGTCTACGAG
GCCATCGAGG ACCGCGACGG CGTCACGGTC GAGCAGGCCG ACGACATCGC CAAGGCCGTC
GAGTCCCGCT ATCTCGACAC GCGCGTCGAT CCGCTCGACC CCGTCGGGAC GGTTTCGGCC
CAGTCGATCG GGGAACCCGG CACCCAGATG TCGATTCCTG GTGACGAGAG GGTTATCGTC
CGCCGGGATG GCGAGACCGA CGTGACCGAG ATCGGCTCGC TCGTCGACGG TCTCGCGTCC
GTACGAGAAT CCACGACTGT AGAGGGCCAC GAGGTAGTCT CTGCGCCCGA AGAGATGGAA
GTCCCGTCTC TCCGGGCTGA CGAGCGCGTC GAGTGGAAGC CACTCGAACA GGTAAGTCGT
CACGAAGCCC CCGACGAGCT CCTCCAGTTC GAACTCGAAT CCGGCCGGAC GATCCGGGCG
ACAAAAGCAC ACTCGTTTGT CACACGGGAA GACAACGAGG TCATCCCTGT CGAAGGGAGT
GATCTCGAGG AAGGCGACTG GTTGCCCGTC GTCCGTTCGC TCGACAGTGA CGGCCAGGAT
TCCGTCGACC TGCGGGAGTA TCTCCCGGCG AGTGACTACT GGTACACCTC GACCCTGACC
GACGGTGGGG CCACGACGCT CCCGGGCGGT GAAGACCAGA TCCGGAACAA ACGCCAGGCG
CTCGAAGCCG GCGAGATCGA GGAACACGCT GTCTATCCCG TCCAGGGAAC GGTCGGACTA
CCCGAACGGT TCCCGCTCGA CGAGGAAACC GGATTCTTCG TCGGGGCGTT CCTCGCCGAG
GGGAACCTGA CGGATCATTA CGTTTCCATC TCGAACGTCG ACGAGACGTT CCAGAATCGG
GTCCGTGCGT TCGCGGAGCG CTTCGATCTC TCCGTCAACG AATACGAGAA TGACAGCGGA
TTTGCCACTG GTTACGACAT TCGCGTCAAT GGAACCGTTC TCGCTGACTT CCTGCGAGAA
GTCTGCACCG TAGACGGCGA GAAGGCTATT CCGGACTTCG CGTTCGGTGC AACGTCCTCG
TTTGTCCGGG GGCTCCTTTC AGGATACTTC AGTGGAGACG GAAACGTCGG CAAAAACGCC
GTCCGAGCTA GTTCGACGTC CAACGAACTG ATCGACGGTG TCACGCTTCT GCTGGCACGA
TTCGATATTT ACGCGACGTT CGGCACGCAA GACGACTCGA AGACGCTTCG GGTGCCCAAG
AAGTTCGTTC CGGCGTTCAC AGATCGCATC GGGATGATCG GTGAGCGCGG TGACGAACTC
GAGGATCTGG CGACCGACGT CGACACCGAC GGCCCGGATA CGACGGATCA GATCCCGAAC
TTCGGGGATT CGCTGAAACA GGCGGCCAAA GCGGCGGGCA TCCCATCACG CCAGGTCAAT
AGCGCCCACA AGCGCCAGCG AGTCGGTCGC AACCGTCTCC GTTCGCTCGT CGAGCAGATC
GACGAAACGG CCGAGGAACG TCCCCCGCAA CTGGGTGCAC TCGAACGGGC AGTCGACGGC
GATGTCGTCT GGGAACGGAT CGAATCCATC CGGACCGTCG AACCCGAGAG CGGGTACGTA
TACGACTTCT CTGTGGCCGG TCTGGAGACG TTCACGACCG CGCAGGGCGT CGTGACTCAC
AACACGATGA ACACCTTCCA CTATGCGGGT GTCGCCGAGA TCGACGTCAC CCAGGGGCTA
CCGCGGCTCA TCGAGCTGGT CGACGCCCGC AAGGAGCCCG ACACGCCGAT GATGACCGTC
AATCTGGACG GCGAGTACGC CACCGAGCGC GAGCGCGCCC ACGAGGTCGT CTGGAAGATC
GAGGCGACGC GCATCCTCGC GCTGGGCGAC GTCTCGACGA ACGTCGCGGA CATGCTCGTT
CAGGTCGATC TCAATCCCGA CACGCTCGAG GAGCGCTGGC CGACGGCCGA CAGCGTCGAG
AGCGTCGCCG AGGAGATCGC CGCCACGATC GAGTCCCAGC TGGGCGTCGA AACCCAGCGC
AAGGAGACGG TCATCCAGTT CGGCCCAGAG GAACCGTCCT ACCGGGATCT CCTCCAGCTG
GTCGAGGAGT TACGCGAGAT CGTCTTCAAA GGGATCGAGG AAGTCTCCCG GGTCGTCATC
CGCAAGGAGG AGATGGACGA GGACCACGAG AACGACGAGG AGTTCGTCCT CTACACCGAG
GGGTCGGCCT TCGGCGACGT CCTCGGGATC GAGGGGGTCG ACGCCTCGCG GACGACGTGT
AACAACATCC ACGAGATCTA CAGCAATCTC GGCGTCGAGG CGGCACGCGA GTCGATCATC
GAGGAGACGA TGAACACCCT CGAGGAACAG GGGCTGGACG ACGTGAACGT CCGTCACCTG
ATGCTGGTCG CCGACATCAT GACCAACAAC GGGGTCATCG AGTCGATCGG TCGCCACGGC
ATCTCCGGCA GCAAGGACTC CGTCCTTGCA CGGGCTGCCT TCGAGGTCAC CGTCTCACAC
CTGCTGGACG CCGCCATCCA CGGCGAGATC GACGCCCTCA ACGGCGTCAC CGAGAACGTC
ATCGTCGGCA AGCCGATCAA GCTCGGCACC GGCGACGTCA ACCTCCGGAT GGGCAGTCAC
ACCGGCGGCG GCGCGGCCGA CTGA
 
Protein sequence
MTDTPTEYDP MNRYAAVDTD IAAVVEDTDL PRRLKTRVYE AIEDRDGVTV EQADDIAKAV 
ESRYLDTRVD PLDPVGTVSA QSIGEPGTQM SIPGDERVIV RRDGETDVTE IGSLVDGLAS
VRESTTVEGH EVVSAPEEME VPSLRADERV EWKPLEQVSR HEAPDELLQF ELESGRTIRA
TKAHSFVTRE DNEVIPVEGS DLEEGDWLPV VRSLDSDGQD SVDLREYLPA SDYWYTSTLT
DGGATTLPGG EDQIRNKRQA LEAGEIEEHA VYPVQGTVGL PERFPLDEET GFFVGAFLAE
GNLTDHYVSI SNVDETFQNR VRAFAERFDL SVNEYENDSG FATGYDIRVN GTVLADFLRE
VCTVDGEKAI PDFAFGATSS FVRGLLSGYF SGDGNVGKNA VRASSTSNEL IDGVTLLLAR
FDIYATFGTQ DDSKTLRVPK KFVPAFTDRI GMIGERGDEL EDLATDVDTD GPDTTDQIPN
FGDSLKQAAK AAGIPSRQVN SAHKRQRVGR NRLRSLVEQI DETAEERPPQ LGALERAVDG
DVVWERIESI RTVEPESGYV YDFSVAGLET FTTAQGVVTH NTMNTFHYAG VAEIDVTQGL
PRLIELVDAR KEPDTPMMTV NLDGEYATER ERAHEVVWKI EATRILALGD VSTNVADMLV
QVDLNPDTLE ERWPTADSVE SVAEEIAATI ESQLGVETQR KETVIQFGPE EPSYRDLLQL
VEELREIVFK GIEEVSRVVI RKEEMDEDHE NDEEFVLYTE GSAFGDVLGI EGVDASRTTC
NNIHEIYSNL GVEAARESII EETMNTLEEQ GLDDVNVRHL MLVADIMTNN GVIESIGRHG
ISGSKDSVLA RAAFEVTVSH LLDAAIHGEI DALNGVTENV IVGKPIKLGT GDVNLRMGSH
TGGGAAD