Gene Huta_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0888 
Symbol 
ID8383161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp856228 
End bp859365 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content63% 
IMG OID644971952 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003129804 
Protein GI257051971 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.628916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA CAGCCGAATC AGTCCGCATC CTCCACGTCG ACGACGACCC GGACTTTCTC 
GATATCACGG CCGAATTTTT ACAGCGGGAA AACGAGCAGT TCACGGTCGA CGTTGCGACG
AGTGCCGCGG ATGGGTTGGA TCGCCTCACG GAGAATGCCT ACGACTGCAT CGTTTCGGAC
TACGAGATGC CGGGCCAAGA CGGGATCGAA TTTCTCAAAG CCGTTCGCAA GGACCACCCC
GACCTCCCGT TTGTCCTATA CACCGGCAAA GGAAGCGAGA CGATCGCCAG TGAGGCAATC
TCGGCCGGGG TGAGCGAGTA CCTCCAGAAG GGAAGCGGGA CCGATCAGTA CACGGTGCTT
GCCAACCGGA TCGAAAACCT CGTCTCCCGC CGCGAAGCCG AGATCACGGT CGCCCGGTAC
GACCAACGGG AGCGCGAAAG CGAGCGGTAT CGGCGGGAAC TTCTGGATAT CACCGCCGAT
CCGGACACCA CAACCGACGA GAAAGCCCAT CAGTTGCTCG AACTGGGCCG CGAGCGCCTC
GGGGCTGAGA ACGGCCATCT CGTCAAGATC GACGAGGATC GGAACCGACA CGAAGTGATC
AGCGTGACCG GGTCCGAAAT CGTCCAGAAG GGAGTAACCG ACCTCTCGAA AACGTACTGT
CGACAAACGA TCGACTCCGA CTCGATCTTC GACGTTCACG ACGCCCCAGG AGCTGGCTGG
GAGGACGATC CTGCCTACCA GCAATTCGAG TTGGGCTGTT ACATCGGCCG GAAACTGCTC
GTGGAGGGGG AGCTGTTCGG CACACTGTGT TACGTCAGCA GCGACCCGCG TGAGGAACCG
TTCACCAACG ACGAGAAGAC GTTCTTCGAG CTGCTCGCCC GGTGGTTCAC CCAATTGCTG
GAACGGAAAC GCTATCGGTG CCAGGCGGAG ACGGTGTTCG AACACGCCCA AGATGGGATC
TTCCTGATCG ACGTCAGCCC CGAGCAGCGG TTTCGGATCC GGCGGGTAAA TCGGGCGTAC
GAGGCACTGA CCGGCCACTC GACCAAGGAC ATCGAAGGGA AAAGCCCCAC CGACATCTAC
GGCGCTGACA TCGGGACCGA AATCGAGACC CAGTATCGTG AGTGCGTCGA CCGAGAAGAA
CCGATCGAAT ACGAAGTGGC GTTGCCGGTC GGTCCCCACG ACGAGCCCAG GCAGTTTCAC
ACGAAACTCG CACCCGTCGT CGAGGCGGGC ACGGTCGTCG AGCTCGTCGG TGCAACCAGA
GATGTTACCC AGCGCAAGGA ACGACAGTCG AAACTTGAAG CCGAGCGTGC GTTCATCGAG
GAAACACTCA ACAGCCTCGA AGATGTGCTA TACCTGATCA ATCCCGATGG CAGCCTCCGT
CGCTGGAACG ATCGCCTCGG TGCGGTGACC GGCTACGACG ACGAGGAGAT CGAGACGATG
GCGGCGACTG ACTTTTTCCC ACCAGAAGAA CGCGAGCGCA TCGCCGACGC GATCGACGAG
GCGCTTGCGA CTGGCAGGGC AGTCGTCGAG GCAGACGTCC TCACCGCCGC GGGGGAACGG
ACCCCCTACG AGTTCACCGG CACGCGATTG ACCGACTCGT CCGGCGACGT GCTCGGGGTG
GTCGGGATCG GCCGGGACAT CGCCGGCCAC AAGGAACGCG AACGCGAGCT CAGACAGTAC
CAGCAGATCC TCGACGCGAT GCTCGACCCG GCGACCGTCA TGAACGAGGA CGGGGAGTAC
ACTGTCGTCA ACAACGCGAT GGCAGCAGTC CACGAGATGC AGGCCGAAGA ACTGATCGGC
GAACCCAGCC CGTTCATCCG GGAGCTTCGC GAAGCGCGTT CGGACGACCC CTACCGGGAA
CTCGTCGATG GCGAACGCGA GGAGTATCGC GGGGAGTACA CGATGGAGAT CCCCGACGCC
GACCCCATCT ACTTCGAGTA TCGGCTCAGC CGGCTGACCA TCGACGGGCG CTTCTGTGGG
ATTGTCGCGG TGGGCCGTGA CGTCACCGAC CGGAAGCGCC GCGAGGAGAC GCTCGAAGCC
CTCCACAAAC GGACGCGCCC GTTCATCACG ACGCCGGATC AGGAGGTCGT TGCCGAGCAC
GCCGTCGAGA CGGTCGCCAG TGTCCTGGAC CAGGGGATCA ACACTGTCTG GCTCTACGAC
GAGGACAGCG AGACACTGGA ACCGGCCGCC TGGACCGACG CTGCCGCGGA GCTGCTGGGC
GAGATGCCGA GCTATACCGG CGAAGGAAGC CTCACGTGGG ACGCCTTCCG GAGCGGAGAG
GTACTGGTGA TCGACGAGAT GAGTACGGTG AACGGCCGGC ACAACCCCCA AACGCCGATC
CGGAGTGAAA TCATCCTCCC GCTGGGCGAG TACGGCGTAA TGAACATCGG ATCGACGGAA
CCGGACGCCT TCGAAGACAT CGACATCTCG CTCGCCCGTG TCCTCGGCGA CATGGTCGAA
GCAGCTCTCA TCCGGACGGA CCGGGAGGAG GAACTCCGCG ACAGACGTCG TGAGCTCGAA
CGCCAGAACG AGCGTCTCGA AGAGTTCGCC AGCCTCGTCA GCCACGACCT CCGCAACCCG
TTGAACGTCG CCGAGGCACG GGTCCAACTC GCCCTGGACG AGCGCGACAG CGAGCACCTG
GACGTCGCGG CGAAAGCGAT CGACCGAATG GGTGTCCTGA TCGACGATAT CCTTCAACTG
GCCCGGGAAG GCGAACGGAT CGACGAGATG GAACGCATCG ACCTCGAAAC GATCTGTGCC
GATTGCTGGG ACGCCGTCGA GACAACCGAG GCGACGCTAT CCGTCGAATC GAACCGGCCG
ATCCGTGCCG ACCGGAGCCG GGTTCGACAG CTGCTCGAGA ACCTCTTTCG AAACGCGGTC
GAACACGGGG GCGAAGACGT ACAGATCACG GTCGGTGCCC TGGAGGAGGG CTTTTTCGTC
GCCGACGACG GGCCGGGTAT CCCGCCGGAC GAACGAGAAA CCGTCTTCGA GAGTGGCTAT
ACGACCCGCG AGGACGGGAC CGGGTTCGGG CTCGCGATCG TCGCCGAGAT CGCCGACGCA
CACGGCTGGG ACGTCGCTGT GACCGACAGC GAGGACGGCG GCGCTCGCTT CGAGATCACC
GCCGTCGAAG CCCCGTAG
 
Protein sequence
MSETAESVRI LHVDDDPDFL DITAEFLQRE NEQFTVDVAT SAADGLDRLT ENAYDCIVSD 
YEMPGQDGIE FLKAVRKDHP DLPFVLYTGK GSETIASEAI SAGVSEYLQK GSGTDQYTVL
ANRIENLVSR REAEITVARY DQRERESERY RRELLDITAD PDTTTDEKAH QLLELGRERL
GAENGHLVKI DEDRNRHEVI SVTGSEIVQK GVTDLSKTYC RQTIDSDSIF DVHDAPGAGW
EDDPAYQQFE LGCYIGRKLL VEGELFGTLC YVSSDPREEP FTNDEKTFFE LLARWFTQLL
ERKRYRCQAE TVFEHAQDGI FLIDVSPEQR FRIRRVNRAY EALTGHSTKD IEGKSPTDIY
GADIGTEIET QYRECVDREE PIEYEVALPV GPHDEPRQFH TKLAPVVEAG TVVELVGATR
DVTQRKERQS KLEAERAFIE ETLNSLEDVL YLINPDGSLR RWNDRLGAVT GYDDEEIETM
AATDFFPPEE RERIADAIDE ALATGRAVVE ADVLTAAGER TPYEFTGTRL TDSSGDVLGV
VGIGRDIAGH KERERELRQY QQILDAMLDP ATVMNEDGEY TVVNNAMAAV HEMQAEELIG
EPSPFIRELR EARSDDPYRE LVDGEREEYR GEYTMEIPDA DPIYFEYRLS RLTIDGRFCG
IVAVGRDVTD RKRREETLEA LHKRTRPFIT TPDQEVVAEH AVETVASVLD QGINTVWLYD
EDSETLEPAA WTDAAAELLG EMPSYTGEGS LTWDAFRSGE VLVIDEMSTV NGRHNPQTPI
RSEIILPLGE YGVMNIGSTE PDAFEDIDIS LARVLGDMVE AALIRTDREE ELRDRRRELE
RQNERLEEFA SLVSHDLRNP LNVAEARVQL ALDERDSEHL DVAAKAIDRM GVLIDDILQL
AREGERIDEM ERIDLETICA DCWDAVETTE ATLSVESNRP IRADRSRVRQ LLENLFRNAV
EHGGEDVQIT VGALEEGFFV ADDGPGIPPD ERETVFESGY TTREDGTGFG LAIVAEIADA
HGWDVAVTDS EDGGARFEIT AVEAP