Gene Huta_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2079 
Symbol 
ID8384373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2108995 
End bp2110995 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content64% 
IMG OID644973148 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003130979 
Protein GI257053146 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0273003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGCA GGGAGTGCTA CAGCAGCGGC TGGTGCACAA TGAGCGGGAA TGATGCGCCC 
GTAGTGAGTG GGGACGCCAT CCACACGCTG TTAGTCGACG ATAGCGACCG ATGGGCCTCG
GTCGCTGCGA AGCGAATCGA ACGCGAGACA GACGCGGTCA GTGTCTCGGT CGCGAACGAT
GCCAACGAGG CGCTCGTACA TCTCGCGGAG GCCGACCGAG TCGACTGTCT GCTCGTGGAC
TACATGATGC CGGGCATCAC CGGCCTGGAG TTGCTCGAAC GCGTTCGCGA GGATCGTCCC
GATATTCCGT TCATCCTCAT CACCAGCGAG GGAAACGAGG ACGTGGCTGC CCGGGCGATC
GACGCCGGTG TGACCGATTA CGTCGTCAAA GAGCCGGCAA CCGATCAAAC CCCACTCCTC
GCCGAGAAAA TCGAGTCAGT TGTGCGCCAG CATCGCTTAC AATGCCAACT TGAAGAAAGC
GAACGGCGCT ATCGATCCGT CGTCGAGGAG AGCCGTGACG CGATCTGTCT GCTCGAAGAC
GGTCGCGTGC AGTTTTGTAA CGAGCACTTT GCCGAACTCA CGGGTAGCGA CTGTGCGTCC
TGGATCGGCG AGGAGCTCGT CGAAGACGCA ATTCACCCGG ACGACCGGAC CGCAGTGCGT
GAGGCATTCG CGAACTGGGA CGACGGGAGC GCCGAGCCGG AACTGCAAGA AGCGCGCCTT
CGTCGTTCCG ACGGATCGGT TCGGGCCTGT GAGTTCACTG GCCGGCGGAT CACCGACGAG
GGTGAGCCGA CACTGCTCGT CTCGATCCGG GACGTCAGCG AGCGACGACG CCACGAGCGT
GAGCTACAGT GGGAGCGAGA CCTCAATCGA ACGGTCCAGG AAGCGCTGGT CGAATCGCGG
ACGCGAGACA CATTGGAAGA AGACGTCGTT ACCCATTTAC ACGAGTACGG GTATCCGGTC
GCGTGGGTTG CCGAACGGGG GGCCGGTGGA CTGTCCCCGC GAGCCGTCGC CGGGGATCAG
GCATTCGTCG ACGCCATCTC CTCGACGGCA GAAGCCGACG AGGTGCAGGG CGAACCCGCC
ACGTGGGCTG CCCAGTCGGG AGAGCCACAG TTCGTCCAAG ATGTCGCTGA GTTGTTTCCC
TCGGCGTGGC GAGAAGTCGT CACAGACGAC GGGTATCGGT CCGGCGGGGC GGTTCCGCTC
GAACACAACG ACGTCCCCTA CGGCGTATTG GCGGTCTATC ATGACGAGCC GGACCATTTC
GGCGAGACCG AGCGCCGGTT GCTCACGGAG CTCGGGGATA CCGTCGCGTT CGGGATCCAC
AGCCTCGCGA CGGAGAATAG CCTCGCTGCA GACCGGACTG TCACGGCCCG GTTTCGCGTC
GGCGACGACG CGTACTATCT CGCGGCACTC GCGATAGACG GGGCCTTTCG GGACTGCAAG
CGGGTCACCG TCCAGGGGAC TGTCCCGGAC GACGAGGACG GGATCGTCCA GTATCTCCGT
ATCGCAGGCG CGACGGACGC GATCCAGGAC GTGCTGGCTG CTCACCCCGA TGTGACAGCA
GTCCACGGGA TCGACATTGA GCCACGCAGA CTCCAGGTGA CGGTCAGCGG TCCCAGTCCG
GAAGCCCATC TCGCGACGCG CGGCGTCGTC GTCGATACCA CGACCCTCGA CCCCAACGGG
GCCGTCGTCG AGGCACAGCT CCAGTCCCGG GAGACCGTCA CGCCGACACT CGAGCGGTTA
GAAGCGGCGT TCGACGACGT CTCCATACTC GCGATCGGTA ACGAGGACAC GGTGAGTGAC
ACCGGCGGCC AGCTGCGAGC GGCCCGACTC ACCGACAAGC AACGCCAGGC TCTCCGCGCG
GCGTACCATC ACGGGTACTT CGAACAGCCT CGCGGGGCCA CCGCGGCGGA GATCGCCGAG
ACCCTGGGCG TTGCCCACTC GACGTTCCTC CAGCACCTCC ACCGCGCCCA GCAGAAAGTC
TTCGAAGCGC GCTTCGAGTG A
 
Protein sequence
MASRECYSSG WCTMSGNDAP VVSGDAIHTL LVDDSDRWAS VAAKRIERET DAVSVSVAND 
ANEALVHLAE ADRVDCLLVD YMMPGITGLE LLERVREDRP DIPFILITSE GNEDVAARAI
DAGVTDYVVK EPATDQTPLL AEKIESVVRQ HRLQCQLEES ERRYRSVVEE SRDAICLLED
GRVQFCNEHF AELTGSDCAS WIGEELVEDA IHPDDRTAVR EAFANWDDGS AEPELQEARL
RRSDGSVRAC EFTGRRITDE GEPTLLVSIR DVSERRRHER ELQWERDLNR TVQEALVESR
TRDTLEEDVV THLHEYGYPV AWVAERGAGG LSPRAVAGDQ AFVDAISSTA EADEVQGEPA
TWAAQSGEPQ FVQDVAELFP SAWREVVTDD GYRSGGAVPL EHNDVPYGVL AVYHDEPDHF
GETERRLLTE LGDTVAFGIH SLATENSLAA DRTVTARFRV GDDAYYLAAL AIDGAFRDCK
RVTVQGTVPD DEDGIVQYLR IAGATDAIQD VLAAHPDVTA VHGIDIEPRR LQVTVSGPSP
EAHLATRGVV VDTTTLDPNG AVVEAQLQSR ETVTPTLERL EAAFDDVSIL AIGNEDTVSD
TGGQLRAARL TDKQRQALRA AYHHGYFEQP RGATAAEIAE TLGVAHSTFL QHLHRAQQKV
FEARFE