Gene Dret_0082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0082 
Symbol 
ID8417886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp109380 
End bp112220 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content63% 
IMG OID645036647 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003196962 
Protein GI258404220 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGTG GATCGGGGTT GAGTCATTTC GGGGCGCTGG TGGCCTGTCT TCTGGTGCTG 
CTCGGGATGG TGCCCTCTGT TCCGGCTGCG GACAACGGCT TCGCCGCCCG GGAGCAATGG
GCAGTCGGCA CGGCGCAATT CGTTGTCGCC CCGGACCCGC ATTGGCCGCC GATTGAATTT
TTCGACGCCC AGGGCCGTTA CCAGGGCTTG GCCGCCGATT ATCTCCGCCT GGCCGCCAAA
GAACTCGGCC TGTCGTTGCG GGTCCGGCGG CTGGACAATT GGGCTCAGGC CGTCGACGCC
GTGCGCAGCG GCGAGGCCGA TGTGCTGGGG GCGGCCCGCA AGACCGCAGA ACGGGAACGC
TTCGCCCGCT TTACCCAGCC GTTTTATGAG GTGCCCACTG TGCTGGTGGT TCGCCAGGGG
GACCGGACAA TCACCTCTCT TGGCGATCTT GAGGCCGTTG GAGCCGTGCG CGGGTATGCG
GTGGCCGAGT ATCTGCGCCG CACTGCCCCG GAATTGACCC TCCATCTGGT ACCCGATACC
GCCACCGGTC TGCGCCAACT GTCCCTCGGC CACCTTCCGG CCCTGGTCAC CGAACTCCCG
GCCGCGACCT TCCTGCAGAG CAAACTGGGG CTGAGCAATC TCCGGTTTGT CGAGACCCTG
GACTACAGTT ACCCCCTCTC CTTTGCCGTC CGGCGTGACG CGCCCGGACT TTACACCCGT
CTCGAAGCGG CTCTGGCCGG TATTGCAAAC GAGCAACGCC GCGCCCTGGA GCGGAAGTGG
ATCGGGCTGG ACCACAGCGA ACTGCTGCGC GATTCCTGGT TTTGGAAACA GGTTTTGGGT
GCGACCCTGC TCGTGGTCCT GGTCGTGTTT GTTCCCCTGT ACGGACTGCA ACGCGAGAAG
ATCCGCCGGC GGACGCGGCA GTTGCAGACC TCGGAAGACA AGTTTCGTTC CGTTTTCGAG
CAATCGCACC AGCTGATGGC GCTGTTGAGC GTGGACGGGA CCTTGATTGA GGCCAACGAC
CGGGCTCTGG AGTTCATGGG GGCTGACCGG GCGTTTTTGC GGGGCCATGA GCTGACCGCC
CTGCCGTGGT GGACGGGGGT GGAGGACGAC AACGAGGAAA TGGCGGCGGC CATCCGTCTT
GCGGCCACAG GTGAGGTGGT CCACGGCCGG ACCCGGAATA TTGCGCCGAG CGGCAGAACG
GTCTGGATCG ATTATTCGGT CAAGCCGGTT TTCGGCGAGG AAGAGACACC GCACTATCTG
GTCGTTGAAG GACGGGACGT CACCGCGTAT CTCCAGGCCA AACAGGCGCT GGAGGAAAAC
GAGCGCATGC TCTCGACGCT GCTCGCCAAC CTGCCGGGGA TGGTCTACCG ATGTTCCGAC
GACCCGCGCT GGAAGATGGA ATATGTCAGC CGCGGGTGTA TGGATTTGAC GGGCTACGAC
GAGGCCACGC TCACGACCCG CATGGACCCG GAATACGGTG ACCTTATCCT GGACGAAGAC
AAGCCTTTTG TCCTGGAATC TGTGGAGCAG GCCTTGACCT CAAAGGAGTC GTTTCAAGTC
ACGTATCGCA TCCAGGACGC CGATGGCGTG GTGAAATGGG TCTGGGAACA GGGACGCGGG
GTCTGGTCCG AGGGGGGCGA TCTGGAAAGT ATTGAAGGGT TTGTGACCGA TGTGACTGCC
CAACGCCAAT TGGAGGAGCA GTTGCGCCGC TCGCAGCGCC TGGAGTCCGT GGGACGGCTG
GCGGGCTGGG CCGCCCACGA TTTCAACAAT ATGCTGACCC CCATTTTGGG GTATGCCGAG
ATCCTGTTGG CCGGGGCCCC TGAAGGGGGC AAAGAGGCCG CCAAGCTGGA ACAGATCCAT
GCTGCGGCCG AGCGGGCTCG GCAATTGGCC CGGCAACTCC TCGCTTTCAG CCGCAATCAG
GTCCTGGCCA AAGCCCACCT GGATCTCCGG GAAATGGTCC AGGGGTTTTG GGATATCCTG
AGCCAGGGCG TGCGGGAGGA CATCTTTCTC GAACTCACGC TCAGTGAAGA GCCGTGCCGG
GTCATGGGTG ACGCCCACCA ACTCGAGCAG GTGCTCATGA ATCTCGTGCT CAATTCCCAG
GACGCCATGC TTGAGGGGGG CTCGATCCAG GTCGCGGTTG CCCCGATCGA ACTCGACGCT
GGCTATCAAA AGCTCCATCC GGAGATCGCT CCCGGCAGGT ACATCCAACT CAGTGTGGCT
GACACCGGCG TGGGGATGGA CCGAGAAACC CAGCGCCAGG CCTTTGACCC CTTTTTTACC
ACCAAGGGCG AAGAAGGGAC CGGTCTGGGA TTGTCCTCAG TGCACGGCAT CTTGAAACAG
CACGGCGGGC ACGCCGAAAT CTATTCCGAG CCGGGCGCGG GTACGGTGGT CAAACTCTTT
TTGCCGCGGG TGGATGAGGA CTCCAGTCGC AGTGAGGGAC CGCAGACCCT GCAGGAGCAG
ACATTTGTCT CCGGCAGTGA GACGGTGCTC GTGGTCGAAG ATAATGAGAT GGTCCGGGAG
TTGACCTGCG AGGTCCTGTT TTCGCTCGGC TACAGCGTCG TCTCCGCCGC CGAGCCCCGG
GAAGGGATGC GGGTGGCCAG CCAGACCGAG GAGGAGATTG CCTTGCTGGT CACCGATGTG
GTCATGCCGT ATATGGACGG CAGGAAATTG TACCAAGAGC TGCAGGCCGA CCGGCCCGGA
TTGCGCGCTT TGTTCATCTC CGGGTACACT GACAATCACA TCGCCAATAC CGGGGGCATG
GAAGCGGGCA CGGCCTTTTT GCAAAAACCG TTTACCAGTC AGGAACTGGC CCGGAGCGTG
CGACAGGTGC TTGACGCCTG A
 
Protein sequence
MRGGSGLSHF GALVACLLVL LGMVPSVPAA DNGFAAREQW AVGTAQFVVA PDPHWPPIEF 
FDAQGRYQGL AADYLRLAAK ELGLSLRVRR LDNWAQAVDA VRSGEADVLG AARKTAERER
FARFTQPFYE VPTVLVVRQG DRTITSLGDL EAVGAVRGYA VAEYLRRTAP ELTLHLVPDT
ATGLRQLSLG HLPALVTELP AATFLQSKLG LSNLRFVETL DYSYPLSFAV RRDAPGLYTR
LEAALAGIAN EQRRALERKW IGLDHSELLR DSWFWKQVLG ATLLVVLVVF VPLYGLQREK
IRRRTRQLQT SEDKFRSVFE QSHQLMALLS VDGTLIEAND RALEFMGADR AFLRGHELTA
LPWWTGVEDD NEEMAAAIRL AATGEVVHGR TRNIAPSGRT VWIDYSVKPV FGEEETPHYL
VVEGRDVTAY LQAKQALEEN ERMLSTLLAN LPGMVYRCSD DPRWKMEYVS RGCMDLTGYD
EATLTTRMDP EYGDLILDED KPFVLESVEQ ALTSKESFQV TYRIQDADGV VKWVWEQGRG
VWSEGGDLES IEGFVTDVTA QRQLEEQLRR SQRLESVGRL AGWAAHDFNN MLTPILGYAE
ILLAGAPEGG KEAAKLEQIH AAAERARQLA RQLLAFSRNQ VLAKAHLDLR EMVQGFWDIL
SQGVREDIFL ELTLSEEPCR VMGDAHQLEQ VLMNLVLNSQ DAMLEGGSIQ VAVAPIELDA
GYQKLHPEIA PGRYIQLSVA DTGVGMDRET QRQAFDPFFT TKGEEGTGLG LSSVHGILKQ
HGGHAEIYSE PGAGTVVKLF LPRVDEDSSR SEGPQTLQEQ TFVSGSETVL VVEDNEMVRE
LTCEVLFSLG YSVVSAAEPR EGMRVASQTE EEIALLVTDV VMPYMDGRKL YQELQADRPG
LRALFISGYT DNHIANTGGM EAGTAFLQKP FTSQELARSV RQVLDA