Gene Dret_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1849 
Symbol 
ID8419690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2120534 
End bp2121568 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content54% 
IMG OID645038433 
Producthistidine kinase 
Protein accessionYP_003198711 
Protein GI258405969 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000588786 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAAC CCCGATCGTT TGTTGTCCAA GTGCTGATTT TTGTTTTGGC GCAGGTGGCC 
TGGCTGGTGC TGCTCGGGCT GTGGATTTAT TGGTATACGA GCAATTATAT GGTCATCAGC
GAGGTCGGGC CTCGCCTGCA TTCCCAACTC ATGTCCGAGG GGCTCAATCA CCTGCTCCTG
ATCGGCGGGC TGATTCTGCT CATCGCCATT TCCACGGGGA TGTCACTTCT TTTTCACCGC
TTGAGCGTCC AGTTCAAATT GACCCGGCTG TACGACAATT TTATCGCCAA TGTGACCCAC
GAACTCAAAT CTCCGCTGGC CTCGATACAG CTTTCCATCG AGACCATGCG TATGCATGAA
CTCCCCCGGG AGAAGCAGGA GGAGTTCTTC AACATGATGC TCAAGGATAC CGACCGGCTC
AATAACCTCA TCAGTGCGAT CCTGCAGGTC CCGGCTCTGG AGCAGAAAAA AATTGCTCAC
GATTTCCAGG TCCACCGCAT GGAGGAACTG GTTCCAGAAC TTGTTCACGA GTCGCGTGAG
CAGTTTTCCT TGCCTGAAAA GGCGATACAG ATATCCGGGG ATGGAGGGTG TGATTGCGTT
CTCGACCGCA ACGCCTTCCG CATTGTTCTG GACAATCTTG TAGACAACAG CATCAAATAC
AGCCGAGAGG GAGTGGATAC AGCAATCCAT ATCAGGATGG CCTGCGAACG GGGGAAATTT
ATTCTCCGTT TTGCTGACAA TGGCGTCGGC ATTCCACTCC AGCATCAGGA ACAGGTCTTT
GAGAAATTTT TCCGCAGTCA TGACACGGCC ATGCCGAGCG TTAAAGGGAC CGGCCTGGGA
CTGTACTGGG TCAAAGAGAT TATCCGCATC CACCAGGGGG CGATCCGGGT GTCCAGCCGG
GGGACGAACA AAGGCTCCAC CTTCCGCATC GAATTGCCCC AATACGCCAA GGGCACAGAG
CGCGCGGCGC AACGGTTGCT GCGCTTGAGC CGTAAACAAA AACAAAAGGA CACCGCACAT
GGTGAGGGCG CCTGA
 
Protein sequence
MRQPRSFVVQ VLIFVLAQVA WLVLLGLWIY WYTSNYMVIS EVGPRLHSQL MSEGLNHLLL 
IGGLILLIAI STGMSLLFHR LSVQFKLTRL YDNFIANVTH ELKSPLASIQ LSIETMRMHE
LPREKQEEFF NMMLKDTDRL NNLISAILQV PALEQKKIAH DFQVHRMEEL VPELVHESRE
QFSLPEKAIQ ISGDGGCDCV LDRNAFRIVL DNLVDNSIKY SREGVDTAIH IRMACERGKF
ILRFADNGVG IPLQHQEQVF EKFFRSHDTA MPSVKGTGLG LYWVKEIIRI HQGAIRVSSR
GTNKGSTFRI ELPQYAKGTE RAAQRLLRLS RKQKQKDTAH GEGA