Gene Dret_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0868 
Symbol 
ID8418687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1025856 
End bp1027808 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content58% 
IMG OID645037437 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003197737 
Protein GI258404995 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAATG CGCTGTTACT TGTCGACGAC GAAGAAGGGA TTCGCAAAGT CCTCGGCATC 
ACCCTGGCCG ATAGCGGCTA CACCGTCCAT ACCGCGGCCA ACGGCAATGA GGCCCTGGAG
GTCTGGCGCG AACACAGGCC AGGCATCGTA CTCACCGATA TCAAAATGCC GGGCATGGAC
GGCCTGGAAC TTCTCCAGGC GCTCAAGGCT GAAGATCCGG AAACCGAAGT GATCATGATC
ACCGGACACG GGGACATGGA CTCGGCCATC GAAAGCCTGC GCAACGACGC CGCCGACTTC
ATCACCAAGC CCATCAACGA CGAAGTCCTC GAATTTTCTC TGCGCCGGGT GCGTGAACGC
ATTCGCATGC GTCGCCAATT GCAGGAATAC ACACAGAATC TGGAAAACCT GGTCCGGGAA
AAGTCCCAGC GCGTGGTCCA ATTGGAACGC CAGCTGGCCG TGGGCCAGGT CGTCGAGGCT
CTCGGCTCGG CTGTGGCGGG ACTCAGCCGG GAAATGGATC ACGGCGATCC GCCCGTTTTC
AACGAGTTGC CGTGTTTTGT GGCCATCCAC AACCGGGACC AGAAGGTCGT TGCCACGAAT
GCGCTCTATG CGGAACGGTT CGGCGATCGC AGCGGGAAAC CGAGCGCCGA GGTGTATTGC
GGCCAAGCCG CTTCCCCGGA TGAATGTCCT GTGGGCAGAA CCTTGGCCAC CGGTCAGCCC
CAGCGCTTCC AGGTTACGGC GTGCAGTGCC GAAGGACGGG AATTGCCGCT TATGGTCCAC
ACTGCGCCAG TGCGCAATCA CGACGGTGAT GTGGACCTGG TCCTCGAAAT CTCGGTCGAT
GTCAGTGAAA TAACGCGCCT GCGCGACGAA TTGCGTTTTA CCCGGCAACG CTATCAGCAG
CTGTTCGACG CCGCTCCTTG CTATATCTCC GTGCAGGATA TGGACCTGCG CATCGCTGAA
ACCAACGATC TCTTCAAACA GGATTTCGGG GATTTTTCCG GGGAGTATTG CTACCAGGTC
TACCGTCACC GGAGCCGCCC CTGCGACAAC TGCCCAGCCT TCGAGACCTT TGAAAGCGGA
GAGCCGTTTC ATACTGAAGA GATTGTCACT TCCCGCACCG GAGAGCGCTA CAACGTTCTG
GTCTGGACCG CTCCGATCCG CGATACCAGC GGGGAAGTGG TCCAGGTCAT GGAACTGGCC
ACCAACATCA CCCAGATCCG TCAACTCCAG GACCATTTGA CCTCCTTGGG CATGCTCCTT
GGTTCGGTTT CCCACGGCGT CAAGGGGATG TTGACCGCCC TGGACGGCGG AATCTATCGC
CTGGAGTCCG GACTGAAAAA ACAGGATTTC GAGCGTATCT CCTCCGCCTC CCAGGTGATC
AAGAAGCAAG TTGGCAAAAT CCAGCGTATG GTCCTGGATA TCCTGTATTA TGCCAAATCC
CGCGAAATCA ATTGGGACGT CGTCTCCGCC CGGGAATTGG CCGAGGAAAT CTTGGACACG
GCCGGCTCCC GGGCCGAACA GGCCGGCGTG CGGCTGCATT CAGACATCCA GGAAGACCTC
CCGAAATTCG AAGCCGATCC CACGGCTGTC TCCTCGGCCG TGGTCAACTT CCTGGAAAAC
GGCGTCGATG CCTGCACAAC AGGCCGCAAC GCGGACCCAC AGATCCATTT TAGCGTCGAG
AGCACAAAAG ACCACGTCGT CTTCAGCGTT GAGGACAACG GCCCTGGCAT GGACCAGGAG
ACCCAGGACA AGATCTTCAG TCTGTTCTTT TCCACCAAGG GCAATAAAGG TACCGGGCTG
GGCCTTTTTA TCGCCAACCA GGTCATTGAA CAGCACGGTG GCCGCATTGA GGTTTCCTCC
GAACTCGGCA ATGGCACCCG ATTTATCATC CGCTTACCGC GGCAACTGCC GTCGCATATC
AAAAACGCCC CACAACCCCG TTCGACATCC TGA
 
Protein sequence
MDNALLLVDD EEGIRKVLGI TLADSGYTVH TAANGNEALE VWREHRPGIV LTDIKMPGMD 
GLELLQALKA EDPETEVIMI TGHGDMDSAI ESLRNDAADF ITKPINDEVL EFSLRRVRER
IRMRRQLQEY TQNLENLVRE KSQRVVQLER QLAVGQVVEA LGSAVAGLSR EMDHGDPPVF
NELPCFVAIH NRDQKVVATN ALYAERFGDR SGKPSAEVYC GQAASPDECP VGRTLATGQP
QRFQVTACSA EGRELPLMVH TAPVRNHDGD VDLVLEISVD VSEITRLRDE LRFTRQRYQQ
LFDAAPCYIS VQDMDLRIAE TNDLFKQDFG DFSGEYCYQV YRHRSRPCDN CPAFETFESG
EPFHTEEIVT SRTGERYNVL VWTAPIRDTS GEVVQVMELA TNITQIRQLQ DHLTSLGMLL
GSVSHGVKGM LTALDGGIYR LESGLKKQDF ERISSASQVI KKQVGKIQRM VLDILYYAKS
REINWDVVSA RELAEEILDT AGSRAEQAGV RLHSDIQEDL PKFEADPTAV SSAVVNFLEN
GVDACTTGRN ADPQIHFSVE STKDHVVFSV EDNGPGMDQE TQDKIFSLFF STKGNKGTGL
GLFIANQVIE QHGGRIEVSS ELGNGTRFII RLPRQLPSHI KNAPQPRSTS