Gene Dret_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0423 
Symbol 
ID8418228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp519291 
End bp520787 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content59% 
IMG OID645036984 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_003197298 
Protein GI258404556 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.268173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAC ACAGGCTTCT TGAAACTTAT GGACTTGATC TTTTGCGGGC GGTGCTCGAG 
ACGACCTCGG ATGCCATCTA CGTCCTGGAC AGTGAGGATC TCGTTGTCGA GCTCAATGCC
CCTGCGGCCG TGCAAAGGGG AGCCCCGTTA GAAACGTTGC GGGGCAGGCC GTGGTATGAC
CTCTTTGACA ATGAGGTCGG GCAGCGGCGG CGACAGATCC TGGCCATGGC CCGGAACCAG
AAAGAGATGC TCCACTATGA GGACCGCATT GAGCCGGGGC TGGTCTTTGA TGTCCGTATC
CAGCCGCTAT TTGACGCCTG CGCCTCGGTG GTGGGCCGGG TGATCTGCAT GCGCGATATT
TCCCAGGCCC GCAAGCAGGA GCGGGAGCGT CTCCGGCTGG GAACGGCCAT CGAGCAGGCG
GTTGAAGCCG TGCTTATCAT GGATGAAAAC CTGGTCATCC AGTACGTCAA CCAATCCTTT
GAGGATCTGA CGGGATATTC CCAGCAGCAG GCCAAGGGGC TGCCTTTGGA TGTCCTGTAC
CAGGATAGCG AGCAGCGGCG TTGGTATGCC GCGATTTCAG CGACGCTGAG CCAGGGCGAG
ATCTGGGTCG GACGGACGCG CAATACCGGC AAGGACGGCC GGTTGTTTCG GGTCGAGAAG
ACCGTGGCCC CAATACGTGG TCAATCCGGG GTTATCCTCG GGTATGTGAG CGTGTGGCGG
GATCTGGGAC CGGTGGAACA GCTGGAACGG CAATTGCGTA CGGCCCAGAA AATGGAGGCC
ATTGGGACCT TGGCTAGTGG GATCGCTCAT GATTTCAATA ACATTTTAGG GCCGATCCTC
TTGAATGCCC AACTCCTCCT GGAGCGATCT CCGCTTTGTG ACGAGGAGCG AGATCTGCTC
CAGGAAATCA ACGATGCCGC TGAGCGGGCC CGCCACTTGG TCCGGCAGAT TCTCCACCTG
GGGCGCCGAC GGGAGGCAGA GCAGCCGGTG CCTTTCCGCT TGAGCACAAT TATCAAGGAG
TGTTGCAAGC TCCTGCGCCC GACCTTGCCC GAGACCCTGC GCATCGAACA CCGTCTGCAT
ACAGAGCAGG ACACCATCGT CGCCGATCCA ACCCAGATCC ACCAGGCAAT CATGAATCTG
TGCACCAATG CAGTGCACGC CATGGATCGG CGGCCGGGGG TCTTGCGGTT TGTCCTCCAA
ACGGCCGATC CGGAAACCAT CAGGACCAAG CCCCAACTCA ACGTCGGCCG TCCGTATATT
TGTTTGGCCA TTGAAGATTC TGGCCGGGGG ATGTCCGAGG AAGTGTTGAG CCAGGTCTTC
GATCCTTTTT TCACCACAAA AAACGACGGG TTGGGCACCG GATTGGGGTT GCCGGTGGTC
CAGAATATCA TCACCCAACT CGGCGGGACA GTAACCGTGC AGAGCGCCCC CGGTCAGGGC
AGTGTTTTTG CTTTGTACCT GCCGCTGGCC CCGGCGGAGG CGTTTATAAA CGAGTGA
 
Protein sequence
MAQHRLLETY GLDLLRAVLE TTSDAIYVLD SEDLVVELNA PAAVQRGAPL ETLRGRPWYD 
LFDNEVGQRR RQILAMARNQ KEMLHYEDRI EPGLVFDVRI QPLFDACASV VGRVICMRDI
SQARKQERER LRLGTAIEQA VEAVLIMDEN LVIQYVNQSF EDLTGYSQQQ AKGLPLDVLY
QDSEQRRWYA AISATLSQGE IWVGRTRNTG KDGRLFRVEK TVAPIRGQSG VILGYVSVWR
DLGPVEQLER QLRTAQKMEA IGTLASGIAH DFNNILGPIL LNAQLLLERS PLCDEERDLL
QEINDAAERA RHLVRQILHL GRRREAEQPV PFRLSTIIKE CCKLLRPTLP ETLRIEHRLH
TEQDTIVADP TQIHQAIMNL CTNAVHAMDR RPGVLRFVLQ TADPETIRTK PQLNVGRPYI
CLAIEDSGRG MSEEVLSQVF DPFFTTKNDG LGTGLGLPVV QNIITQLGGT VTVQSAPGQG
SVFALYLPLA PAEAFINE