Gene Dret_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0201 
Symbol 
ID8418005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp253513 
End bp254739 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content56% 
IMG OID645036766 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003197081 
Protein GI258404339 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCAC AAACAACCCT TGAAGACATC ATCGGCATCG AGCATTCCAA ACTCGGTTTT 
TTCCAGGAAG TCCAGCGCAA GGTCGCTGAA CTCAGACACT CCAACGAAGA GCTTGAACAC
AAGCAACGCG AAATCCAGGC CATTTTGGAC GGCATCTCGG ATATCATGCT CGTGCTCTCG
GCAGATATGC GCATTTTGTC CGTCAATCAG GTCTTTTTCG ACCATTTTGA AGAGCCCAAC
CCGGTCGGAA AATATTGTTT TGAGGTTTTC CGCGGCCAGG CCAAGGCGTG CCGGGGGTGT
CCGGCCCGCG AATCCATGCG CTCCGGGGAA GTCTGCAAGG AAACCGCCAT CTTCAAAGTC
AATGGACGCA ACCGGCAATA CGATATGTTG GCCTCTCCCT TGCAGGACAC CCCAGGACAA
GAAGGCCGGG TCCTGTTGTT CAAGCGGGAC GTGACCCTGG AAAAGGAGTA TCAGGCCCAA
TTCTATCAAG CCGAGAAAAT GGCCACCATT GGCATGCTCG CCGCAGGTGT GGCCCATGAA
GTCAACAATC CCCTGGCCGG CATCCAGGGC TTCGCTCAGG GGATCTTGCG GCGTCTGCCC
CGGGTTCAAG AGGCTGTGGA CGAGGAATTG GCCAACGACT TTCAGGAGTA CACCGAGACC
ATCCTCCAGG AATGCAACCG GTGCCAGGAA ATCGTGCGCA CCCTGTTGAC CTTCAGCCGC
CCCAGGGAAT CCGCCTTTTG CTCCTTGAGC ATGAACAACC TCATCCGGGA CACCTTGAAA
GTTCTCCAGC ACCGGATTAA ACGCCATCAG GGTCTGTTGC TCCAGGAAGA CCTGGAACCG
GAATTGCCCT TGGTTTGCGG CGACGAGCCC CATCTGAAGC AGGTCATTCT CAATCTGTTG
ACCAATGCCC TGGACGCCAT CGGCGAAGAG GGGGTGATCA CCATCCGTAC CTCGAGTACG
GAGAGTCAGG TCGCATTGTG CGTCGAGGAT ACCGGCGAGG GGATCCCGGA AGAACATCTG
GACAAACTCT TCGAGCCCTT TTTCACGACG AAGCCGGTCG GGCGTGGCAC GGGCATCGGC
CTGTCAACCT GCTACACCAT CGTCCATAAC CACGGCGGTG AGATTTCCGT CAAAAGCGCC
CCTGAAAAGG GGAGCCGCTT TTGTGTTGTC CTCCCTAAAC GGCAAGGAAA CGCTTGTGAA
CAGCCAATCA CTTGTACTTA TCGTTGA
 
Protein sequence
MAPQTTLEDI IGIEHSKLGF FQEVQRKVAE LRHSNEELEH KQREIQAILD GISDIMLVLS 
ADMRILSVNQ VFFDHFEEPN PVGKYCFEVF RGQAKACRGC PARESMRSGE VCKETAIFKV
NGRNRQYDML ASPLQDTPGQ EGRVLLFKRD VTLEKEYQAQ FYQAEKMATI GMLAAGVAHE
VNNPLAGIQG FAQGILRRLP RVQEAVDEEL ANDFQEYTET ILQECNRCQE IVRTLLTFSR
PRESAFCSLS MNNLIRDTLK VLQHRIKRHQ GLLLQEDLEP ELPLVCGDEP HLKQVILNLL
TNALDAIGEE GVITIRTSST ESQVALCVED TGEGIPEEHL DKLFEPFFTT KPVGRGTGIG
LSTCYTIVHN HGGEISVKSA PEKGSRFCVV LPKRQGNACE QPITCTYR