Gene Dret_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0053 
Symbol 
ID8417856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp67565 
End bp68833 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content59% 
IMG OID645036617 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003196933 
Protein GI258404191 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAG AGTACGCTAC AGACGACCGC ATCCGGGAAT TGGAAGCTGA GTTGGCCGCG 
GCCAAGAAAC GCCTTGCCGC ATTGGAAAAG CGGCAGGAAT CCGACGCCCC GCCGGCCATG
CTCCAGGACG TCCTGGACGG CATCGATATG GATATCCATA TCACCGATAG TCACGAGGGC
ACCATTCTCA TGGCCAATGC CCGGATGTAC ACGACCTATT CGGGGACCAT CGTCGGCCGC
CATTGCTGGG ATGTCATTCG CAACGCGAAT TCGCCTTGCC CGGGGTGCAT GGTCGAGCAG
TTGGATGACC CAAAGGTCCC GTCAGTGCGC TGGGAAGATT ACAATCTCGC CGCCGGCGGC
TGGTTCGAGC ACATCGCCCG CCGCATCACT TGGCAGGACG GCCGTCCGGC GGTCCTGCAC
ATTTCCCGGA ACACAACTGA GGCCCACGAA ACAGCTCGGG CCTATCGACG CAGTGAGCGC
AAATACCGGC GGATGAACCG GCTCTTGCGG CTCATGGCCG ACAACGTCCC GGACCTGATC
TGGGCCAAGG ATCTGGACGA CCAATATTTG TTCGCCAACA AGGCCATTTG TAAGCAATTG
CTGCGTTGCT CAGACCCTGA ATTGCCCCTT GGCAAAACGG ATCTGTATTT CGCTGAGCGC
GAGCGTCGCT GGGGTCACGA CCACACCTTT GGTGAGATCT GTATCGACTC GGACCAAGTG
GTCAAGGCCA ACCGCAAAAG CGGCCGCTTC CTGGAAGACG GGAAGGTCCG CGGCCGATAC
TTGGTGCTCG ATGTTCATAA GGCGCCGTTT TTCGACGAAG ATGGAGCCAT GATCGGCACC
GTGGGCGCGG CACGGGAGGT GACCGAGGAG ATCCAGCGCC AGCAGGAACT CGACGACATT
CAGAGCAAGT ACCGTCTCAT CCTCGAAAAC TGCAACGACG GCATTGGCGT CAACGGTCCC
GACGGCTTCT TTCGCTGGGT TAACCCCAGC CTGTGCCGTA TTTGTGGGTT CTCTGAACAG
GAACTGCTCA AGCACAGGAC AACCACTTTC GTCGACCCGC GGGACCAAAA CTGGGTCTGG
GACTATTTCC GGCAACGACT CGATCCCGAG CAACCCGATC CGGAGCAGGT CTATCCCTTT
CGTATCGTGC GCAAAGATGG GGTCGTCCGT TGGCTCCAGG CCTCGGTGGT CAAGATTGAG
TGGAACGATC GGCCTGCCGC GTTGGTTTTT TATACCGACA TCACAGGTCG GTATCCTCAT
CTGGACTGA
 
Protein sequence
MSEEYATDDR IRELEAELAA AKKRLAALEK RQESDAPPAM LQDVLDGIDM DIHITDSHEG 
TILMANARMY TTYSGTIVGR HCWDVIRNAN SPCPGCMVEQ LDDPKVPSVR WEDYNLAAGG
WFEHIARRIT WQDGRPAVLH ISRNTTEAHE TARAYRRSER KYRRMNRLLR LMADNVPDLI
WAKDLDDQYL FANKAICKQL LRCSDPELPL GKTDLYFAER ERRWGHDHTF GEICIDSDQV
VKANRKSGRF LEDGKVRGRY LVLDVHKAPF FDEDGAMIGT VGAAREVTEE IQRQQELDDI
QSKYRLILEN CNDGIGVNGP DGFFRWVNPS LCRICGFSEQ ELLKHRTTTF VDPRDQNWVW
DYFRQRLDPE QPDPEQVYPF RIVRKDGVVR WLQASVVKIE WNDRPAALVF YTDITGRYPH
LD