Gene Dret_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0447 
Symbol 
ID8418252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp547927 
End bp549243 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content60% 
IMG OID645037008 
Productputative sigma54 specific transcriptional regulator 
Protein accessionYP_003197322 
Protein GI258404580 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTTC CCGACAATTT GCCCTGCGAG GCCATCCTGG ACAGCATTGC CGACGGGGTG 
TTCACCGTGG ACCTCGAATG GACGATCACC TCGTTCAACC GGGCCGCCAC TCAGATTACC
GGCATCGCCC GCCAGGATGC CGTGGGGCAA AAATGCTGGG AGGTCTTGCG GTCTTCGCTG
TGTGACGGCT CTTGTGCCCT GGAGACCTGC CTGCGGGATG CCACCTCCAT CAGCAACAAA
TCCATTTTTA TTATCCGCCC CGACGGGAGC AAAGTGCCCA TCTCCATCAG CGCGGCCCCA
TTGCACAACC ACCAGGGCGA ATGCATCGGC GGCGTGGAGA CCTTTCGGGA CCTGAGCGCC
ATCCAGGTCA TGCGCCAGGA GATGGAGCAA CGCTACACCT TTGAAGACAT CGTCGGCAAA
AGCGAAGCCC TGTCCAAGAT CTTCCGCATC TTGCCGCAGG TGGCCCAGAG TCCGTCCTCG
GTTCTGCTGA CCGGCGAATC GGGCACAGGC AAGGAATTGT TTGCCCGCGC CCTGCACAAC
CTCAGCCCGC GCCGACAGGG CCCTCTGGTT GTAGTCAACT GCGGCGCCCT TCCCGAGCAC
CTCCTGGAAT CCGAACTTTT CGGCTACAAG GCCGGTGCGT TCACCGACGC CAAAAAAGAC
AAACCCGGCC GATTCCAACT CGCCGATGGC GGGACCCTGT TTTTGGACGA GATCGGCGAT
CTGCCTCTGG CCCTGCAGGT CAAACTGTTG CGGGTGTTGC AGGAAAAACA GGTCGAACCC
CTCGGCGCTG TGGCTCCGGT GCCTACGGAC GTGCGGATCA TCGCCGCCAC CAACCGGGAC
CTGGAACACC TTGTTCGCGA AGGCCGGTTC CGCGAGGACC TCTTCTACCG TCTCAATGTC
GCCCAACTCC GACTGCCCCC ACTGCGCGAG CGGCAGGAAG ACATCCCCTT GTTGGCCAAC
CATTTCATAC GGCGCTTCAA CCTCCTGCAA GGTAAAGAGG TCCAGGGCAT TTCCGAGGAC
GCTCTGGCCA CCCTTATACG TCACGATTTT CCCGGCAACA TCAGGGAGTT GGAAAACATC
CTCGAATACA GTTTCATCCT GTGCTCTTCA GGATTCATCC AACTCGAGCA TTTGCCGGAA
AAATTCCATC CTCAGGAGGC AAGTTCCCCG AACACCAATG CCCCCATGAC CATGGAAGAG
GTCAAGGTCC AAGCCGCCCG GCAGGCCCTG GCCCGCAACC AGGGCAAAAA AATGGCCGCC
TGCCGCGAAC TCGATATCAG CAAGGATACC CTGCGCCGCC TTTTGCGCCA GGGGTAA
 
Protein sequence
MAFPDNLPCE AILDSIADGV FTVDLEWTIT SFNRAATQIT GIARQDAVGQ KCWEVLRSSL 
CDGSCALETC LRDATSISNK SIFIIRPDGS KVPISISAAP LHNHQGECIG GVETFRDLSA
IQVMRQEMEQ RYTFEDIVGK SEALSKIFRI LPQVAQSPSS VLLTGESGTG KELFARALHN
LSPRRQGPLV VVNCGALPEH LLESELFGYK AGAFTDAKKD KPGRFQLADG GTLFLDEIGD
LPLALQVKLL RVLQEKQVEP LGAVAPVPTD VRIIAATNRD LEHLVREGRF REDLFYRLNV
AQLRLPPLRE RQEDIPLLAN HFIRRFNLLQ GKEVQGISED ALATLIRHDF PGNIRELENI
LEYSFILCSS GFIQLEHLPE KFHPQEASSP NTNAPMTMEE VKVQAARQAL ARNQGKKMAA
CRELDISKDT LRRLLRQG