Gene Dret_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0831 
Symbol 
ID8418649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp983387 
End bp984940 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content58% 
IMG OID645037399 
Producttranscriptional regulator, NifA subfamily, Fis Family 
Protein accessionYP_003197700 
Protein GI258404958 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0724194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.196373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGT GCATGCCGTT TCTCAATACC CTCAAGCAGG TTTTGAGCGA ACTCGATCCC 
CAAAGCCCGT TGCAAACCGG TTTGCAACGG CTTTTGGATA TCATTGCCGC CAACCACGGG
TACGAACGGC TCTCTCTGGC CATCTTCGAC CCCCAGACCG CAACCCTGCA ATTCCATCTC
AACTACGGAG ACGACGCAGC TGCGGACGTG CGCTACGCTC CGGGCCAGGG TATTTCCGGC
CAGGTACTGG CCTCGGGGTC ACCGTCTATT ATTCCGCGCA TGGCGGACAA CCCCGAATTC
CTGAACCGCG CCTTTGGCCG TCCTCCGGAC GAACTGGCCG CGTGCGCCTT TATCTGTGTG
CCCATTTTGT TGCCAAGCTC TGCCGAAGAG GGCAAAAGCC AGGAGACCAT CGGTGTCTTG
AGCGCTGACC TGGCCACCGC TCCCAAGGAA AAGCTCGAAG AGCACTGCCG GTTTCTGGAA
ACCTTGGCCG GGATTATCGC CCGGCAGGCC GCCCATCTCC AGGACGAACT CGCCCGGCGC
GAACAATGGC AACGCCTCGG ACTTTTGCAC CAGGACGGCG ATTTGCTCCA GGTCGACACC
GAAGAGATCA TCGCCTTCTC CAAGACCATG GGCATGGTCC TGCAACAGAT CTATCAGGTC
GCACCGAGCC GGGCCACGGT CTTACTGCGC GGCGAATCCG GGACCGGGAA AGAACTCCTG
GCTGAGGCCA TCCACCGCGC CAGCCCCCGC CGTGACAAGC CCTGCATCAA GCTCAACTGC
GCTGCGCTTC CTGCAGACCT GTTGGAAAGC GAATTGTTCG GTCACGAAAA AGGGGCCTTT
ACCGGGGCGG TCAGCGCCAA AAAAGGCCGC TTTGAAATGG CCCATGAAGG GACGCTTTTT
CTGGATGAGA TCGGCGAGCT CAGTGCCGAG GCCCAGGCCA AACTCCTGCG GGCCATCCAG
GAGGGCGAAA TCCAGCGCCT GGGCAGCGAA CGCCCCATCA AGGTCGATGT CCGCCTCGTT
TGCGCCACCC ACCGTCCTCT GGAAATGCTC CTTGAAGATG GCAGCTTTCG CGAGGATCTG
TATTACCGGA TCAATGTTTT TCCGGTTTTC ATCCCCTCCC TGCGCGAGCG CCGCGACGAT
ATCATCCCTT TGACCGAACA TTTCCTGTCC TATTTCGCCC GGGAGTACCA AAAAAGTATC
AAGCGGGTCT CCTCACCGGC TATCGATCTC CTCGTCCAAT ACCATTGGCC GGGAAACGTC
AGGGAATTGC GCAACTGCAT TGAACGCGCT GTTTTGTTAT GCAACGAAGA TGTTATCCGG
ACCTACCATC TGCCGCCGTC ATTGCAGACC GCTGAGAGTT CGGCCACGGA CACCGATCTC
TCCTTCGGCG AGGCTGTGGC CCGTTTCGAA CAGGAACTCC TGGTCGAGGC GCTGAAAAAA
ACCAAAGGAA ATATGCTCCA GGCGGCCCGC AATCTGCGCG CCAGCTACCG GATCATCAAT
TACAAGGTCA AAAAATACGG GATCGACGTC AAACGGATCT CAGGAAAAAA ATAA
 
Protein sequence
MTECMPFLNT LKQVLSELDP QSPLQTGLQR LLDIIAANHG YERLSLAIFD PQTATLQFHL 
NYGDDAAADV RYAPGQGISG QVLASGSPSI IPRMADNPEF LNRAFGRPPD ELAACAFICV
PILLPSSAEE GKSQETIGVL SADLATAPKE KLEEHCRFLE TLAGIIARQA AHLQDELARR
EQWQRLGLLH QDGDLLQVDT EEIIAFSKTM GMVLQQIYQV APSRATVLLR GESGTGKELL
AEAIHRASPR RDKPCIKLNC AALPADLLES ELFGHEKGAF TGAVSAKKGR FEMAHEGTLF
LDEIGELSAE AQAKLLRAIQ EGEIQRLGSE RPIKVDVRLV CATHRPLEML LEDGSFREDL
YYRINVFPVF IPSLRERRDD IIPLTEHFLS YFAREYQKSI KRVSSPAIDL LVQYHWPGNV
RELRNCIERA VLLCNEDVIR TYHLPPSLQT AESSATDTDL SFGEAVARFE QELLVEALKK
TKGNMLQAAR NLRASYRIIN YKVKKYGIDV KRISGKK