Gene Dret_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1201 
Symbol 
ID8419029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1412408 
End bp1413823 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content61% 
IMG OID645037776 
ProductPAS modulated sigma54 specific transcriptional regulator, Fis family 
Protein accessionYP_003198067 
Protein GI258405325 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC AAACACACGC GGTCTGGAAT GTCCGCTTTT TGAATCAGAT CATGGATTCC 
ATGGCCGAAG GGGTCTTCAC CCTCGACGTC CAAGGCCGGA TCACCTCCTG GAACAGGTCG
ATGGAGGACA TCACCGGCTA CAGCGCCTCA GAAGCCCTGG GCCGCTCCTG CCGGTTTCTG
GGGTTCAGCC ATTGTCTGGG CACGCTCTGC CCGGCTGATA TCCACCAGTG CGGCATCCTG
CGGCACGAAC AGCCCGAAGC CAAGGAATGT GTCCTGCGCC ATCGCGAGGG CCGGGATGTG
CCGGTCATCA AGCAGGCCCG AGTGGTCAAA GACGACAACG GAGAACTGAT CGGCATTGTC
GAGACCGTGA CCGACATGAC TGAACTCCAG AAGGCCCGGC ACAAGGCTGA AGAGGCAACC
CGACTGTTGG GGCAGCACTA CAGCCTTGGC AATATCATGG GCAAAAGCGA AGTCATGCAG
GAGGTCTTTT CCCGGGTCCG GGCCGCGGCC GCCAGCCGGT CCACCGTTCT CATCCAGGGG
GAAAGCGGGA CCGGGAAAGA ACTCATCGCC CGCGCCATTC ACTACAACAG CGACCAGGCC
GATCAGCCGT TCGTGACCGT CAATTGCTCG GCGCTGACGG AAACTCTTCT GGAAAGCGAG
CTTTTCGGGC ATGTCAAAGG GGCGTTTACC GGTGCTGTGC GCGACCGGGC CGGCCGCTTT
GAAGAAGCGC ATCGCGGATC GATTTTCCTG GATGAAATCG GGGAACTCAG CCAGACCATC
CAGGTCAAGC TGCTGCGCGT CCTGCAGGAA CGGGAAGTCG AACGGGTCGG CGACTCACAA
ACCCGCACGA TCGATATCCG GGTCATCGTG GCTACACACC GGGATCTCAA CGAGCTCTTG
GCCCAGGGCG TTTTTCGTCA GGACCTGTAT TACCGCCTCA AGGTTTTTCC TATCACCCTG
CCCCCATTGC GCCAGCGGCG CGAAGACCTG CCCCTGCTGG TCAACCACTT TATCGCCAAG
CAAAACGAAG CCACCGGAAA ACGGGTCACC GGCCTGGCAC CGGAGGCCAT GCGGCTGGTG
TTCGAGTACC ATTGGCCCGG CAATGTGCGT GAATTGGAAA ACGCCATCGA ACACGCCTTT
GTCCTGACTT CCGGGGAGCA GATCCAGGTC AACGATCTGC CGGCGGAGAT CATAACGCCC
CGGCCGAGCC CGGAAAGAGC CAGGGAAGCC GGCGTTCCCC GGACCGCTGG CAGCAGACAG
CACCATGAAC AGCCCGACCG GGAGCAGTTG CTCGCGCTGC TGGAGGCCAA CCAGTGGAAC
AAGGCCGCGG TGGCCCGGCA ACTCGGGGTG AGCCGGACGG CGGTGTGGAA GTACATGAAA
AAGTGGGGCA TTCCGCTGCA GCCGGAACCA GATTAA
 
Protein sequence
MTEQTHAVWN VRFLNQIMDS MAEGVFTLDV QGRITSWNRS MEDITGYSAS EALGRSCRFL 
GFSHCLGTLC PADIHQCGIL RHEQPEAKEC VLRHREGRDV PVIKQARVVK DDNGELIGIV
ETVTDMTELQ KARHKAEEAT RLLGQHYSLG NIMGKSEVMQ EVFSRVRAAA ASRSTVLIQG
ESGTGKELIA RAIHYNSDQA DQPFVTVNCS ALTETLLESE LFGHVKGAFT GAVRDRAGRF
EEAHRGSIFL DEIGELSQTI QVKLLRVLQE REVERVGDSQ TRTIDIRVIV ATHRDLNELL
AQGVFRQDLY YRLKVFPITL PPLRQRREDL PLLVNHFIAK QNEATGKRVT GLAPEAMRLV
FEYHWPGNVR ELENAIEHAF VLTSGEQIQV NDLPAEIITP RPSPERAREA GVPRTAGSRQ
HHEQPDREQL LALLEANQWN KAAVARQLGV SRTAVWKYMK KWGIPLQPEP D