Gene Dole_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2105 
Symbol 
ID5694948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2556736 
End bp2558502 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content60% 
IMG OID641264706 
ProductPAS modulated sigma54 specific transcriptional regulator 
Protein accessionYP_001529986 
Protein GI158522116 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0394] Protein-tyrosine-phosphatase
[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00514129 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGG GCACCATTCT TTTTTTGTGC AGGGATAACA GCGCCAGAAG CCAGATGGCA 
GAGGGCTTTG CCAGGCAAAT GGCCGGTGAC AATATTTCGA TTTTCAGTGC GGGCATCACG
CCCGATCAGG AGGTCCACCC CATGGCCGTG GAGGTGATGG CCGAGCATGG CATTGATATT
TCCGGCCATC GGCCCAAGGC GGTTTCAGCG TTGAGAGCGG GCCATTTCGA CCTTGCCGTG
GACCTCTGCC AGACCCTTGG CCAGGAGTTT CCCATGCTGG CCGGATTCCC CCCCCTGGTG
TGCTGGACCG TGGCCGATCC GGCGGAAGCT GTGGGGGATC TTGAGGGCCA ACGGGTGGCA
TTCCGGGAAG CGGCCCGGAT TATAAAGGAC TTGGTCCACG ACCTTCTGAA CCGGGGATAT
TACGCCTCTT TTTCTCTATA CAAGGCCAAT ATCGAACGGC TTATCGACAA CCTTCACGAG
GGGGTTCTGG CCCATGACCT GGGCCGGAAA ATCTTTTTTT TCAGCAAAGG GGCTGAAAGG
ATCACCGGCC TGTCCGCCGT GGACGTGATC GGTAAAAACT GTCACGACGT GTTTGTTCCC
CGCCTGTGCG GAGAGAACTG CTCTTTCTGT GATGGGTGCG AACCCCCGAC GTTTCAGAAA
AAGAGTTATT CCACCGTGGC GCCGGAAATT GAGGGCCAGC GCAAAGAGCT GGATGTGACG
GTGGTGCCCC TGCGGGACCC GGCCGGCCGT ATTCAGGGCG TCGTGGCGGC TCTGGCCGAC
CAGACCGCCT TCAAGGAGGC GGTCCGCGGC CAGAAGGGGG AGGATGGATT TGCCGGCATC
ATCGGCCGAA CACCGGAGAT GCGAAGCCTT TTTCACCAGA TTCGCGACCT GTCGGTCTAT
GATGTGCCGG TGAATATCAG CGGTGAGACC GGCACCGGGA AAGAACTGGT GGCCCGGGCC
ATTCACGGCG AAAGCACCCG GCGTAACGGA CCGTTTGTGC CCATCAACTG CGGTGCCCTG
CCCGAGGGGC TGGTGGAAAG CGAACTGTTC GGCCATGTGC GGGGCTCTTT TTCTGGGGCC
GTGCGTGACA AGAAAGGCCG GTTTGAGCTG GCCCATAACG GGACCATCTT TTTAGACGAG
GTGGCCGAGC TGCCCATGTC CACCCAGGTC AAGCTGCTGC GGTTTCTCCA GGAGGGGGTC
CTGGAAAAGG TGGGCAGTGA AAAACAGACC TCGGTGGATG TGCGGGTGAT CAGCGCCACC
AACAAGAACC TGAAAAAAGA GGTGGCAAAG GGGACGTTCC GCGAAGACCT TTACTACCGG
CTCAACGTGG TGCCCATTCA CCTGCCGCCG TTGCGCATGC GGAAAAACGA CATTCCCCTG
CTGGCCAACT ATTTTGTCAG GCATGCGGCC ATGGGCGCCC GAACCGGCAA TGTCACCATC
ACCGATGATG CCATGGGCCT GCTGGCGGAA TACGCGTGGC CCGGCAATGT GCGGGAGCTT
CAGAATATCA TTCAGTTCCT GGTAATCAAG GCGTCCGGTA ACAAGATCAC GGCGGCCCAT
CTGCCGCCGG AAATTCAGGG CGACGGCACG CCGCTTCCCC AGAAGCGGGG CCGGCGCAAC
AAACTGGACA CGGGCAGCGT GGAAACGGCC CTGGCAAAAG CCGGCGGCAA CAAGGCCAAG
GCGGCCCGCC TGCTGGGCGT GGGCCGGGCC ACCCTCTACC GGTTTCTCAA CGACCACCCC
GATATTGTTG CTGATGAAGA GATCTGA
 
Protein sequence
MNKGTILFLC RDNSARSQMA EGFARQMAGD NISIFSAGIT PDQEVHPMAV EVMAEHGIDI 
SGHRPKAVSA LRAGHFDLAV DLCQTLGQEF PMLAGFPPLV CWTVADPAEA VGDLEGQRVA
FREAARIIKD LVHDLLNRGY YASFSLYKAN IERLIDNLHE GVLAHDLGRK IFFFSKGAER
ITGLSAVDVI GKNCHDVFVP RLCGENCSFC DGCEPPTFQK KSYSTVAPEI EGQRKELDVT
VVPLRDPAGR IQGVVAALAD QTAFKEAVRG QKGEDGFAGI IGRTPEMRSL FHQIRDLSVY
DVPVNISGET GTGKELVARA IHGESTRRNG PFVPINCGAL PEGLVESELF GHVRGSFSGA
VRDKKGRFEL AHNGTIFLDE VAELPMSTQV KLLRFLQEGV LEKVGSEKQT SVDVRVISAT
NKNLKKEVAK GTFREDLYYR LNVVPIHLPP LRMRKNDIPL LANYFVRHAA MGARTGNVTI
TDDAMGLLAE YAWPGNVREL QNIIQFLVIK ASGNKITAAH LPPEIQGDGT PLPQKRGRRN
KLDTGSVETA LAKAGGNKAK AARLLGVGRA TLYRFLNDHP DIVADEEI