Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2105 |
Symbol | |
ID | 5694948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 2556736 |
End bp | 2558502 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641264706 |
Product | PAS modulated sigma54 specific transcriptional regulator |
Protein accession | YP_001529986 |
Protein GI | 158522116 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0394] Protein-tyrosine-phosphatase [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00514129 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGG GCACCATTCT TTTTTTGTGC AGGGATAACA GCGCCAGAAG CCAGATGGCA GAGGGCTTTG CCAGGCAAAT GGCCGGTGAC AATATTTCGA TTTTCAGTGC GGGCATCACG CCCGATCAGG AGGTCCACCC CATGGCCGTG GAGGTGATGG CCGAGCATGG CATTGATATT TCCGGCCATC GGCCCAAGGC GGTTTCAGCG TTGAGAGCGG GCCATTTCGA CCTTGCCGTG GACCTCTGCC AGACCCTTGG CCAGGAGTTT CCCATGCTGG CCGGATTCCC CCCCCTGGTG TGCTGGACCG TGGCCGATCC GGCGGAAGCT GTGGGGGATC TTGAGGGCCA ACGGGTGGCA TTCCGGGAAG CGGCCCGGAT TATAAAGGAC TTGGTCCACG ACCTTCTGAA CCGGGGATAT TACGCCTCTT TTTCTCTATA CAAGGCCAAT ATCGAACGGC TTATCGACAA CCTTCACGAG GGGGTTCTGG CCCATGACCT GGGCCGGAAA ATCTTTTTTT TCAGCAAAGG GGCTGAAAGG ATCACCGGCC TGTCCGCCGT GGACGTGATC GGTAAAAACT GTCACGACGT GTTTGTTCCC CGCCTGTGCG GAGAGAACTG CTCTTTCTGT GATGGGTGCG AACCCCCGAC GTTTCAGAAA AAGAGTTATT CCACCGTGGC GCCGGAAATT GAGGGCCAGC GCAAAGAGCT GGATGTGACG GTGGTGCCCC TGCGGGACCC GGCCGGCCGT ATTCAGGGCG TCGTGGCGGC TCTGGCCGAC CAGACCGCCT TCAAGGAGGC GGTCCGCGGC CAGAAGGGGG AGGATGGATT TGCCGGCATC ATCGGCCGAA CACCGGAGAT GCGAAGCCTT TTTCACCAGA TTCGCGACCT GTCGGTCTAT GATGTGCCGG TGAATATCAG CGGTGAGACC GGCACCGGGA AAGAACTGGT GGCCCGGGCC ATTCACGGCG AAAGCACCCG GCGTAACGGA CCGTTTGTGC CCATCAACTG CGGTGCCCTG CCCGAGGGGC TGGTGGAAAG CGAACTGTTC GGCCATGTGC GGGGCTCTTT TTCTGGGGCC GTGCGTGACA AGAAAGGCCG GTTTGAGCTG GCCCATAACG GGACCATCTT TTTAGACGAG GTGGCCGAGC TGCCCATGTC CACCCAGGTC AAGCTGCTGC GGTTTCTCCA GGAGGGGGTC CTGGAAAAGG TGGGCAGTGA AAAACAGACC TCGGTGGATG TGCGGGTGAT CAGCGCCACC AACAAGAACC TGAAAAAAGA GGTGGCAAAG GGGACGTTCC GCGAAGACCT TTACTACCGG CTCAACGTGG TGCCCATTCA CCTGCCGCCG TTGCGCATGC GGAAAAACGA CATTCCCCTG CTGGCCAACT ATTTTGTCAG GCATGCGGCC ATGGGCGCCC GAACCGGCAA TGTCACCATC ACCGATGATG CCATGGGCCT GCTGGCGGAA TACGCGTGGC CCGGCAATGT GCGGGAGCTT CAGAATATCA TTCAGTTCCT GGTAATCAAG GCGTCCGGTA ACAAGATCAC GGCGGCCCAT CTGCCGCCGG AAATTCAGGG CGACGGCACG CCGCTTCCCC AGAAGCGGGG CCGGCGCAAC AAACTGGACA CGGGCAGCGT GGAAACGGCC CTGGCAAAAG CCGGCGGCAA CAAGGCCAAG GCGGCCCGCC TGCTGGGCGT GGGCCGGGCC ACCCTCTACC GGTTTCTCAA CGACCACCCC GATATTGTTG CTGATGAAGA GATCTGA
|
Protein sequence | MNKGTILFLC RDNSARSQMA EGFARQMAGD NISIFSAGIT PDQEVHPMAV EVMAEHGIDI SGHRPKAVSA LRAGHFDLAV DLCQTLGQEF PMLAGFPPLV CWTVADPAEA VGDLEGQRVA FREAARIIKD LVHDLLNRGY YASFSLYKAN IERLIDNLHE GVLAHDLGRK IFFFSKGAER ITGLSAVDVI GKNCHDVFVP RLCGENCSFC DGCEPPTFQK KSYSTVAPEI EGQRKELDVT VVPLRDPAGR IQGVVAALAD QTAFKEAVRG QKGEDGFAGI IGRTPEMRSL FHQIRDLSVY DVPVNISGET GTGKELVARA IHGESTRRNG PFVPINCGAL PEGLVESELF GHVRGSFSGA VRDKKGRFEL AHNGTIFLDE VAELPMSTQV KLLRFLQEGV LEKVGSEKQT SVDVRVISAT NKNLKKEVAK GTFREDLYYR LNVVPIHLPP LRMRKNDIPL LANYFVRHAA MGARTGNVTI TDDAMGLLAE YAWPGNVREL QNIIQFLVIK ASGNKITAAH LPPEIQGDGT PLPQKRGRRN KLDTGSVETA LAKAGGNKAK AARLLGVGRA TLYRFLNDHP DIVADEEI
|
| |