Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_3113 |
Symbol | |
ID | 3758782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 3102377 |
End bp | 3105265 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637784023 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_389602 |
Protein GI | 78358153 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0730498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTAC CGTTATCTGC TCCGCAGACA ACGCGCACCG TACAGTGCCC CCCCTGCGCG CCGGATAACG CCATGCACCC GCGCGCGGTT CTGCCGGCCC TGTTCCTGCT TGCGGCTGCA GTTTTCTTCC TGTGCTGGGC ACCGCCCCGG AACGCGTCGG CGCAGCCCCC TCCCGGCACA GGGCGCACGC ACTTTCTGCT GGTACATTCC TATCACCCCA ACATGCTGTG GGTGCAGGAT ATCAACAGCG GCGTTCAGGA AGTTCTGGGG CCGGCCATAG ACGCTTCCAA CGGCACCATG CGGCTTTCAG TCGAATTCAT GGACACCAAG CGGCATCCCG ATGCAGCATA CCGCGCCGAC ATACTGGCCC TGCTGCGCGA CAAATACGCG CAGGACCCGC CCGACGTCAT CATTACCGCC GACAACATCG CCCTGAACAC CCTGCTGGAC GAACACGAAG CCATCGCCCC CGGCAGCCGT GTGGTCTTCT GCGGGTACAA CAACTATACC CCCGACGCAC TGCGCGGGCA TGACGACGTC ACGGGCATCG CCGAAAAAGT CAGCATACGC GAAACCATAC AGGCTGCGGC AGCCATGCTG CCGCAGACAT CGCGGTTTAT TGTCATCAGC GACCGTTCGG AAACCGGCCG TGCCATGACG CACGAACTGG CGCTGCAGAC TGCCGGTATA CCGGAACACG GGCGCATGGA ACTGTGGGAC GCCTATACCT TTGCGGATCT GGGCAGACAG CTTGCCGCGC TGCAGGGTAC CGAAACCATT GTGCTGCTTT CGGCCCTGCA GGACACAGCG GGCAATGTGC AATCGTACAG GCATAGCCTG CGCCATATCC TTGCGGCGAC AGACAGGCCG GTTTTTGCCG TTTTCGGCTT TTATGCCGAC AAAGGCATTG TGGGCGGAAA ACTGACGGAC GGAGTCATGC AGGGACGCGA AGCCGCCCGC ATGGCGCTGC GCATAGCACA GGGCACCCCG CCATCGCGCA TCCCCGTCAT CACCGAAAAT ATCAACCGGT TTGTATTCAA CCACGAGCTG CTGGCCCGGT ACGGTATTTC CTCTGACAGC ATTCCCGCAG AAAGCACCGT GCTGAACAGG CCTCCCTCTT TCTTCACACG GTACCGGCAG GTGCTGCTGC CCGCCGGTGT GCTGGTGGCC CTGCTGGTAC TGGCACTGGC TCTTGAATCG CACCACCTTG TCAGACAAAG AGCCGTGGAG CAGAGCCTGC GCGAAGCCAA AACCCGCTAT CGCGAACTGG TGGACAACGC GCGTTCGGTC ATCATGCATA TCGACCGCAA CGGGCAGATA GAATTCATCA ACGAGTACGG ACTGTCCTTT TTCGGATACG AGGAGCATGA GCTGACAGGA AAAAGCGTTG CGGCGACAAT TCTGCCGCAA AACGCGGACG GCACCCCGTA TTCCGAACTG CTGCGGCGTA TTCTGGAAAA TCCGGAAACC TATGCCTGCC ATGAAAATGA AAACATACGC AAAAACGGCG AACGGGTCTG GATATCATGG CTCAACAGGC CGCTGCGCGA TGCTTCCGGC AACGTGACCG GATTGCTTTC CGCCGGGCAG GATGCCACCG AACGCAAAAA AGCACAGGAC GCTCTGGCAG AACGTGAACG CAGCTATTCG GTATTGCTTT CAAACCTGCC GGGCATGGCT TTCCGCAGGC AGACCGGCGG CGACGGCACG TTTCTTTTTG CCAGCGAAGG CTGTCTGCCG CTGACAGGGT TTGCCCCCGA TGATTTTGTG CAGCACGGCA GAAATCTGCG GGGGCTGGCC CACCCCGAAG ACCTGCCCCG CATTACCGGC GCCATAGACG CATCGCCGCA CCGCTACGCT GTCGAATACC GCATAATCCG TTCCGACGGT GAAACGCGCT GGGTATGGGA AGGCGGCATG ATGATATCCG GCAGCGGCAC GTCACCCTGC CGGTCGTCCG GCGACAGCAT CTGCGTTATC GAAGGATTCA TGACCGATAT CACGGCACGG ATGACCGCGC GCACCGAGCT GGAACGCCTG AATGAAGAAC TTGAAGAGCG GGTTGCTGCA CGCACCGCCG AACTGCAGAC ATCGCTGGAG CATCTGCGTC AGGCTCAGCA CCAGCTTGTG GAAACAGAAA AAATGGCAGC GCTGGGCGGA CTGGTGGCCG GTGTGGCGCA CGAAATCAAC ACTCCCATCG GCATCGGAGT GACCAGCACA TCCTATCTGC AGGAAAAAAT GCAGCAGCTT GAAGAGCTGT ACAGATCCGG CGGCATGAAA CGTTCGGATA TTGAAAACTT TCTGCGCGTG GGCAACGAGT CCCTGACCGC CACCCGCATG AATCTTTCCC GCGCGGCGGA TCTGGTACGC AGCTTCAAGC AGGTTGCCGT TGATCAGTCC GATGAAGATA CCCGCCGCTT CAACGTGCTG GACTATCTTG AAGAAGTGCT TGTCAGTCTG CGTCCGCGCT ACAAACGCAC TTCGCACCGC GTCGAACTGT CCGGCGACAA GGACCTTGTT ATCACCAGCT ATCCCGGCGT GTTCATGCAG ATAGTCACCA ATATTCTGAC AAACGCCCTG CTGCATGCCT TTGACGGCAT GGAAAACGGC GTGCTGAGCA TACACGCCGT CCGTCAGGGA AATGACCTGA CACTCACCCT TGCCGACAAC GGCAAAGGCA TGACGCCGGA AATTCTTTCA CGCGTGTTCG AACCGTTTTT TTCCACACGC AGAGGCAACG GCGGCACCGG ACTGGGCATG CACATCGTAT ACAACCTTGT AACACGCAGA CTCAAAGGTA CCGTGCAGTG CCGCAGCACT CCGGGCGACG GCACCACATT CACCATAACC GTGCCCATGC AGGATTCCCC CTCTGCAGTA CAGACATAA
|
Protein sequence | MPLPLSAPQT TRTVQCPPCA PDNAMHPRAV LPALFLLAAA VFFLCWAPPR NASAQPPPGT GRTHFLLVHS YHPNMLWVQD INSGVQEVLG PAIDASNGTM RLSVEFMDTK RHPDAAYRAD ILALLRDKYA QDPPDVIITA DNIALNTLLD EHEAIAPGSR VVFCGYNNYT PDALRGHDDV TGIAEKVSIR ETIQAAAAML PQTSRFIVIS DRSETGRAMT HELALQTAGI PEHGRMELWD AYTFADLGRQ LAALQGTETI VLLSALQDTA GNVQSYRHSL RHILAATDRP VFAVFGFYAD KGIVGGKLTD GVMQGREAAR MALRIAQGTP PSRIPVITEN INRFVFNHEL LARYGISSDS IPAESTVLNR PPSFFTRYRQ VLLPAGVLVA LLVLALALES HHLVRQRAVE QSLREAKTRY RELVDNARSV IMHIDRNGQI EFINEYGLSF FGYEEHELTG KSVAATILPQ NADGTPYSEL LRRILENPET YACHENENIR KNGERVWISW LNRPLRDASG NVTGLLSAGQ DATERKKAQD ALAERERSYS VLLSNLPGMA FRRQTGGDGT FLFASEGCLP LTGFAPDDFV QHGRNLRGLA HPEDLPRITG AIDASPHRYA VEYRIIRSDG ETRWVWEGGM MISGSGTSPC RSSGDSICVI EGFMTDITAR MTARTELERL NEELEERVAA RTAELQTSLE HLRQAQHQLV ETEKMAALGG LVAGVAHEIN TPIGIGVTST SYLQEKMQQL EELYRSGGMK RSDIENFLRV GNESLTATRM NLSRAADLVR SFKQVAVDQS DEDTRRFNVL DYLEEVLVSL RPRYKRTSHR VELSGDKDLV ITSYPGVFMQ IVTNILTNAL LHAFDGMENG VLSIHAVRQG NDLTLTLADN GKGMTPEILS RVFEPFFSTR RGNGGTGLGM HIVYNLVTRR LKGTVQCRST PGDGTTFTIT VPMQDSPSAV QT
|
| |