Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_3272 |
Symbol | |
ID | 5696135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3940172 |
End bp | 3943060 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641265892 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001531152 |
Protein GI | 158523282 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTCTT TTTTTCATCA TTCATCATTC CGCATTCATC ATTCCTTCAA ACGGGAGAAC ACCATGAACA TCCTCATTGT CGATGACCGC GAAGAGAACC GCTACCTTCT GGAGCGCCTG CTTCAGGGAA ACGGGCACAC GGTCCGGCAG GCTGCCAACG GGGCCGAGGC CATGGAAATA CTGACGGCCG GCGGCATCGA CCTGGTCATC AGTGATATTC TCATGCCGGT GATGGACGGG TTCCAGTTGT GTCGCAAGGT GAAAACCGAC GAAACCTTGC ACGCTATTCC CTTTATCGTC TACACCGCCA CCTATACCGG CCCCCAGGAC GAGGCCTTTG CCCTGAAAAT CGGCGCGGAC CGTTTTATTC AAAAACCCTG CGAGGTCGAC GTGCTGTTAT CGGCCATCAA CGAGGTAATG GCCGTCGGTG GCGGCCGGGT GGCAGAGCCG GTACAGGAGG AGGAAGCCCT CAAACTTTAC AGTGAACGGC TGATAAGAAA GCTGGAACAG AAAATGCTTC AGGCCGAACA GGAACTCCAG GCTCGGCAGG AAGCCGAGCA GGCCCTGCGC GAAAGCGAGT CCCGGTTCCG GCTCTTGGCG GAGACCGCGC CCGTTGGTAT CATTATCGAA GACCGGGATC AGAACGTCCT GTATGTGAGC CCCACGTGTA TCTCTCTTTG CGGTTACGCG CCTGAAGAGA CACCAACAAT GGAGGCATGG TTTTCGCTTG TCTGCCCCGA CGAAACCCTG AGAAACCGGG TGCGCGCGGA ATGGGCCGCA GCGGTTGAAA CGGCAACAAA AACCGGGGTC GAAATTCAGC CCATGGAGTT TCCCGTCACC TGCAGGGACG GCACGGTTCG GGATATTGAG TTTCGCATGT CCGCCACTCA GGACCTGGAT TTTGTGGTGC TGTCCGATGT CACCAGCCGC AGGCGGGCCG AGCAGGCCCT GCGCGAAAGC GAACGCAAGT ACCGTCGGCT GGCGGAAAAT ACGTCGGACG TGGTCTGGAC CGCGGACATG AACCTGAACA CCACCTATGT CAGCCCCTCG GTGGAGCGAC TGGTGGGAGA ACCGGTCGGC CTGCACATTC AGCGAACCAT GGAGGAAAAA TTTCCCGCCG ATTCCCTGAA CCGGCTTTAT GCTGTTTTTG CCAAAGAGCT GGAAAACGAA AAAGACCCGG CCTGTGACAA AAACCGGTCC CGCCTGGTGG AGGTTCAGCA CTTTCGGGCC GACAGCAGCA CCGTCTGGGT TTCGATCAAC ATCTCGTTTA TTCGGGATAC CACCGGCCGG CCCATCGGAC TACAGGGCAT CACCCGGGAC ATCACCGAAC GCAAACAGGC TGAACAGGCG CTGCGGGAGA GCGAGGAACG GTACCGGACC ATTCTGGAAA ACATCGAGGC CGGTTACTAT GAAGTGGACC TGGCCGGCAA CTTCACCTTT TTCAACCAGG CCATGTGCCA GATTCTGGGA TATGCGGAAG ATGAGCTGCT GGGCATGAAC AACAGATCCT ACATGGACGA CGAAAACGCC AAAAAGGTGT TTCACACCTT CAACCAGGTG TTCACAAGCG GAAAAACAGC CAAAGCCTTT GACTGGGAAC TGATTCGAAA AGACAACACC CGCTGTTTTG TGGACACGTC GATTGCACTG ATGCGGGATG CCGAGAACAA CCCCGTGGGG TTCCGGGGAA TCGCCCGGGA CATTTCGGAG TGGAAACAGG CCGAGGCGGA GCGGGAAAGA CTTTCAACCC AGCTGCTTCA GGCCCAGAAA ATGGAGTCCG TGGGCCGGCT GGCCGGCGGC GTGGCCCATG ATTTTAACAA CATGCTTTCC GTGATTCTGG GATACACGGA ACTGGCCATG CACCGGGTGC CGCCCCACGA CCCCCTTTAC GAGGATTTAA GAGAAATTCT GTCCGCAGCC AAACGCTCCT CGGAAATCAC CCGGCAGCTG CTGGCCTTTG CCCGCAAACA GACCAGCAGC CCGAAAGTGA TCGACTTAAG CGACACCGTG GAGAACATGT TAAAGATGCT GCGGCGGCTC ATCGGTGAAG ATATCGACCT TGCGTGGAAC CCCGGTCCGG GTCTCTGGAC CGTGAACATG GATCCGGCCC AGATAAGCCA GATTCTGGCC AACCTGCTGG TCAATGCCAG AGACGCCATC GGCGGGGTGG GCAAAGTCAC CATCGAGACC GGCAACGTCA GCTTTGATCA GGACTACTGC GAAGATCACA ACGAATTTTT GCCCGGAGAC TATGTGGTGC TGGCCGTGAG CGACGACGGT TGCGGCATGG ACAGAAAAAC CCGGGACCGC CTGTTCGAGC CCTTTTTCAC CACCAAAGAG GTGGGGAAGG GCACGGGCCT GGGCCTGGCC ACGGTCTATG GCATTGTCAG CCAGAACAAC GGGTTTATAA ATGTTTACAG CGAGCCGGGC CAGGGCACCA CCTTTCGTAT CTACCTGCCC CGCCACGGCG GGGCCCTGTC CACCGCCCAT CAACCCGGAC CGGCCACGGC GCCCCGCGGC ACCGGAGAAA CCATTCTGGT GGTGGAGGAC GAGGCCGCCA TTCTCAAGCT GACCCAGCGG GTCCTGTCCG GGCTGGGTTA TACCGTTCTG ACCGCGGAGA CCCCGGCCCA GGCACTGAAG CTGGCCATGG AACACAGTGA CCGGATCGAC CTGCTGATCA CGGATGTGAT CATGCCGGAG ATGAACGGCC GGGACCTGGC CGACCGGCTG AACGCCCTGC AACCGGACCT CAAAATCCTC TATATGTCAG GCTACACCGC CGATGTCATT GCCCACAGAG GCATACTGGA GCCCGGCGTC CACTTTATTC AGAAACCCTT TTCCAACCAG GACCTGGCAA AAAAGGTAAA GGCCGTGCTG GGAGGATAG
|
Protein sequence | MRSFFHHSSF RIHHSFKREN TMNILIVDDR EENRYLLERL LQGNGHTVRQ AANGAEAMEI LTAGGIDLVI SDILMPVMDG FQLCRKVKTD ETLHAIPFIV YTATYTGPQD EAFALKIGAD RFIQKPCEVD VLLSAINEVM AVGGGRVAEP VQEEEALKLY SERLIRKLEQ KMLQAEQELQ ARQEAEQALR ESESRFRLLA ETAPVGIIIE DRDQNVLYVS PTCISLCGYA PEETPTMEAW FSLVCPDETL RNRVRAEWAA AVETATKTGV EIQPMEFPVT CRDGTVRDIE FRMSATQDLD FVVLSDVTSR RRAEQALRES ERKYRRLAEN TSDVVWTADM NLNTTYVSPS VERLVGEPVG LHIQRTMEEK FPADSLNRLY AVFAKELENE KDPACDKNRS RLVEVQHFRA DSSTVWVSIN ISFIRDTTGR PIGLQGITRD ITERKQAEQA LRESEERYRT ILENIEAGYY EVDLAGNFTF FNQAMCQILG YAEDELLGMN NRSYMDDENA KKVFHTFNQV FTSGKTAKAF DWELIRKDNT RCFVDTSIAL MRDAENNPVG FRGIARDISE WKQAEAERER LSTQLLQAQK MESVGRLAGG VAHDFNNMLS VILGYTELAM HRVPPHDPLY EDLREILSAA KRSSEITRQL LAFARKQTSS PKVIDLSDTV ENMLKMLRRL IGEDIDLAWN PGPGLWTVNM DPAQISQILA NLLVNARDAI GGVGKVTIET GNVSFDQDYC EDHNEFLPGD YVVLAVSDDG CGMDRKTRDR LFEPFFTTKE VGKGTGLGLA TVYGIVSQNN GFINVYSEPG QGTTFRIYLP RHGGALSTAH QPGPATAPRG TGETILVVED EAAILKLTQR VLSGLGYTVL TAETPAQALK LAMEHSDRID LLITDVIMPE MNGRDLADRL NALQPDLKIL YMSGYTADVI AHRGILEPGV HFIQKPFSNQ DLAKKVKAVL GG
|
| |