Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2651 |
Symbol | |
ID | 4663196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 3091432 |
End bp | 3094353 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639820898 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_968090 |
Protein GI | 120603690 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCG TTACCGCCAC GCACATCGGA CAGCAGTCCG GCAAGGCTGC CTCACGCGGC CTCTCAGGTC CGGCCTGCCT CAACGAGATG CTTCTGTCGC TGGTTTCGTT GCGTGATGCC GACAACGCCG AACAGATCGC CGACAAGATG CGCACGGTGG CATGGCTACT GCCCCCGGCC CTCGGCGAGG GGTGCTATGC CGCGCGCATC GTGTGGCTGG GCACGGCCTA CGACCCGCCG GGCCTCCCGG CGTTTCCTCC GCACATCATC CACCCGTTCG ACGTGGCCTC GCGCGAGACC GGATCCATCG AACTGTGGCG CACGCCCGCC GCGCCGCTAC CGCCACGGGC CTATGCGCTC CTTGAGGATT GCGGGCAGAT CATAAGCTCA TGCCTGCAAC AGATCGTAAC GGTCGCCGCC ACCCGCGAAC GCGCCGAACA GTACCGCTCC ATCTTCGACT ACGTCACCGA AGCCCTTGTA CTGTACGACC TTGAAGGCAT CATCCTCGAC ATCAACGCCG CGGCCCTGCG CCTGCTCGGT GCCGACCATG AGCGACTGCG CGGCAGGTGG TTCGGAGACC TGCTCCCCCC CGACGACGCC GCAGAACTTG CAGCGCACAT CAACCGCATC CGTCTCCGGG GAATGACGTT CGCCGAGTCG CATCTACGCC GGATGGACGG CAGACCGGTG CCCGTGGAGA TGACCGACAG GCTGCTTCTG CTTCAGGGCA GGCAGGTCGT GCTCAGCAAC GCCCGCGACA TCAGCGAACG GCATTCGGCG CGTGAAGCCC TCGAGCGACG GATGGAGCGC GAGCACCTTG TGGGTGACAT CGCCACGCTG CTTGCAGGCT CCACACCGGA GGCGACGCAG GCGGCACTGA CAGCGGTGAC GGAACGGCTG GGCAACTTTC TCGGCGTCAG GCACGTATGG TTCATCGAAA TGCCCCCCGG CGAGACGACA GTGTCCTTGC GCCACGAATG GCGTGCTGCC GGGGCGACGT CGCGGAAGGG CACACTCGAT GCCACCCCGC TCAACATGCT GCCGTGGAGC CTGTGGCGTC TGACCGACTT CGAAACGGTC GCCATCGAAG ACGCAGGTTC CCTCGGCAAG GGCCATGAGC ACGAACGGGA CAGGCACAGG GAGATGGGCC TCGTCTCCAC CCTCGCCGTG CCCTTGCGCC GTCGGGCCGA ACTCGCCGGC TGCCTCGCCA TGGACGATAT CGGCAACCGC AAATGGAGTG CCGAGGACAT CGCCCTCGCG GAAACCGTGG GCGGAATGCT CGGCGCGGCC CTCGACCGCA CGGCGACCCT GCTCCAGCAC CAGCGGGCAC ATGCCCACAT CGCGGCCATT CTCGACACGC TCCCGGCGCA GGTCGCCGTG GTGGACGGAA CCGGACGCAT CACCCACGTG AACGCCTCGT GGCTTCATGC CGCGGCGGAC ATGTCCCTGC CCGAACCCAT GCGCTGCCTG CCGGGGGCCG ACTATCTCTC CGCCCTTGAC GCCAAGACGC CGGACATCCC CTCGGCGGGC GATGCGGCGA CGCTGCTGCG CGAGATTCTG GCAGGACGAC GCGAGGGTGG GTCGCTCGAA TACGAGGTCA TGGCCGACGG ACGGCGTCAC TTCATGTTGC AGCTGGCACC CCTGATGCCA CCCCTGACCG GTGCGGTGCT CATGCGTAGC GACGTCACCG CGTTGCGCCG TGCAGAGGCC GACCTCGCCC GCAGCGAGGT TCGTTACAGG ATGCTGCTGG ACACCATGCA GGAAGGACTG CTCTTCACCG ACGCCGCAGG CCGCATGACC TACATCAACG GCCCGTTCTG CGCCATGGTG GAGCATGATG CCGATGCGCT GGCCGGACGC GAGGCACTCG ACCTTGTGGC CCCCGAAAGT CGTCAGGCGT TTCAGAACCT TCTCTTCCCC GGAGAAGCCC CTCACTCACT GCAAGAGATA ACATGGCTCA CTTCCAAGGG CGGACGGGCG TTCTCCCTTG TCTCTCCCTC CATCCTGCGA GACAGCCACG GGCATTTCAT CGGGCTCACG GCCATGATCA CCAACATCAC GCAACGCCGC ATCCTCGAAA GCCAGCTGGC GCAGACCCAG AAGCTGGAGG CCGTGGGCAG TCTCGCCACG GGCATCGCCC ATGAAATCAA CTCCCCTGTC CAGTACCTCG GCAGCAATCT GACGTTCCTG CAACATGCGT TCGACGAAAT CATGAAGGCC TATACGACCT GCGACACGGC CCTCCGCACA GCGCGAGACG GTGGCAGCGA CGTGCCCTCC ATCGACGCGG CGCTCGACGC CATGCAGCAC CTGGACACCG CCTACCTGCG CGACGAAGCC CCGCGGGCAC TGCGCGAATG TCTTGAGGGG GTCGAGCATA TCGCCGCCAT CGTGCGTTCC GTCCGGCAAT TCGCCCACCC CGGCAATGGC ACCGTGGTCC CCGTCGACAT CAACTTCAAC ATCGAAAGCA CCGTCAACGT CGCCCGAAGT TCATGGCGCC GCGTGGCCTC GCTTCGACTC GACCTCGCCC CCGGACTGCC TCCCGTCCCC TGTGTGCCGT CCGAGTTCAA CCAGACGGTG CTCAACCTGC TCATCAACGC CGTACATGCC ATCGAGGACC GCAAGGCCGA AGACCCGGCG CATGAGGGGA GCATCGTCAT CACATCCCGA CTCAAGGCGG GCTGGGCCGA GGTCAGCGTC AGCGACAACG GCGCGGGCAT CAAACCCGAA CACGCAGCAC ACGTCTTCGA CCCCTTCTTC ACGACCAAGC CCATGGAACG GGGTACGGGA CAGGGGCTGG CCATCGCCCA CGCCTGCATC GTCGGCAGAC TGGGGGGCCA ACTGTTCTTC CGAAGCGAAC CGGGCCTCGG TTCCACGTTC TTCATCCGTC TGCCCTTCAC TTCGCCCTCG CAGGAGGCAT GA
|
Protein sequence | MTTVTATHIG QQSGKAASRG LSGPACLNEM LLSLVSLRDA DNAEQIADKM RTVAWLLPPA LGEGCYAARI VWLGTAYDPP GLPAFPPHII HPFDVASRET GSIELWRTPA APLPPRAYAL LEDCGQIISS CLQQIVTVAA TRERAEQYRS IFDYVTEALV LYDLEGIILD INAAALRLLG ADHERLRGRW FGDLLPPDDA AELAAHINRI RLRGMTFAES HLRRMDGRPV PVEMTDRLLL LQGRQVVLSN ARDISERHSA REALERRMER EHLVGDIATL LAGSTPEATQ AALTAVTERL GNFLGVRHVW FIEMPPGETT VSLRHEWRAA GATSRKGTLD ATPLNMLPWS LWRLTDFETV AIEDAGSLGK GHEHERDRHR EMGLVSTLAV PLRRRAELAG CLAMDDIGNR KWSAEDIALA ETVGGMLGAA LDRTATLLQH QRAHAHIAAI LDTLPAQVAV VDGTGRITHV NASWLHAAAD MSLPEPMRCL PGADYLSALD AKTPDIPSAG DAATLLREIL AGRREGGSLE YEVMADGRRH FMLQLAPLMP PLTGAVLMRS DVTALRRAEA DLARSEVRYR MLLDTMQEGL LFTDAAGRMT YINGPFCAMV EHDADALAGR EALDLVAPES RQAFQNLLFP GEAPHSLQEI TWLTSKGGRA FSLVSPSILR DSHGHFIGLT AMITNITQRR ILESQLAQTQ KLEAVGSLAT GIAHEINSPV QYLGSNLTFL QHAFDEIMKA YTTCDTALRT ARDGGSDVPS IDAALDAMQH LDTAYLRDEA PRALRECLEG VEHIAAIVRS VRQFAHPGNG TVVPVDINFN IESTVNVARS SWRRVASLRL DLAPGLPPVP CVPSEFNQTV LNLLINAVHA IEDRKAEDPA HEGSIVITSR LKAGWAEVSV SDNGAGIKPE HAAHVFDPFF TTKPMERGTG QGLAIAHACI VGRLGGQLFF RSEPGLGSTF FIRLPFTSPS QEA
|
| |