Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0820 |
Symbol | |
ID | 4664351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1005247 |
End bp | 1008252 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639819041 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_966268 |
Protein GI | 120601868 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGATG AAGCCGCAGG TTCCGCAGAG GTACAGGCCC TGCGCGAACG CATCGCACGC CTTGAGGATG AGAACCGGTT GCTGCGCCTG TGCGCGCGTT TCGAGGGCCC TGCACCCGGT GAGGCCCTTG GCAGCCTGCT ACCCGAAATC ATCGCACCCG CTTCCGACGC CCCTCTCGTC TACGCCTATG TGAGCGATAT GCAGACGCAT GAACTGCTCT TCGTCAACCG CTCGCTGACG ATGGCCGTGG GCGCATGGCA GGGCAGAAAA TGCTATGAAC TGCTGCAGGG GCGGGACACG CCATGTCCGT TCTGCACGTC GGGCAGACTG CAACGCAATC CCGAACGCCC GGTCGTATGG GAGTGGCGCA ACCCGCGGCT TGGACGGTGG TTCCGCTGCA TTGACCGCTG CATCCACTGG CCCGACGGCA GGCCCGTGCG CTACGAACTC GCCGTCGACA TCACGGACAT GCGTGAGGCA CAGGAGGACC TTCTGCGCTT TCGTGCCGCA CTGGACGCCT CGGCCGAGGC CATTTTTCTC GTCGACCTCG AAGAAGGCCG TTTCCTTGAT GTGAACAGTG GTGCCTGCAC CATGCTGGGC TACGACCGTG ACACACTGCT CGGCCTTGGC CTGTGCGGCA TACGTCGCAG CATGGCAGCG GACGGCTGCC GCCAGATTCT GGACAGCATC GCCGCCGGGG CCGTTCTCGA GAACGTCGAG ACGGTCTACC TGCACCGGGA TGGGACCTTC GTGCCCGTGG AAGTGGGTGC GCGCCTTGTC GACCGCGCGG GCGGGCCGCG CCTCGCCATC ATGGTGGCAC GTGACGTGTC GGAGAAGCGC AAGGCTCGCA GGGCCATGGA GGTGCGTTAC CTCTATGAGC ACGTGCTGTC GTCCTGTGCA CGCGAGTTGC TCTCCCGGTC ATGTTCCGAG TCGACCCTCG TGGGGGTGCT GGGCGCATTG CGTCAGGGGG CGGGGGCTTG CAGGGCCTAT ATCTTCGAGA ACTATGAGGA TGCCGAAGGC CGGCTGTGCT GCTCGCAGCG CTATGAGAGT AGCGCGCCGG GTGTGCGCCC CGAACTGGAC AACCCGGCGC TGCAGAACAT CATCTATGCG TCGGAGGCTC CCAACTGGCT GCATGAACTC CGCAAGGGGC GCGCCGTGGT GGGCCCTGTC GCAGACCAGC CGCTGCCGGA GCGTGACGTG CTTCAGGCGC AGGGCATACG TTCGCTGCTT GTCCTCCCCA TCTTTGCCCG TGGGGTATGG TGCGGCTTCG TCGGCTTCGA CGACACCCGC ACGGAGAGGA CATGGCAGGG GGGCGACATC CTCTTTCTCC AGACCGCCAG CGAGATTCTC GGCGCCGCGC TCGAACGCCA CAGGGCAGAG GCCGAACTCG CCGCGTCGCA CCAGCGGGCC GAAGAGGCGA GTCGGGCCAA GAGCGTGTTT CTCGCCAACA TGAGCCATGA GATACGCACG CCGCTCAATG CCATCATCGG TCTCACCGAA CTGACCCTTC AGGAACCCAT ATCCCAAGGC GTGGGCGAGA ACCTGCGGGG CGTGCTGCAT AGTGCCGAGG CGCTGCTGGC TGTGGTCAAC GACCTGCTCG ACCTGTCGCG GGTCGAGTCC GGGAGGTTGC ACCTCGAGAG CGTGGAGTTC TCTCCCTCGC GGCTTGCGCG GGGGGTTGTG CGGCTCATGA CCCATGTCGC CGACCGCAAG GGGCTCGACT TCGAACTCTA CATCGCCCGC GATGTGCCCC CCACGGTGAC CGGCGACCCG GCAAGGCTTC GTCAGGTGCT GCTCAACCTT GTGGGCAACG CGCTGAAGTT CACCGACGAA GGGGGCGTGA GCCTCACGGT CACCCCTTGC ATCTGTTCCA CCGACACAGG GGGGGGTATC CTCAAGGGGG GCAATGGCGG TGTGCGCGGA TTGCGGTTCA CGGTGACGGA CACGGGCATC GGCATTCCGC CGGACAGGCA GGCGTGCATC TTCGAATCGT TCGTGCAGGC CGATGAAAGC ACGGCGCGTC GCTTCGGAGG AACAGGGCTG GGGCTCGCCA TCTCTCGCAG GCTGGTGGAG ATGATGGGCG GCAGGCTTGA ACTGCGAAGC GAACCGGGCA AGGGCAGCGA ATTCTGGTGT GATGTGCCTT TCGCCTCAGA GGGTGCTGCG CGGGATGGCA TCTGCGAGGT CGCTTGCGTT GCCACTCATG ATGCCCCTTC TGGTACCGGA AGACGCCCGG AAGGGCGCGG GGCGGGTGAC AGACGGCTAG AGGCGGCACC CTTGCGGATT CTCGTCGTCG AGTCGGACCC CCTGTGCCGC AGGGCCATGG TCAAGAGCCT CGGCAGACGC GGGCATGCAG TGACGGCCCT GTCGGCATTG TCAGAGGCCG CTGAAGTGCT GGTGGCAGAA CGCCATGATG CTGTCGTGGT CGAGGCGACT CCCGGTGCGT GGGCTTTCTG CGAGTCCCTC CGTCAGGGGG GATATGGCGC GAACAGGGCG GCCTTGCCCC TGGTGTTCAT CTGCGATGGC GATACGTTCC TGCCCGAATC GGTGGAACCT CTGGCCGCTC CCCATGCGTT GCTGTCACGG CCTGTCAAGG GGCGAGCCCT GTGTGAAGCC GTCGAGCATC TTGCAGGTGC CGATGCCGGG CAGCCTGCAT CCGAGCGTGT GGCATTGCCG GAAGACGTTC CCCTGTTGCA GCCGGGGGCA TTGCTGCTGA CCGCCCCCGA TGCCCAGATG CGGTTCGTGC GGCAGGTTCC CAGTCTGCGT GAGAGCCTGT GGTCGGCCAT GGACAGGGGA AGCCGCGTCG AACTTGCCTC GCTGGCGCAT CTTCTGCGTC ATGAGGCCGA GGGAATCGGC GCCTTGCGCC TGCAGGTGCT GGCTGAACGG CTCGAGGACA GGGTTCGCAT CGGGGCGACC GAAGACGCGC GCCCGGTCTT CATGCTTCTC GCCGATGCCC TCAACCAGCT TGAGCACGAC CTGCGCAAAC TCGTCCCCGC ACTATCCGAG GACTGA
|
Protein sequence | MKDEAAGSAE VQALRERIAR LEDENRLLRL CARFEGPAPG EALGSLLPEI IAPASDAPLV YAYVSDMQTH ELLFVNRSLT MAVGAWQGRK CYELLQGRDT PCPFCTSGRL QRNPERPVVW EWRNPRLGRW FRCIDRCIHW PDGRPVRYEL AVDITDMREA QEDLLRFRAA LDASAEAIFL VDLEEGRFLD VNSGACTMLG YDRDTLLGLG LCGIRRSMAA DGCRQILDSI AAGAVLENVE TVYLHRDGTF VPVEVGARLV DRAGGPRLAI MVARDVSEKR KARRAMEVRY LYEHVLSSCA RELLSRSCSE STLVGVLGAL RQGAGACRAY IFENYEDAEG RLCCSQRYES SAPGVRPELD NPALQNIIYA SEAPNWLHEL RKGRAVVGPV ADQPLPERDV LQAQGIRSLL VLPIFARGVW CGFVGFDDTR TERTWQGGDI LFLQTASEIL GAALERHRAE AELAASHQRA EEASRAKSVF LANMSHEIRT PLNAIIGLTE LTLQEPISQG VGENLRGVLH SAEALLAVVN DLLDLSRVES GRLHLESVEF SPSRLARGVV RLMTHVADRK GLDFELYIAR DVPPTVTGDP ARLRQVLLNL VGNALKFTDE GGVSLTVTPC ICSTDTGGGI LKGGNGGVRG LRFTVTDTGI GIPPDRQACI FESFVQADES TARRFGGTGL GLAISRRLVE MMGGRLELRS EPGKGSEFWC DVPFASEGAA RDGICEVACV ATHDAPSGTG RRPEGRGAGD RRLEAAPLRI LVVESDPLCR RAMVKSLGRR GHAVTALSAL SEAAEVLVAE RHDAVVVEAT PGAWAFCESL RQGGYGANRA ALPLVFICDG DTFLPESVEP LAAPHALLSR PVKGRALCEA VEHLAGADAG QPASERVALP EDVPLLQPGA LLLTAPDAQM RFVRQVPSLR ESLWSAMDRG SRVELASLAH LLRHEAEGIG ALRLQVLAER LEDRVRIGAT EDARPVFMLL ADALNQLEHD LRKLVPALSE D
|
| |