Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1156 |
Symbol | |
ID | 4663056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1409329 |
End bp | 1412598 |
Gene Length | 3270 bp |
Protein Length | 1089 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639819386 |
Product | CheA signal transduction histidine kinases |
Protein accession | YP_966603 |
Protein GI | 120602203 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.254693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.262443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAGG AATATATGGA TCCGGAAATA TTCGCCGATT TCATCGTCGA AGCCAAAGAG CATCTCGAAA CCATCGAGCC GAACCTGCTC GAACTCGAGA AGGCTCCTGA CAACCTCGCG CTGCTGAACG AGATCTTCAG GCCGATGCAC TCGCTCAAGG GGGCATCGGG CTTTCTGGGC CTCAACCGCA TGAATCAGCT GGCCCACAGG GCGGAGAACA TCCTCGATGA ACTCCGCAAG GGTGCCATGG TCGTCACATC CGAGATCATG GACGTCATCC TTGCCGTGAC AGACGCCTTG CGCCAGATGA TCGACAACCT CGAAAGCTCC GGGCAGGAAG GCGACGTTGC CATCGAGTCG CTCATCACCA CCATTGATGC CATCATGGCC GGTGGCGGCG CGGTTCCTGC TGCCCCCGCA GCCCCCGCTT CACCTGTTGC TCCTGTCGAG CCGGTGGCGG CGGTCACTCC GGCTGCCGAT GAAGGGGCGG TCATGCCTGA CGGCATCGTC GCCACGACTG TCGACGAGCC TTCAGACGGT ATGTCGGCGT CGGATGCAGC GGGAGGCGTC GGCGACAACG TGCAGGGGCC GTCCGTGCAG GATGCCGCCG CACCCGTCGC TTCCGGAGAC AGCGGAAACG AATCGGGAAT GACCGAAGGT GAGAGTATCG CGGCATGGAT TGCCGCCCTG CCCGACACGG AGTCGTATGC CCTCACTGCC TTCGGTGAGG CGCATCTCAA GGACTTCATC GATGAAGCCC GCGAGATGAC GGAGCAACTC GGTACCGGGC TTCTCGAACT CGAACGGCAC CCCGACGAGA AGGATGACCG CGTCAACGAC CTGTTCCGCT TTTTCCATAA TCTCAAGGGA AACAGCGGCA TCATCGGCTT CCGCGAGCTG AACGGCCTCA CCCATGAGGC AGAGACGCTC CTGAACAACA TCCGGCGCGG CGAGATGCAG TCTTCGGCAG AACTCGTCGA CCTGCTCCTG TTCGTGGTCG ATGTCATCGA CGCGCTCGTG AATCGCATCG AGACGGCATC GGGCATGGTG CACGGATTCG ATATCTCCGA AGTGCTCGAA CCGCTTCAGC AGGCCGTGGC TGGCGGCGAG GCCCGCCTGC CCGTGGGTAT CGTGGGGATG GCGAAGCAGG CCGCATCACA ACCAGCGGCT GAAAGCACCG CCACCACTGA AGGCAGCATG CAGGGTGGTG CCGATTCAGC CGATGAGTCC GCCGCCCCCG CCAAGGGCGA GCAGCAGAAG GCAGCGGCCT GCGACCCCTC GGCGACCTAT GATGCTGAAG ACCTCGATGT CTTCAAGGCG ACCGTCGCCC AGCAGATGGA CAATATCCGG GTCGGTCTGG ATGAACTGGC ACGTGATGCC AGCCAGAAGG AATATGTCGA CGCGCTCTAC CGTTTTCTGG TCACCGTGCA GAACTCGTGT GCCTACATGG GGGCCGAGGA TGTTCGCCTC TATGCGGAAC GGACAGCCGG GCTCGTCGAT CAGGCGCGCA ATGCCGATCT TGATTTCGGG CTGATGGTCG ACCTGCTCCG GCAGGAGACC TCGATCATCG GCGATATGCT CGGCAAAGAG ATAGCCCGCA TGGAGGACTG CCTCAACGGT GGAGACGCCG AATCCGAGCA GGGCGAGGCC CCTGCTGCGG CTACAGTCGG TACACCGTCT TCGGCAGCGA CGACTTCTGC ATCGGCTCCC GAGAGCGGCA GCGTCGCGCA GGGGGCACCT TCATCGGAAA AACCGGCCCA GAAGCCGGAG CCCGTCCCGG CTGCGCCGGC ATCCCAGAGC GCGGCATCTT CTGGTGCTCA GGCCGCCGCT TCGGCTCCTG CCGGGCAGGC TGCGGCCGCC GCATCGGCAA AACCCGCAGC CCCTGCCACA TCCGGGACAG AAGCGAAGCC CGTAGCGGCG GCGACAACTC CGCCCAAAGC GGCTGCTGCG TCATCTGCCG CGCCTGCCAA GGCCGTAGCG ACTCCGCCCC GTCCCGCCGC CCCGGCCCCC GCAGCGGCGT CGGCCCCGGC AGCCGCTCCT GCAGGCGGCC CGCCCGCAGG TGGCAACAAG GCGTCTTCCA CCATTCGCGT CGACCACGAG AAGCTCGACC ACCTGATGAA TCTCATCGGA GAGCTCATCA TCAACCGCAA CCGCTATTCC ATGCTTGCGC GGCACCTTGA AGAGGGTGGT GCGTCGGTCA ATGTGGCGGA GGTGGCACAG AACCTCTCCG AGACGACGTA TGCCATGGCG CGCATATCCG ACGATCTTCA GGACACCATC ATGAAGGTCC GCATGGTGCC TGTCTCTTCG GTGTTCTCCC GCTTCCCGCG CCTTGTACGC GACCTTTCCC GCAAGAGCGG CAAGGAGGTC GACCTCGTCC TTGAAGGCGA GGAGACTGAA CTCGACAAGA GCGTCGTAGA AGTCATCGGC GACCCGCTGG TGCACCTCAT CCGTAACTCG GTGGACCATG GCGTCGAACC CGAAGAGGAA CGTATCGCCA AGGGCAAGAA GCCGAAGGGC GTCGTCACCT TGCGCGCATT CCACAAGGGC AACTCGGTCG CCATCGAAAT CGAGGACGAC GGCAAGGGCA TCGACCCTGA AAAGATGCGC GAAGTGGCTG TCCGCAAAGG CATCGTCACG CCTGAAGAAG CCAAGGCCAT GGACGACCGC GAGGCGATGG AACTCATCTT CGCTCCCGGT TTCTCGTCGG CTGAGAAGAT TACCGACATC TCCGGACGCG GCGTCGGCAT GGACGTGGTG CGTACCAACA TCAAGAACCT CAAGGGCAGC GTCAGCATCC ATTCAGAGGT GGGCAAAGGG ACTCGGTTCA CGCTCTCCCT GCCCCTCACC CTTGCCATCA TCGACGCACT CATGGTCAAT GTCGCCGGGC AGATGTATGC CATCCCGCTC GATGCCGTCT CGGAGACCAC CAAGATAGAG GCGCGGCGCC TCACCGATGT GAAGGGCCGC AAGGCGGTCA CCTTGCGTGG CGAGGTGCTT GGCATCGTCG ACCTCTCCGA GTTGCTTGCC CTGCCAAGGA GTGACGCGCA GGACGTGCTA TCGGTCGTGG TCATCCACGA CAACGACAGA CGCCTGGGCC TCGTGGTGGA CAGGCTGCTT GAACGGCAGG AGATCGTCAT CAAGCCCCTC GGAGCCTATC TGGGCGACCT GAAAGGCATC TCCGGTGCCA CCATCATGGG CGACGGTAGC GTGATACTCA TCCTCGACCC GCACGAAATC TACATGATGG CCACTTCCAA GGCCATATGA
|
Protein sequence | MTQEYMDPEI FADFIVEAKE HLETIEPNLL ELEKAPDNLA LLNEIFRPMH SLKGASGFLG LNRMNQLAHR AENILDELRK GAMVVTSEIM DVILAVTDAL RQMIDNLESS GQEGDVAIES LITTIDAIMA GGGAVPAAPA APASPVAPVE PVAAVTPAAD EGAVMPDGIV ATTVDEPSDG MSASDAAGGV GDNVQGPSVQ DAAAPVASGD SGNESGMTEG ESIAAWIAAL PDTESYALTA FGEAHLKDFI DEAREMTEQL GTGLLELERH PDEKDDRVND LFRFFHNLKG NSGIIGFREL NGLTHEAETL LNNIRRGEMQ SSAELVDLLL FVVDVIDALV NRIETASGMV HGFDISEVLE PLQQAVAGGE ARLPVGIVGM AKQAASQPAA ESTATTEGSM QGGADSADES AAPAKGEQQK AAACDPSATY DAEDLDVFKA TVAQQMDNIR VGLDELARDA SQKEYVDALY RFLVTVQNSC AYMGAEDVRL YAERTAGLVD QARNADLDFG LMVDLLRQET SIIGDMLGKE IARMEDCLNG GDAESEQGEA PAAATVGTPS SAATTSASAP ESGSVAQGAP SSEKPAQKPE PVPAAPASQS AASSGAQAAA SAPAGQAAAA ASAKPAAPAT SGTEAKPVAA ATTPPKAAAA SSAAPAKAVA TPPRPAAPAP AAASAPAAAP AGGPPAGGNK ASSTIRVDHE KLDHLMNLIG ELIINRNRYS MLARHLEEGG ASVNVAEVAQ NLSETTYAMA RISDDLQDTI MKVRMVPVSS VFSRFPRLVR DLSRKSGKEV DLVLEGEETE LDKSVVEVIG DPLVHLIRNS VDHGVEPEEE RIAKGKKPKG VVTLRAFHKG NSVAIEIEDD GKGIDPEKMR EVAVRKGIVT PEEAKAMDDR EAMELIFAPG FSSAEKITDI SGRGVGMDVV RTNIKNLKGS VSIHSEVGKG TRFTLSLPLT LAIIDALMVN VAGQMYAIPL DAVSETTKIE ARRLTDVKGR KAVTLRGEVL GIVDLSELLA LPRSDAQDVL SVVVIHDNDR RLGLVVDRLL ERQEIVIKPL GAYLGDLKGI SGATIMGDGS VILILDPHEI YMMATSKAI
|
| |