Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0991 |
Symbol | |
ID | 7172886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 1204387 |
End bp | 1206210 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643539497 |
Product | signal transduction histidine kinase, nitrogen specific, NtrB |
Protein accession | YP_002435414 |
Protein GI | 218886093 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 0.964598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGC GTTCTGACGA CCTGCGGGCA AAGCTCATCG GGCTCGGGCA GGCCTCCACG CGCAAGAGCT ACTACCCGGA ACTGAAAGAG CGTCTCCAGC AACTGGAACT GTTCCGCGCC CTGCTGGAAA GCGCAGGCGA CGGCGTGCTG CTGCTGGCCG CGGAAACCCT GCTGGTGCTG GACGCCAACG GCGCTGCCTG CCGCTTCGCG GGGCTGCCAT GGGAAGCGCT GACCGGTCGC CCGCTGGCGA CGGCGTTTCC CGAACTGGCC GGGTGGAAGC CACATCTCGC AGGCAACGCG GAAGGCTTTG TGCCGCAGGA CGGACCGCGC ACCGGCGGCT GGCACATGGC CGATGCCGCC CCCCCTGCCG GGGGCGAGGA ACGCCGCCTG ATCCGCCAAC CCTCGCACGG GGCCGACCTG TGGCTGGAAC TGGCCCTGAA ACGCCACGAC ATCGGCGGCA TGCAGTATGT CTCGTGCATC GCGCGCGACG TGTCGGAACG GCTGCGCGCC GAAGAAGAAC TGCGCGAGAG CGAGCGCAAG TACCGCGCGC TGTTCGAGGC GGCCAACGAC GCCATTTTCC TGATCCGCGA CGGCCTTGTG CGCGACGCCA ACCGCAGGGC CTGCGAGCTG TTCGGCAGAT CCCTGGACCA GTTGACCTCC ATGGCCCCGC GCGACCTTTC GCCGCCAGAG CAGCCCCGCT CCGGGGCCAC CCACCTTGCC GAGGACATCC ACCTGGCCAA GGCCCAGGCC GGAGAGCCGC AGTTCTTCGA ATGGACGCAT CTGCGCGCGG ACGGCACCCC CTTCGACTGC GAAATTTCCC TCAGCCGCTT CACCCTGCGG GGCGAGCCAT GGCTCATCGC CATCATCCGC GACGTGACCG AGCAGAAGAA ACTGCGCGAA ATCATGGTCC AGACGGAAAA GATGATGTCC GTGGGGGGCG TGGCCGCCGG CATGGCCCAC GAGATCAACA ACCCCCTCTC GGCCATCGCC CAGTCCGCCC AGAACCTGCA ACGCCGCCTG ACGCCCGGCG TGCCGGACAA CGAAGCCGCC GCCCTGATAT CCGGAGTGGA CCTGGAACGG GTGCAGCACT ACTTCGAGTT GCGCGGACTG GCCAAGCTGG TGAGCAACAT CCGCGAGGCC TGCGCCCGCG CGGCGGACAT CGTGCGTCAC ATGCTGGACT TTGCCCGGCG CAGCGACGCA GAACGCGCCG CATGCGACCT TGCGGTACTG CTGGACCGGG CCGTGGAACT GGCGGAGAAC GATTACAACC TCAAGAAGCG CCACGATTTC CGCAACATCC GCATCGTCCG CGACTACGCG CCCGGCGCTC CCATGGCGGA ATGCGCTCCC ACCGAAATCG AGCAGGTGTT GTTCAACCTG CTGAAAAACG CCTCGCAGGC CATTTCGCAA AAGGCCTACC CGCAGGGCGA GGCGCCGTGC ATCACCTTGC GCGTCGGCAC GCGGTGCGGC GCGCCGGGCC TGCCCGGCGG CCTCAGCGGC CTCGGCAGCG CGGACGGTGT CATGGGGATG GGGGAAACCG AAGACCCGGG CGAACCCGAG GCGGCGGACG CCGCACCGGC CAACACGGAA TACGTCTGCG TGCAGGTGGA GGACAACGGC CCGGGCATGG ACGAGCGCAC GCGGGTGCGG GTGTTCGAGC CGTTCTACAC CACCAAGCCG CCGGGCGAAG GCACGGGGCT GGGCCTTTCG GTGTCGTACT TCATCATCAG CCGCAACCAT GGCGGGCACA TCATGGTGGA ATCGCAGCCG GGGCAGTGGT GCCGCTTCAC GGTGCTGCTG CCCGCCCTAC GGCGTGCGGC GTAA
|
Protein sequence | MPKRSDDLRA KLIGLGQAST RKSYYPELKE RLQQLELFRA LLESAGDGVL LLAAETLLVL DANGAACRFA GLPWEALTGR PLATAFPELA GWKPHLAGNA EGFVPQDGPR TGGWHMADAA PPAGGEERRL IRQPSHGADL WLELALKRHD IGGMQYVSCI ARDVSERLRA EEELRESERK YRALFEAAND AIFLIRDGLV RDANRRACEL FGRSLDQLTS MAPRDLSPPE QPRSGATHLA EDIHLAKAQA GEPQFFEWTH LRADGTPFDC EISLSRFTLR GEPWLIAIIR DVTEQKKLRE IMVQTEKMMS VGGVAAGMAH EINNPLSAIA QSAQNLQRRL TPGVPDNEAA ALISGVDLER VQHYFELRGL AKLVSNIREA CARAADIVRH MLDFARRSDA ERAACDLAVL LDRAVELAEN DYNLKKRHDF RNIRIVRDYA PGAPMAECAP TEIEQVLFNL LKNASQAISQ KAYPQGEAPC ITLRVGTRCG APGLPGGLSG LGSADGVMGM GETEDPGEPE AADAAPANTE YVCVQVEDNG PGMDERTRVR VFEPFYTTKP PGEGTGLGLS VSYFIISRNH GGHIMVESQP GQWCRFTVLL PALRRAA
|
| |