Gene Dvul_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0016 
Symbol 
ID4662653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp24392 
End bp26431 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content67% 
IMG OID639818209 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_965467 
Protein GI120601067 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.225702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCAT CCACCGTCCT CGAACGTCTG CGTAGCCGCC TTGCCTCGCC ATGGTCGCTG 
GTGCTTCTCG CCGCCTTGCT GGTGGCTGGC ATCGTGGCCG TCCTGCTGGG GTCGTATCAT
GGTGCGGCAT CGACCATGAC GCGCATCCTC ACTGAAAAGG GCGCGTCGCT CATCCGTTCG
TTCGAAGCGG GAGCACGTAC GGGCATGAGG CACAGTGCCG GGTTGCGCCT GCAGATACTG
CTTGAGGAGA TGGCAGACCA GCCTGACATC CGTTTCATCG CCGTGGTCTC GCCCGAGGGC
GAGGTGCTGG CGCATAGCGA CATGACCCGC GTGGGCACAC GCCTGTTCAA CCCTGACGAA
CTTCAGAGGC TGGATGCCGG GACAGAACCC CGCTGGACGC TCCTGCCCGG TGACGGTGGG
GACGATTTCG TGGTCTTTCG CCTGTTCGAC CCGGTGCGCC GCCGTGCCAT GTTCACACCC
GAAGGCGGGC GCATGGCCCC GCCTCCGCCA GACGATGCAA GGCTGGAGCA TCCCGACGAC
CAGCGGCCGC GTTCACGGGC GGAGCGTTTC GCCAGATGGC GCGAACGGCA AGAAGGCCTC
TCCCCCTTTC CGGGAAGCCT CCCCTTGCCG GAAGATGGTC GCGGAGACAC CGGCAGGCGT
GAACATGGCG GCCACGGGAT GCACGGCGGG TTCGACGCCT CGTGCATGCG AAGCCCGCAG
GGCCCTCCTC CGGTCATCCT CGTGGGGCTG GATATGCAGC CCTTTGCCGA GGCGCGCGCG
CAGGACCGCA ACCATGCCCT GCTCATGCTG GCACTGGCTG GTGGCGTAGC TGTCGCCGGG
GTCCTGTCGC TGGTGTGGTC ACGACGTGAA CAGGCGGCAC GGCGTCGGCT GATGGCGTCG
CAGGCGTTCG CGGCCAAGGT GGTGTCGAGC CTGCCTGACG GGCTTGTGGC CTTCGACCGC
GAAGGGCGCA TCGACGAATG CAACGCAGCC GGGGCCGACC TGCTGGGTGT CGGCGCTTCG
TCCGCACTGG GACAGGGCGC GTCATGGCTT CCTGCCCCGC TTGATGCCAT GGCCGCGTAT
CTTCTCGGTG GCGGTGTTCT CGCCCCCACC GAAGTGGAGT GCCGCCGTGC CGACGGAACG
AGCGTACCAC TGGGCGTGCG GGGCGCGCGC ATCGTGGACG ATGAAGGCCG TGCTGTGGGT
GTCATCCTGC TGTTGCGCGA CCTCAGCGAG GTGCGGCACC TTGAGGCGGA AGTGCGCAGG
CGCGAGAAGC TTGCAGCCGT GGGTAACCTT GCCGCTGGCG TCGCGCACGA GATACGCAAC
CCGCTCAGTT CGATAAAGGG CTACGCCACC TACTTCGGTA CGCGCTTCCC TGAAGGCAGC
GACGACCGTG AGGCCGCCCG CGTCATGGTG CAGGAGGTCG ACAGGCTCAA CCGCGTCATC
TCCGACCTCA TCGGCCTGTC GCGTCCATCC GACATCCGGC CCCGGCCCAC CCGTGTGGCG
GACATCGTCG ACCACGCCCT GCGACTGGTG CGTCCGGATG CCGAGGCCCG CAAAGTCGCC
GTGCGTTTCG AGGCCGCGCC GGATGTGCCC GAGGCCCTCG TGGACCCCGA CCGCTTCGCG
CAGGCGTTGC TCAATGTCTG CCTCAACGGC ATGGAGGCCA TGGGAGACGG GGGTGAGCTT
ATGGTGACGG CGTATCGCCA CGGTGATGGA AGGGTCGCGG TGTCGGTGCG CGACACCGGG
CCGGGCATCT CACCGGAGAA CCTGTCGCGG GTCTTCGACC CCTATTTCAC GACCAAGGGA
CAGGGCACGG GCCTTGGCCT CGCCATCGTC CACAAGATCA TCGAAGCCCA TGGGGGCGAG
GTGGGCTTCC GCTCCGAACC GGGGCACGGC ACCGAGTGCA CCTTCATACT GCCCGCTGTC
CATGCAGGGG CGACACCGGA CATCCGGGCC GATGCGGCAG GCGCGGGTAA GCCCCGTGCG
GCGACATCCG CGGGCATATT AGCGGGCACA CCCGATCAAG ACACGGACAA GGGGGAGTAG
 
Protein sequence
MTPSTVLERL RSRLASPWSL VLLAALLVAG IVAVLLGSYH GAASTMTRIL TEKGASLIRS 
FEAGARTGMR HSAGLRLQIL LEEMADQPDI RFIAVVSPEG EVLAHSDMTR VGTRLFNPDE
LQRLDAGTEP RWTLLPGDGG DDFVVFRLFD PVRRRAMFTP EGGRMAPPPP DDARLEHPDD
QRPRSRAERF ARWRERQEGL SPFPGSLPLP EDGRGDTGRR EHGGHGMHGG FDASCMRSPQ
GPPPVILVGL DMQPFAEARA QDRNHALLML ALAGGVAVAG VLSLVWSRRE QAARRRLMAS
QAFAAKVVSS LPDGLVAFDR EGRIDECNAA GADLLGVGAS SALGQGASWL PAPLDAMAAY
LLGGGVLAPT EVECRRADGT SVPLGVRGAR IVDDEGRAVG VILLLRDLSE VRHLEAEVRR
REKLAAVGNL AAGVAHEIRN PLSSIKGYAT YFGTRFPEGS DDREAARVMV QEVDRLNRVI
SDLIGLSRPS DIRPRPTRVA DIVDHALRLV RPDAEARKVA VRFEAAPDVP EALVDPDRFA
QALLNVCLNG MEAMGDGGEL MVTAYRHGDG RVAVSVRDTG PGISPENLSR VFDPYFTTKG
QGTGLGLAIV HKIIEAHGGE VGFRSEPGHG TECTFILPAV HAGATPDIRA DAAGAGKPRA
ATSAGILAGT PDQDTDKGE