Gene Dvul_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1658 
Symbol 
ID4663842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1966142 
End bp1967881 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content63% 
IMG OID639819897 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_967102 
Protein GI120602702 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0362267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.337398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCG CCGACAAGAA TGCCGAACGC AGCACCATCA TCATCGCCGT GCTCACGCTC 
GTTACCATCG GGCTTTCACT CATCGTCACC ACCGGGCAGA CCCTGCGCCA GCAGGAGGAG
GCCGGGACAC AGTACCTGAC CATGACCTCG CGCTCCGTGC TACAGGCTGT CGAGAGTTCA
CTCCGCCGGG GGCTGTTCGT TCGTCCCACG GGGCCCGGCC TCTTCAGTCC GGGAACGGCA
GAGTTCTTCA GGGAACTTGA GCAGACCGGA GACGTACTCT TCGTCGGGGT CATCGACCGT
AACGGAGGCC GGGTGCTGTC GTCGCGTCCC GCACAGGAGG CCGGGCCTCT TGCCTTCCCC
CCGGAAGCGT TGCAGCAGCT TGCCCAGAAG GGCGAGTGGT ATGGCAGGGC GACGTTCGGT
GCACGTCAGA CCTATGTTTA CGGCAAGCGT ATCGTCCCCG GCAGGGGACT CGCGCAGGAT
GAGGACGAGT TGCCGACCTT TCTGGTGGTC GGGCTGGACA TGACCAAGCA CCTCGGCGTC
TACAATCGCT TCCGCCAGAA TGCCCTTTTT CAGGCTGCCT ACATCCTCGC TGCTGCCGTG
TTCATCTGGA CGCTGGCCAT GAGCTTCCTC AAGCGACGTG AACAGGCCGG GCGCGCGGCC
GTTCTCGAAC GCTTTCAGGC GCGTCTTCTC GACAACCTTC CCGATGGGCT CATCACCGTT
TCACGGTCTG ACACCATCAG TGCTGCCAAT CCCGCTGCCC ACGGCATTCT CGGCATCGCC
CCCGGAAGGC TGGCGGGCAT GTCCATAGCC GAACTGCCGG AAGGTATCGT CGCCCCTGTC
GCGCGGGTTG CGGAAGGGGG CTCTTCCGCT GAACCGGAGG CGGCAGCCGG GCGATGGTTC
ACCCGTTCAG TCGGGGGGGC GCACCTTGAA GTCCTTACCC TGCCCATCCG GCAGGGCGAT
GATGATCACG ACAGGCTGGT CATCATCCGT GACCGTACGC AGATTCGGGA ACTTGAGAAG
AACCTCAGTG AGGCCGAAAA ATTGGCGGCG CTCGGGACTC TGGCGGCAGG CGTCGCCCAC
GAGATTCGCA ACCCGCTGAG TGCCCTGCGC GGCTTCGCCC AGTATTTCGC GAAGAAGCTC
GTGGGGCGTC AGCCCGATGA AACCTACGCC CAGACCATGG TGCGCGAGGC AGACAGGCTC
AATCGCGTCA TCACCGACCT GTTGTACCTC TCGCGGCCGC GGGCCATGGA TGCGCGGGCC
GTGAGCCTCG GTCAGCTTGT GGCAGACATC GGCAACCTTG TCCGTTTCGA CCTCGAGAAG
CGCAATATCG CCCTGAAGGT GAGTCTCGGA CCGGATATCG TATGGGCCGA TGAGGACGCC
CTCAAGCAGG CGGTACTCAA TCTCGTTCTC AACAGCATCG ACGCGCTGGA GGGACTGGCG
ACCGGCGAAC GCGAAATCCA TGTCTCCTCT GCACAGGGCG ATGGCGGCAC GTGGGTTTTC
GTTGGCGACT CCGGACCGGG GATGAGTGCC CTGCAACGCG AACAGGCTTT CGAGCCCTTC
TTCACCACCA AGAAGAAAGG GACGGGGCTT GGGCTGGCGC TCGTCCACAA GACCATGCGC
GAGCATGATG GTCGCGCCCA GATAGATTCG GAGATGGGAC GCGGGACCAC CGTTTCACTC
TTTTTTCCGG GCAATGAGGG CGTCGTGCCC TGTTCCCCTG ACGTGGAGGT TCGCAAGTGA
 
Protein sequence
MEIADKNAER STIIIAVLTL VTIGLSLIVT TGQTLRQQEE AGTQYLTMTS RSVLQAVESS 
LRRGLFVRPT GPGLFSPGTA EFFRELEQTG DVLFVGVIDR NGGRVLSSRP AQEAGPLAFP
PEALQQLAQK GEWYGRATFG ARQTYVYGKR IVPGRGLAQD EDELPTFLVV GLDMTKHLGV
YNRFRQNALF QAAYILAAAV FIWTLAMSFL KRREQAGRAA VLERFQARLL DNLPDGLITV
SRSDTISAAN PAAHGILGIA PGRLAGMSIA ELPEGIVAPV ARVAEGGSSA EPEAAAGRWF
TRSVGGAHLE VLTLPIRQGD DDHDRLVIIR DRTQIRELEK NLSEAEKLAA LGTLAAGVAH
EIRNPLSALR GFAQYFAKKL VGRQPDETYA QTMVREADRL NRVITDLLYL SRPRAMDARA
VSLGQLVADI GNLVRFDLEK RNIALKVSLG PDIVWADEDA LKQAVLNLVL NSIDALEGLA
TGEREIHVSS AQGDGGTWVF VGDSGPGMSA LQREQAFEPF FTTKKKGTGL GLALVHKTMR
EHDGRAQIDS EMGRGTTVSL FFPGNEGVVP CSPDVEVRK