Gene Dvul_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2233 
Symbol 
ID4664255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2590048 
End bp2592291 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content63% 
IMG OID639820478 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_967676 
Protein GI120603276 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGG TCAATACCTC CGCAACGCCG ACCCTCACCG GCTTCCTGCA ACGCAGGCTC 
ACGCTGTGGC TCATGCTGCT GGCGCTCTTT TCGCTCGCCT CGACCAGCGC CTTCCTCTAC
CTCGTGCAGA AGCGCAACTT CGAAGAGCGC AGCGTGCGAA CCACCACACT GCTCACGCGA
CAGATCAATG CCCACATAGC CCTTGCCCAG GAATCGTTAC GCGAGATGGC GCTGCGCCTA
CCACGCACTC CCAGCGAAGA CCTTCTTCGC ACCCTCACCG CCACCGATGC CCACGCCCTT
GAACTGTTCG CCCGCGTCAT CGTCCTCGAT GCCAGCGGCA CCATCGTGGC ACGTACACCA
CGCGGCCCGG TGCAGGTCGA TTTCCCGCTC CATCAGCCAC GGGGAGAGGA GTCGGCAAGC
CCCATCATCG GCAAACCCAT CCCCCTGGGG GATACAGGTG AAGTCGTGGT GTGGATCGGA
ACAGACCTGT GGACAAAAGG GCGCCTGCTC GGTGCCCTGC GGCTCGATGC CTTCAGCCGT
TCCATTCAGG AACTGCTCCC CGAAGAAAGC GAAGCCCTCA TCATCACCGA TCGCTTCGGC
AACCTCATCG CACACCCCGA CCCCACAGAA ATACAGCGTC AGGGCAACAT CGGCGACCTG
CCGTTCTTCA GGCGCCACGA CCCGCCAACC CAGGGTACGC TCAATCTCGG CGGCACCAGC
TGGGTAGCCA CCGTCGCCAC AGTGGAGCCC CACGACTGGA AGATCCTGTC GCTGAAGCGC
CGTTCAGCCC TGCTGAGCGA GGTGGCGGGA GGCATCGGGC TTCTGGTCGT CCTCCTTACG
GCCCTGTATG TCGTCTTCGC CTTCAGGCTG ATACGCGACC TCCGCCTGCG GGTGGGCGAC
CCTCTCAAGG CACTGGCTGC GTCGCTCAGG CGGGTGGCAG CGGGCGAATA CGGCCCGCCA
CTGCCGCATA GGCACGACTT CACCGAACTG GACACCATGA ACGACACCTT CATGCTCATG
TCCGACAGGG TGCGGCAACG TGAGACTGAC CTCAAACGCG AACGGGCCTT CGTCGGAACC
GTGATCGACG CCATCCCCTC CGCCCTGTTC GCCCTCGACA GGGAAGGACG CGTATCGCTC
ATGAACGCTG CCGCAGGGGA GCGCGTGGGC ATAGGTAGCG AAGAGGGCCG TGGACGCGCC
GTGGCGGAAC TGCTGCCCTT CCTCTCCCCG GTCACGGATG AGATGCTGGA TGCCATACAG
AACGGTGCAA CGTTCCGCCA TGACCGTCTT GCCCACCAGC ATGAAGGTCG TACCCTCTTC
GAAGACGTCG CCCTGTTCCC CATCGCCGAT TCGGAGACGG CCCACGCGCT GCTACGGGTG
GATGATGTGA CGGCGCGGGT CCATATGGAA GAGTTGATGG TGCAGACTGA AAAGATGATG
TCGGTGGGTG GGCTTGCCGC TGGCATGGCT CACGAGATAA ACAACCCTCT CGGGGCCATT
CTCCTCGGTG CCCAGAACAT CCAGCGCAGG CTGGACCCTG CCCTGCCCGC CAACACCGAT
ACCGCCAACA GGGTGGGCTG CCCGCTGGAA GCCATCAACG CCTATCTTGC CGAACGCAGG
ATTCCCGCCT TTCTTGAAGG CATCCGCGAG GCGGGGGCAC GCGCGGCCAA CATCGTGGCC
AACATGCTGG AATTCAGCCG CAGAAGCGAG ACACGCCATT CGACTGTTGA CTTGCGCGAC
GCGCTGGACA GGACAGTGGA CCTTGCCGCC AACGATTACG ACCTGAAAAA GAAGTACGAT
TTCAGGCACA TCATGATCGT GCGCGACTAT GACGCCGACC TGCCGCCGCT GGTCTGTTCC
GTGACGGAGA TTGAGCAGGT CATCCTCAAC CTGCTGCGCA ATGCGGCGCA GGCACTCGCT
GAGAAGCCAC CGTCAGAAGA GAGGCCACGC GTCACCCTGC GTACCCGCCG AGAAGGGGAC
ATGGCCCGCA TAGATGTGGA GGACAACGGG CCGGGGATGA CGCCGGACAT CCGCAAGCGC
GTCTTCGAAC CCTTCTTCAC CACGAAGGAT GTCGGGGTCG GCACCGGGCT TGGCCTGTCG
GTCTCCTATT TCATCGTCAC CCGCAACCAC AAGGGCACGT TCGACGTGAC GTCCGACCCG
GGGCACGGCA CATGCTTCAC CCTGCGGTTG CCGTACCGGA ACACCGGCGC ACTGATGCAG
CCGGAAGCCG TGGACACCCC CTGA
 
Protein sequence
MMPVNTSATP TLTGFLQRRL TLWLMLLALF SLASTSAFLY LVQKRNFEER SVRTTTLLTR 
QINAHIALAQ ESLREMALRL PRTPSEDLLR TLTATDAHAL ELFARVIVLD ASGTIVARTP
RGPVQVDFPL HQPRGEESAS PIIGKPIPLG DTGEVVVWIG TDLWTKGRLL GALRLDAFSR
SIQELLPEES EALIITDRFG NLIAHPDPTE IQRQGNIGDL PFFRRHDPPT QGTLNLGGTS
WVATVATVEP HDWKILSLKR RSALLSEVAG GIGLLVVLLT ALYVVFAFRL IRDLRLRVGD
PLKALAASLR RVAAGEYGPP LPHRHDFTEL DTMNDTFMLM SDRVRQRETD LKRERAFVGT
VIDAIPSALF ALDREGRVSL MNAAAGERVG IGSEEGRGRA VAELLPFLSP VTDEMLDAIQ
NGATFRHDRL AHQHEGRTLF EDVALFPIAD SETAHALLRV DDVTARVHME ELMVQTEKMM
SVGGLAAGMA HEINNPLGAI LLGAQNIQRR LDPALPANTD TANRVGCPLE AINAYLAERR
IPAFLEGIRE AGARAANIVA NMLEFSRRSE TRHSTVDLRD ALDRTVDLAA NDYDLKKKYD
FRHIMIVRDY DADLPPLVCS VTEIEQVILN LLRNAAQALA EKPPSEERPR VTLRTRREGD
MARIDVEDNG PGMTPDIRKR VFEPFFTTKD VGVGTGLGLS VSYFIVTRNH KGTFDVTSDP
GHGTCFTLRL PYRNTGALMQ PEAVDTP