Gene Dvul_2299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2299 
Symbol 
ID4663249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2681476 
End bp2682474 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content69% 
IMG OID639820545 
Productdihydrouridine synthase, DuS 
Protein accessionYP_967742 
Protein GI120603342 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.755764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGTCG GCGCGCGCAC GGCCCCCGGC AGTCTCTGGC TTGCCCCGCT GGCGGGGGTG 
GGCCATGTGG CCTTTCGTGA GGTGATCGAC AACCTCGGAG GTTGCGGGCT GCTGTTCACC
GGCATGTGCA ACGCGCGGGC CGTCCCCACG GAGAACCCGG CACGTTCCAA TGCCTTCACG
TGGCGTGCGG AAGAACTGGA GCGGTGCGTC TGTCAGTTGT TCGGTGCCGA CCCATCCGAG
ATGGCCGAGG CGGCCCGACG GGTCGAAGCC GAGGGCTTCT TCGGGGTGGA CATCAACATG
GGGTGTTCCG TGGCTGCCAT CGTGCGGCGC GGCTGCGGTG CCGACCTGCT GCGGGACGAG
GAACGTGCCG TGCGCATGGT CGAGTCCGTC CGTCGGGCCG TGGATTGCCC CGTGCTGGTC
AAGTTCCGCA CAGGCTGGAG TCCCGACCCG CAGGGTGCGG TGGCGCTCGC CCGCCGTTTC
GAGGACGCAG GGGCCGACGC GCTGGTCTTC CATCCCCGCG TGGCCCCCGA CAGGCGCACC
CGCCCGCCGT TGCGCCACCA CATCCGTCTC GTCAAGGAGG CGGTCGCCAT CCCCGTGCTC
GGCAATGGCG AGGTGTTCAC ACCGGGGGAC GCCGCATCCA TGCTCGAGAC CACGGGGTGC
GACGGCATCT CGCTGGGGCG CATCGCACTG GGGCGACCGT GGGTGTTCGC GGGGTGGACG
GGCCTTGTGG ATGACAATCC CGCCAGCAAC CCCGACCTGT GGCGCGACAT TCCGCTGGCC
CTGCTGGACG CGCTGGAACG CCGCCATGCC GACCGCAAGT ATGCGGCGCG CCTCTTCAAG
AAGTTCCTTC TCTACTTCAT CGCCAACTTC ACCTATGGCA ACAGCCTTCG CGGCCCCATG
CTCAAAGGGG ACGACCCCGC CGACCTGCGT ACCGCACTCG TGGATACGCT GGCGACACTG
CCTGCCATCA CCCGCCGTCC CAGTGCGCTG ATGTTCTGA
 
Protein sequence
MQVGARTAPG SLWLAPLAGV GHVAFREVID NLGGCGLLFT GMCNARAVPT ENPARSNAFT 
WRAEELERCV CQLFGADPSE MAEAARRVEA EGFFGVDINM GCSVAAIVRR GCGADLLRDE
ERAVRMVESV RRAVDCPVLV KFRTGWSPDP QGAVALARRF EDAGADALVF HPRVAPDRRT
RPPLRHHIRL VKEAVAIPVL GNGEVFTPGD AASMLETTGC DGISLGRIAL GRPWVFAGWT
GLVDDNPASN PDLWRDIPLA LLDALERRHA DRKYAARLFK KFLLYFIANF TYGNSLRGPM
LKGDDPADLR TALVDTLATL PAITRRPSAL MF