Gene DvMF_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2019 
Symbol 
ID7173938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2500863 
End bp2501918 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content71% 
IMG OID643540536 
Productdihydrouridine synthase DuS 
Protein accessionYP_002436430 
Protein GI218887109 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones99 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCGCATCCGC CGCCCCCCAC CTTTCCCTGG CCCCGCCCCT GCGGACCGAC 
GCCCCGTGGC TGGCCCCCCT GGCCGGGTAT TCCGACCTGC CCTTCCGCCT GCTGTGCCGC
GAGCACGGCG CGGCGGCCTG CTGCACAGAA ATGGTCAGCG CCAAGGGGTT GCTGTACCAC
AGCCCCGGCA CCCGCGACCT GCTGGCCTCC ACGCCTGAAG ACGCGCCACT TGTCCTGCAA
CTGTTCGGAG CCGACGCGGA CATCATGCGC ACGGTCATGC CGGGGCTGCT GGAGCAGGGC
TTTCGCTGGT TCGACCTGAA CATGGGATGT TCGGTGCCCA AGGTGGTCAA GACGGGTTGC
GGCTCGGCCA TGTCGCGCGA CATGGACAAC GCCCTGTCCG TTGCCCGCGC CATGGTGGAG
GTGGCCGGTG AAGGCCGGGT GGGCTTCAAG ATGCGGCTGG GCTGGCAGGC GGGCGAGGAA
ACCTGGCGCG AAATGGCCCT GCGGCTGCAA GACGCGGGCG CGGGGTGGAT CACCCTGCAC
CCCCGCTTCG CGCGGCAGGG CTTTGGCGGC GAAGCGCGCT GGAGTGCCCT GCGCGAGCTT
GCGGCCACGC TGACCATCCC GGTCATTGCC AGCGGCGACC TGTTCACGGC GGCGGACGCG
GTGCGCTGCG TGCGCGAGAC GGGCGTGGCC ACGGTGATGT TCGCGCGCGG GGCCATGAAC
AACCCCGCCG TCTTCGACGA ATACCGGGTG CTGCTGGCCG GGGGCCAGCC CCCGCCGCCC
GACGCGGACC GGCTGAAGGC GCTCATCCGC CGCCATCTTG AACTGGCCCT GGCCCACTCC
GGCGAACGCA CCGCCCTGCT CAAGATGCGC ACCTTCGTGC CGCGCTACGT GCGCCACATT
CCCGGCGTGC GGGCGCTGCG CAACCGGCTG GCCTCGTGCC TGGACCGCGA CCTGCTGGAA
GAACTGCTTG AAACCCACCT GACCCCGCAA GCGTTCGCGG AAGACGGCGG CGCCGACCAA
GCCACCACCA ACACCGATGG AGAGGCCCGG CCATGA
 
Protein sequence
MTDTASAAPH LSLAPPLRTD APWLAPLAGY SDLPFRLLCR EHGAAACCTE MVSAKGLLYH 
SPGTRDLLAS TPEDAPLVLQ LFGADADIMR TVMPGLLEQG FRWFDLNMGC SVPKVVKTGC
GSAMSRDMDN ALSVARAMVE VAGEGRVGFK MRLGWQAGEE TWREMALRLQ DAGAGWITLH
PRFARQGFGG EARWSALREL AATLTIPVIA SGDLFTAADA VRCVRETGVA TVMFARGAMN
NPAVFDEYRV LLAGGQPPPP DADRLKALIR RHLELALAHS GERTALLKMR TFVPRYVRHI
PGVRALRNRL ASCLDRDLLE ELLETHLTPQ AFAEDGGADQ ATTNTDGEAR P