Gene DvMF_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1749 
Symbol 
ID7173664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2137049 
End bp2138305 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content67% 
IMG OID643540264 
Productprephenate dehydratase 
Protein accessionYP_002436162 
Protein GI218886841 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones113 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCA CCGACGACAA GGAACCCCAC TGGCGGGGGG ATCACGTTGC CGCCAGCGCC 
CCCAAGGTCA CGGACGACGG CGCGCCCCAG CCTGCGCATT CCGCAGTCAC ATCCGCAGTC
ACATCCGCAG TCACATCCGG GGGCGCAGCC GACCCGGAAG CGGCGCTGGC CGGGCGGCTT
GGCCAGATCC GCCACGAGAT CGACGGGCTG GATTCCGACC TGCTGAACCT GCTGAACCGC
CGCGCCTCGC TGAGTCTGGA GGTGGGGCGC ATCAAGGCCG ACGACGCGGG CATCGTGTTC
AAGCCCTTTC GCGAGCGCGA GGTGCTGGAA AATCTCATGG CCGCCAATGG CGGGCCGCTG
CCCAACGAGC ACCTGCGTTC CATCTGGCGC GAGATTCTTT CGTCATCGCG CAGCCTGCAA
CGGCCCCAGA AGGTGGCCTA CCTTGGGCCG GAGGGCACCT TTTCGTACTT CGCGGGCGTG
GAATTTCTCG GCAAGGCCGT GGAATACATG CCGCAAAAGG ATCTGGACGG GGTGTTCCGC
GCCGTGCACG ACAGGCAGTG CGAGCTTGGC GTGGTGCCGC TGGAAAATTC GCTGCACGGC
ACCGTGGGCC AGAGTTTGGA CCTGTTTCTG TCGCATGAGG TATTCATCCA GTCCGAGCTG
TTCTGCCGCA TCAGCCATTG CCTGCTGACC ACGGAAACCA GCCTGGCCGA CGTGACCACG
GTGTATTCGC ATCCGCAGCC GCTGGCCCAG TGCGGCGGCT GGCTGCGCCA GGCCCTGCCC
GGGGCGCGGA TCATCCCGGC GGATTCCACC GCCTCCGCCG CGCGCCGCGT GGGGGGCGAA
AAGGGCGCGG CGGCCATCGG GCACCGCAGT CTGGCCGCGC TGCTGGGGCT GAACATCCTG
GCGCGCGGCA TAGAGGACCA GCCGGACAAC TGGACGCGCT TCGTGGTCAT CGGCCCCGCC
CCGGCGGGCC AGCCCGGCAC GGACAAGACC TCCATGCTGT TCTCTGTGCC GGACAGGCCC
GGCGCGCTGG CAGAGGTGCT GAACCTGCTG GCCCGCGAGG GCATCAACAT GAAGAAGCTG
GAGTCGCGGC CCCTGCGCGG CGAAAAGTGG AAATACGTGT TTTTCGTGGA CGTGGAATGC
GACCTTGGCA ACGAGGACTA TGGCCGGGTG GTGCATGAAC TGCGCAGGCT GTGTCATACG
TTGCGCATCC TCGGGAGCTA CCCCGCCGGG CCGCAGTTGG ACATGAGTCG AGATTGA
 
Protein sequence
MSLTDDKEPH WRGDHVAASA PKVTDDGAPQ PAHSAVTSAV TSAVTSGGAA DPEAALAGRL 
GQIRHEIDGL DSDLLNLLNR RASLSLEVGR IKADDAGIVF KPFREREVLE NLMAANGGPL
PNEHLRSIWR EILSSSRSLQ RPQKVAYLGP EGTFSYFAGV EFLGKAVEYM PQKDLDGVFR
AVHDRQCELG VVPLENSLHG TVGQSLDLFL SHEVFIQSEL FCRISHCLLT TETSLADVTT
VYSHPQPLAQ CGGWLRQALP GARIIPADST ASAARRVGGE KGAAAIGHRS LAALLGLNIL
ARGIEDQPDN WTRFVVIGPA PAGQPGTDKT SMLFSVPDRP GALAEVLNLL AREGINMKKL
ESRPLRGEKW KYVFFVDVEC DLGNEDYGRV VHELRRLCHT LRILGSYPAG PQLDMSRD