Gene Dvul_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2094 
Symbol 
ID4662915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2433970 
End bp2435244 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content68% 
IMG OID639820337 
Producthomoserine dehydrogenase 
Protein accessionYP_967537 
Protein GI120603137 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.547464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0709018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCGC TGGTCATCGG CATGGCCGGT TGCGGCACCG TGGGCAGCGG CCTCTTGCGC 
GTCCTCGAAG AGAACCGCCA GTGGATCGTC GAGCGCACGG GGCGTGCCGT CCAGGTGAAG
CATGTGCTGG TACGCGACCT TTCGAAGCCG CGCGACCTGC CTGACGGGGC CAGCCTCACG
GACGACCCCG CCGTCCTTAC GGACGACCCT GAAGTCGACG TGCTCGTCGA ACTCATGGGG
GGCATCGAGA AGCCGCGCGA ACTCATCCGG CGCGCCATCG AGAACGGCAA GCATGTGGTC
ACCGCCAACA AGGCCCTGCT GGCCGAAGAC GGCTTCGGCC TCTTCCGCCT TGCAGAGGAG
AAGGGGGTGG GGCTGTACTA CGAGGCCAGC GTCGCCGGGG GCATACCCAT CGTGCAGACC
CTGAAGGAGA GCCTCGCGGG CAACCGCATC ACCTCGCTGG TGGGCATCCT CAACGGCACG
GCCAACCACA TCCTCTCCGA GATGACGAGT GCCGGGCTCG ACTTCGAGAC GGCCCTCGCG
CAGGCGCAGG AACTCGGCTA CGCCGAGGCC GACCCCACGC TCGACATCGA CGGGCACGAC
ACGGCCCACA AGCTGGTGCT GCTCATCCGT CTCGCCTACG GGCTCGAATA CCCCTACGCC
GAGATGCCCG TGCAGGGCAT TCGCGGCATA GACCGCATGG ATATCGAGTT CGCTCGCGAG
TTCGGCTTCC GCATCAAGCT GCTTGGGCAG GTGCGCGAGG TGGACGGCAG GCTCGAGGCG
GGGGTATTCC CCACCCTCGT GCGCCACACC TACCTCATTG CCCGTGTGGG CGGCGCGTAC
AACGCCATCC GCATCGAAGG CAACGCCGTC GGGCCGGTCT TCCTGCACGG GCAGGGCGCG
GGCAGCCTGC CCACGGCCAG CAGCGTGCTT GCCGACCTTA TGGCGGTGGC ACGGTCGACC
CCGCCGCACA ACACCGGCTT CCAGCGTCAG GTGCCGCCCA AGGCCAGCAT CCTGCCGCCC
GATGACGCCG TGAGCGCGTA CTACGTTCGC GTCATGGTGC CCGACCACCC CGGTGTTCTT
CGCGACCTTG CCGGGGCCAT GGCCGACCAC GGCATCAGCA TCGCACAGGC CATCCAGAAG
GGGCAGGACA AGCGCGGCGT GCCGCTGGTG TTCATGACGC ATGAGGCAGG GGCACGCGCC
ATCCGCGACG CCATCGAACA GATTCGCCAA GCTGGTCTGC TCACGGCCGA CCCGGTCTGC
TACCGCGTGC TGTGA
 
Protein sequence
MKPLVIGMAG CGTVGSGLLR VLEENRQWIV ERTGRAVQVK HVLVRDLSKP RDLPDGASLT 
DDPAVLTDDP EVDVLVELMG GIEKPRELIR RAIENGKHVV TANKALLAED GFGLFRLAEE
KGVGLYYEAS VAGGIPIVQT LKESLAGNRI TSLVGILNGT ANHILSEMTS AGLDFETALA
QAQELGYAEA DPTLDIDGHD TAHKLVLLIR LAYGLEYPYA EMPVQGIRGI DRMDIEFARE
FGFRIKLLGQ VREVDGRLEA GVFPTLVRHT YLIARVGGAY NAIRIEGNAV GPVFLHGQGA
GSLPTASSVL ADLMAVARST PPHNTGFQRQ VPPKASILPP DDAVSAYYVR VMVPDHPGVL
RDLAGAMADH GISIAQAIQK GQDKRGVPLV FMTHEAGARA IRDAIEQIRQ AGLLTADPVC
YRVL