Gene Dvul_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2031 
Symbol 
ID4662479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2365033 
End bp2366163 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content65% 
IMG OID639820274 
Productalanine racemase 
Protein accessionYP_967474 
Protein GI120603074 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0787] Alanine racemase 
TIGRFAM ID[TIGR00492] alanine racemase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATAT CCTACAACAA GGCGAGTGTT GTCGTCAGTC TCCAATCCAT TATCGCCAAC 
TATCGCCGGA TTCGTACGGT GGCACAGCGG CCCATGCCCG TCATCAAATC CGATGCGTAC
GGCCACGGCC TTGAAGCTGT GGGCATGGCC CTTGAGGCCG AAGGTGCACG TGAATGCGCC
GTCGGTACGG TGGGGGAGGG GGCGAAGCTG CGCAAGGCCG GTTTCGGGGC GGATATCGTC
GCCCTGCTTG GGGCCCTCGA CAGAGAGGAT GCCCAGTTGG CGGCCTCTTC CGGCATCATC
CCCACCGTCC TTGACATCGC CGGGCTTGAA CGCCTTGCCG CGCAGGGTAC CACCGAAAGG
CCGGTGCGCG TGGCGCTCAA GTTCGATACG GGCATGGCCC GCCTCGGCTT CACGGAACAT
GATGTGTCTG CCCTTTGCGA GCGTCTGCGC ACCCTGCCTT CGGTGCGGCC CGTCATGGCG
GTATCGCATC TCGCCGTTGC CGACGACCCC ACCCAGTCGG CCTTCACGAT GGCGCAAGGG
GCCGCATTCG CGCGTATCAT GGCGGGGCTT CGAAGCAACT TCCCCGATAT CATGGGGTCG
CTGTCAAACT CCGCAGCCAC GCTGGCGCAC CCGCAACTGC ACTGGGACGT GCAGCGTCCC
GGCATCGCCC TCTATGGTTC GAATCCCCTT CGCGGGACGG CCCTTGCACG GCATGGCGAA
GGGCTGTTGC CCGCCATGTC CGTCTCCGTG CCGGTGTTGC AGGTCCATCC GCTGCCCGCG
GGGCGTAGTA TCAGCTACGG GCGGACGTAC ACCGCCACCA AAGATGCGAC CGTGGCCATC
ATAGCCGCCG GATACGCCGA CAACTACAGC CGCGCCCTGT CAGGGCGTGG TGTGGCTGTA
GCTGGCGGGC GGCGTGTGCC TGTTCTTGGT CGCGTGTGCA TGCAGACCAC GGCCATCGAC
GTCACCGACG TGCCCGGCAT CGCCACGGGA GACCGTGTAT GGCTGCTTGG TGGCCCCGGC
CCCGCCACGG TCTCGGCTGA TGAACTGGCC GACCTGTGGG GGACGATATC CTATGAAGTG
CTGTGTCTGC TTGGCATGAA CCCGCGCAGG CATGACGACT CTGTGGAGTA G
 
Protein sequence
MPISYNKASV VVSLQSIIAN YRRIRTVAQR PMPVIKSDAY GHGLEAVGMA LEAEGARECA 
VGTVGEGAKL RKAGFGADIV ALLGALDRED AQLAASSGII PTVLDIAGLE RLAAQGTTER
PVRVALKFDT GMARLGFTEH DVSALCERLR TLPSVRPVMA VSHLAVADDP TQSAFTMAQG
AAFARIMAGL RSNFPDIMGS LSNSAATLAH PQLHWDVQRP GIALYGSNPL RGTALARHGE
GLLPAMSVSV PVLQVHPLPA GRSISYGRTY TATKDATVAI IAAGYADNYS RALSGRGVAV
AGGRRVPVLG RVCMQTTAID VTDVPGIATG DRVWLLGGPG PATVSADELA DLWGTISYEV
LCLLGMNPRR HDDSVE