Gene Dvul_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2090 
Symbol 
ID4663204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2430358 
End bp2431422 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content68% 
IMG OID639820333 
Productchorismate synthase 
Protein accessionYP_967533 
Protein GI120603133 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0281395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0647066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCA ACACACTGGG ACGTCTTTTC AGGCTGACGA CCTACGGCGA ATCGCACGGC 
GCAGGGCTTG GTGGCGTCAT CGACGGCTGC CCGGCAGGCA TTGCGCTGGA CGAGGCCGTC
ATCCAGCGTG AACTCGACCT TCGCCGCCCC GGTGGCAACT CCGCCTCGAC CACCCGGCAG
GAACCCGACA GGGTGCGTCT GCTTTCGGGC GTGTTCGAGG GGGTGACCAC CGGAACGCCC
ATCGCCTTCC ACGTGGAGAA CGTCGACCAG CGTTCGCGCG ACTATGGCGA GATAGCCCGG
TTGTACAGGC CGGGTCATGC CGATTTCACC TACGACGCCA AGTTCGGCGT ACGCGACTAT
CGCGGCGGCG GTCGCGCCTC CGGGCGTGAG ACCCTCTCGC GCGTGGCGGG CGGTGCCATC
GCGCAGGCGC TGCTGGCCCG CCATGGCATC GCGGTGCGGG CCTTCACCGT GGAACTTGGC
GGCGTACCCG CCGACCTCGT GGACGTGGCG GGGGCGCAGC TACGTCCGTT CTTCTCGCCC
GACCCCGATG TGGTGGAGGC GTGGGAGGAC ATGGTGCGCA CGGTGAAGGG CGAAGGCGAT
ACCCTCGGCG GCATCGTGCA GGTCGAGGCC ACGGGAGTCC CCGCCGGTCT GGGCGAACCC
GTGTTCGACA AGCTGGACGC CGTGCTTGCC TATGCGCTCA TGTCCGTAGG GGCGGTGAAG
GGCGTCGAGG TCGGCGCCGG GTTCGAGGCC GCGCGGATGC ACGGCAGCGA CAACAACGAC
CCCATCGTGC CCAGCGGTTT CTTCACCAAC CATGCGGGCG GCATTCTCGG CGGCATCTCC
AACGGAGAGA CCATCGTCCT GCGCGCGGCG GTGAAGCCCA TCCCCTCCAT CGCGCAAGAG
CAGATAACCA TCGACCGCGA CGGCAAGCCC TCGGCCCTGT TCATCGCCGG ACGGCACGAC
ATTAGCGCGA TTCCGCGCAT CGTGCCTGTG CTCAAGGCCA TGACCGCACT CGTGCTGGCC
GACATGCTGC TCATGCAGCG CCGCATGGCA ACGCCGCAGC CCTAG
 
Protein sequence
MSGNTLGRLF RLTTYGESHG AGLGGVIDGC PAGIALDEAV IQRELDLRRP GGNSASTTRQ 
EPDRVRLLSG VFEGVTTGTP IAFHVENVDQ RSRDYGEIAR LYRPGHADFT YDAKFGVRDY
RGGGRASGRE TLSRVAGGAI AQALLARHGI AVRAFTVELG GVPADLVDVA GAQLRPFFSP
DPDVVEAWED MVRTVKGEGD TLGGIVQVEA TGVPAGLGEP VFDKLDAVLA YALMSVGAVK
GVEVGAGFEA ARMHGSDNND PIVPSGFFTN HAGGILGGIS NGETIVLRAA VKPIPSIAQE
QITIDRDGKP SALFIAGRHD ISAIPRIVPV LKAMTALVLA DMLLMQRRMA TPQP