Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_1408 |
Symbol | |
ID | 7173312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 1721788 |
End bp | 1722849 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643539915 |
Product | chorismate synthase |
Protein accession | YP_002435824 |
Protein GI | 218886503 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 0.422248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCA ACACCTTCGG TCGCATCTTC CGGCTGACCA CCTACGGCGA ATCGCACGGC CCCGGCCTTG GCGGCGTGGT GGACGGCTGC CCCGCCGGGG TGCCGCTGGA CGAATCGGTC ATCCAGCGCG AACTGGACCT GCGCCGCCCC GGCAGCGCGT CCGCCGGTCT TGCGGGTACG GCCCGCAAGG AACCGGATAC CGTGCGCCTG CTTTCCGGCG TGTTCGAAGG GGTCACCACC GGCACCCCCA TCGGATTCCA CATCGCCAAC GAAGACCAGC GCTCGCGCGA CTACGGCGAC CTGGCCAAGC TGTACCGCCC CGGCCACGCG GACATCACCT ACGACGCCAA GTACGGCCTG CGCGATTTTC GCGGCGGCGG CCGGGCATCG GGCCGCGAGA CCGTGTCGCG CGTGGCGGGC GGCGCCGTGG CGCTGGCCTT GCTGGCCATG CACGACATCG AGGTGCGCGC CTACACCGTG GAGATCGGCG GCGTGCCCGC CGACGTCGTG GACCCGGCGG GCGCGCAGGG GCGGCTGTTC TTTTCGCCCG ACCCGGACGT GGTGCCCGCA TGGGAATCGC TGGTGCACGA CGTGCGGGCC GAGGGCGACA CCCTGGGCGG CATCGTGCAG GTGGAGGCCA CCGGCGTACC CGCCGGGCTT GGCGAGCCGG TGTTCGACAA GCTGGACGCC CTGCTGGCCC ACGCCATGAT GTCCGTGGGC GCGGTAAAGG CCGTGGAGGT GGGCGCGGGC CTTGAGGCGG CGCGCCTGCG CGGCAGCGAG AACAACGACC CCATCATTCC CGGCGGCTTC CACACCAACC ATGCCGGGGG CATTCTGGGC GGCATCTCCA ACGGCCAGCC CATCGTGGTG CGCGCGACGG TGAAGCCCAT TCCCTCCATC GCGCAGGAGC AGATCACCAT CGACACCAAC GGCAGGCCCG CGCCGCTGCG CGTGGGCGGT CGCCACGACA TCTGCGCCAT CCCGCGCGTG GTGCCCGTGC TGAAGGCCAT GGCGGCGCTG GTGTTGGCGG ACAGTCTTCT CCTCCAACGT CGCATGGGCT AG
|
Protein sequence | MSGNTFGRIF RLTTYGESHG PGLGGVVDGC PAGVPLDESV IQRELDLRRP GSASAGLAGT ARKEPDTVRL LSGVFEGVTT GTPIGFHIAN EDQRSRDYGD LAKLYRPGHA DITYDAKYGL RDFRGGGRAS GRETVSRVAG GAVALALLAM HDIEVRAYTV EIGGVPADVV DPAGAQGRLF FSPDPDVVPA WESLVHDVRA EGDTLGGIVQ VEATGVPAGL GEPVFDKLDA LLAHAMMSVG AVKAVEVGAG LEAARLRGSE NNDPIIPGGF HTNHAGGILG GISNGQPIVV RATVKPIPSI AQEQITIDTN GRPAPLRVGG RHDICAIPRV VPVLKAMAAL VLADSLLLQR RMG
|
| |