Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1947 |
Symbol | |
ID | 5694787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 2356890 |
End bp | 2357954 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641264545 |
Product | chorismate synthase |
Protein accession | YP_001529828 |
Protein GI | 158521958 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGTA ATACTTTTGG AGAGCTGTTT CGCGTTACCA CATGGGGAGA GTCCCACGGG CCGGGCATCG GTGTGGTCAT CGACGGGTGC CCGCCCGGCC TTGCCCTGGA CGAAGCCGGG GTGCAGAAAA TGCTGGACCG CCGCAAGCCC GGCGGCGGGT CCATTGCCAG CACCGCCAGA AAAGAGGCGG ACCGGGCCGT TATCCTGTCC GGCGTGTTTG AAGGCAAAAC CACGGGCACC CCGATCCTGA TCATGGCCCA TAACAGGGAT GCCCGGTCAT CCGCCTACAC CGACATCGCC GGCCTGTTCC GGCCCGGGCA TGGTGACATC ACCTACACGG CCAAGTACGG CATTCGGGAC TGGCGGGGCG GGGGCCGGGC CTCGGCCCGG GAGACCTTTG GCCGGGTGGC GGCCGGGGCC GTGGCCGCTG AACTGCTTCG GCTTTCCGGT ATTTCAGTTG CGGCCTACAC CCTGGAACTG GGCGGCATCC GCGCAACAAC CATTGATGTC GGGCAGGTTG ATCAGAACAT GTTCGGCTGC CCGGACAGCA CTGTTATGGC GGCCATGACT GACCGTGTGA CCCAGGTAAA GCGGCGGGGT GACTCTGTCG GCGGCATCGT CGAGGTCCGT GCCGATGGCG TGCCCGCCGG CCTGGGAGAG CCGGTGTTTG ACAAACTGGA TGCCGACATT GCCAAAGCCC TGATGAGTAT CGGCGCGGTA AAGGGAGTTG AGATCGGCGC CGGGTTTGAA GCATCGGGTA TGACCGGCTC CCGGAGCAAC GATGAAATCA CGCCCCAGGG GTTTGCCACC AATAATGCCG GCGGCATTCT GGCCGGCATT TCCAACGGGG ACCGGATCGT GGCCAGGGCC GCGGTCAAGC CGATTCCCTC CATCGGCATT ACCCAGCAAA CCGTGGATAC AAACGGCAAA CCGGCCTCCA TTTCCATCAA GGGCCGGCAC GATATTTCCG CCATTCCCCG GATCAACGTG GTGTGTGAGG CCATGGTGTG CCTGGTGCTG GCCGATCATC TTCTTAGACA GAAAGCGATT TCATGGACCC GGTAA
|
Protein sequence | MAGNTFGELF RVTTWGESHG PGIGVVIDGC PPGLALDEAG VQKMLDRRKP GGGSIASTAR KEADRAVILS GVFEGKTTGT PILIMAHNRD ARSSAYTDIA GLFRPGHGDI TYTAKYGIRD WRGGGRASAR ETFGRVAAGA VAAELLRLSG ISVAAYTLEL GGIRATTIDV GQVDQNMFGC PDSTVMAAMT DRVTQVKRRG DSVGGIVEVR ADGVPAGLGE PVFDKLDADI AKALMSIGAV KGVEIGAGFE ASGMTGSRSN DEITPQGFAT NNAGGILAGI SNGDRIVARA AVKPIPSIGI TQQTVDTNGK PASISIKGRH DISAIPRINV VCEAMVCLVL ADHLLRQKAI SWTR
|
| |