Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_4064 |
Symbol | |
ID | 8227662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 4911922 |
End bp | 4913001 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644931907 |
Product | chorismate synthase |
Protein accession | YP_003088432 |
Protein GI | 255037811 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000000659407 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000405427 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCAGTA CATACGGAAA GATATTCAAA ATAGCCACTT TCGGCGAATC GCACGGGGTA GGAATAGGCG TGGTGATCGA GGGCTGCCCG GCGGGTGTGA CTTTCGATTC AGATTTTATT CAAAGCGAAC TCACGCGCAG GAAGCCCGGA CAGTCCCGGA TCACGACCCA ACGCAAGGAG GCCGACGAAT TTGAAGTATT ATCGGGGGTT TTCGAAGGAA AAACGACCGG CACGCCCATT GCGATGGTGA TCCGGAACGA AGACCAGCGG AGCAAGGATT ACTCGCACAT AGCGGCGCAA TTCAGGCCTT CTCATGCCGA TTATACCTAT CAGGTCAAAT ACGGCGTGCG CGACTATCGT GGCGGAGGGC GCAGTTCTGC ACGCGAAACG GCAGCACGCG TAGCCGCGGG AGCGCTCGCG AAGCTGATGC TGGCCGAATT GGGGATCAGC ATCCAGGCTT ACGTATCACA GGTGGGCACT ATGAAGCTGG AAAAAAGCTA CCAGGAACTG GACCTTTCCG AAACGGAAAA CAATGCAGTG CGCTGCCCGG ACCCTGAAAT GGCCCAGCAA ATGTTCGATT ACATCGACGG CATCCGCAAG CAAGGCGATT CGATCGGTGG TGTGGTGAAT TGTGTGGTAA AAGGAACGCC GGCCGGCTGG GGTGAACCCG TGTTCGACAA ACTCCATGCG GAGCTCGGGA AAGCTATGCT AAGCATTAAT GCCGTAAAGG GCTTTGAATA CGGCAGCGGT TTCGACGGCG TGCTGCTACC CGGCTCACAG CACAATGACG CTTTCTATAC CGATGAAAAC GGCAATGTCC ACACGCGTAC CAACCATTCA GGAGGCATTC AAGGGGGTAT TTCAAATGGA GAGGATATCT ATTTCCGCAC TGCTTTCAAA CCGGTAGCAA CTATTATGCA AGACCAGGAG AGCGTCGATC AGTTCGGTAA CGTGGCCGTT GTGCAAGGGA AAGGACGCCA TGATCCGTGC GTAGTGCCGC GCGCAGTGCC CATCGTAGAG GCGATGACGG CATTGGTTCT GGCGGACTTT TATCTTAGAA ACAGATCCAG CAAGCTGTAA
|
Protein sequence | MGSTYGKIFK IATFGESHGV GIGVVIEGCP AGVTFDSDFI QSELTRRKPG QSRITTQRKE ADEFEVLSGV FEGKTTGTPI AMVIRNEDQR SKDYSHIAAQ FRPSHADYTY QVKYGVRDYR GGGRSSARET AARVAAGALA KLMLAELGIS IQAYVSQVGT MKLEKSYQEL DLSETENNAV RCPDPEMAQQ MFDYIDGIRK QGDSIGGVVN CVVKGTPAGW GEPVFDKLHA ELGKAMLSIN AVKGFEYGSG FDGVLLPGSQ HNDAFYTDEN GNVHTRTNHS GGIQGGISNG EDIYFRTAFK PVATIMQDQE SVDQFGNVAV VQGKGRHDPC VVPRAVPIVE AMTALVLADF YLRNRSSKL
|
| |