Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1833 |
Symbol | |
ID | 5670235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2199164 |
End bp | 2200522 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240754 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001506177 |
Protein GI | 158313669 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0830687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.174475 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAACG ACCTGGACAT CTGGCGGAGC CTCCCAGCCA GGCAGCAGCC CTCCTGGCCC GACGGGGAGG AGCTGGCGGC GGCGTTCGCC GAGCTCTCGG CCCTGCCGCC GCTGGTGACC GCGCCGGAGG TGCGTTCGCT GACCGACCGT CTCGCGATGG TGGCCCGCGG CGAGGCGTTT CTGCTCCAGG GCGGTGACTG CGCGGAGACC TTCGCGGCCA ACACCGCAGA CAAGATCCGC GACAAGGTCA AGACCCTGCT GCAGATGGCG GTCGCCCTGA CCTACGGCGC CAGCACGCCG GTGGTCAAGG TGGCCCGCAT CGCCGGCCAG TACGCCAAGC CGCGCTCGGC CGACATCGAG GCCTCGACCG GGCTGCCCTC CTACCGGGGC GACGCGGTGA ACGACATCGC GCCGAACGCG CAGGCCCGCC GTCCGAACCC GCGGCGGATG GTCGACGCCT ACCACCAGAG CGCCGTCGCG CTGAACCTGG TGCGGGCGTT CGCGACCGGC GGCTTCGCCG ACCTGTCCAA GGTCCACGAG TGGAACAAGG CGTTCGTGCG CGACTCGGCC GCCGGCCGCC GCTACGAGCT GATGGCCGTC GACATCGAGC GCGCCCTCGC GTTCATGGCC GCCTGCGGCA TCGACCTCGA CCGGACGGCC GCGCTGACCG GCGTCGAGAT GTTCACCAGC CACGAGGGCC TGCTGATGGA GTACGAGCGG GCGCTCACCC GCACCGAGGA GTCGACCGGC GAGGTCTACG ACCTGTCGGC GCACATGATC TGGATCGGCG AGCGCACCCG TGACCTCGAC GGCGCCCACG TCGACTTCCT CTCCCGGGTC GGCAACCCGA TCGGCTGCAA GATCGGCCCG ACGGCAACGC CGGACGAGGT CGTGGCGCTC ACCGAGCGGC TCAACCCCGA CCACATCCCC GGCCGGCTGA CGCTGATCGC GCGGATGGGC GCCAAGCGGG TGCGCGACGC CCTCCCGCCG ATCATCGACA AGGTGAACGC GGCCGGGCAC CCCGTGGTGT GGTCGTGCGA CCCGATGCAC GGCAACACCC GCGACGTCGG CGGCGTGAAG ACCCGGCACT TCGATGACGT CCTCGACGAG GTCTTCGGGT TCTTCGAGGT GCACAAGGGG CTCGGGACGC ACCCCGGTGG CCTGCACATC GAGCTGACCG GCGAGAACGT CACCGAGTGC CTCGGCGGCG CGGAGATGAT CGGCGAGGCC GACCTCGGTG GCCGCTACGA GACGGCCTGC GATCCGCGGC TGAACACCGG CCAGGCGCTC GAGCTGGCCT TCCTGGTGGC CGAGTCGCTG CAGCAGGCCC GCGCGGAGCG CGACACACAC ACCCGCTGA
|
Protein sequence | MSNDLDIWRS LPARQQPSWP DGEELAAAFA ELSALPPLVT APEVRSLTDR LAMVARGEAF LLQGGDCAET FAANTADKIR DKVKTLLQMA VALTYGASTP VVKVARIAGQ YAKPRSADIE ASTGLPSYRG DAVNDIAPNA QARRPNPRRM VDAYHQSAVA LNLVRAFATG GFADLSKVHE WNKAFVRDSA AGRRYELMAV DIERALAFMA ACGIDLDRTA ALTGVEMFTS HEGLLMEYER ALTRTEESTG EVYDLSAHMI WIGERTRDLD GAHVDFLSRV GNPIGCKIGP TATPDEVVAL TERLNPDHIP GRLTLIARMG AKRVRDALPP IIDKVNAAGH PVVWSCDPMH GNTRDVGGVK TRHFDDVLDE VFGFFEVHKG LGTHPGGLHI ELTGENVTEC LGGAEMIGEA DLGGRYETAC DPRLNTGQAL ELAFLVAESL QQARAERDTH TR
|
| |