Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1701 |
Symbol | aroB |
ID | 5670103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2033332 |
End bp | 2034447 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240619 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001506045 |
Protein GI | 158313537 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.16166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.109076 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGCGA TGAAGGTTTC CGACATGGTC CGCATCCAGG TCACCCCGTC CGGTGACCGG GCCTACGACG TGGTGCTGGG CGAGGGCGCC CTCTGCGAGC TACCGGCGCT GGCCGCGGGG CGCACCCGGG TCATGGTCAT TCACCCGCGA GCGCTGCGGG CCACCGCCGC GGCGGTCATC GCCGAGCTGC GGGCCGGCGC CGGTGCCGGC GTCGAGACGC ACGCGTTCGA GGTGCCGGAC GGCGAGGAGG CCAAGCAGCT GCGGGTCGCG GGCGCCTGCT GGGACGCCCT CGGCCGTGTC GGCTTCACCC GCGACGACCT GGTGGTCGGG CTCGGCGGCG GGACGACGAC CGACCTGGCC GGGTTCGCCG CGGCGGGCTG GCTGCGTGGT GTCGACGTCA TCCAGGTGCC CACCACCGTG CTCGGGATGG TCGATGCGGC GGTCGGTGGC AAGACGGGCA TCGACATCGA GGCGGGCAAG AACCTCGTCG GCGCCTTCCA CCAGCCGCTG GCGGTGCTCT GTGACCTGTC GACCCTGGCC AGCCTGCCGG CGGTCGAGGT CCGGGCCGGG CTCGCCGAGG TCGTCAAGGC CGGGTTCATC GCCGATCCGC GCATCCTCGA GCTGCTGGAG GCCGATCCGA CCGGGTCGGC GCGGCTGCCC GAGCTCGTCG AGCGGTCGAT CCGGGTCAAG GCGGCGGTGG TGTCCGGCGA CCCGCGCGAG GCCGGCCGGC GCGAGATCCT GAACTACGGG CACACCCTCG CCCACGCGAT CGAGAAGGTC GAGAACTTCT CCTGGCGGCA CGGCGCGGCG GTCTCGGTCG GCATGGTCTT CGCCGCCGAG CTCTCCCGGC TCGTCGCCGG GCTCGACCGC GTGACCGCCG ATCGCCACCG CGAGCTGCTG CGGGCCATCG GGCTGCCGGT GGAGTACCGG GGGGACCGCT GGCCGGCGCT GCTCGACGCG ATGCGGGTGG ACAAGAAGAC CCGGGGCCGG CGGTTGCGTT TCGTTGTGCT CGAAGCGCTC GGCCGGCCGC GCGGATTCGA CGATCCCGAG CCCGGCCTGC TGCTGGCCGC GTACGGCTCG GTCGCCGCGG GCGGTGTGAG CGCTACCGGG AACTGA
|
Protein sequence | MCAMKVSDMV RIQVTPSGDR AYDVVLGEGA LCELPALAAG RTRVMVIHPR ALRATAAAVI AELRAGAGAG VETHAFEVPD GEEAKQLRVA GACWDALGRV GFTRDDLVVG LGGGTTTDLA GFAAAGWLRG VDVIQVPTTV LGMVDAAVGG KTGIDIEAGK NLVGAFHQPL AVLCDLSTLA SLPAVEVRAG LAEVVKAGFI ADPRILELLE ADPTGSARLP ELVERSIRVK AAVVSGDPRE AGRREILNYG HTLAHAIEKV ENFSWRHGAA VSVGMVFAAE LSRLVAGLDR VTADRHRELL RAIGLPVEYR GDRWPALLDA MRVDKKTRGR RLRFVVLEAL GRPRGFDDPE PGLLLAAYGS VAAGGVSATG N
|
| |