Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1661 |
Symbol | |
ID | 5670063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1986055 |
End bp | 1986699 |
Gene Length | 645 bp |
Protein Length | 214 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240579 |
Product | AHBA synthesis associated protein |
Protein accession | YP_001506005 |
Protein GI | 158313497 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01454] 3-amino-5-hydroxybenoic acid synthesis related protein [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.49792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCACCC ACGCGATCGT TTTCGACCTT GACGGTGTGA TCGTCGACAG CCACGCCGTC ATGCGCGAGG CGTTCATGAT CGCCTATGCC GAGGTGGTCG GGCCAGGCCC CGCGCCGTTC GAGGAGTACA ACCGCCACCT CGGACGGTAC TTCCCGGACA TCATGCGGAT CATGGGCCTG CCGCTCGAGA TGGAGGAGCC CTTCGTCCGG GAGAGCTACC GGCTCTCCCG CAAGGTCCTG CTTTTCGAGG GCGTCCGGGA GCTGCTCGGC GACCTGCGCG AGCGCGGCCT GCGGCTCGCG GTGGCGACGG GCAAGAGCGG CCCGCGGGCC CGCGCGCTGC TCACCGAGCT GGAGATCGTT GACTACTTCG AACGGGTTAT CGGCTCCGAC GAGGTGGCCC ACCCCAAGCC GGCCCCGGAC ATCGTCCTGC TCGCCCTCGA CGTGCTCGGT GCCGCGCCCG GCGAGGCGAT GATGATCGGC GACGCGGTGA CCGACATCCA GAGCGCCCGC GGCGCCGGGG TGCGGGCGGT CGCCGCGATG TGGGGGGAGA CCGACGAGGC CGAGCTGTTG GCGGCCGGCC CCGACTCGGT GCTGCGCTCC CCGCGTGAGC TCCTCGGCCT GCTCGGTGAC CACCGCGCGG TGTGA
|
Protein sequence | MGTHAIVFDL DGVIVDSHAV MREAFMIAYA EVVGPGPAPF EEYNRHLGRY FPDIMRIMGL PLEMEEPFVR ESYRLSRKVL LFEGVRELLG DLRERGLRLA VATGKSGPRA RALLTELEIV DYFERVIGSD EVAHPKPAPD IVLLALDVLG AAPGEAMMIG DAVTDIQSAR GAGVRAVAAM WGETDEAELL AAGPDSVLRS PRELLGLLGD HRAV
|
| |