Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4839 |
Symbol | |
ID | 5673180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5802412 |
End bp | 5803125 |
Gene Length | 714 bp |
Protein Length | 237 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243695 |
Product | AHBA synthesis associated protein |
Protein accession | YP_001509111 |
Protein GI | 158316603 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01454] 3-amino-5-hydroxybenoic acid synthesis related protein [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.098143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0397379 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCACG AGTCAGGGGC TCTCGATCCC GTTCGCTCCG GCGCCGGCGG AAACGCTCGC CCTGATTTCG CCCATGCTGT CGTCTTCGAT CTGGACGGCG TTGTCGTCGA CAGTTTCGCC GTGATGCGGG AGGCATTCCG GATCGCCTAT TCGGAGGTGG TCGGCGCAGG TCCCGCGCCT TTCGGTGAAT ACCATCGTCA CCAGGGGAGA TATTTCCGCG ACATTATGCG GATCATGGGA CTTCCGGCGG AGATGGAGGA GCCGTTCGTC CGGGAGAGCC ATCGGCTGCG GGCCGAGGTT CTCGTCTACG ACGGTGTGGT CGACCTGCTC GACACCCTGC GCCACCGTGG GTACGGCCTC GCCGTCGCCA CCGGGAAGAC CGGCTCGCGG GCACGCGACC TGCTGGCGCA TCTCGGCCTG CTCCGCTTCT TCCACCATGT CGTCGGCTCC GACGAGGTCG CGCGCCCGAA ACCGGCGCCC GACATCGTGC TGCGGGCGCT GGATCTGCTC GGCGCACGCC CGGAACGGGC CCTCATGATC GGTGACGCGG TGACCGACCT GCGGAGTGCC CACGACGCGG GGGTTCGCGC GGCGGCCGCG TTGTGGGGGT CGCCGGACGA AGCCGAGCTC CTCGCGGGCG GCCCGGACCT CGTGCTGCGG CGGCCGGCGG ACCTGCTGAC CGTCCTGCCG GCACAGCCGG CGAGCGTCGG CTGA
|
Protein sequence | MGHESGALDP VRSGAGGNAR PDFAHAVVFD LDGVVVDSFA VMREAFRIAY SEVVGAGPAP FGEYHRHQGR YFRDIMRIMG LPAEMEEPFV RESHRLRAEV LVYDGVVDLL DTLRHRGYGL AVATGKTGSR ARDLLAHLGL LRFFHHVVGS DEVARPKPAP DIVLRALDLL GARPERALMI GDAVTDLRSA HDAGVRAAAA LWGSPDEAEL LAGGPDLVLR RPADLLTVLP AQPASVG
|
| |