Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0972 |
Symbol | |
ID | 5669386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1135770 |
End bp | 1136729 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239900 |
Product | glycerophosphoryl diester phosphodiesterase |
Protein accession | YP_001505334 |
Protein GI | 158312826 |
COG category | [C] Energy production and conversion |
COG ID | [COG0584] Glycerophosphoryl diester phosphodiesterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.701633 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGTTC TGGGCCATCG TGGAAGCCGG ACACCCGGTC CGGAGAACAC GCTCGAGGCG GTAGACGCCG CGTTACGGGC AGGTGCGGAC GGCGTCGAGC TCGACGTGCG CCGCAGCGCC GACGGCGACC TGGTGTGTGT GCACGACGCG CGGCTGCCGC GGTTGGGCGG TCGGGCCGTC ATCCGCCGGT CGACCAGTGA GCTCGCCAGT CGTGGCATCC CACTGCTGAC CGAGATGCTC GACGTCTGGG ACGGCCGTGG CCGCCTCATC CTGGAGATCA AGAACCAGCC GGGCCAGCCG GATTTCGACG CTCCACGCGA GCGGACGGCC CGCGCGCTCA TCGAGCTGCT GCGGGCTCGC GGGCTGCCGG GCTCCTCCGT CGCCGGCGCG CACCTCGATC AGGCGACAGC GCCGGCCTCC GGTGCGCCAT CCGACGTCGG CCCACGGCTC GCCGTCGGTG CGCCGCCCGG CCCCGGTGCG CTTCCCGCGA ACGGAGCGCC TCCCGCGAAC GGAGCGTCAC TCGCGAACGG AGCGCCTCCT GCGCCCGGCT CGGCGGGTGG GCCGGCCGGC GAATCGAGTG AGTCGAGCTC CACCGGGAAC TCGACGCCGG CAAGCCAATC TGTGTCGGGA ATCACAGTCT CGTCCTTCGA CTGGTTCGCA ATCGAGGCGA TCCGCGACGC CGGGCTCGGC GTCGCGACCG CGTTCCTCAC GATGCCGCGG ATGTCGGTGA GCGGCGGGGT CGCCTACGCC CGCTCGGCAG GCCACACGGA GCTGCACGCG CACGTGTCCG CCGTGCTGGG CGTGGCCGAC GCGGTCCCGC GCGCGCGGCG GGCCGGCCTC CGCCTCGTCA CCTGGACGGT CACCGACCCG GCGACGGCGA TCGAGCTGCG CGACGCCGGT GTGGACGGCG TGATCTGTGA CGACCCCGTG GGCGTCGGCC AGGCGCTGCG GCGCCCCTGA
|
Protein sequence | MEVLGHRGSR TPGPENTLEA VDAALRAGAD GVELDVRRSA DGDLVCVHDA RLPRLGGRAV IRRSTSELAS RGIPLLTEML DVWDGRGRLI LEIKNQPGQP DFDAPRERTA RALIELLRAR GLPGSSVAGA HLDQATAPAS GAPSDVGPRL AVGAPPGPGA LPANGAPPAN GASLANGAPP APGSAGGPAG ESSESSSTGN STPASQSVSG ITVSSFDWFA IEAIRDAGLG VATAFLTMPR MSVSGGVAYA RSAGHTELHA HVSAVLGVAD AVPRARRAGL RLVTWTVTDP ATAIELRDAG VDGVICDDPV GVGQALRRP
|
| |