Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5201 |
Symbol | |
ID | 5673535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6243695 |
End bp | 6244609 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244055 |
Product | inositol-phosphate phosphatase |
Protein accession | YP_001509465 |
Protein GI | 158316957 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.289429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG TGGACGCGCC CCCCAGCACC CTCCGCCCGG CCCCGGCCGC CCTGCTCGAC CTCGGCCTGG ACGTCGCCCG CGAGGCCGGG GCCCTGCTCG TCACCGGCCG CGCCGGCACG GTCGCCGCCG AGGCGACGAA ATCCTCGCCG ACCGACGTCG TCACCGCGCT GGACCGGGCG TCGGAGGCCC TCGTCGCCCG CCGCCTGCGC GAAGCCCGCC CGGACGACGG CCTGCTCGGC GAGGAGGGCT CCGACACAGC GGGCACCAGC GGCGTCCGCT GGATCGTCGA CCCCCTCGAC GGGACGGTCA ACTTCCTCTA CCGCCTGCCC AACTGGGCGG TGTCGATCGC GGCCGAGCTG GACGGCGAGA TCGTGGCGGG CGTGGTGCAC GCGCCCGCGA TGGGAGTCAC ATACACCGCT GTCCGCGGCG GCGGCGCCTT CCGCTGGGAG ACACCGGTGG GCACGGATCG CGGGGGCGAC CAGGGCGCTG GCACCGGCGC GGGCGCGCCG CTCGGGCCGA CGGCCGGGGA GCCGACGAAG CTGACCGGTT CGGCGGTGAC CGAGCTGGGC GGCGCACTCG TCGCCACCGG CTTCGGATAC ACCGAGCGCC GCCGGACGAC CCAGGCCGCG GTGCTGACCC GGGTCGTGCC CAGGGTCCGC GACATCCGCC GGATGGGCGC GGCCTCCCTC GACCTCTGCG CCGCCGCGGC GGGCATCGTC GACGCCTACT ACGAACGCGG ACTACACCCC TGGGACCACG CGGCGGGCGC ACTGATCGCC GCCGAGGCGG GCCTGCGGGT CGGCGGCCTG GACGGCCGGG AAGTCAGCGA GGACCTCGTC ATAGCCGCTC CCCCCTCCCT GTTCGCCAAC CTCACCGCCC TGCTGGCCGA ACACCCCCGC GCCGACACCG ACTAG
|
Protein sequence | MSTVDAPPST LRPAPAALLD LGLDVAREAG ALLVTGRAGT VAAEATKSSP TDVVTALDRA SEALVARRLR EARPDDGLLG EEGSDTAGTS GVRWIVDPLD GTVNFLYRLP NWAVSIAAEL DGEIVAGVVH APAMGVTYTA VRGGGAFRWE TPVGTDRGGD QGAGTGAGAP LGPTAGEPTK LTGSAVTELG GALVATGFGY TERRRTTQAA VLTRVVPRVR DIRRMGAASL DLCAAAAGIV DAYYERGLHP WDHAAGALIA AEAGLRVGGL DGREVSEDLV IAAPPSLFAN LTALLAEHPR ADTD
|
| |