Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0955 |
Symbol | |
ID | 5669369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1119068 |
End bp | 1119910 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239883 |
Product | histidinol-phosphate phosphatase, putative |
Protein accession | YP_001505317 |
Protein GI | 158312809 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0961586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTCCGC CAAACCCGAG CCCCGTCAGC GCCAACCCGG TCGAGGCGGT CGACCCTGCC GGCTGGCTCG CTGACGACCT CGCACTCGCT CTGAGCCTCG CCGACGCCGC CGACCGGATC ACCCTCTCCC GCTTCCAGGC GGTGGACCTC CACGTCGAGT CGAAGCCCGA CAACACTCCG GTCTCGGACG CGGACACCGC CGTCGAGTCC ATGATCAGGG AACGGCTCGC CACCGCGCGC CCCGGTGACG GAGTCCTCGG CGAGGAGGAG GGCCTCGTCG GCGAGAACAC CCGCCGGCGC TGGATCCTCG ACCCGGTCGA CGGCACCAAG AACTTCGTGC GCGGCGTTCC CGTCTGGGGC ACCCTGCTCG GCCTCGAGGT GGACGGCGAG ATGGTCGTCG GCGTGGCGAG CGCCCCGGCC ATGAGCCGCC GCTGGTGGGC CGCCCGCGGC ACCGGGGCGT TCACCCGGGA CGCCACCGGC GACACCCGTT CACTGCAGGT CTCCTCCGTC GCCCGGCTGT CGGACGCCTT CCTCTCGTTC GCCTCGCTGG AGGGATGGCG CACCGCCGAC CGGCTGCCCC AGTTCCTGAG CCTGGCCGAG CAGATCTGGC GCAGCCGCGC CTACGGCGAC TTCTGGTCGC ACATGATGGT CGCCGAGGGC GCCGTCGACC TCGCCTGCGA GCCGGAGGTC TCACTGTGGG ACCTCGCCGC CCTGCAGGTC ATCGTCGAGG AGGCCGGCGG CCGGTTCACC GACCTGCGCG GGCGGCGCGG GCCCGGCTGG GGCACGATCC TCACGACCAA CGGGCACCTG CACGACGTCG CCCTGAGCCA CTTCCGGGAC TGA
|
Protein sequence | MTPPNPSPVS ANPVEAVDPA GWLADDLALA LSLADAADRI TLSRFQAVDL HVESKPDNTP VSDADTAVES MIRERLATAR PGDGVLGEEE GLVGENTRRR WILDPVDGTK NFVRGVPVWG TLLGLEVDGE MVVGVASAPA MSRRWWAARG TGAFTRDATG DTRSLQVSSV ARLSDAFLSF ASLEGWRTAD RLPQFLSLAE QIWRSRAYGD FWSHMMVAEG AVDLACEPEV SLWDLAALQV IVEEAGGRFT DLRGRRGPGW GTILTTNGHL HDVALSHFRD
|
| |