Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3773 |
Symbol | |
ID | 3906057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4521983 |
End bp | 4522933 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881099 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_482853 |
Protein GI | 86742453 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0738114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCGCAA TAGATCGGGA GATTGGTCAC ACTAACCCCG CCACCTGGTG TCCGGCGCCC GGGACCACCG ACGCGCGGTG GGGCTCGGCT CGTAGTCTCG CAGGCGTGAC CCCGCCTAAC CCCGCATCCG TGAGCATCGA ACCGACCGCG GACTCCGACA CCGTCGCCAT CGCCGACGAT CTGGCCCTCG CGCTCAGCCT CGCGGACGCC GCCGACCGGA TCACTCTTTC CCGATTCCAG GCAGTAGACC TGCACGTCGA GTCGAAGCCG GACAACACCC CGGTCTCCGA CGCCGACACC GCCGTCGAGT CCATGATCCG CAAACGGCTC GCCGTGGCCC GGCCGGGCGA CGCCGTGCTC GGCGAGGAGG AGGGTCTCGT CGGATCGGGC GCCCGCCGGC GCTGGATCCT GGACCCCGTT GACGGCACCA AGAACTTCGT GCGCGGCGTG CCCGTCTGGG GCACCCTGCT CGGACTCGAG GTGGACGGCG AGATGGTCGT CGGGGTCGCG AGCGCCCCGG CGATGAGCCG ACGGTGGTGG GGAGCCCGGG GCACCGGCGC CTTCACTCGC GACGCCACCG GCGACACCCG TGCCCTGAAG GTCTCCTCGG TCACCAGCCT GGGTGACGCC TTCCTGTCCT TCGCCTCCGT CGAGGGCTGG CAGACCGCCG ACCGGCTGGA GGAGTTCCTC CACCTCGCCG GCCAGGTCTG GCGGACCCGC GCCTACGGCG ACTTCTGGTC GCACATGATG GTCGCCGAGG GCGCCGTGGA CCTCGCCTGC GAGCCCGAGG TGTCGCTCTG GGACATGGCC GCGCTCCAGG TCATCGTCGA GGAGGCCGGT GGCCGGTTCA CCGACCTGGG CGGGCGCCGG GGGCCGGGAC ACGGCACGGT GCTCACCACC AACGGGCATC TCCACAACGT CGCGCTGCAG TCGTTCAACG GCTCTTCATG A
|
Protein sequence | MIAIDREIGH TNPATWCPAP GTTDARWGSA RSLAGVTPPN PASVSIEPTA DSDTVAIADD LALALSLADA ADRITLSRFQ AVDLHVESKP DNTPVSDADT AVESMIRKRL AVARPGDAVL GEEEGLVGSG ARRRWILDPV DGTKNFVRGV PVWGTLLGLE VDGEMVVGVA SAPAMSRRWW GARGTGAFTR DATGDTRALK VSSVTSLGDA FLSFASVEGW QTADRLEEFL HLAGQVWRTR AYGDFWSHMM VAEGAVDLAC EPEVSLWDMA ALQVIVEEAG GRFTDLGGRR GPGHGTVLTT NGHLHNVALQ SFNGSS
|
| |