Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4265 |
Symbol | |
ID | 5672620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5092216 |
End bp | 5093166 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243138 |
Product | hypothetical protein |
Protein accession | YP_001508555 |
Protein GI | 158316047 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0631] Serine/threonine protein phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00390671 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.266903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCGC TGCCCGCCAC GGCGACGTCC GCGCGCCCGG CGACCGCCTC GCCGGCACCG TCGCAGCCGG CACTGTCGCA GCCGGCACTG TCGCAGCCGG CACTGTCGCA GCCGGCACTG TCGCAGCCGG CACTGTCGCA GCCGGCACTG TCGCAGCCGG CACTGTCGCA GCCGGCACTG TCGCAGCCGG CACTGTCGCA GCCGGCACCG CCGCCCGCGC CGAAACCCAC GCCCATGCCC TCGGGAGCGG CCGTTGCTCT GGCGGCCACG GCCCACTCGG TGGCGAGGCT CGGCCGGTCG CCGGCGGAGA ACGAGGACAG CTGCGCGATC CGGCCGGAGC TCGGCCGGTT CGCCGTGGCC GACGGGGCGT CGACCTCGGC CCGACCGGAG GTGTGGAGCC GGCTGCTGGT CGACGCCTAC GCCCACGACG GGCTCGACCC GCTCGCTCCC GACGTCCTGC GTGCGCTGCG CGAGCGCTGG TGGGCGCAGG TGAGCCGGCC GGGGCTGCCC TGGTTCGCGC GGGCGAAGCT GCAGTCGGGC GCAGACGCGT CTTTCCTCGG GCTTTCCGTC GACGTCGCGA ACCAGAGTTG GACAGCGACG TGCGTCGGTG ACTCGTGCGT GTTCCACCTA CGCGACGGGG AGACGCGCTC GGTGGGCCCG GTCGGGCGGT CCAGCGACTT CACCCGGTTC GCGGAGCTGG TCGGCAGCCA CGGCCCGTCC GCCCCGGAGC CGACCCTGCT GACCGGTGAG CTGCGGCCTG GCGACGTGCT CGTCCTCGCC ACCGACGCGC TCGCCCGGCT CCTGCTGCAC GCGGCCGAGA CCCGCGGCCG GATGCCGTCG CCCGGCTGGC TCGCGCAGAC CGCCGGCCGG TTCTCCCGAG CCGTCGCCGC CTACCGGCAC CACGGGTACC TCGCCGACGA CGACACGACG ATCTGCGTGG TGCAAGCGTG A
|
Protein sequence | MTPLPATATS ARPATASPAP SQPALSQPAL SQPALSQPAL SQPALSQPAL SQPALSQPAL SQPALSQPAP PPAPKPTPMP SGAAVALAAT AHSVARLGRS PAENEDSCAI RPELGRFAVA DGASTSARPE VWSRLLVDAY AHDGLDPLAP DVLRALRERW WAQVSRPGLP WFARAKLQSG ADASFLGLSV DVANQSWTAT CVGDSCVFHL RDGETRSVGP VGRSSDFTRF AELVGSHGPS APEPTLLTGE LRPGDVLVLA TDALARLLLH AAETRGRMPS PGWLAQTAGR FSRAVAAYRH HGYLADDDTT ICVVQA
|
| |