Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2045 |
Symbol | |
ID | 5670446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2462719 |
End bp | 2463780 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240967 |
Product | peptidase M4 thermolysin |
Protein accession | YP_001506388 |
Protein GI | 158313880 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.912259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGTC ACCGACGCTG CGCGCCCTGT GCCGTGCCGC CGCACATCCT CGAGCGCATC GCCCGGAACG GGAACGACGA GCAGCGCGCC CGCGCGCTGT CCACCATGGC CCAGGACTCC TCGCACCGCA GCGTGCGGGT GCACAACGCC CTGATGCGCT CCGTACAGCG CCCGGCGACC CCGCCGCGGC TGGCCCCGAC CAGCGGCCCG CGCCGCACGA TCGGCGACGC GAAGGAGTCC GAGGACCTGC CGGGTACGAC CGTGCGCGCC GAGGGCGACG CCCCGGTGGC CGACGTCGCC GTGAACGAGG CGTACGACGG CCTCGGCGCC ACGTTCGCGT TCTTCCTCGA CGCCTACGGC CGGGACTCCG TCGACGACGA CGGGATGGGC CTGCTCGCGA CCGTGCACTA CGGCGATCAT TACGAAAATG CCTTCTGGAA CGGCCGGCAG ATGGTCTTCG GCGACGGTGA CGGCGAGCTG TTCAACCGGT TCACGGTCTC GCTGGACATC ATCGGCCACG AGCTCGCGCA CGGCGTCACC GAGGACGAGG CGCAGCTGAT GTACCTGAAC CAGTCGGGCG CGCTGAACGA GTCGCTGAGC GACGTCTTCG GCTCGCTGGT GAAGCAGCAC CTGCGCGGCC AGACCGCCGA GGATGCCGAC TGGCTGATCG GCGAGGGCCT GCTCACCGAC GCCGTGCAGG GCGTGGCGCT GCGGTCGATG AAGGAGCCCG GCACGGCCTA CGACGACCCG GTGCTCGGCG ACGACATTCA GCCCGCGCAC ATGGACGGCT ACGTCCGCAC GACGACCGAC AACGGCGGGG TGCACATCAA CTCGGGCATC CCGAACAAGG CTTTCTACAC GCTCGCGGTG GCCCTCGGCG GCCACGCCTG GGAGCGCGCG GGCCGGATCT GGTACGAGGC GCTGCGCGCG CCCCAGCTCC GGCCGAACGC GACGTTCCGG TCGTTCGCCG CCGCCACCTC GCGCCAGGCC CAGGTGCTGT TCGGGCCCGA GGAGACCCAG GCCGTGCGGG ACGCCTGGGC CTCGGTCGGC GTTCCGGTCT GA
|
Protein sequence | MTGHRRCAPC AVPPHILERI ARNGNDEQRA RALSTMAQDS SHRSVRVHNA LMRSVQRPAT PPRLAPTSGP RRTIGDAKES EDLPGTTVRA EGDAPVADVA VNEAYDGLGA TFAFFLDAYG RDSVDDDGMG LLATVHYGDH YENAFWNGRQ MVFGDGDGEL FNRFTVSLDI IGHELAHGVT EDEAQLMYLN QSGALNESLS DVFGSLVKQH LRGQTAEDAD WLIGEGLLTD AVQGVALRSM KEPGTAYDDP VLGDDIQPAH MDGYVRTTTD NGGVHINSGI PNKAFYTLAV ALGGHAWERA GRIWYEALRA PQLRPNATFR SFAAATSRQA QVLFGPEETQ AVRDAWASVG VPV
|
| |