Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0289 |
Symbol | |
ID | 5668713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 339741 |
End bp | 341312 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239219 |
Product | peptidase U62 modulator of DNA gyrase |
Protein accession | YP_001504661 |
Protein GI | 158312153 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCT CCGTTCCCCG GGATCCGGCC GAGCCGGCCC TGCCGCCGCA CGAGATCGAC GAGGAGTTCC GCGCGCTGCC CACCGCGGCC CTCACCGACG CCGCGCTCCA GGCCGCCCGA GACCTGGGTG CCGCCCACGC CGACATCCGG ATCGAACGTC TCAAGGAGTC GTCGCTGTCG TTCCGCGACG CCGGCCTGGA GAGCCGTTCG GACGGTGTCA CCGCGGGCTT CGCCGTCCGC GTCGTCCACG ACGGGACCTG GGGCTTCGCC GCCGGTGTGG ATCTGACCGT GGATGAGGCC GTGCGGGTCG CCCGGGAGGC CGTCGCCATG GCGAAGGTGG CGCGGCCGCT GAACTCCGAG CCGGTCGAGC TGGCCGACGA GCCCGTTCAC GCCGGCGCGA CCTGGGTCTC CAGCTACGCC ACGGACCCGT TCTCGGTCGA TCCGCGCGAC CAGGTCGAGC GGATCGGCGG GCTGTGCCGG TCGCTGTACT CCGCCGAGGA CGTCGATCAC GTCGACGGAC GGTTCGCCGC CGTCATGGAG AACAAGTTCT ACGCCGACAC CGCCGGGACG GACACCACCC AGCAGCGGGT CCGCGTGCTC TGCCAGCTCG AGGCGACCCG GGTCGACCCG GCGGGTGGCT TCGAGTCCAT GCGCACCATC GCGCCTCCGG CCGGCCGCGG CTGGGAGTGG ATGGTCGGCG GCTCGGCGGC CGGCTGCTGG GACTGGGAGT CCGAGACCGA GCTCATCCCC TCGCTGCTGG CCGAGAAGGC GAAGGCGTCG TCGGTGGAAC CGGGGCGCTA CGACCTGGTT ATCGACCCGT CGAACCTGTG GCTGACGATC CACGAGTCGG TCGGGCACGC CACCGAGCTC GACCGGGCTC TCGGCTACGA GGCCGCCTAC GCCGGGACGT CCTTCGCCAC GCTCGACAAG CTCGGGTCGC TGCGCTACGG ATCACCGGCG ATGACCGTGA CCGGCGACCG GACGGCACCG CACGGCCTGG CCACCATCGG CTACGACGAC GAGGGCGTGC AGACCCGCCG CTGGGACATC GTCCGCGACG GTGTCCTGGT CGGCTACCAG CTCGACCGGC GGATGGCGGC GCAGAACGCG TCCACGCTCG GCGTGGACCG CTCCAACGGC TGCGCCTTCG CGGACTCCCC CGGGCATGTA CCGATCCAGC GGATGGCGAA CGTCTCACTG CTCCCCGCGC CCGGCGGGCC GTCCACGGAG GACCTGATCG GCCGGGTCGA CCGGGGGATC TACGTGGTCG GCGACCGGAG CTGGTCGATC GACATGCAGC GCTACAACTT CCAGTTCACC GGGCAGCGGT TCTACGAGAT CCGCAAGGGC CGGATCGTCG GCCAGCTGCG CGACGTCGCC TACCAGGCGA CCACCACGGA CTTCTGGGGC TCGCTGGACG CCGTCGGCGG CCCCGAGACC TACGTGCTGG GCGGGGCGTT CAACTGCGGC AAGGGCCAGC CCGGCCAGGT CGCGGCGGTC AGCCACGGCT GCCCGTCGGC GCTGTTCCGC GACGTCAACA TCCTGAACAC GCGCCGCGAG GGCGGCCGAT GA
|
Protein sequence | MATSVPRDPA EPALPPHEID EEFRALPTAA LTDAALQAAR DLGAAHADIR IERLKESSLS FRDAGLESRS DGVTAGFAVR VVHDGTWGFA AGVDLTVDEA VRVAREAVAM AKVARPLNSE PVELADEPVH AGATWVSSYA TDPFSVDPRD QVERIGGLCR SLYSAEDVDH VDGRFAAVME NKFYADTAGT DTTQQRVRVL CQLEATRVDP AGGFESMRTI APPAGRGWEW MVGGSAAGCW DWESETELIP SLLAEKAKAS SVEPGRYDLV IDPSNLWLTI HESVGHATEL DRALGYEAAY AGTSFATLDK LGSLRYGSPA MTVTGDRTAP HGLATIGYDD EGVQTRRWDI VRDGVLVGYQ LDRRMAAQNA STLGVDRSNG CAFADSPGHV PIQRMANVSL LPAPGGPSTE DLIGRVDRGI YVVGDRSWSI DMQRYNFQFT GQRFYEIRKG RIVGQLRDVA YQATTTDFWG SLDAVGGPET YVLGGAFNCG KGQPGQVAAV SHGCPSALFR DVNILNTRRE GGR
|
| |