Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1781 |
Symbol | |
ID | 5670183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2140258 |
End bp | 2141871 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240702 |
Product | leucyl aminopeptidase |
Protein accession | YP_001506125 |
Protein GI | 158313617 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.228538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.195294 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCA CCTCTGCCGC CTCCGCAGCC CTCACCGACC TCGACGTCGA CGCCGTCGTG ATCGGAATGG CCAAGGGCGA CGACGGTCCG GTGCCCCTCG GCGGGACCGG CGCCCTCGAC GCGGCGCTCG GCGGCCGGCT GGCCCGGATC CTCGCCGACC TCGGCGCGAC CGGAGCCACC GGCGACATCG TCCGCTTCGC CACGCTCGGG ACGCTGAAGG CCCCGACCGT GCTCGCGGTC GGCGTCGGGG AGCACGCCGC GTTCACCGCC GGTACCGCCG GAGCGGGTGA GGAAGCGGGC GGTGCCAGCA CCAGCACCAG CGCCGACACC GACACCGACA CCGAGACGCT GCGCCGCGCG GCCGGCGCCG CCGTCCGCGC GCTCGCCGGC ACCGCCCGGG TCGCCTCGAC ACTCGCGCTC GCCGCCGGCC GGCCGACCGC GTCCACGCTG CGCGCGGCGG CCGAGGGCTC GCTGCTGGGC GCCTACGCGT TCGACAGCCT GCGCACCTCA CCGAACGGGC CGGCGCCCGT GCGCGAGCTG GTCCTGACGG TCGAGGACCC GTCCGACCTG GCGGCGGCGG AGGCCGCCCT GGAGCGCGCG ACGATCATCG GTGAGGCCGT CACGCTGGTG CGGGACCTGG TGAACACTCC GCCGAGCCAC CTCTCGCCGG CCCGGCTGGC CGAGATCGCG GTGGAGCGTG GCGGCGCGGC CGGCGTCGAG GTGACCGTTC TCGACGAGGA GGCGCTCGCC GAGGGCGGTT ACGGCGGCCT GCTCGGTGTC GGGCAGGGCT CGGCCAACCC GCCGCGACTG ATCCGGCTGC GCTACCAGGG CGCCGCCGGC ACCAGGGCAG CCGAGGGCGA CCCCGCCTGC ACCGACCTGG CCCTGGTCGG CAAGGGCATC ACGTTCGACT CCGGCGGCCT GTCACTGAAG CCGCCGGTGT CCATGGAGTG GATGAAGAGC GACATGGCCG GCGCGGCCTC CGTGCTCGCG GCCGTCGTCG CGGCCGCACG GCTGGGCCTG GCGCTCAACG TGACCGGCTG GATGCCGTGC GCGGAGAACA TGCCCTCCGG CGACGCCATC CGGCCGTCCG ACGTGATCAC GCTGCGCGGC GGCAAGCGGG TCGAGGTGCT CAACACCGAC GCCGAGGGGC GGCTCGTGCT CGGCGACGCG CTGGCCCGCG CGTCCGAGGA CACTCCGGCG CTCGTCGTTG ACGTCGCCAC CCTCACCGGC GCGCAGATCG TCGCGCTGGG CACCCGCACG AGCGGGGTGA TGGGCCGCTC CGAGGCGACC GACGCGGTGG TCGCCGCCGC CGGGCGCGCC GGCGAGACGG TCTGGCCGAT GCCGATGCCG CCGGAGCTGC GCAAGAGCCT CGACTCGGTG GTCGCCGACC TGGCGAACGT GGCCCCGGGC GGGAACCGGG ACGCGGGCAT GCTGCTCGCC GCCCATTTCC TCGCCGCTTT CGTGCCCGAG GAGATCCCCT GGGCGCACGT CGACATCGCC GGCCCGTCCT GGAACGGCGG CGAGCCCTAC GGCTACACGC CCAGAGGCGG CACCGGGGCC ATCGTGCGCA CCCTCGTCCA GCTCGCCGAG GACAGGGCCG CCACCACCCC GTGA
|
Protein sequence | MTVTSAASAA LTDLDVDAVV IGMAKGDDGP VPLGGTGALD AALGGRLARI LADLGATGAT GDIVRFATLG TLKAPTVLAV GVGEHAAFTA GTAGAGEEAG GASTSTSADT DTDTETLRRA AGAAVRALAG TARVASTLAL AAGRPTASTL RAAAEGSLLG AYAFDSLRTS PNGPAPVREL VLTVEDPSDL AAAEAALERA TIIGEAVTLV RDLVNTPPSH LSPARLAEIA VERGGAAGVE VTVLDEEALA EGGYGGLLGV GQGSANPPRL IRLRYQGAAG TRAAEGDPAC TDLALVGKGI TFDSGGLSLK PPVSMEWMKS DMAGAASVLA AVVAAARLGL ALNVTGWMPC AENMPSGDAI RPSDVITLRG GKRVEVLNTD AEGRLVLGDA LARASEDTPA LVVDVATLTG AQIVALGTRT SGVMGRSEAT DAVVAAAGRA GETVWPMPMP PELRKSLDSV VADLANVAPG GNRDAGMLLA AHFLAAFVPE EIPWAHVDIA GPSWNGGEPY GYTPRGGTGA IVRTLVQLAE DRAATTP
|
| |