Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6755 |
Symbol | |
ID | 5675068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8215996 |
End bp | 8218743 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245604 |
Product | lantibiotic dehydratase domain-containing protein |
Protein accession | YP_001510995 |
Protein GI | 158318487 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCAAG CTGCCCACGC GGCGTTGATC AGGGCTGCTT CCTACCCCCG AGATCTGACG CTGCCCACGT GGCCCGATCT GACCGCGAAC CAGGCCGACG AGTGGCTGGA GTGGCTGCGG GAGGTATGGG CGTTGCCGGA GTTCGCGGCG GCTGTCGGGC AGGCAGCCCC TGACCTGGCC GATCAGATCA CCCACGTGCT TGCCCAGGAG TCGATGCCAG CGCGCAGGGT GCGGCGTCTG GTGGAGACGA CCGTGCGGTA TCTGCTGCGG TGGACGACCC GGCCCACGCC GTTCGGCAGG TTCGCCGGGG TGGCTCCCCT CGCGTTCGGC CCCCGCGCGG CGGTCTGGTG GGGCGACCAG CATCACGAGG TGGTCCGACT GGACGACCGC TCCGTCGCCG AATACACGGC GGTGACGGAG CGGGACCTGG CGGTGCTGCG CGGGGTCACG GTCATGACGA ACACGCTGGG GTATCGGCGT GGCGGGGTGT GGGTGCTGCC CTGCGCCCGC GTCGAAGGTG ACCGGGTGTG GGATGTCGAG ATCAACCTGA CTGCCCCGGT GCTGGTGGCG GTGGAGAAGG CCCGTGCCCC GATCCCGTTC CGGGAGCTGG CCGCGACGGT CGCCGAGGAC CAGGCCATCG GAACCGCGAA GGCGGAGCGA CTACTCGGCG CCCTGGTGGG CGCGGGGGTG TTGCTGTCGG CGGTCCGGCC GCCGATGACC GTGACCGACC CGGCCGCGCA CCTGGCCCGC CACATCGCTC TCCCCAACCC AGGCGAACGG AGCGCGGTCG ATCTGCGGGT TGACTGCTCG GTGACGCTGC CGCCCGCGGT GGTCCGCGAG GCGCAGGAAG CCGCCGCGGC GCTCGTCGCA GTTGCGCCAC GCCTGCCTGG CTGGGCCGCC TATCATTCCG CGTTCAGTGA GCGGTGGGGG CCGGGCGCGG CTGTGCCGCT GCGGGAGGTC GTGGGCATTC TCGGGTTCCC GGCCGGTTAC CGGGGCTCGC TGCGCCGTGA TGCGGCGACG TTCACTGCCC GGGATGCTCT GCTCGCGACG CTCGCCCAGC GCTCTGCCCT GGACGGATGC GCCGAAGTCC TCCTGGACGA CGATCTAATC GGGCAGCTTC GCAGCGAGGA CGACCGGCCG CCGATCCCGC ACACCGAACT GCGGTTCACT CTCGCCGCAG GAACGCTTCA GGACCTCGAC CGCGGCGCGT TCACCCTGAC GGTCGTCAGT GGAGCCCGCC ACGCCGGCGT GGCAGGTGGC CGCTTCCTGC ACCTGCTCAC CCCCACCGAG CTGGACCAGT TCCGCAGCAT CTACACCAGC CTGCCGACTG CCTTACCCGG CGCGGACGCC GTACAGCTGT CCGGGCCTCC GCTCGACCCC AGGCTGGCCA CCGTCGCCCG CACGCCCGAG CTCCTACCGG TGCTGCCCGT CGGCGACCTC CATGCCGACC CGGTGTGCAC GGTGGACGAC CTGGCGGTGG CCGCCGATGG GCAGCGGCTC TGGTTGGTGT CGCGCCTAAC TGGTCGACCG GTCGAGCCGC TGCTGTTCAA CTGCGTGCTC CTGGCTACCC ACCAGCAGCC ACTCGTTCGG TTCCTCACCG AGATCTGGAC GGCCTGGACG GCGCCGTGCG CCCGGTTCGA CTGGGGACAC GCCCGCACAT TGCCGTTCCT CCCGCGGGTC CGCCGGGGCC GTTCGATCCT GCACCCGGCC CGCTGGACCA TCCCCGCCGA GGCGTTACCC GCCCGCACCG CGACGTGGCC GCAGTGGCGG GCCGCCTGGC ACCAGCACCA CGAACGCCGC CAGCTGCCAC AGGAGGTGCT GATCGGCGGC GACGACGTAC GGCTACGCCT CGACCTGGAC GAGAACGCCC ATCTCGCGGT CCTGCGTAGC CACCTCGACC GGCACGGACG CGCTGTCCTC ACCGAGACGG ACGGGCCCTC AGGGTGGATC GATGGCAGAC CCGCCGAACT CCTGCTCACC CTCACCCGCA CCCCACCAGC CCACCGCCTG GCCGCCCGCC GGGCCCGTCC TGTCAGCGTT CCTGCCCACC GGCCCGGCCG GTCGCGCTGG TTGGACGCCC GCCTGGTCGG ACAAGCCGAT CACGTCCTCG CCCGCCTGTC CGAGTTTCCT GGTCTGCCCG CAGGTTGGTG GTTCCTGCGC TACCCGCACC CCGAACCCCA CCTGCGGCTG CGCATCCCGC TGCGGGGCAT TGCCCAGTTC GCCGACGTCG CCCGCGGCCT CGCCGGCTGG GCGGAGCAGC TGCACGACGA CGGACTGCTG GCCGACTACA CCCTGGCCAC CTACCGGCCA GAGACCCGCT GGGGCTGCGG GCAGACCCTC GCCGCGGCCG AGGCAGTGTT CGCCGCCGAC TCCCGCGCCG CCCTCACCTG GACGTCCGGC GACCGCCAGG CTGGCACCGC AGCAGGGATG ATCGCTATCG CCGGCGGGTT CACCGATGAC GGAGCGCGTT GGCTCGTCGA GCACGCACCC CACGGCGGCG GGTCACGCCT GGAGCCCGCC CAGATCGCCG GGGCCCGCCT GACCTACGGA GACGAGGCTC TGACCGCGAC GCTGGCCACC TACCGGACCC TCGCCGCCCG AGACGGTCTC GACCTGGACC AGGTGCTGGC CGACCTGCTG CACCTGCACC ACGCCCGCAT GATCGGCCCC GATCTGGCGT CCGAACGGCA CTGTCTGCGC CTGGCCCGCG CCCTCGCCCA GACCACCCTG GCCAGGAGGC CATCGTGA
|
Protein sequence | MYQAAHAALI RAASYPRDLT LPTWPDLTAN QADEWLEWLR EVWALPEFAA AVGQAAPDLA DQITHVLAQE SMPARRVRRL VETTVRYLLR WTTRPTPFGR FAGVAPLAFG PRAAVWWGDQ HHEVVRLDDR SVAEYTAVTE RDLAVLRGVT VMTNTLGYRR GGVWVLPCAR VEGDRVWDVE INLTAPVLVA VEKARAPIPF RELAATVAED QAIGTAKAER LLGALVGAGV LLSAVRPPMT VTDPAAHLAR HIALPNPGER SAVDLRVDCS VTLPPAVVRE AQEAAAALVA VAPRLPGWAA YHSAFSERWG PGAAVPLREV VGILGFPAGY RGSLRRDAAT FTARDALLAT LAQRSALDGC AEVLLDDDLI GQLRSEDDRP PIPHTELRFT LAAGTLQDLD RGAFTLTVVS GARHAGVAGG RFLHLLTPTE LDQFRSIYTS LPTALPGADA VQLSGPPLDP RLATVARTPE LLPVLPVGDL HADPVCTVDD LAVAADGQRL WLVSRLTGRP VEPLLFNCVL LATHQQPLVR FLTEIWTAWT APCARFDWGH ARTLPFLPRV RRGRSILHPA RWTIPAEALP ARTATWPQWR AAWHQHHERR QLPQEVLIGG DDVRLRLDLD ENAHLAVLRS HLDRHGRAVL TETDGPSGWI DGRPAELLLT LTRTPPAHRL AARRARPVSV PAHRPGRSRW LDARLVGQAD HVLARLSEFP GLPAGWWFLR YPHPEPHLRL RIPLRGIAQF ADVARGLAGW AEQLHDDGLL ADYTLATYRP ETRWGCGQTL AAAEAVFAAD SRAALTWTSG DRQAGTAAGM IAIAGGFTDD GARWLVEHAP HGGGSRLEPA QIAGARLTYG DEALTATLAT YRTLAARDGL DLDQVLADLL HLHHARMIGP DLASERHCLR LARALAQTTL ARRPS
|
| |