Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5335 |
Symbol | |
ID | 5673669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6424617 |
End bp | 6430769 |
Gene Length | 6153 bp |
Protein Length | 2050 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244193 |
Product | peptidase C14 caspase catalytic subunit p20 |
Protein accession | YP_001509599 |
Protein GI | 158317091 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0326] Molecular chaperone, HSP90 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.103248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCC GTTTCCGCGC CCTGCTGATC GGGGTGCCTT CCTACCGCGA CCCGGACATT GACAACCTCC TCTTCATCGA AGAGGACCTG GCCGAGCTAT CCGCCACGCT TACCCGCACT GGTTACGAGG TAACGGTCCA TAACGTGGCC CATACCGACT TCGCAAGTAT CGACACTGCG ATAGAATTCT TCATTGGCGA CGCCGAATCG GCGGATACGC TACTTATATT CCTCAGCGGA CATGGCATCC ATCACGCCAA CATGGACTAT CTTGTCCCGT CCGGGGCGAT GATGCGATCT AGCCCTTTCT ACACCCGTTG CGTCCCGATC GATGTCGGCC GCTACGTCGA GGCCAGCCAT GCCGGAAATG TCGTATTGAT GGTCGATGCC TGCCGGGAAG GAATCCACTT CAACGAGAAA TCCGGGCCAT CGACGCTGGC GTGGTCGGAC CGCGAGGTTA CACGGACGGC ATCTCGACAC TTCGCCTACG TCTACGCGTG CTCCCCTGGC GAAAAAGCCC GCTTTGCGAC AGTTGGCAAC TCGTCATTCA GCTTCTTTTC GCGGGCGGTC AGCACCGTCG CCGACGACGA CGCCGGTCCC GCCACCCTTG CCGACATCGA GCTGGCCGTC CAAGCGGAGA TAGACACCCT CACTAGCGCC CACGGGTATC CGCGGCAGCA GGTCCGGATT CTGACCGAAA CCGGCAAAGA TTTCATCCTG CTACCGAGAC CGAGGAGAAG ACGGACCGGA GCGGCCGGCG AACATAGCTG GGTTACCGCG GCCCGGACCC ATACGGCCTG GAACCTTGCT GATGGCCGAC CTGGCCAGGA ACTGCTGAGG GAGGCGACCA CCACCCTCGT CGCATACCTG GCACGGTGCT GGGAGCGGGA CAACCGGCTG GTTCAGAGCG ACCCGTGGCA CGCCCCCGGC TGGGCCGAAC GGATGAACGG CAAGGTCCGC TGGCTGCTAT CTACGCTGAA TCCGGAGAAG TTGAGCCTGT CGCCAGCAGA GGCGGCCCTG CTCACAGCTG TCCCGTTCCT GCACACCGCC TACTGGACCC GGCTGGCCGC CTCTGCCTAC CCCCAGGTCA ACCCGGCGGA TCTAACAATA CGCGGCACTT CGGACGACCG AGCCGCTTTC GAACGGTTCA CCGCTGGTTA TGGACGTCTC CTCCGCCGGG CGCAGCGCAT TGACCTGCCC GACGGGCAGC GGCCGGCGGC CGGTATTGGA TGGTGGCTGT TCCATCGCTG GCTCATCCGC CGGCCGGCGT CCTATCAGCT CGACCTGCTC GTCGATCTGC TGCTTCCGTC CACGCCCCTG CCAGAGGACG GCGACAGCAG ACTGGTACCC GAGGTTTTCG CCGTCGACCG ACTACTTGAA TTGCTTCGGA CGCTGCGCAC CGACACGGCC TCACTTGCCC AAGGTGACCG GGACACGGCA CCACGGCCGA TTCGGCCGGT CGCGAACTCC AGCGAGATGG AGCAGAGCAT CCGCGAACAA TTGGTGGCGT TCCTGCTCTC CGCCGCGGAA CGGTTCACCC TTGACCCGGT CGGGATGTCA GAAGTGGTGG TCGACCATCT CGGCATCAGC GGCGGAGTTA GCCTGCCACA GCTTCACGAG ACGCTCGCAG AGGCCGCCTG GGAGGCGCGT GGACGTACAC GGGTACTCGC CGCCCGCTGT CACCATCCCG CCGTCGATCT GGCACTGCGC GAGCAGGCGA GTGCCGTGGA TAGTCTGCTA CGCCATATCG ATACCGCCGC CGCGGACGGC GGGCTGTTGG CTCCGTTGGC GGACCTACCG ACCCGGGCGA CCGCTGACCG GGTGGTGGCC GCGACCGGAG CCGACGGTCA GCGGGCCTAC GACTCGACTG GGTTTCGGTT CCGACTCGCC GACGACCGCA TCCAGGAATT ACTCATGGGT GAGCAACTCT ATGGCGATCC GGCGTTGGCG GTCCGTGAAT TGTATCAGAA TGCGCTCGAT GCCTGTCGCT ATCGTCAGGC GAGGATCGAG TACCTTGAAG TCGCCGGACG ACACCCTACG CCCTGGATCG GAAGTATCCG GTTTACGCAG GGCGTAATGG ACGGTCGACC TTTCTTGGAC TGCATCGATA ACGGCGTTGG TATGGGTCTA CGGGAACTCG TCGAGGTATT CTCCCACGCT GGTATGCGTT TCGCCGACCT ACCCGAATAC GTCGAGGAGC GGGCCACTTG GAAACAGGAG GGCATCGAGC TTTATCCAAA CAGCCGGTTC GGTATCGGCG TGTTGAGCTA TTTCATGCTT GCCGACGAGA TTACCGTGAC AACCTGCAGG CTCGACCGTT CCGGCCGTCC AGGGACCGTG TTCGAGGTGC ACATCGCCGG GCCGGGCGCG CTATTTCGCG TGCACGACCG CGGCCCCGGA GAGGAAGCCG GCACTACAGT CCGGCTGCAT CTGCGTCCCA CCGACACACC GCTTTCCTGC GTCGATCTGC TTCGCCGGAT CCTGTGGATC TCCGAATTCG CGGTGGAGGC CATCGACGCC ACTGGTCGAC AGAGCTGGGC CGCTGGGCAG TTGTCGTCAG TCGCGCCGAT TGGCGCGGAG GACCCGCTAG CCGACGACGC CCGACGCACC GCCTGCCGGA TCGATGCCAC CTCCGCTCCC ACGGTCTGGT GGTGCGACAC GAACGGTGGT GTGGTCGCCG ACGGAGTGTG GGTCGGCACC GACCTGTTCG GGGCGGTGGT GAACCTGACC GGCCCCTGCT ATCCCCGTCT GACGGTGGAC CGCCGACGGA TCCTCGCTCA CGACGACGCC GAGGTGACCC GACTCCAATA CCGGGAGATC CCGGTGTTGC TTGCCGAGGA TGCGACCGTG TTGACGTTCG ACTGGCTCTG CGCGCTGGCC GGACACCATC CAGATCTTGC CGACGAAATC TGCACGCAGG CGGCAGCACA TCACCGGCGG TGGAAGGTGG CCGGGGATAC CGCGGATATT GCTGTAGTGG GATGTTTTCC GGCAGATGGC ATGCTTTTTC GACGGCGGGA CGGTTCACCA CTACGCGCAG TACAGCCGGT GCTGCCGGAG GTGATTCTTC GATGGCGGTT GCACGCGTGG ATCGAAGCCG GTCTTGTTAA CGGGCTAACC GTCGCAGCGC CGGCTGGCGA ACGTCTTCTG TCTCTTCCGA CCGATGACGT GGTCCTCGCG GCCAGGGCAG TCGTCGAAAC TGAAACGTCG AACCTGGCCG ATTTTCGTGT GATCCCGGCG CCTCGCCTCG GCTCGGATCG GCCGGTGCCG GTGGGGCATC TGGCTGCGGC TGCGGTGAGG ACGGGTCGCA GTGTCGGCCA GGTGGCTGCC CGGCTAGCTG CTCTGGGGTT CGCCGTCCCG GACCTGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGACC GCACCCTCCT CAGCCGTGGT CTCGACGGCG AGGAACCGTG GCTCGGCTCG GATCGGCCGG TGCCGGTGGG GCATCTGGCT GCGGCTGCGG TGAGGACGGG TCGCAGTGTC GGCCAGGTGG CTGCCCGGCT AGCTGCTCTG GGGTTCGCCG TCCCGGACCT GGCTACCCTG CCCGACCAGC TCGACCGCGA CGACCGCACC CTCCTCAGCC GTGGTCTCGA CGGCGAGGAA CCGTGGCTCG GCTCGGATCG GCCGGTGCCG GTGGGGCATC TGGCTGCGGC TGCGGTGAGG ACGGGTCGCA GTGTCGGCCA GGTGGCTGCC CGGCTAGCTG CTCTGGGGTT CGCCGTCCCG GACCCGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGACC GCACCCTCCT CAGCCGTGGT CTGAATAGCT GGATGCCGTG GCTCGATTCC GACCGGCCAG TGCCGGTAGG ACATCTCGCG CGGGCTGCGG CGAGGACGAG CCATAGCCCT CGCCAGGTGG CTGCCCGGCT AGCTGCCTTT GGGTTCGCCG TCCCGGACCC GGCTACCCTG CCCGACCAGC TCGACCGCGG CGACCGCACC CTCCTCAGCC GTGGTCTCGA CGGCGAGGAA CCGTGGCTCG GCTCGGATCG GCCGGTGCCG GTGGGGCATC TGGCTGCGGC TGCGGTGAGG ACGGGTCGCA GTGTCGGCCA GGTGGCTGCC CGGCTAGCTG CTCTGGGGTT CGCCGTCCCG GACCTGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGACC GCACCCTCCT CAGCTGCAAT CTTGACGGCG AGTGGCCATG GCTGGACGAT CTCGACGGCC GTTCGCTGTG GCTCGATTCG GATCGGCCAG TCCCGGCGGT GCATCTCGCG CGGGCTGCCG CGAAGACGGG TCGCAGCCCC TTCCAGGTGG CTGCCCGGCT AGCTGCTCTG GGGTTCGCCG TCCCGGACCC GGCTACCCTG CCCGACCAGC TCGACCGCGA CGATCTCATC CTCCTCAGCC GTGGTCTCGA CGGCCGTTCG CCGTGGCTCG ATTCGGATCG GCCAGTCCCG GCGGTGCATC TCGCGCGGGC TGCCGCGAAG ACGGGTCGCA GCCCCTTCCA GGTGGCTGCC CGGCTAGCTG CTCTGGGGCT CACCGTCCCG GACCCGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGATC TTCACCTTCT CGGCGACGAA CGCTACGGCT GGACGTCGTG GCTCGGCTCG GATCGGCCGG TGCCGGTGGG GCATCTGGCT GCGGCTGCGG TGAGGACGGG TCGCAGTGTC GGCCAGGTGG CTGCCCGGCT AGCTGCTCTG GGGTTCACCG TCGCGGACCC GGCTACCCTG CCCGACCAGC TCGACCGCGG CGATCTCATC CTCCTTAGCC GTGGTCTCGA CGGCCGGTCG CCGTGGCTCG GCTCGGATCG ACCAGTGCCA GTGGGGCATC TGGCTGCGGC TGCGGTGAGT ACGGGTCGCA GCGTCGACCA GGTGGCTACC CGGCTAGCTG CCTTGGGGTT CGCCGTCGCG GACCTGGCTA CTGTGCCTGA CCAGCTCGAC CGCGACGATC TCATCCTCCT CAGCGGCGGT CTCGACGGCC GGTCGCCGTG GCTCGACTCG GATCGACCAG TATCAGCGGG ATATCTGGCC GTAGCTGCGG TGAGGACGGG TCGTAGCCCT CGCCAGGTGG CTTCTTGGCT TGCTGCCTTG GGGTTCGCCG TCGCGGACCC GGCTACCCTG CCCGACCAGT TCGACCGTGG CGATCTCATC CTCCTCAGCG ACGATCGCCA CGGCCGGTGG CAGTCGCTCG GCTCGGACCG GCCGATCCCG ACGGTGCATC TCGCGGAGGC TGCGGTGAGG ACGGGTCGTA GCCCTCGCCA GGTGGCTTCT CGGCTGGTTG CCTTGGGGTT CACCGTCCCG GACCCGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGACC GGATCCTCCT TAGTCGCGAT CTCGATGGCG AGGAACCGTG GCTCGATTCG GATCGGCCGG TGCCGGTGGG GCATCTGGCC GTGACTGCGG TGAGGACGGG TCGCAGTGTC GGCCAGGTGG CTGCCCGGCT GGCTGCCTTT GGGCTCACGG TCGCGGACCG TGTGCCCGAC CAGCTCGACC CCGACGACGC CATTCTTCTC AGCCGCGATC TCGACAGCCG GTGGCCGTGG CTCGACCCGG ATCGGCCGAT CCCGCCGGCG CATCTGGCCG TGACTGCGGT GAGGACGGGT CGCAGCGTCG GCCAGGTGGC TGCCCGGCTG GCTGCCTTGG GGTTCACGGT CGCGGACCTG GCTACTGTGC CTGACCAGCT CGACCGTGGC GATCTCATCC TCCTCAGCCG CGGTCTCGAC GGCCGGTCGC CGTGGCTCGA CTCGGATCGG CCAGTCCAGG CGGTGCATCT CGCGGAGGCT GCGGTGAGGA CGGGTCGTAG CCCTCGCCAG GTGGCTTCTC GGCTGGCTGC CTTTGGGTTC GCCGTCCCGG ACCCGGCTAC CCTGCCCGAC CAGCTCGACC GCGACGACCG CACCCTCCTT AGCCGCAATC TCGACGGCGA GGAACCGTGG CTTCACCCAG GCCAGCTTGT CCCCGCAGGG CACCTGATCG CCGCGTTCGC CTCCGGACGG AAGATCACCG AGGTTGCCAG CCGGTTGACG AAATATGGGT TTCGCTTGCC TGCGTCCATC AACATGGACA ACTTGGTTTC CGACGGATCT TGA
|
Protein sequence | MDGRFRALLI GVPSYRDPDI DNLLFIEEDL AELSATLTRT GYEVTVHNVA HTDFASIDTA IEFFIGDAES ADTLLIFLSG HGIHHANMDY LVPSGAMMRS SPFYTRCVPI DVGRYVEASH AGNVVLMVDA CREGIHFNEK SGPSTLAWSD REVTRTASRH FAYVYACSPG EKARFATVGN SSFSFFSRAV STVADDDAGP ATLADIELAV QAEIDTLTSA HGYPRQQVRI LTETGKDFIL LPRPRRRRTG AAGEHSWVTA ARTHTAWNLA DGRPGQELLR EATTTLVAYL ARCWERDNRL VQSDPWHAPG WAERMNGKVR WLLSTLNPEK LSLSPAEAAL LTAVPFLHTA YWTRLAASAY PQVNPADLTI RGTSDDRAAF ERFTAGYGRL LRRAQRIDLP DGQRPAAGIG WWLFHRWLIR RPASYQLDLL VDLLLPSTPL PEDGDSRLVP EVFAVDRLLE LLRTLRTDTA SLAQGDRDTA PRPIRPVANS SEMEQSIREQ LVAFLLSAAE RFTLDPVGMS EVVVDHLGIS GGVSLPQLHE TLAEAAWEAR GRTRVLAARC HHPAVDLALR EQASAVDSLL RHIDTAAADG GLLAPLADLP TRATADRVVA ATGADGQRAY DSTGFRFRLA DDRIQELLMG EQLYGDPALA VRELYQNALD ACRYRQARIE YLEVAGRHPT PWIGSIRFTQ GVMDGRPFLD CIDNGVGMGL RELVEVFSHA GMRFADLPEY VEERATWKQE GIELYPNSRF GIGVLSYFML ADEITVTTCR LDRSGRPGTV FEVHIAGPGA LFRVHDRGPG EEAGTTVRLH LRPTDTPLSC VDLLRRILWI SEFAVEAIDA TGRQSWAAGQ LSSVAPIGAE DPLADDARRT ACRIDATSAP TVWWCDTNGG VVADGVWVGT DLFGAVVNLT GPCYPRLTVD RRRILAHDDA EVTRLQYREI PVLLAEDATV LTFDWLCALA GHHPDLADEI CTQAAAHHRR WKVAGDTADI AVVGCFPADG MLFRRRDGSP LRAVQPVLPE VILRWRLHAW IEAGLVNGLT VAAPAGERLL SLPTDDVVLA ARAVVETETS NLADFRVIPA PRLGSDRPVP VGHLAAAAVR TGRSVGQVAA RLAALGFAVP DLATLPDQLD RDDRTLLSRG LDGEEPWLGS DRPVPVGHLA AAAVRTGRSV GQVAARLAAL GFAVPDLATL PDQLDRDDRT LLSRGLDGEE PWLGSDRPVP VGHLAAAAVR TGRSVGQVAA RLAALGFAVP DPATLPDQLD RDDRTLLSRG LNSWMPWLDS DRPVPVGHLA RAAARTSHSP RQVAARLAAF GFAVPDPATL PDQLDRGDRT LLSRGLDGEE PWLGSDRPVP VGHLAAAAVR TGRSVGQVAA RLAALGFAVP DLATLPDQLD RDDRTLLSCN LDGEWPWLDD LDGRSLWLDS DRPVPAVHLA RAAAKTGRSP FQVAARLAAL GFAVPDPATL PDQLDRDDLI LLSRGLDGRS PWLDSDRPVP AVHLARAAAK TGRSPFQVAA RLAALGLTVP DPATLPDQLD RDDLHLLGDE RYGWTSWLGS DRPVPVGHLA AAAVRTGRSV GQVAARLAAL GFTVADPATL PDQLDRGDLI LLSRGLDGRS PWLGSDRPVP VGHLAAAAVS TGRSVDQVAT RLAALGFAVA DLATVPDQLD RDDLILLSGG LDGRSPWLDS DRPVSAGYLA VAAVRTGRSP RQVASWLAAL GFAVADPATL PDQFDRGDLI LLSDDRHGRW QSLGSDRPIP TVHLAEAAVR TGRSPRQVAS RLVALGFTVP DPATLPDQLD RDDRILLSRD LDGEEPWLDS DRPVPVGHLA VTAVRTGRSV GQVAARLAAF GLTVADRVPD QLDPDDAILL SRDLDSRWPW LDPDRPIPPA HLAVTAVRTG RSVGQVAARL AALGFTVADL ATVPDQLDRG DLILLSRGLD GRSPWLDSDR PVQAVHLAEA AVRTGRSPRQ VASRLAAFGF AVPDPATLPD QLDRDDRTLL SRNLDGEEPW LHPGQLVPAG HLIAAFASGR KITEVASRLT KYGFRLPASI NMDNLVSDGS
|
| |