Gene Franean1_5335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5335 
Symbol 
ID5673669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6424617 
End bp6430769 
Gene Length6153 bp 
Protein Length2050 aa 
Translation table11 
GC content67% 
IMG OID641244193 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_001509599 
Protein GI158317091 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0326] Molecular chaperone, HSP90 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.103248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCC GTTTCCGCGC CCTGCTGATC GGGGTGCCTT CCTACCGCGA CCCGGACATT 
GACAACCTCC TCTTCATCGA AGAGGACCTG GCCGAGCTAT CCGCCACGCT TACCCGCACT
GGTTACGAGG TAACGGTCCA TAACGTGGCC CATACCGACT TCGCAAGTAT CGACACTGCG
ATAGAATTCT TCATTGGCGA CGCCGAATCG GCGGATACGC TACTTATATT CCTCAGCGGA
CATGGCATCC ATCACGCCAA CATGGACTAT CTTGTCCCGT CCGGGGCGAT GATGCGATCT
AGCCCTTTCT ACACCCGTTG CGTCCCGATC GATGTCGGCC GCTACGTCGA GGCCAGCCAT
GCCGGAAATG TCGTATTGAT GGTCGATGCC TGCCGGGAAG GAATCCACTT CAACGAGAAA
TCCGGGCCAT CGACGCTGGC GTGGTCGGAC CGCGAGGTTA CACGGACGGC ATCTCGACAC
TTCGCCTACG TCTACGCGTG CTCCCCTGGC GAAAAAGCCC GCTTTGCGAC AGTTGGCAAC
TCGTCATTCA GCTTCTTTTC GCGGGCGGTC AGCACCGTCG CCGACGACGA CGCCGGTCCC
GCCACCCTTG CCGACATCGA GCTGGCCGTC CAAGCGGAGA TAGACACCCT CACTAGCGCC
CACGGGTATC CGCGGCAGCA GGTCCGGATT CTGACCGAAA CCGGCAAAGA TTTCATCCTG
CTACCGAGAC CGAGGAGAAG ACGGACCGGA GCGGCCGGCG AACATAGCTG GGTTACCGCG
GCCCGGACCC ATACGGCCTG GAACCTTGCT GATGGCCGAC CTGGCCAGGA ACTGCTGAGG
GAGGCGACCA CCACCCTCGT CGCATACCTG GCACGGTGCT GGGAGCGGGA CAACCGGCTG
GTTCAGAGCG ACCCGTGGCA CGCCCCCGGC TGGGCCGAAC GGATGAACGG CAAGGTCCGC
TGGCTGCTAT CTACGCTGAA TCCGGAGAAG TTGAGCCTGT CGCCAGCAGA GGCGGCCCTG
CTCACAGCTG TCCCGTTCCT GCACACCGCC TACTGGACCC GGCTGGCCGC CTCTGCCTAC
CCCCAGGTCA ACCCGGCGGA TCTAACAATA CGCGGCACTT CGGACGACCG AGCCGCTTTC
GAACGGTTCA CCGCTGGTTA TGGACGTCTC CTCCGCCGGG CGCAGCGCAT TGACCTGCCC
GACGGGCAGC GGCCGGCGGC CGGTATTGGA TGGTGGCTGT TCCATCGCTG GCTCATCCGC
CGGCCGGCGT CCTATCAGCT CGACCTGCTC GTCGATCTGC TGCTTCCGTC CACGCCCCTG
CCAGAGGACG GCGACAGCAG ACTGGTACCC GAGGTTTTCG CCGTCGACCG ACTACTTGAA
TTGCTTCGGA CGCTGCGCAC CGACACGGCC TCACTTGCCC AAGGTGACCG GGACACGGCA
CCACGGCCGA TTCGGCCGGT CGCGAACTCC AGCGAGATGG AGCAGAGCAT CCGCGAACAA
TTGGTGGCGT TCCTGCTCTC CGCCGCGGAA CGGTTCACCC TTGACCCGGT CGGGATGTCA
GAAGTGGTGG TCGACCATCT CGGCATCAGC GGCGGAGTTA GCCTGCCACA GCTTCACGAG
ACGCTCGCAG AGGCCGCCTG GGAGGCGCGT GGACGTACAC GGGTACTCGC CGCCCGCTGT
CACCATCCCG CCGTCGATCT GGCACTGCGC GAGCAGGCGA GTGCCGTGGA TAGTCTGCTA
CGCCATATCG ATACCGCCGC CGCGGACGGC GGGCTGTTGG CTCCGTTGGC GGACCTACCG
ACCCGGGCGA CCGCTGACCG GGTGGTGGCC GCGACCGGAG CCGACGGTCA GCGGGCCTAC
GACTCGACTG GGTTTCGGTT CCGACTCGCC GACGACCGCA TCCAGGAATT ACTCATGGGT
GAGCAACTCT ATGGCGATCC GGCGTTGGCG GTCCGTGAAT TGTATCAGAA TGCGCTCGAT
GCCTGTCGCT ATCGTCAGGC GAGGATCGAG TACCTTGAAG TCGCCGGACG ACACCCTACG
CCCTGGATCG GAAGTATCCG GTTTACGCAG GGCGTAATGG ACGGTCGACC TTTCTTGGAC
TGCATCGATA ACGGCGTTGG TATGGGTCTA CGGGAACTCG TCGAGGTATT CTCCCACGCT
GGTATGCGTT TCGCCGACCT ACCCGAATAC GTCGAGGAGC GGGCCACTTG GAAACAGGAG
GGCATCGAGC TTTATCCAAA CAGCCGGTTC GGTATCGGCG TGTTGAGCTA TTTCATGCTT
GCCGACGAGA TTACCGTGAC AACCTGCAGG CTCGACCGTT CCGGCCGTCC AGGGACCGTG
TTCGAGGTGC ACATCGCCGG GCCGGGCGCG CTATTTCGCG TGCACGACCG CGGCCCCGGA
GAGGAAGCCG GCACTACAGT CCGGCTGCAT CTGCGTCCCA CCGACACACC GCTTTCCTGC
GTCGATCTGC TTCGCCGGAT CCTGTGGATC TCCGAATTCG CGGTGGAGGC CATCGACGCC
ACTGGTCGAC AGAGCTGGGC CGCTGGGCAG TTGTCGTCAG TCGCGCCGAT TGGCGCGGAG
GACCCGCTAG CCGACGACGC CCGACGCACC GCCTGCCGGA TCGATGCCAC CTCCGCTCCC
ACGGTCTGGT GGTGCGACAC GAACGGTGGT GTGGTCGCCG ACGGAGTGTG GGTCGGCACC
GACCTGTTCG GGGCGGTGGT GAACCTGACC GGCCCCTGCT ATCCCCGTCT GACGGTGGAC
CGCCGACGGA TCCTCGCTCA CGACGACGCC GAGGTGACCC GACTCCAATA CCGGGAGATC
CCGGTGTTGC TTGCCGAGGA TGCGACCGTG TTGACGTTCG ACTGGCTCTG CGCGCTGGCC
GGACACCATC CAGATCTTGC CGACGAAATC TGCACGCAGG CGGCAGCACA TCACCGGCGG
TGGAAGGTGG CCGGGGATAC CGCGGATATT GCTGTAGTGG GATGTTTTCC GGCAGATGGC
ATGCTTTTTC GACGGCGGGA CGGTTCACCA CTACGCGCAG TACAGCCGGT GCTGCCGGAG
GTGATTCTTC GATGGCGGTT GCACGCGTGG ATCGAAGCCG GTCTTGTTAA CGGGCTAACC
GTCGCAGCGC CGGCTGGCGA ACGTCTTCTG TCTCTTCCGA CCGATGACGT GGTCCTCGCG
GCCAGGGCAG TCGTCGAAAC TGAAACGTCG AACCTGGCCG ATTTTCGTGT GATCCCGGCG
CCTCGCCTCG GCTCGGATCG GCCGGTGCCG GTGGGGCATC TGGCTGCGGC TGCGGTGAGG
ACGGGTCGCA GTGTCGGCCA GGTGGCTGCC CGGCTAGCTG CTCTGGGGTT CGCCGTCCCG
GACCTGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGACC GCACCCTCCT CAGCCGTGGT
CTCGACGGCG AGGAACCGTG GCTCGGCTCG GATCGGCCGG TGCCGGTGGG GCATCTGGCT
GCGGCTGCGG TGAGGACGGG TCGCAGTGTC GGCCAGGTGG CTGCCCGGCT AGCTGCTCTG
GGGTTCGCCG TCCCGGACCT GGCTACCCTG CCCGACCAGC TCGACCGCGA CGACCGCACC
CTCCTCAGCC GTGGTCTCGA CGGCGAGGAA CCGTGGCTCG GCTCGGATCG GCCGGTGCCG
GTGGGGCATC TGGCTGCGGC TGCGGTGAGG ACGGGTCGCA GTGTCGGCCA GGTGGCTGCC
CGGCTAGCTG CTCTGGGGTT CGCCGTCCCG GACCCGGCTA CCCTGCCCGA CCAGCTCGAC
CGCGACGACC GCACCCTCCT CAGCCGTGGT CTGAATAGCT GGATGCCGTG GCTCGATTCC
GACCGGCCAG TGCCGGTAGG ACATCTCGCG CGGGCTGCGG CGAGGACGAG CCATAGCCCT
CGCCAGGTGG CTGCCCGGCT AGCTGCCTTT GGGTTCGCCG TCCCGGACCC GGCTACCCTG
CCCGACCAGC TCGACCGCGG CGACCGCACC CTCCTCAGCC GTGGTCTCGA CGGCGAGGAA
CCGTGGCTCG GCTCGGATCG GCCGGTGCCG GTGGGGCATC TGGCTGCGGC TGCGGTGAGG
ACGGGTCGCA GTGTCGGCCA GGTGGCTGCC CGGCTAGCTG CTCTGGGGTT CGCCGTCCCG
GACCTGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGACC GCACCCTCCT CAGCTGCAAT
CTTGACGGCG AGTGGCCATG GCTGGACGAT CTCGACGGCC GTTCGCTGTG GCTCGATTCG
GATCGGCCAG TCCCGGCGGT GCATCTCGCG CGGGCTGCCG CGAAGACGGG TCGCAGCCCC
TTCCAGGTGG CTGCCCGGCT AGCTGCTCTG GGGTTCGCCG TCCCGGACCC GGCTACCCTG
CCCGACCAGC TCGACCGCGA CGATCTCATC CTCCTCAGCC GTGGTCTCGA CGGCCGTTCG
CCGTGGCTCG ATTCGGATCG GCCAGTCCCG GCGGTGCATC TCGCGCGGGC TGCCGCGAAG
ACGGGTCGCA GCCCCTTCCA GGTGGCTGCC CGGCTAGCTG CTCTGGGGCT CACCGTCCCG
GACCCGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGATC TTCACCTTCT CGGCGACGAA
CGCTACGGCT GGACGTCGTG GCTCGGCTCG GATCGGCCGG TGCCGGTGGG GCATCTGGCT
GCGGCTGCGG TGAGGACGGG TCGCAGTGTC GGCCAGGTGG CTGCCCGGCT AGCTGCTCTG
GGGTTCACCG TCGCGGACCC GGCTACCCTG CCCGACCAGC TCGACCGCGG CGATCTCATC
CTCCTTAGCC GTGGTCTCGA CGGCCGGTCG CCGTGGCTCG GCTCGGATCG ACCAGTGCCA
GTGGGGCATC TGGCTGCGGC TGCGGTGAGT ACGGGTCGCA GCGTCGACCA GGTGGCTACC
CGGCTAGCTG CCTTGGGGTT CGCCGTCGCG GACCTGGCTA CTGTGCCTGA CCAGCTCGAC
CGCGACGATC TCATCCTCCT CAGCGGCGGT CTCGACGGCC GGTCGCCGTG GCTCGACTCG
GATCGACCAG TATCAGCGGG ATATCTGGCC GTAGCTGCGG TGAGGACGGG TCGTAGCCCT
CGCCAGGTGG CTTCTTGGCT TGCTGCCTTG GGGTTCGCCG TCGCGGACCC GGCTACCCTG
CCCGACCAGT TCGACCGTGG CGATCTCATC CTCCTCAGCG ACGATCGCCA CGGCCGGTGG
CAGTCGCTCG GCTCGGACCG GCCGATCCCG ACGGTGCATC TCGCGGAGGC TGCGGTGAGG
ACGGGTCGTA GCCCTCGCCA GGTGGCTTCT CGGCTGGTTG CCTTGGGGTT CACCGTCCCG
GACCCGGCTA CCCTGCCCGA CCAGCTCGAC CGCGACGACC GGATCCTCCT TAGTCGCGAT
CTCGATGGCG AGGAACCGTG GCTCGATTCG GATCGGCCGG TGCCGGTGGG GCATCTGGCC
GTGACTGCGG TGAGGACGGG TCGCAGTGTC GGCCAGGTGG CTGCCCGGCT GGCTGCCTTT
GGGCTCACGG TCGCGGACCG TGTGCCCGAC CAGCTCGACC CCGACGACGC CATTCTTCTC
AGCCGCGATC TCGACAGCCG GTGGCCGTGG CTCGACCCGG ATCGGCCGAT CCCGCCGGCG
CATCTGGCCG TGACTGCGGT GAGGACGGGT CGCAGCGTCG GCCAGGTGGC TGCCCGGCTG
GCTGCCTTGG GGTTCACGGT CGCGGACCTG GCTACTGTGC CTGACCAGCT CGACCGTGGC
GATCTCATCC TCCTCAGCCG CGGTCTCGAC GGCCGGTCGC CGTGGCTCGA CTCGGATCGG
CCAGTCCAGG CGGTGCATCT CGCGGAGGCT GCGGTGAGGA CGGGTCGTAG CCCTCGCCAG
GTGGCTTCTC GGCTGGCTGC CTTTGGGTTC GCCGTCCCGG ACCCGGCTAC CCTGCCCGAC
CAGCTCGACC GCGACGACCG CACCCTCCTT AGCCGCAATC TCGACGGCGA GGAACCGTGG
CTTCACCCAG GCCAGCTTGT CCCCGCAGGG CACCTGATCG CCGCGTTCGC CTCCGGACGG
AAGATCACCG AGGTTGCCAG CCGGTTGACG AAATATGGGT TTCGCTTGCC TGCGTCCATC
AACATGGACA ACTTGGTTTC CGACGGATCT TGA
 
Protein sequence
MDGRFRALLI GVPSYRDPDI DNLLFIEEDL AELSATLTRT GYEVTVHNVA HTDFASIDTA 
IEFFIGDAES ADTLLIFLSG HGIHHANMDY LVPSGAMMRS SPFYTRCVPI DVGRYVEASH
AGNVVLMVDA CREGIHFNEK SGPSTLAWSD REVTRTASRH FAYVYACSPG EKARFATVGN
SSFSFFSRAV STVADDDAGP ATLADIELAV QAEIDTLTSA HGYPRQQVRI LTETGKDFIL
LPRPRRRRTG AAGEHSWVTA ARTHTAWNLA DGRPGQELLR EATTTLVAYL ARCWERDNRL
VQSDPWHAPG WAERMNGKVR WLLSTLNPEK LSLSPAEAAL LTAVPFLHTA YWTRLAASAY
PQVNPADLTI RGTSDDRAAF ERFTAGYGRL LRRAQRIDLP DGQRPAAGIG WWLFHRWLIR
RPASYQLDLL VDLLLPSTPL PEDGDSRLVP EVFAVDRLLE LLRTLRTDTA SLAQGDRDTA
PRPIRPVANS SEMEQSIREQ LVAFLLSAAE RFTLDPVGMS EVVVDHLGIS GGVSLPQLHE
TLAEAAWEAR GRTRVLAARC HHPAVDLALR EQASAVDSLL RHIDTAAADG GLLAPLADLP
TRATADRVVA ATGADGQRAY DSTGFRFRLA DDRIQELLMG EQLYGDPALA VRELYQNALD
ACRYRQARIE YLEVAGRHPT PWIGSIRFTQ GVMDGRPFLD CIDNGVGMGL RELVEVFSHA
GMRFADLPEY VEERATWKQE GIELYPNSRF GIGVLSYFML ADEITVTTCR LDRSGRPGTV
FEVHIAGPGA LFRVHDRGPG EEAGTTVRLH LRPTDTPLSC VDLLRRILWI SEFAVEAIDA
TGRQSWAAGQ LSSVAPIGAE DPLADDARRT ACRIDATSAP TVWWCDTNGG VVADGVWVGT
DLFGAVVNLT GPCYPRLTVD RRRILAHDDA EVTRLQYREI PVLLAEDATV LTFDWLCALA
GHHPDLADEI CTQAAAHHRR WKVAGDTADI AVVGCFPADG MLFRRRDGSP LRAVQPVLPE
VILRWRLHAW IEAGLVNGLT VAAPAGERLL SLPTDDVVLA ARAVVETETS NLADFRVIPA
PRLGSDRPVP VGHLAAAAVR TGRSVGQVAA RLAALGFAVP DLATLPDQLD RDDRTLLSRG
LDGEEPWLGS DRPVPVGHLA AAAVRTGRSV GQVAARLAAL GFAVPDLATL PDQLDRDDRT
LLSRGLDGEE PWLGSDRPVP VGHLAAAAVR TGRSVGQVAA RLAALGFAVP DPATLPDQLD
RDDRTLLSRG LNSWMPWLDS DRPVPVGHLA RAAARTSHSP RQVAARLAAF GFAVPDPATL
PDQLDRGDRT LLSRGLDGEE PWLGSDRPVP VGHLAAAAVR TGRSVGQVAA RLAALGFAVP
DLATLPDQLD RDDRTLLSCN LDGEWPWLDD LDGRSLWLDS DRPVPAVHLA RAAAKTGRSP
FQVAARLAAL GFAVPDPATL PDQLDRDDLI LLSRGLDGRS PWLDSDRPVP AVHLARAAAK
TGRSPFQVAA RLAALGLTVP DPATLPDQLD RDDLHLLGDE RYGWTSWLGS DRPVPVGHLA
AAAVRTGRSV GQVAARLAAL GFTVADPATL PDQLDRGDLI LLSRGLDGRS PWLGSDRPVP
VGHLAAAAVS TGRSVDQVAT RLAALGFAVA DLATVPDQLD RDDLILLSGG LDGRSPWLDS
DRPVSAGYLA VAAVRTGRSP RQVASWLAAL GFAVADPATL PDQFDRGDLI LLSDDRHGRW
QSLGSDRPIP TVHLAEAAVR TGRSPRQVAS RLVALGFTVP DPATLPDQLD RDDRILLSRD
LDGEEPWLDS DRPVPVGHLA VTAVRTGRSV GQVAARLAAF GLTVADRVPD QLDPDDAILL
SRDLDSRWPW LDPDRPIPPA HLAVTAVRTG RSVGQVAARL AALGFTVADL ATVPDQLDRG
DLILLSRGLD GRSPWLDSDR PVQAVHLAEA AVRTGRSPRQ VASRLAAFGF AVPDPATLPD
QLDRDDRTLL SRNLDGEEPW LHPGQLVPAG HLIAAFASGR KITEVASRLT KYGFRLPASI
NMDNLVSDGS