Gene Franean1_3051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3051 
Symbol 
ID5671430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3586774 
End bp3591147 
Gene Length4374 bp 
Protein Length1457 aa 
Translation table11 
GC content65% 
IMG OID641241949 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001507369 
Protein GI158314861 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.690626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGG GCGGGGATTC CCGTGGGGAT GCGGGGAGTG CATCGGCTGC GGATGAAGTG 
CCCTCTGATC AGGGGGTGAG CGGTGGGAAG CGGCTGCCTC TCTCGTTTGC TCAGCGCGGG
AACTGGGCGG CGCAGCGGTT GTTCCCCGGT TCTGCTGCGT TTTGTGTGTG CGACCTGGTG
TGGCTTGACG GGGCCATCAA TGCCGGGGCG TTCGCCGACG CGGTGAGTGG GGCGTTCGCG
GAGACGGAGG CGCTGCGGGC GGTCATCTAT GACGATGACG GGGCTCCTTC GCAGTCGGTC
GGTGGGAAGC TGTCTCTGCC GACGGTGGTT CCCGAGGAGG TGCTCACAGA CGAGGAGATC
CGGTCTGTGG TTCGCGCCGG GGTGAGTGCG CGGGAGTCTT CGGCGGCTGA GGATCTCACG
TCGTCGACGC TGTTCAAGCG TGTCGGTGGT GGGTGGGTGT GGTCGTTCAC GACGAATAAT
CTCCTGCTTG ACGGGTATAG CACGTCTCTC TATATCCGCC GGGTCGCGGA GCTGTACTCG
GCTGCGGTCG ATGCAATCCT CGCGCCGCCG AGGTGGTTTG GTCGTCTGGA GGATCTCGTC
GCGGGGTCGG CATCGACGCC GGGTGGGTCC GGCATCGTCG GTCACTGGCG TGATGTGCTT
GCTATCGACG CGTTTTCCGA GCCCATGGGA AGCGCGCCCG CTGATCTGTT TTCGTTCTCG
TACCGGCCTG TTCCCGTGGT GCTTCCGGCG GGGGCCGATG ATCGCCTGCG TATGCTGGCG
CGCCAGGCGA GGAGCACGTG GACGGATCTC GTCATCACCG CGTGGGGGCT GTATACGGCC
CTTGCCGGGA ACCAGGACTA CCTCGCTGTG CGGGTACCGT CGATGATGCG CGGCGAGCCG
GAGTCGCTGC GCGTGCCGGG CGCGGTCGCC CGCGCGCTGC CCGTCGCGAC CGCCCTGCGC
CCGGGCGCCA CATTCGCCGA GGTGCTGTCG GTGGTGGGAT CGCAGGTCCG GGGTCTGCGG
GACAACTCCG CGATCGAGGA TCATCAGCTC GCCCGTCTCT GGAAGGGAGG AGAGCTCTCC
TACCTGTCGC TGCCCTCGGT CAACATCAAG ATGTTCAGGA CGACACCGGT CTTCGGAAAG
GTCGCGGGGG TCACGGAGCT GATCAACCCC GGCCCGACCG GCGCGCTGGA TCTCTCCGTC
TACGGGAGCC CGGGACGGGG CCTGCGGATG GATATCTCCG GCCGGTCTCC GCTCGTGCCG
ACAGACGCGG CCGGCCAGCA CGCAGCGGCG TTCACGGCGT TCCTCGGCCA CCTCCTCGGC
GGCCCTGCGG GCATGACACT CCACGAACTC GCGGATCTCA CGATCACTCC CTCTTCGGTC
GATGCCGGGT CGGTGGGTGT GTGGTCGGTG GGTGCGGGGG CGGTGGTTCC CGCGGTGACG
GTGGATGCGC TGATCCGGGA TCAGGTAGCA CGTACTCCCG GGGCGGTCGC GGTCGTGGAT
GACGCGGATG GTGCGGAGTT GGTGTATGCG CAGTTCGACG CGCGGGTGAA CGCTCTGGCC
CATCTCCTGA CCGAGCGGGG CGTGAGGGTG GGTGGCCGGG TCGCGGTGGC CCTGCCGCGC
TCGGCGGACC TGGTGACCTC CCTCGCCGCG GTACTCCGCG CCGGCGCCGC GTATGTCCCC
GTCGACCCGG GATACCCCGC CGAGCGCATT GCCGCGATCC TCCAGGACTC CGGTGCCCGT
GTGGCCATCA CCGATAGTGC GACGGCGGTG GCCCATGCGG GCGTGCTCAC GGCCGCGGGC
GTTGTCACCG TGGTCCTCGA TGAGGACGCT GTTCGTGGCC AGATAGAACA CGGGGCGCCC
GACGCACCCG TGCTGCCGCG TCCCCTCACC CCCGACGATA CCGCGTATGT GATCTTTACT
TCCGGTACCA CCGGACGACC CAAAGGCATC GCCCTGTCCC ATGCCGCAGT CGTGAACCGG
CTTGTCTGGG GCCGGGAAGC GTTGGGGTTC TCGTCGTCTG ACCGGGTGCT ACTGAAGACG
CCATTCACGT TCGACGTGTC GGTACCGGAG TTCTTCCTCC CGCTGATCAC CGGAGCGGTG
GTCGTGGTCG CCCGTGACAA CGCACATGGG GATCCGGGCT ATATCGCCGG TGTGGTGCGG
AAAAGGCGCG TCACGAGCGT GCATTTCGTG CCATCGATGC TTCAGGCATT CCTGGACTCG
GGAGTAGAGG CAGGGTTTTT CCCGGATGTG CGGCTGGTGT CGTTCACGGG GGAAGCGCTG
CCGGTGGCGG CGGCTATCAG GGCCCGGGAG GTGTTCGATC GAGCGGAACT GTTCAACCTG
TACGGGCCGA CGGAAGCGGC GGTCGAGATC GCCAGCTACG ACATCGCCGC TCTGAACGCA
GACGCGGATT CGACGCCGAT CGGTCGCCCG GTGTCGAATT CTTATGTGCG GGTGCTGGAC
GGGTGGCTGC GCCCGGTGCC GGTCGGGGTG ACCGGTGAGC TGTATCTGGG CGGGGTGCAG
CTGGCGGAAG GGTATGTGGG CCGGGCGGGG CTGACAGCGG AGAGGTTCGT CGCCGACCCT
CTCGGTGCCC CGGACGAGCG GTTGTATCGG ACCGGGGACC TGGCCCGGTG GAACGACCAG
GGCGAGTTGG AGTATCTGGG CCGGTCCGAC GACCAGGTCA AGGTACGCGG GTTCCGGATC
GAACTCGACG AGATCCGCGC TGTCCTCGAA CGACACCCCG CCGTCTCGGG TGCCGCGGTC
ACCGCTCTCG ATCACCCCGC CGGAGGGAAG TTCCTCGCCG CCTATGTCAC TACCACCCCG
TCCGCACCGG CCGACCAGGC CGTACTGGCC GACGCACTGC GGGAACACAC GAACGCGCTC
CTGCCGGAAT ACATGGTTCC CGCATCGTTC ACCCGCCTGG CCACACTCCC CACGACCCCG
AGCGGGAAAC TCGACCGCAA AGCACTACCC GCCCCCGACC TCACCGCCGG ATCCGGCAGC
GGCCGCCCCC CGGAAACCGA CACCGAACTG TCCCTGGCCG GGGTGTTCCG CGACGTACTC
GCCCTCCCCG AGGGCACACC CCTGTCGGTG GACGACGACT TCTTCCGACT CGGCGGTGAC
AGCATCCTCG CGATACGTCT TGTTGCGCGC GCGTCACAAC GACAGCACAC TTTCACGTTG
CGGGACGTCT TCGAGCAGCG AACCGTCGCG AAGCTGTCCC AGAAGATCAT CAAAGAAGTC
GAGGCCAAGG CGATTTCGAC AAGCATGATC ACTGTGCCCG CCTCTCCGAC TCTCGAGAGA
CTACGTGAGT CGCGCGACGA TCCGAACTCA TGGATCTTGA CCGAGACCGC CATTCTCCCC
GTCTCGTTAT CGCACGACGC GCTACTCGCC GCATACGCTT CGCTCGTTCA AGAACATGAC
CTGCTTCGCA TGTCGGTTCA GACGGTAAGT CGTCGACTGT GGCTCACTTG GGTAACCCCT
ATGACAACCA TCGCACCAAC TCTCTCCAGA GTGCATGTCG CAGGGGCCTC GCCCTCCACG
AAGCTGACAG ATCTTCGGAC GATGGCCAGC CAAATGATTG ATGTCACCAA TGCACGGCCG
TCGGGCCTCG CATACGCAAG CAGCTCGACA CAGACCTTAA TTGCTCTTGC GGTACATGCT
GCCGTGGCCG ATCGATACAC CGTGCACCAG CTACTGGAGA CCCTTCGCGC GCTCACTGAT
GAGAACGCCA ACAAGAGCAT GCTATCAACG CCCTCCGTCG CAGCGACCTT GACTGAAGTC
GCCGATATCG CTGCGGCTCT CAAAATGGAT CGGATCGAGA ACCCGATTGA GCTGATCGAG
CGCACGGATT CACTCGACGA GGGACTGTAC TCAGCCGACA GAACTCAGGT CATTCACTGG
GACGGCTCTC GCACGGATGC CACCGTGCGC GAGACCATCC GACGAGCACT GCACGCGACG
GGGTACGGAT CGCGACTCGG CGGTGTCGTA GATCACGAAG CTCCTCTTCT GCCAGATGCT
GCCCTGGGGC CGCTTGGCCC GATGACGGTA ACCGTTCCCG TGCCTATCGA GCAGGAAACT
TGGCACCCAG ATCCTGAGTT CGCATTGGCA CGCTACGGCA GTAGCTCGGG TCGGCGTCTC
CTAGCTGGCA TACCGATCGC GCCGATACTT ATCAGCCGAT CCTATTCCGC CGATGCAGAA
TCCTTGGCGA TCGAACCGAC CGAGGCAGCG TACCGCGCGG TGATCCGATA CAGGGTGGAG
GCCTGCAGCA CTATTCTGAC GGTCATCGGA TTTACCCGTG CCGTGATCCT GGCGTTCGAG
AACGTGCTGC GCAAAGTCGG CGACAATGCC GATCGGCGAG TCTGGCGCCC ATAG
 
Protein sequence
MSRGGDSRGD AGSASAADEV PSDQGVSGGK RLPLSFAQRG NWAAQRLFPG SAAFCVCDLV 
WLDGAINAGA FADAVSGAFA ETEALRAVIY DDDGAPSQSV GGKLSLPTVV PEEVLTDEEI
RSVVRAGVSA RESSAAEDLT SSTLFKRVGG GWVWSFTTNN LLLDGYSTSL YIRRVAELYS
AAVDAILAPP RWFGRLEDLV AGSASTPGGS GIVGHWRDVL AIDAFSEPMG SAPADLFSFS
YRPVPVVLPA GADDRLRMLA RQARSTWTDL VITAWGLYTA LAGNQDYLAV RVPSMMRGEP
ESLRVPGAVA RALPVATALR PGATFAEVLS VVGSQVRGLR DNSAIEDHQL ARLWKGGELS
YLSLPSVNIK MFRTTPVFGK VAGVTELINP GPTGALDLSV YGSPGRGLRM DISGRSPLVP
TDAAGQHAAA FTAFLGHLLG GPAGMTLHEL ADLTITPSSV DAGSVGVWSV GAGAVVPAVT
VDALIRDQVA RTPGAVAVVD DADGAELVYA QFDARVNALA HLLTERGVRV GGRVAVALPR
SADLVTSLAA VLRAGAAYVP VDPGYPAERI AAILQDSGAR VAITDSATAV AHAGVLTAAG
VVTVVLDEDA VRGQIEHGAP DAPVLPRPLT PDDTAYVIFT SGTTGRPKGI ALSHAAVVNR
LVWGREALGF SSSDRVLLKT PFTFDVSVPE FFLPLITGAV VVVARDNAHG DPGYIAGVVR
KRRVTSVHFV PSMLQAFLDS GVEAGFFPDV RLVSFTGEAL PVAAAIRARE VFDRAELFNL
YGPTEAAVEI ASYDIAALNA DADSTPIGRP VSNSYVRVLD GWLRPVPVGV TGELYLGGVQ
LAEGYVGRAG LTAERFVADP LGAPDERLYR TGDLARWNDQ GELEYLGRSD DQVKVRGFRI
ELDEIRAVLE RHPAVSGAAV TALDHPAGGK FLAAYVTTTP SAPADQAVLA DALREHTNAL
LPEYMVPASF TRLATLPTTP SGKLDRKALP APDLTAGSGS GRPPETDTEL SLAGVFRDVL
ALPEGTPLSV DDDFFRLGGD SILAIRLVAR ASQRQHTFTL RDVFEQRTVA KLSQKIIKEV
EAKAISTSMI TVPASPTLER LRESRDDPNS WILTETAILP VSLSHDALLA AYASLVQEHD
LLRMSVQTVS RRLWLTWVTP MTTIAPTLSR VHVAGASPST KLTDLRTMAS QMIDVTNARP
SGLAYASSST QTLIALAVHA AVADRYTVHQ LLETLRALTD ENANKSMLST PSVAATLTEV
ADIAAALKMD RIENPIELIE RTDSLDEGLY SADRTQVIHW DGSRTDATVR ETIRRALHAT
GYGSRLGGVV DHEAPLLPDA ALGPLGPMTV TVPVPIEQET WHPDPEFALA RYGSSSGRRL
LAGIPIAPIL ISRSYSADAE SLAIEPTEAA YRAVIRYRVE ACSTILTVIG FTRAVILAFE
NVLRKVGDNA DRRVWRP