Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3051 |
Symbol | |
ID | 5671430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3586774 |
End bp | 3591147 |
Gene Length | 4374 bp |
Protein Length | 1457 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641241949 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001507369 |
Protein GI | 158314861 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.690626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGG GCGGGGATTC CCGTGGGGAT GCGGGGAGTG CATCGGCTGC GGATGAAGTG CCCTCTGATC AGGGGGTGAG CGGTGGGAAG CGGCTGCCTC TCTCGTTTGC TCAGCGCGGG AACTGGGCGG CGCAGCGGTT GTTCCCCGGT TCTGCTGCGT TTTGTGTGTG CGACCTGGTG TGGCTTGACG GGGCCATCAA TGCCGGGGCG TTCGCCGACG CGGTGAGTGG GGCGTTCGCG GAGACGGAGG CGCTGCGGGC GGTCATCTAT GACGATGACG GGGCTCCTTC GCAGTCGGTC GGTGGGAAGC TGTCTCTGCC GACGGTGGTT CCCGAGGAGG TGCTCACAGA CGAGGAGATC CGGTCTGTGG TTCGCGCCGG GGTGAGTGCG CGGGAGTCTT CGGCGGCTGA GGATCTCACG TCGTCGACGC TGTTCAAGCG TGTCGGTGGT GGGTGGGTGT GGTCGTTCAC GACGAATAAT CTCCTGCTTG ACGGGTATAG CACGTCTCTC TATATCCGCC GGGTCGCGGA GCTGTACTCG GCTGCGGTCG ATGCAATCCT CGCGCCGCCG AGGTGGTTTG GTCGTCTGGA GGATCTCGTC GCGGGGTCGG CATCGACGCC GGGTGGGTCC GGCATCGTCG GTCACTGGCG TGATGTGCTT GCTATCGACG CGTTTTCCGA GCCCATGGGA AGCGCGCCCG CTGATCTGTT TTCGTTCTCG TACCGGCCTG TTCCCGTGGT GCTTCCGGCG GGGGCCGATG ATCGCCTGCG TATGCTGGCG CGCCAGGCGA GGAGCACGTG GACGGATCTC GTCATCACCG CGTGGGGGCT GTATACGGCC CTTGCCGGGA ACCAGGACTA CCTCGCTGTG CGGGTACCGT CGATGATGCG CGGCGAGCCG GAGTCGCTGC GCGTGCCGGG CGCGGTCGCC CGCGCGCTGC CCGTCGCGAC CGCCCTGCGC CCGGGCGCCA CATTCGCCGA GGTGCTGTCG GTGGTGGGAT CGCAGGTCCG GGGTCTGCGG GACAACTCCG CGATCGAGGA TCATCAGCTC GCCCGTCTCT GGAAGGGAGG AGAGCTCTCC TACCTGTCGC TGCCCTCGGT CAACATCAAG ATGTTCAGGA CGACACCGGT CTTCGGAAAG GTCGCGGGGG TCACGGAGCT GATCAACCCC GGCCCGACCG GCGCGCTGGA TCTCTCCGTC TACGGGAGCC CGGGACGGGG CCTGCGGATG GATATCTCCG GCCGGTCTCC GCTCGTGCCG ACAGACGCGG CCGGCCAGCA CGCAGCGGCG TTCACGGCGT TCCTCGGCCA CCTCCTCGGC GGCCCTGCGG GCATGACACT CCACGAACTC GCGGATCTCA CGATCACTCC CTCTTCGGTC GATGCCGGGT CGGTGGGTGT GTGGTCGGTG GGTGCGGGGG CGGTGGTTCC CGCGGTGACG GTGGATGCGC TGATCCGGGA TCAGGTAGCA CGTACTCCCG GGGCGGTCGC GGTCGTGGAT GACGCGGATG GTGCGGAGTT GGTGTATGCG CAGTTCGACG CGCGGGTGAA CGCTCTGGCC CATCTCCTGA CCGAGCGGGG CGTGAGGGTG GGTGGCCGGG TCGCGGTGGC CCTGCCGCGC TCGGCGGACC TGGTGACCTC CCTCGCCGCG GTACTCCGCG CCGGCGCCGC GTATGTCCCC GTCGACCCGG GATACCCCGC CGAGCGCATT GCCGCGATCC TCCAGGACTC CGGTGCCCGT GTGGCCATCA CCGATAGTGC GACGGCGGTG GCCCATGCGG GCGTGCTCAC GGCCGCGGGC GTTGTCACCG TGGTCCTCGA TGAGGACGCT GTTCGTGGCC AGATAGAACA CGGGGCGCCC GACGCACCCG TGCTGCCGCG TCCCCTCACC CCCGACGATA CCGCGTATGT GATCTTTACT TCCGGTACCA CCGGACGACC CAAAGGCATC GCCCTGTCCC ATGCCGCAGT CGTGAACCGG CTTGTCTGGG GCCGGGAAGC GTTGGGGTTC TCGTCGTCTG ACCGGGTGCT ACTGAAGACG CCATTCACGT TCGACGTGTC GGTACCGGAG TTCTTCCTCC CGCTGATCAC CGGAGCGGTG GTCGTGGTCG CCCGTGACAA CGCACATGGG GATCCGGGCT ATATCGCCGG TGTGGTGCGG AAAAGGCGCG TCACGAGCGT GCATTTCGTG CCATCGATGC TTCAGGCATT CCTGGACTCG GGAGTAGAGG CAGGGTTTTT CCCGGATGTG CGGCTGGTGT CGTTCACGGG GGAAGCGCTG CCGGTGGCGG CGGCTATCAG GGCCCGGGAG GTGTTCGATC GAGCGGAACT GTTCAACCTG TACGGGCCGA CGGAAGCGGC GGTCGAGATC GCCAGCTACG ACATCGCCGC TCTGAACGCA GACGCGGATT CGACGCCGAT CGGTCGCCCG GTGTCGAATT CTTATGTGCG GGTGCTGGAC GGGTGGCTGC GCCCGGTGCC GGTCGGGGTG ACCGGTGAGC TGTATCTGGG CGGGGTGCAG CTGGCGGAAG GGTATGTGGG CCGGGCGGGG CTGACAGCGG AGAGGTTCGT CGCCGACCCT CTCGGTGCCC CGGACGAGCG GTTGTATCGG ACCGGGGACC TGGCCCGGTG GAACGACCAG GGCGAGTTGG AGTATCTGGG CCGGTCCGAC GACCAGGTCA AGGTACGCGG GTTCCGGATC GAACTCGACG AGATCCGCGC TGTCCTCGAA CGACACCCCG CCGTCTCGGG TGCCGCGGTC ACCGCTCTCG ATCACCCCGC CGGAGGGAAG TTCCTCGCCG CCTATGTCAC TACCACCCCG TCCGCACCGG CCGACCAGGC CGTACTGGCC GACGCACTGC GGGAACACAC GAACGCGCTC CTGCCGGAAT ACATGGTTCC CGCATCGTTC ACCCGCCTGG CCACACTCCC CACGACCCCG AGCGGGAAAC TCGACCGCAA AGCACTACCC GCCCCCGACC TCACCGCCGG ATCCGGCAGC GGCCGCCCCC CGGAAACCGA CACCGAACTG TCCCTGGCCG GGGTGTTCCG CGACGTACTC GCCCTCCCCG AGGGCACACC CCTGTCGGTG GACGACGACT TCTTCCGACT CGGCGGTGAC AGCATCCTCG CGATACGTCT TGTTGCGCGC GCGTCACAAC GACAGCACAC TTTCACGTTG CGGGACGTCT TCGAGCAGCG AACCGTCGCG AAGCTGTCCC AGAAGATCAT CAAAGAAGTC GAGGCCAAGG CGATTTCGAC AAGCATGATC ACTGTGCCCG CCTCTCCGAC TCTCGAGAGA CTACGTGAGT CGCGCGACGA TCCGAACTCA TGGATCTTGA CCGAGACCGC CATTCTCCCC GTCTCGTTAT CGCACGACGC GCTACTCGCC GCATACGCTT CGCTCGTTCA AGAACATGAC CTGCTTCGCA TGTCGGTTCA GACGGTAAGT CGTCGACTGT GGCTCACTTG GGTAACCCCT ATGACAACCA TCGCACCAAC TCTCTCCAGA GTGCATGTCG CAGGGGCCTC GCCCTCCACG AAGCTGACAG ATCTTCGGAC GATGGCCAGC CAAATGATTG ATGTCACCAA TGCACGGCCG TCGGGCCTCG CATACGCAAG CAGCTCGACA CAGACCTTAA TTGCTCTTGC GGTACATGCT GCCGTGGCCG ATCGATACAC CGTGCACCAG CTACTGGAGA CCCTTCGCGC GCTCACTGAT GAGAACGCCA ACAAGAGCAT GCTATCAACG CCCTCCGTCG CAGCGACCTT GACTGAAGTC GCCGATATCG CTGCGGCTCT CAAAATGGAT CGGATCGAGA ACCCGATTGA GCTGATCGAG CGCACGGATT CACTCGACGA GGGACTGTAC TCAGCCGACA GAACTCAGGT CATTCACTGG GACGGCTCTC GCACGGATGC CACCGTGCGC GAGACCATCC GACGAGCACT GCACGCGACG GGGTACGGAT CGCGACTCGG CGGTGTCGTA GATCACGAAG CTCCTCTTCT GCCAGATGCT GCCCTGGGGC CGCTTGGCCC GATGACGGTA ACCGTTCCCG TGCCTATCGA GCAGGAAACT TGGCACCCAG ATCCTGAGTT CGCATTGGCA CGCTACGGCA GTAGCTCGGG TCGGCGTCTC CTAGCTGGCA TACCGATCGC GCCGATACTT ATCAGCCGAT CCTATTCCGC CGATGCAGAA TCCTTGGCGA TCGAACCGAC CGAGGCAGCG TACCGCGCGG TGATCCGATA CAGGGTGGAG GCCTGCAGCA CTATTCTGAC GGTCATCGGA TTTACCCGTG CCGTGATCCT GGCGTTCGAG AACGTGCTGC GCAAAGTCGG CGACAATGCC GATCGGCGAG TCTGGCGCCC ATAG
|
Protein sequence | MSRGGDSRGD AGSASAADEV PSDQGVSGGK RLPLSFAQRG NWAAQRLFPG SAAFCVCDLV WLDGAINAGA FADAVSGAFA ETEALRAVIY DDDGAPSQSV GGKLSLPTVV PEEVLTDEEI RSVVRAGVSA RESSAAEDLT SSTLFKRVGG GWVWSFTTNN LLLDGYSTSL YIRRVAELYS AAVDAILAPP RWFGRLEDLV AGSASTPGGS GIVGHWRDVL AIDAFSEPMG SAPADLFSFS YRPVPVVLPA GADDRLRMLA RQARSTWTDL VITAWGLYTA LAGNQDYLAV RVPSMMRGEP ESLRVPGAVA RALPVATALR PGATFAEVLS VVGSQVRGLR DNSAIEDHQL ARLWKGGELS YLSLPSVNIK MFRTTPVFGK VAGVTELINP GPTGALDLSV YGSPGRGLRM DISGRSPLVP TDAAGQHAAA FTAFLGHLLG GPAGMTLHEL ADLTITPSSV DAGSVGVWSV GAGAVVPAVT VDALIRDQVA RTPGAVAVVD DADGAELVYA QFDARVNALA HLLTERGVRV GGRVAVALPR SADLVTSLAA VLRAGAAYVP VDPGYPAERI AAILQDSGAR VAITDSATAV AHAGVLTAAG VVTVVLDEDA VRGQIEHGAP DAPVLPRPLT PDDTAYVIFT SGTTGRPKGI ALSHAAVVNR LVWGREALGF SSSDRVLLKT PFTFDVSVPE FFLPLITGAV VVVARDNAHG DPGYIAGVVR KRRVTSVHFV PSMLQAFLDS GVEAGFFPDV RLVSFTGEAL PVAAAIRARE VFDRAELFNL YGPTEAAVEI ASYDIAALNA DADSTPIGRP VSNSYVRVLD GWLRPVPVGV TGELYLGGVQ LAEGYVGRAG LTAERFVADP LGAPDERLYR TGDLARWNDQ GELEYLGRSD DQVKVRGFRI ELDEIRAVLE RHPAVSGAAV TALDHPAGGK FLAAYVTTTP SAPADQAVLA DALREHTNAL LPEYMVPASF TRLATLPTTP SGKLDRKALP APDLTAGSGS GRPPETDTEL SLAGVFRDVL ALPEGTPLSV DDDFFRLGGD SILAIRLVAR ASQRQHTFTL RDVFEQRTVA KLSQKIIKEV EAKAISTSMI TVPASPTLER LRESRDDPNS WILTETAILP VSLSHDALLA AYASLVQEHD LLRMSVQTVS RRLWLTWVTP MTTIAPTLSR VHVAGASPST KLTDLRTMAS QMIDVTNARP SGLAYASSST QTLIALAVHA AVADRYTVHQ LLETLRALTD ENANKSMLST PSVAATLTEV ADIAAALKMD RIENPIELIE RTDSLDEGLY SADRTQVIHW DGSRTDATVR ETIRRALHAT GYGSRLGGVV DHEAPLLPDA ALGPLGPMTV TVPVPIEQET WHPDPEFALA RYGSSSGRRL LAGIPIAPIL ISRSYSADAE SLAIEPTEAA YRAVIRYRVE ACSTILTVIG FTRAVILAFE NVLRKVGDNA DRRVWRP
|
| |