Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5942 |
Symbol | |
ID | 5674263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7240173 |
End bp | 7245209 |
Gene Length | 5037 bp |
Protein Length | 1678 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244790 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001510192 |
Protein GI | 158317684 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3433] Aryl carrier domain |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR03494] salicylate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAGCGC AAGGGAGTTA CCGCAGCAGG TCGGTCCCGC TCGACCAGGC GCCGCAGGCC GCCGCCGCGG GTCTCGCGCG CGGGCTGGCC GCGCCGTTCG TGCTCTACGA GCGCGACGGA GAGTGGTCCT GCGGCTCGGG GGTGCTCGCC GAGGTCGTGC TGAGCGCCAC CGAGGTGAGG CACCGGACGG GCAGCGGCGA CTGGCGGTCC GAGCCGACCG GAGCCACCCC GCTGCGCCAG GTCGCGAATG TCCTCGGCGG CCTCGAGGCG ACCGGGTGGC GGGCCTACGG CTGGGCGGCG TTCGAGCTCA GCCTGCTCCT GCAGGGCCAG CCGGTGCCGG CCGGCGACGA TCCGCTGCTG CACGTGATCG TGCCACGGCG GGAGGTGCGG CTGATCGACG GGGTCGCCGT CCTGCGCGCC GTCGACACCG CCGATCTGGC GGAGCTGGCC AAGCTGCTCG CCGCCGTGCC GAGCGGCCCG CCGGCGGCCA GCGCCCCGCG CGCCGTCGTC GACCTCGAGC ACGGGGTCGA CGAGTACCGC CGGATCGTCG CGGCGGCCGT CGCCGACATC CGCGCCGAAC GGCTGCGCAA GGTGATCCTC TCCCGCGTCG TGCCCGTGGC GGGCGACGTC GACCTGGTGG CCACCTACGA GGTCGGCCGG GCCGGGAACA ACCCGGCGCG GTCGTTCATC GTGGACACGG GCCACCTGCG GGCCACCGGG TTCAGCCCGG AGACCGTCGT CGAGGTGTCC GCGGACGGCC TGGTGTCGAC CCAGCCGCTG GCCGGCACCC GGGCGCTGAC CGGCGATCCC GCCGTCGACC GCGATCTGCG CGAGGTGCTG CTCTCCGATC CCAAGGAGAT CTACGAGCAC GCTGTCTCCG TACAGGCCTG CCAGGACGAG CTGCTCCAGG TCTGCCGGCC CGGGTCGGTG GTCGTGGACC AGTTCATGAA CGTGCTGCCC CGCGGCAGCG TCCAGCACCT TGCCTCGCGG GTGGCCGGCC GGCTCGCCGC CGGCCGCGAC GCCTGGGACG CACTGGGCGC GGTCTTCCCC TCCATCACCG CCTCCGGCGT GCCCAAGCCG GTCGCGTGCG CGGTGATCGG CGACTACGAG GGGGAGCTCC GCGGCCTGTA CAGCGGCGCG GTGCTGATCG CTGACTCCGA CGGCGGGCTC GACGCGGCGC TGGTGCTGCG CACGGTGTTC CAGCAGGACG GCCGGACCTG GCTGCGCGCC GGGGCCGGCA TCGTCGGCCA CTCCGACCCC GACCGCGAGG CGGAGGAGAC CCGGGAGAAG CTGCGCAGCG TCAGCCGTTT CCTGGTCGCG GCGCCCGCCG CGCCGGCCGC CCCGGCTGTT CCCGCCGCGC GGGCGGGCAC CCCCGGGTAC CAGCTCGAGG ATGTCCGCCG CATCGTGGCC GAGCTGATCG ACGAGGATCC GTCGGCGATC GGCAACGAGG CGAACCTGTT CGAGCTGGGC CTGGAGTCCA TCGCCCTGAT GAAGGCCGTC GGCCGCTGGC GCCGGGCCGG GATGGCGGTC TCGTTCGCCG AGCTCGCGGA GAACCCGACC GTCGACGGCT GGTGCAAGCT CCTGTCGATG CGGGCGCCCA CCGAGCCCGC CGGCGCGGAC GACGGCGCGC AGGCCCGCGC GGAGGCCGGT GAGTTCCCGC TCGCGCTCAT GCAGCATGCC TACTGGGTCG GCCGGGACGG CGGGCAGCCG CTCGGCGAGG TCGCGGCGCA CCTCTACACC GAGTTCGACG GCGCGGGAGT CGACGTCGAC CGGCTGCGCA CGGCTATCGA GGCGCTCGTC GCGCGGCACG ACATGCTGCG CGTGCGGATC ACGGACAACG GGAGCCAGGT CGTCGAGGCG ACCTCGGGCT GGCGCGGTCT CACCGTGCAC GACCTGCGCG AGCTCGGCCA GGCCGAGACC CAGGCCCGGC TCGCCGCGGT GCGGGACCGG ATGTCGCACC AGATGCTGGA CATCGAGCAC GGCGAGGTGT TCGCCACCGC GCTGAGCCTG CTGCCGCAGG GCCGGACCAG GCTCCACCTG GACGTCGACA TGGTCGCGGC CGACGCCGTC AGCTACCGGG TGCTGCTCAA CGACCTCGCG CGGTTCTACG ACCGTCCCGG CGAGGAGCAC CCGCCGCTGG GCTACACCTA TCGCGAGTAC CGCGCCGCGC GGGTCCCGGC CAGGCGCGCC GCCGCGCGCG CGGCCGCCGA GTGGTGGCAG GGCCGGCTGC CGGGCCTGCC CGGCGCGCCC GGCCTGCCCC GCCTGGCCTC GCCCGACGAC GCGGGCACCG CCTCGAGCGC CGGCGCCGCG AGCACGGCGG GCGCTACCGG CTCCACCGGC GAGCCGCCGC GGGTGACGCG GCGTCACTTC GTCCTGGGGC CGTCGGCGCG CCAGGCGTTG CAGCGCGCCG CCCACAGCCG GGGCGTCACC CCGGCGATGG CGGTGGCGAC CGCGTTCGCC GAGGTGCTCG CCGGGTGGAG CACCGAGTCC CGGTTCGTCC TCAACGTGCC GATGTTCGAC CGCGACCAGG TGCACGCGGA CGTCAACCAG GTGGTCGGGG ACTTCACGAG CTCGGTGCTG CTCGAGGTCG ACCTGGCCGA GCCGCAGCCG TTCGCGACCC GGGTGCGCCA GGTCCAGGCC CGGCTGCACG CCGACGCGGC GCACGCGGAC TACTCCGGTG TGGAGGTGCT GCGCGACCTG ACCCGCCGCA CCGGCGAGCA GGTGCTGGCG CCCGTGGTGT TCACCAGCGC GCTCGGGCTC GGGGAGCTGT TCGGCCCCGG GGTCCGCCAG CACTTCGGCG ACCCGGTGTG GATCATCTCG CAGGGCCCCC AGGTGCTCCT GGACGCGCAG GTCACCGAGC TGGACGGCGG CCTGCTCGTC AACTGGGACG TCCGCGACGG CGAGTTCGCC CCGGGCGTGG TCGACGCGAT GTTCGGCGCC TTCGAACGGC TGGTGCGCGG CCTCGCCGAC ACCGCGGGCA CCTGGGACAC GGCGGTCGAC GGCCTGGTGC CGGAACCGGC CCGTGCCATC CGCGCCGCGG CCAACGACAC CGCCGGGCCG GTGCCGACCC GGCTGCTGCA CGAGGGGTTC TTCGAGAACG CCGTGCTGGC CCCCGACGCC CCCGCGCTGC TGTGGGACAC CGCCGGCGGC CCGGGTTCGC TGGCCTACGG CGAGCTGCGG CGCCGGGCGC TGGGGCTGGC CGGCGCGCTG GCCGGCCACG GCGTGCGCCG CGGCGACCTG GTCGGAGTCA GCCTGCCCAA GGGGCCGTCC CAGGTGGTCG CCGTCCTCGG CGTCCTGGCG GCGGGCGCGA CGTACGTGCC GGTCGGCATC GAGCAGCCCG CCGCCCGGGT GGAGCGGATC GCGGCCGCCG CCGGGTTCGG GGTGCTCATC ACCGAGTCGC ACCGCGACGG CGTGCCCGCC GGGGTCGTCC AGCTGGCGCC GGACCAGCCC GCCGAGCCGG CGCCGGTACC GGATCTCGCC GCCCCCGGCG AGCTCGGGGG GCTGGACCGG CCGGCCTACG TGCTCTTCAC CTCCGGATCG ACCGGTCAGC CCAAGGGCGT GGAGGTCGGG CACCGGGCGG CGATGAACAC GATCGCCGAC CTGATCGACC GGCTGGGCCT GGGCACCGAC GACCGGACGC TCGCCGTGTC GGCGCTGGAC TTCGACCTCT CGGTGTTCGA CATCTTCGCC CCGCTGTCGG CCGGCGGCGC CGTGGCGCTC GTCGACGAGG ACTCCCGCCG GGAGGCCAGC CGGTGGGCGG AGCTGATCCG CGACCACCGG GTGACCGTCC TCAACTGCGT CCCCACGGTG CTGGACCTCG TCCTCGCCGC CGGGGTGGCG CTCGGGGACA GCCTGCGGGC GGTCCTGCTC GGCGGCGACA AGGTGGGCGT GGACCTGCCG GGCCGGCTCG CCGCGGCGGT GCCGGGCTGC CGGTTCCTGG GCCTGGGCGG CACGACTGAG ACGGCCATCC ACTCGACGAT CTGCGAGGTG GAGGGGGCGT CCCCGCTGCC GCCGCAGTGG CGCTTGGTCC CCTACGGGAC GCCGCTGCGC AACGTCCGGC TGCGGGTGGT CGACCCGCTC GGCCGAGACT GCCCCGACCA CGTGGCCGGG GAGCTCTGGA TCGGCGGTGA CGGCGTCGCC CGCGGCTACC TGGGCGATCC CGAGCGCACC GCGGACCGCT TCGTCGAGCA CACCGGCATC CGGTGGTACC GGACGGGTGA CATCGCCCGG TACCTGCCGG ACGGCACGGT GGACTTCCTC GGCCGGCGCG ACGACCAGGT GAAGATCCGC GGCTTCCGGG TCGAGCTGGG CGAGGTGGAG GCAGCCCTGA CCACGCTGCC GGAGGTCCGG GCGGGGGTGG CGGTGCTCGT GCGCGGCGCG TCCGGGCGGT CCGCCGTGCT CGGCGGGGGC GTCGTGCTCG GCGCCGGCGT CGTGCCGGCC ACCCCCGCCA CGGGCGAGGC CGACGACGGC GGGGGCGGCG GCGAGAGCTC CGGCGACGGC GCGGGCATCG CCGGCGCCGT GCGCGAGGGG CTGCGCCGCG CGCTGCCGCC GCACATGGTC CCGGACCTGG TGGTCGCGCT CGACAGCCTG CCGCTCACCG CCAACGGGAA GATCGACCGC CGGGCGGTGA CGGCCGCCGT CGAGCGGGCG GTGGCCGGCC GGGCGGCCGA CCACGCACCG CCGCGCACCG ACCTGGAGCG GGTCGTGCAC AACGTCTGGC GCGAGGTGCT CGGGGTGGCC GAGTTCGGGA TCACCGACGA GTTCTTCGCG CTGGGTGGTG ACTCCGTCCT CGCCACCGCG CTCGTCACGC GGCTGCGCGA CGAGCTGGAC ACCGCCGCCG TCACCGTGCG CTCGGTGTTC GGGGCGCCGA CCGTCGCCGC GCTCGCCGAG CGGATCCGCG CCGCCGACAC CGTCCCCGGC CGGGCGGAAC GGGTCGCCGC GATCGCACTG GAGATCGCGG CGATGTCGGA CGACGAGGTC GCGGCCGAGC TGGTCGACCC GGACACGCTC CCCGCCGACT CCGGCGGTGC GTCGTGA
|
Protein sequence | MRAQGSYRSR SVPLDQAPQA AAAGLARGLA APFVLYERDG EWSCGSGVLA EVVLSATEVR HRTGSGDWRS EPTGATPLRQ VANVLGGLEA TGWRAYGWAA FELSLLLQGQ PVPAGDDPLL HVIVPRREVR LIDGVAVLRA VDTADLAELA KLLAAVPSGP PAASAPRAVV DLEHGVDEYR RIVAAAVADI RAERLRKVIL SRVVPVAGDV DLVATYEVGR AGNNPARSFI VDTGHLRATG FSPETVVEVS ADGLVSTQPL AGTRALTGDP AVDRDLREVL LSDPKEIYEH AVSVQACQDE LLQVCRPGSV VVDQFMNVLP RGSVQHLASR VAGRLAAGRD AWDALGAVFP SITASGVPKP VACAVIGDYE GELRGLYSGA VLIADSDGGL DAALVLRTVF QQDGRTWLRA GAGIVGHSDP DREAEETREK LRSVSRFLVA APAAPAAPAV PAARAGTPGY QLEDVRRIVA ELIDEDPSAI GNEANLFELG LESIALMKAV GRWRRAGMAV SFAELAENPT VDGWCKLLSM RAPTEPAGAD DGAQARAEAG EFPLALMQHA YWVGRDGGQP LGEVAAHLYT EFDGAGVDVD RLRTAIEALV ARHDMLRVRI TDNGSQVVEA TSGWRGLTVH DLRELGQAET QARLAAVRDR MSHQMLDIEH GEVFATALSL LPQGRTRLHL DVDMVAADAV SYRVLLNDLA RFYDRPGEEH PPLGYTYREY RAARVPARRA AARAAAEWWQ GRLPGLPGAP GLPRLASPDD AGTASSAGAA STAGATGSTG EPPRVTRRHF VLGPSARQAL QRAAHSRGVT PAMAVATAFA EVLAGWSTES RFVLNVPMFD RDQVHADVNQ VVGDFTSSVL LEVDLAEPQP FATRVRQVQA RLHADAAHAD YSGVEVLRDL TRRTGEQVLA PVVFTSALGL GELFGPGVRQ HFGDPVWIIS QGPQVLLDAQ VTELDGGLLV NWDVRDGEFA PGVVDAMFGA FERLVRGLAD TAGTWDTAVD GLVPEPARAI RAAANDTAGP VPTRLLHEGF FENAVLAPDA PALLWDTAGG PGSLAYGELR RRALGLAGAL AGHGVRRGDL VGVSLPKGPS QVVAVLGVLA AGATYVPVGI EQPAARVERI AAAAGFGVLI TESHRDGVPA GVVQLAPDQP AEPAPVPDLA APGELGGLDR PAYVLFTSGS TGQPKGVEVG HRAAMNTIAD LIDRLGLGTD DRTLAVSALD FDLSVFDIFA PLSAGGAVAL VDEDSRREAS RWAELIRDHR VTVLNCVPTV LDLVLAAGVA LGDSLRAVLL GGDKVGVDLP GRLAAAVPGC RFLGLGGTTE TAIHSTICEV EGASPLPPQW RLVPYGTPLR NVRLRVVDPL GRDCPDHVAG ELWIGGDGVA RGYLGDPERT ADRFVEHTGI RWYRTGDIAR YLPDGTVDFL GRRDDQVKIR GFRVELGEVE AALTTLPEVR AGVAVLVRGA SGRSAVLGGG VVLGAGVVPA TPATGEADDG GGGGESSGDG AGIAGAVREG LRRALPPHMV PDLVVALDSL PLTANGKIDR RAVTAAVERA VAGRAADHAP PRTDLERVVH NVWREVLGVA EFGITDEFFA LGGDSVLATA LVTRLRDELD TAAVTVRSVF GAPTVAALAE RIRAADTVPG RAERVAAIAL EIAAMSDDEV AAELVDPDTL PADSGGAS
|
| |