Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2984 |
Symbol | |
ID | 5671368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3509965 |
End bp | 3511854 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641241888 |
Product | AMP-binding domain protein |
Protein accession | YP_001507308 |
Protein GI | 158314800 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGATG ACCTGCTCTG GCCAGCCTAT AACGGGCCAG CAGACTTGGC CGCGGTCGAG GCTGTGCCAT TTGAGGCACG CGGCCTTCCT GACTCGACCT ACACACTGCT CAAATATGCG GCGGCGCAGT GGCCGGATCA CACGGCGCTC ATTGTCCTGC CCGAGGCCGC CCGCTGGCGG GAGCCACTGC ACCGCAGCTT CATCGAACTC CTGGCCGATG TCCACCGTTA CGCGAACCTG CTGTACAGTC TCGGCGTACG GCGCGGTGAC GCCGTCGCCC TGATGTCAGC CAACTGCGCC GAACTGGTCG GTGCCACCCT CGCCGCGCAG CTCGCCGGGA TCGCGGCGCC GCTCAATGGC AACTTGTCCT CGCCGCATCT CACCGAACTT CTCCGGCGCT CGGGTGCCCG AGTGCTGATC ACCGCCGGGC CCGACCTCGC CCCGACCACC TGGATTACCG CACAGGCCCT TGCCGCGGAC GGGATGCTCG ACGCCGTCCT TGCGCTCCGG CCTACAGCGG CGGTGGGCGC ACTCAAAGCC CTGCCCGCCA TCGAGGGTGT GCGCATCGGC TACCTCAGCG AACTTGCTAC CGGCATGGAA CCGTCCGCTT TCGACGGTGA GCCGCCCGGC TCGACGGACT TGGCCGCGCT GTTCCACACT GGCGGCACCA CCGGCGCGCC GAAACTGGCC GCTCAGACGC ATGCCAATGA GATTGCCAAC GCGTGGATGC TTGCCGCCGA CTCCCAGTTA GACCAGGACT GCGTGGTCTT CGCTGGGCTT CCACTGTTCC ACGTCAACGC GCTCGTGGTC ACGGTACTTG CTCCCCTGTT CAAGGGGCAG ACCGTGATGT GGGCTGGCCC GCTCGGCTAT CGCGACATCG AGCTGTACGG CGAGTTCTGG AAGATTGTCG AGCACTACCG GATCGGGTGC ATGAGCGCGG TGCCCACGGT GTACGCCGTG CTGGCGCAGT GCCCGGTCGA CGCCGACATC AGCAGCCTGC GGTTCGTGGC AACGGGCGCC TCGCCGCTTC CCTCCGCGGT TCGAGACGAC TTCCAAGCAC ACACCGGGAT AGCCCTGGTT GAGGGATACG GTTTGACCGA AGCGACTTGC GCGAGCGCAC GCAGCTTCGC GGATGGGCCG GGGCCGGGCT CAGTAGGGCA GCGCCTGCCC TACCAGCGGG TGAAGGTGGT GAAGGTCGGC CTGGACGGAG CGTGGGAAGA CCTACCCAAG GGCGAGATCG GCGTTCTGGC TATCAGCGGT CCGACCGTCT TCCCCGGCTA CGTCACCGGC CACAACGAAC ACGGCCACCT CTTGGACGGG CTCGGTAGGC TCAGCGACGG ATGGCTGGAC ACCGGAGACC TCGCGCGAGT TGACGAGGAC GGCCTTATCT ACCTTGCGGG AAGAGCCAAG GACCTCATCA TCCGCGGCGG CCACAACATC GACCCGACGA TCATCGAAGA TGCCCTGCTG GCTCACCCGC ACGTCACCGC TGCCGGAGCG GTCGGGCGCC CCGACGTCCA TTCCGGCGAG GTTCCTGTCG CCTACGTGAC ACTGGTGCCC GGTGCCGGGG TGACCGAGCA CGAGCTGCGG GATTGGGCAT GTAGGCAAGT ACTCGAGCGC GCCGCCCAGC CGAAGGCCGT GATCATCCTC GAGGCGCTTC CCATAACCGA TGTCGGCAAA CCGTACAAGC TCCCGCTGCG GGCAGATGCC ACCCGCAGAG AACTCCTGGC CGCGCTCAAC GAGGTCGCCG GCGTTCGCGA CGTCGAGGCC ACCATCGAAG ACGGATCGAT CGTGGCGACC GTCAAGGTCT CCCCATCTGC CGGCGAGGCA GCCGTCAAGG CGATCCTTGG CCGCTACGCG ATCCGATGGC ACATGGTCAC GACATCATGA
|
Protein sequence | MSDDLLWPAY NGPADLAAVE AVPFEARGLP DSTYTLLKYA AAQWPDHTAL IVLPEAARWR EPLHRSFIEL LADVHRYANL LYSLGVRRGD AVALMSANCA ELVGATLAAQ LAGIAAPLNG NLSSPHLTEL LRRSGARVLI TAGPDLAPTT WITAQALAAD GMLDAVLALR PTAAVGALKA LPAIEGVRIG YLSELATGME PSAFDGEPPG STDLAALFHT GGTTGAPKLA AQTHANEIAN AWMLAADSQL DQDCVVFAGL PLFHVNALVV TVLAPLFKGQ TVMWAGPLGY RDIELYGEFW KIVEHYRIGC MSAVPTVYAV LAQCPVDADI SSLRFVATGA SPLPSAVRDD FQAHTGIALV EGYGLTEATC ASARSFADGP GPGSVGQRLP YQRVKVVKVG LDGAWEDLPK GEIGVLAISG PTVFPGYVTG HNEHGHLLDG LGRLSDGWLD TGDLARVDED GLIYLAGRAK DLIIRGGHNI DPTIIEDALL AHPHVTAAGA VGRPDVHSGE VPVAYVTLVP GAGVTEHELR DWACRQVLER AAQPKAVIIL EALPITDVGK PYKLPLRADA TRRELLAALN EVAGVRDVEA TIEDGSIVAT VKVSPSAGEA AVKAILGRYA IRWHMVTTS
|
| |