Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4529 |
Symbol | |
ID | 5672878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5402982 |
End bp | 5404571 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243394 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001508810 |
Protein GI | 158316302 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCC CGATCCCATC GTGGGGACGC GATGTCACTG TCGAACAGGT AGGCGGCGTT CCGTTCCGAA TGTACGAGCC GCGGGCGCGG AGGCTGGAGT CCCTGCTGGA CCATGCCGAC CGCTGGGCGG GCCGGGCCCA CGCCGTGCAG GGCGACGTGC GGCTCGACTT CGGCGACCTC GTGCCCGCGG TCGGTCACAA GGCGGCCCAG CTGAGGCAGC ACGGGGTGGG CCGTGGGGAC CGGGTCGCGC TGCTGGGCTG GAACAGCCCG GACTGGGTGG TGAACCTCTG GGCCACCTGG TGGCTCGGGG CCGTCCCGGT CCTGGTCAAC GCCTGGTGGA GCACCCGAGA GATGGAGCAC GCCTTCACGG CGCTCACACC GGTCGCCGTG CTCGCCGACC GCCGGCTGGA GCACAAGGTG CCAGCCGGGA GTCCCGTCGC GCCGTGGCTG ATGGCCGACG CGGTCGACGG TGCGCCGCCT GAGCGTCCCG ATAGCGACGA CGAGAACGAG CCCGCCCTGA TTCTGTTCAC CTCCGGCAGC ACCGGATTCC CCAAGGCGGT CGTGCTCTCG CACCGCTCGA TCATCTCCGG CCTGCACTCA CTGCTGAGAA TCACCAAGCG GCTTCCGCAG GAACTCGAGG GCGCGGCGCC CAGCGTCGCC CTGCACACGG GGCCGCTGTT CCACATCGGC GGCGTACAGA CCCTCGTGCG AGGGGTCGTG GTCGGCGAGA CCCTGGTGTT CCCGGAAGGC AAGTTCGACG CAGATGCGGC GATGGACCTC ATCTCCGCGC ATGGTGTCAC CCGCTGGAGC GCGGTCCCCA CCATGGTCAG CCGGCTGCTC GACGCCCAGG CACAGCGGCC GGTCGACCTG CACAGCCTGC GGTCACTGAC ACTGGGCGGC GCCCCCGCCC ACCCCAGTCT GTACCAGCGG ATCCGCGACG AGCTGCCGTC CGTCCAGGCC CGCGTGGCCA CGGGTTACGG CCTCACCGAG AACGGTGGGC AGGCCACCGC GGCCAGCGGC CGCGACACCC GTGACCGGCC CGGCTGCTGC GGCCTGCCGC TACCGACCGT CGAGATCTCC TTCGGTGACC GGACGCCGGG CGGGGACGGC GAGGTCCTGC TCCGCGCCGC GTCGCAGATG CTCGGCTACT ACGGGGAGGC GTCCGGCCCC ATCGACGCCG AGGGGTGGCT GCACACCGGT GACCTGGGCT ACCTCGACGA GGACGGCTAT CTCTGGGTGA CCGGTCGCAG CAAGGATCTC ATTCTGCGTG GGGGCGAGAA CATCGCGCCG CTGTCGGTCG AGCGCGCGCT GGTCGGCGTG CCCGGCGTTC TCGACGCGGC GGTGGTCGGC CTGCCGCACG TCGACCTGGG GGAGGAGGTC GCGGCGGTCG TGGTGGTCGA CGAGGCCACC GCGGGCCGGC CGGATCTCGC GGAGTACGTC ATCGAGGTCC TGCGCTCGGA CCTGGCGTCC TTCGCGATCC CGACCCGGTG GCGCTTCCAG ACCGAGGAAC TGCCGGTGCT CGGGTCGGAG AAGATCGACA AGCACGCGCT CGCCGCGGAG TTCGCCGCCG AGACCGCCGC TTCCCGGTGA
|
Protein sequence | MSGPIPSWGR DVTVEQVGGV PFRMYEPRAR RLESLLDHAD RWAGRAHAVQ GDVRLDFGDL VPAVGHKAAQ LRQHGVGRGD RVALLGWNSP DWVVNLWATW WLGAVPVLVN AWWSTREMEH AFTALTPVAV LADRRLEHKV PAGSPVAPWL MADAVDGAPP ERPDSDDENE PALILFTSGS TGFPKAVVLS HRSIISGLHS LLRITKRLPQ ELEGAAPSVA LHTGPLFHIG GVQTLVRGVV VGETLVFPEG KFDADAAMDL ISAHGVTRWS AVPTMVSRLL DAQAQRPVDL HSLRSLTLGG APAHPSLYQR IRDELPSVQA RVATGYGLTE NGGQATAASG RDTRDRPGCC GLPLPTVEIS FGDRTPGGDG EVLLRAASQM LGYYGEASGP IDAEGWLHTG DLGYLDEDGY LWVTGRSKDL ILRGGENIAP LSVERALVGV PGVLDAAVVG LPHVDLGEEV AAVVVVDEAT AGRPDLAEYV IEVLRSDLAS FAIPTRWRFQ TEELPVLGSE KIDKHALAAE FAAETAASR
|
| |