Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3702 |
Symbol | |
ID | 5672068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4383754 |
End bp | 4385454 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242585 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001508005 |
Protein GI | 158315497 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGGCC CGCCTCTGGG CGACGAGCCG GGTCTCGGGG CGCTGACGCT CCCCGGCTTC CTGCGTGAGG TCACGGCGAG GTTCGGCGAG CGCGAGGCGC TCGTCGCGCG CCCCGCGGGG GACGGGGCGG ACGCCGCGGC CGCCGTGCGC TGGACCTACA GCGAGCTGTG GGAACGCGTC GTCGAGGTGG CGAGCGCGCT GCGCGCCTGC GGGGTCGGCA AGGACACCCG GGTGGGCGTC CTGATGACGA ACCGCCCCGA GTGGATCTCG TCGGTGTTCG GCATCTCGCT CGCCGGGGGG GTCGCCGTCG CCCTCAGCAC GTTCTCGACC CAGTCCGAGC TGGACGACCT GCTGCGGATC TCCGGGGTCT CGGTGCTGCT GCTCGAGCGC AGCGTGCTGA AGAAGGACTT CGCGGCCGTG CTGACCGAGC TGGAGCCGGA GATCGCCAGC ACCGTGCCCG GCGCGTTGGA GTCGGCCCGG TTCCCCTTCC TGCGCCGCGT GGTCATGATC GGGGAAGGCG GGCCCGCCGG CGCGATCGAG ACCTGGGGCG ACTTCCTCGC CCGTGGCCGC GACGAGCCCC GCGAGCTGGT CGAGGCGACG GGCGCGGCGG TGCAGCCGAG CGACACCGCC CTGCTGTTCT TCTCCTCGGG CACCACGAGC CGGCCGAAGG GCATCCTGAA CTCGCACCGC GGTGTCGCCA TCCAGTTATG GCGCTTCCGG CGCATGTACC GGTTCGACCC GGAGGACCAC ATCCGCTGCT GGACGGCCAA CGGCTTCTTC TGGTCGGGGA ACTTCGGCAT GGCGCTCGGC GCGACCTTCG CCAGCGGCGG CTCCGTGGTT CTGCAGCCCA CGTTCCTGCC CGTCGAGGCG CTCGAGCTCA TGGCGACGGA GAAGGTCAAC TTCCCCTTCG CCTGGCCGCA CCAGTGGGCG CAGCTCGAGG CGGCGCCGAA CTGGAAGGAC GTCGACCTGA GCAGCATGCG CTTCGCGGAC GTCAACACCG CGATCGCGCG GCACCCGACG GTCTCGACGC GGTGGGCCGA GCCGGGCCAC GCCTACGGCA ACACCGAGAC CTTCACGCTC ACGACCGGCC TGCCGGCCAA CACGCCGCCG GAGCGGCACC GGGACAGCAG CGGCGAGGCG TTGCCGGGCG TCACGCTCAA GATCGTGGAC CCGCTGACCG GCGCCGTCGT CCCGCGCGGC GAGCAGGGTG AGATCTGCGT CAAGGGGCCC ACACTCATGC TGGGCTACGT CGGCATCCCG CTCGACGAGA CGCTCGACGC CGAGGGCTTC TTCCGCACCG GCGACGGCGG CTACCTCGAC GTCGACGACC TGCTGTTCTG GAAGGGCCGG CTCACCGACA TCATCAAGAC CGGCGGCGCG AACGTCTCGC CCCGGGAGGT CGACGAGACC CTCGCGACCT ATCCCGGCGT CAAGGTGGCT CAGACGGTCG GCGTCCCGCA CGAGACGCTC GGCGAGATGG TGGTCTCCTG CGTCGTCCCG CACGACGGCG TGCGCCTCGA CGCCGACGAG ATCCGCGGTT TCCTCCGGGA GCGGCTGGCG AGCTACAAGG TGCCGCGCCG GGTGCTCTTC TTCCGCGAGG AGGAGATCGC GGTCACCGGC AGCGCGAAGA TCAAGTCCGC CGATCTGCGT GAGCTGGCGG CCAGCCGGCT GGCGGGCGAG ACGGCCCCGG CCCCGGCCTG A
|
Protein sequence | MSGPPLGDEP GLGALTLPGF LREVTARFGE REALVARPAG DGADAAAAVR WTYSELWERV VEVASALRAC GVGKDTRVGV LMTNRPEWIS SVFGISLAGG VAVALSTFST QSELDDLLRI SGVSVLLLER SVLKKDFAAV LTELEPEIAS TVPGALESAR FPFLRRVVMI GEGGPAGAIE TWGDFLARGR DEPRELVEAT GAAVQPSDTA LLFFSSGTTS RPKGILNSHR GVAIQLWRFR RMYRFDPEDH IRCWTANGFF WSGNFGMALG ATFASGGSVV LQPTFLPVEA LELMATEKVN FPFAWPHQWA QLEAAPNWKD VDLSSMRFAD VNTAIARHPT VSTRWAEPGH AYGNTETFTL TTGLPANTPP ERHRDSSGEA LPGVTLKIVD PLTGAVVPRG EQGEICVKGP TLMLGYVGIP LDETLDAEGF FRTGDGGYLD VDDLLFWKGR LTDIIKTGGA NVSPREVDET LATYPGVKVA QTVGVPHETL GEMVVSCVVP HDGVRLDADE IRGFLRERLA SYKVPRRVLF FREEEIAVTG SAKIKSADLR ELAASRLAGE TAPAPA
|
| |