Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5341 |
Symbol | |
ID | 5673675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6438561 |
End bp | 6441308 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244199 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001509605 |
Protein GI | 158317097 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.795016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCTG TCGGACCGCC CCGCCCGGGC TCCCCCGCGT TTCTGGCGTC CTCGCTCGAG CGGTACGGAG AGCAGACGGC GATCATCACC GCCGAGGGCG AGCTGTCGTA CCGGAAGCTG GGGGCCAGGG TCACCGCCGC CGCCGAGCGA ATCGGTGGCG AGCGACGCCT GGTGCTGCTC GTGGCGGCCA ACACGGTCGA CGTCCTGGTC ATGTATCTGG CGGCACTGTC GGCCGGGCAC GCGGTCCTGC TCGTGCCCGG CGACACCTCG GCGCCGACCG GTGCCTCGAC GTCGGTCGAC GCCCTGATCG ACGCCTACGA CCCGGATGTC GTCATCCGGC CCGCGGGCCC GGCGACCCAT GTGGAGCAGC GACGCACAGG CACGCGGCAC GTTCTGCACC CTGAGCTGGC GCTGCTGCTG AGCACCTCCG GCTCGACGGG CTCACCAAAA CTGGTCCGGC TGTCGTACAC GAACCTGCAG GCCAATGCGG AGTCGATCGC CGAGTATCTC GACGTGCGGC CCAGCGACCG CGCCGCGACG ACCCTGCCCA TGTACTACTG CTACGGGCTG TCGGTCATCC ACAGCCACCT GTTACGCGGA GCCGGGCTGG TTCTGACGTC GCTGTCAGTC ATGGACGCCT GCTTCTGGGC GCTGTTCCGG AAGGCCCGCG GCACCTCCCT GGCCGCCGTG CCGTACACCT TCGACCTGCT CGACAGGATC GACTTCGACG CGATGTCGCT GCCGCACCTG CGTTACATCA CCCAGGCCGG GGGTCGGCTG GCACCCGACC GGGTCGCCCA CTACGCCCGG CTCGGGCAGC GCGACGGCTG GGACCTCGTC GTGATGTACG GGCAGACCGA GGCCACGGCC CGCATGGCCT ACCTGCCGCC ACACCTGGCC GCGACATACC CGCACGCCAT CGGTGTGCCG ATCCCGGGCG GGTCGTTCCG GCTCGCCCCG GTGACCGCCG GCGCCGGCAG CCCCGCAGGC GCTGGGAGTC CCGGGGACGT CGGCGAGCTT GTGTACTCCG GCCCCAACGT CATGATGGGC TACGCGCACA CCGTCGCGGA CCTCGCCCTC GGCAGGATGC TCGACGAGCT CCACACCGGA GATTTGGCAC GGTGCACCGA CGGCGGGCTC TACGAGGTCG TCGGCAGGCA GGCCAGGTTC GCCAAGCTCT TCGGGCTCCG GATCGATCTC CAGCGTGTGG AGGACGTGCT CGCGGGCGAA GGACTCAGCG CCTGCTGCGT GACCGACGAC GACGCACTCC ATGTGGTCGT GGAGGGCACC GGTGAGACGG CTCCCGTCCG CCGGCTGGTC GCCCGGGCCT GCCGGCTGCC GGAGAGCGCG GTCCACGTGC ACCGTCGGGA CGCGCTGCCA CGCCTGACCA CCGGGAAGGT CGACCTGGTC GCGGCTCGGG AGGTGGCCCA CGAGGCGCTC GGGGCCGCCC CGCGCACGAC CATGCCGTCC GGGCGGGGTG CGCCACTGGA CGGCACAGCC GCGCTGCGCC GGCTGTTCGC GGACTCGCTC GGCCGGGACG ACGTCACCGA CGACAGCACC TTCGTGAGCC TGGGCGGGGA CTCGCTGTCC TACGTGGACC TGTCGCTGCG CCTGCAGGAG CACCTCGGAC GGCTGCCCGC CGACTGGCAC ACCACGCCCC TGCGGGACCT GGAGCCGGCC GCGTGGCCGA GGCGCCGGGC AGGTGGCTCG ATTGAGACCA GTGTGCTGCT CCGCGCCATC GCGATCGTGT TGGTCGTCGG GACGCACGCC AACACCTTCG ACGCCGTCGG TGGCATCCCC GGCGGCGCGC ACCTGCTGAT CGCCGTCGCC GGCTACAACT TCGTCCGTTT CCACCTGACG CCGGCGCCGC GGTCACAACG CGTGCACGGA CTGCTGCGCA GCATCGCGCA CCTGGTGGTG CCCAGCGTGC TGTTCATCGC CACGCTCGTG CTCCTGATGG ACGAGCACGG ACTCGCCAAC ATCGTGCTCC TCAACAGCGT GTTAGGGCCG GCGCAGGCCG GCCCGCCGTG GGAGTTCTGG TTCATCGAGG CGGTGGTCCA GATCCTGCTG GTCCTCACCG TCCTGCTGGC GGTTCCGGCG GTCGACCGGC TGCAGCGGCG TTTCCCGTTC GGGTCCGCCC TCGGCGTTCT CGCGGGCGCG TTGCTGTTCC GGTTCGACGT GGTGGAGCTG CCGGCGGGGC CACGCGACAC GCAGACGTCG CTGGCCGTGC TCTGGCTGTT CGCGCTCGGC TGGGCCGCCG CCGCGGCGGG CACGGTGTGG CACCGCCTCG GCGTGACGGC GCTGGCACTG CTCACCGTCC CCGGCTACTT CGCCAACGCG CGCCGCGAGG ACATCATTCT CGGCGGTCTG GTCCTCCTGA TCTGGACCAC AGACCTGCGC TGCCCGCGGC TGCCGCGGCG TCTGGCCGGC GTGCTGGCCA GCAGCTCGCT CGCCATCTAC CTGACGCACT GGCAGGTGTT TCCCGCCCTG CAGGACGACC ATCCGGTGCT GGCACTGCTG GCCTCCCTCG CGGTCGGCGT CGCGGCCTGG AGGGCATGGT CCCGGGCCAC GACGCTGCTC CGCCGGCTCC CGGTGGGCGC CCTGGCGTTC CGTGCCGCGC GCCGCCCGCC CCGTGCCGCG GATGGCGGTA CTCCGGCTGC CGTTTCGGGC CCAGCGCCCC GCCGCGCCCC GTTCGCGCCT GATGTGCCCA TAGTCACAAT CTCTGGCTCC GAGGTAAGGA TGGGCTAA
|
Protein sequence | MASVGPPRPG SPAFLASSLE RYGEQTAIIT AEGELSYRKL GARVTAAAER IGGERRLVLL VAANTVDVLV MYLAALSAGH AVLLVPGDTS APTGASTSVD ALIDAYDPDV VIRPAGPATH VEQRRTGTRH VLHPELALLL STSGSTGSPK LVRLSYTNLQ ANAESIAEYL DVRPSDRAAT TLPMYYCYGL SVIHSHLLRG AGLVLTSLSV MDACFWALFR KARGTSLAAV PYTFDLLDRI DFDAMSLPHL RYITQAGGRL APDRVAHYAR LGQRDGWDLV VMYGQTEATA RMAYLPPHLA ATYPHAIGVP IPGGSFRLAP VTAGAGSPAG AGSPGDVGEL VYSGPNVMMG YAHTVADLAL GRMLDELHTG DLARCTDGGL YEVVGRQARF AKLFGLRIDL QRVEDVLAGE GLSACCVTDD DALHVVVEGT GETAPVRRLV ARACRLPESA VHVHRRDALP RLTTGKVDLV AAREVAHEAL GAAPRTTMPS GRGAPLDGTA ALRRLFADSL GRDDVTDDST FVSLGGDSLS YVDLSLRLQE HLGRLPADWH TTPLRDLEPA AWPRRRAGGS IETSVLLRAI AIVLVVGTHA NTFDAVGGIP GGAHLLIAVA GYNFVRFHLT PAPRSQRVHG LLRSIAHLVV PSVLFIATLV LLMDEHGLAN IVLLNSVLGP AQAGPPWEFW FIEAVVQILL VLTVLLAVPA VDRLQRRFPF GSALGVLAGA LLFRFDVVEL PAGPRDTQTS LAVLWLFALG WAAAAAGTVW HRLGVTALAL LTVPGYFANA RREDIILGGL VLLIWTTDLR CPRLPRRLAG VLASSSLAIY LTHWQVFPAL QDDHPVLALL ASLAVGVAAW RAWSRATTLL RRLPVGALAF RAARRPPRAA DGGTPAAVSG PAPRRAPFAP DVPIVTISGS EVRMG
|
| |