Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1178 |
Symbol | |
ID | 3905289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1406876 |
End bp | 1409728 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878510 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_480286 |
Protein GI | 86739886 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0144635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTTTGA ACGAAAGCGG TCGGCGCGCG TCCCTTGCGG TCGCGACGCC TGAAGGTGGC GCGTTCTTCC CGGCGTTGCA CTTCGACGAC CTGCCGGCCT CGGTGGTCAG CCGGTTCCGG GAGGTCGCCA CGCACCTGCC CGATACACCC GCGCTGGTCT CGCCCGGCGT CGTCATGACC TTCGCCGAGG CTGATCGCCG GACCGATGAC ATCGCGATGG CGGTCCTCGG CCGCCTCGAC GCGAACGAGG ACGGTCCGGT CGCGACCCTG CTGCCGCACA GCGTGGCCGG CCTGCTGGGC GTACTGGGCG TACTGAAAAC CGGACGCCCG GTCGTCCCGC TGGATCCGAT GGTGCCCGCC GAGCGGATGG CGCAGATCGT TCGGCAGGCC GGCTGCGTGG CCCTGCTGAC CGACCTGGGC GACAGTTCCG TGGCCCGCTC GACGGTGGGC CTGCGGCCGG TGGGGACCAA CGGTTCGAAC GCCACGGCCG AATCCAGCGT GAATCCGGAC GCCCTGCTCG CCGCGCTGGC CGGGGGCGGC CCCCGCCACG TCCTGGACCT CGCCGCGGCG GCCGCGGACG GCGCCCGGTG GATCGCGGCA AACGGCGCGG ACGCGGTCTG GTGGCCGCAG CCGCTGGTCG ACGATCCGGC CTGCATCGTG TTCACCTCCG GTTCGACCGG CGCGCCTAAG GGCGTGGTGT GGACAAACGG CACGTTCCTG TGCGACGCCT ACGCCGGCGC CGAACGTCTC GGTTTCGCGC CGGGCGACCG GCTGGCCCTC GTGCTGCCGT ACTCGTTCGC CGCCGGCATC ACCGTGGTGG TGTTCGGTCT ACTCAACGGG GCCGGGGTGT ATGCCTATGA TCCACGGGCG GCGGGCCTGA GCGGCCTCGC CGACTGGATC TCCTCGCAGC ACCTGACCGC GCTGCAGACC ACTCCGTCTC TGCTGCGCTC CCTCGTCGGC TCGCTCGAAC CGGACCAGGT CCTCGCCGAC CTGCGGATCG TGACCACGTG CGGGGAGGCC GTCTACGGAC GCGACATCAC GGCGCTGCGG CCGCACGTGC CACCGGCGTG TACCTACGTG AACTGGTCCG GTGCCTCCGA GATCGCATCC CTCGGCTTCT TCGAGGTCCC GCCGGGCACG CAGCCGCCCG CGGGCACGGT ACCCGCCGGC CTGCCGGCGA CCGGCAAGGA GGTGGTACTG CGTCGCGAGG ACGGCACGCC CGCCGATCCC GGCGAGGTAG GCGACGTGGA GGTCACCTCG GCCTACCTGT CGGCCGGGTA CTGGGGGAAC GCGGAGATGA CGACGTCCCG CTTCACCCCG CTCACCGATG GCCGGACCAC CTGCCGGACC GGCGACGTCG GCCGGTTCGA ACCCAACGGC ACGCTGATCC TGCTCGGCCG CCGGGACGCG GCGGTGAAGA TCCGCGGCTA TCTCGTCGAA CCGAGCGAGG TGGAGGCGGC GCTGCTGAGC TCCGCGGAGA TCGCCGAGGC GGTCGTCACC GCCGTCGCGC ATCCTTCGGC GCGGAACCGG CTGGTCGCCT ATGTCGTGCC GGCGGTACAC GGCAACACCC TGTCCCCGGT CCGAATCCGG CGGAGGCTAC GCGAGAAGCT GCCGGTCTGG ATGGTCCCGA CGACGATCAT CCCCCTGGCG GAACTGCCCC GCAACGAACG CGGCAAGGTG GACCGGGGCG CGCTGCCGCC CCCGCCGGAG GCCCCGGCCG TCTCCGCCCG GCCGAGAACG CAGTGGGAGA TCGTCGTCGC CGACATCTGG ACGCGGGTCC TCGATCTCGA GGAGGTCGGC ATCGGGGACG ACTTCATGGA GCTCGGCGGC GACTCCCTGG CCGCGAACGA GCTGCTCACC CTGGTCGCGG AGAAGCTCGG GATCACCATG CCGTCATCGG CGCTGGTCGA CGCGCCGACG CTGGGCGAGT TCGCCCGCGC GCTGTCACTC GCGCAGCAGT CGGGTCCGCG ACACCCCACC GTCGTCCCCC TGCGCACCAC CGGCTCGCGG CCGCCGCTGT TCTGCTTCGC CGGGGCCGGC GCGCTCGCCC TCGGCTTCCA CTCACTGGCC CGCCGCCTCG GCGACGACCA GCCGGTCTAC GCCTTCCAGG CGCATGGGCT GGAGCGGCGG GGCGTCCCGG ACTGGAGCGT CGCGCGGACC GCCCGCCGCC ACCTGGAGAT CATCCGCGTC CTGGCTCCCC GGGGTCCCTA CCTGCTCGCC GGGCACTCGC TGGGTGGCCT GATCGCCATG GAGATCGCCC AGCAGCTCGC CGCCGCGGGA GAGGAGGTCG GTTTCCTGTC CATCATGGAC ACCTACCTGC CGGCCTCGCT GCGGATCGAG TCGGCCGGGT CGGGTAGCGC CGCCGGGTCG GGCGGCGCCG CCGGGTCGTC GAAGGCAGCC GAGGACCACG CGCGCGGTTA CCGCGGCCGG CTGGCGCGGT TCACCCAGAA GCTGCTACCG GAACAGCGGG CGAACTTCAC CAGCAAGGCG ACCCTCAAGA AAATGGTGCA GATTCCGCTC ACCGGGGTCG TACAGTTCGG CGGGCTCGAG CAGTTCGACG TCTTCTTCAA CCACGGGCGG CTTCTCGAAC GTTTCTACCG ACCGCGGCCA TGGCCCGGCC GGACCCTCGT GTACCGATCG GCGGAGAATC CGGACCCGGA GGACGCATGG TCAGCATTCC TCACCGGGAG CCATGACACC CATTTCGTTC CGTGTGAGCA TTTCTACCTG CTCCGCGAAC CGCACATCAT CAAGATCTCG GAGCATTTCC GGGCCGAGAT CGACCGGGTG GTCGCCGGGC TGACGAAGAC AGGTCGCCGG GCTGACGAAG GCAGGCTGAC CGCGACGGAC TGA
|
Protein sequence | MALNESGRRA SLAVATPEGG AFFPALHFDD LPASVVSRFR EVATHLPDTP ALVSPGVVMT FAEADRRTDD IAMAVLGRLD ANEDGPVATL LPHSVAGLLG VLGVLKTGRP VVPLDPMVPA ERMAQIVRQA GCVALLTDLG DSSVARSTVG LRPVGTNGSN ATAESSVNPD ALLAALAGGG PRHVLDLAAA AADGARWIAA NGADAVWWPQ PLVDDPACIV FTSGSTGAPK GVVWTNGTFL CDAYAGAERL GFAPGDRLAL VLPYSFAAGI TVVVFGLLNG AGVYAYDPRA AGLSGLADWI SSQHLTALQT TPSLLRSLVG SLEPDQVLAD LRIVTTCGEA VYGRDITALR PHVPPACTYV NWSGASEIAS LGFFEVPPGT QPPAGTVPAG LPATGKEVVL RREDGTPADP GEVGDVEVTS AYLSAGYWGN AEMTTSRFTP LTDGRTTCRT GDVGRFEPNG TLILLGRRDA AVKIRGYLVE PSEVEAALLS SAEIAEAVVT AVAHPSARNR LVAYVVPAVH GNTLSPVRIR RRLREKLPVW MVPTTIIPLA ELPRNERGKV DRGALPPPPE APAVSARPRT QWEIVVADIW TRVLDLEEVG IGDDFMELGG DSLAANELLT LVAEKLGITM PSSALVDAPT LGEFARALSL AQQSGPRHPT VVPLRTTGSR PPLFCFAGAG ALALGFHSLA RRLGDDQPVY AFQAHGLERR GVPDWSVART ARRHLEIIRV LAPRGPYLLA GHSLGGLIAM EIAQQLAAAG EEVGFLSIMD TYLPASLRIE SAGSGSAAGS GGAAGSSKAA EDHARGYRGR LARFTQKLLP EQRANFTSKA TLKKMVQIPL TGVVQFGGLE QFDVFFNHGR LLERFYRPRP WPGRTLVYRS AENPDPEDAW SAFLTGSHDT HFVPCEHFYL LREPHIIKIS EHFRAEIDRV VAGLTKTGRR ADEGRLTATD
|
| |