Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0991 |
Symbol | |
ID | 3905847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1174730 |
End bp | 1177960 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637878324 |
Product | acyl transferase region |
Protein accession | YP_480103 |
Protein GI | 86739703 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGAAGC AGAGGAAGAG CGACGATCGA CTGCCCCGGG TCTCCCCGGA TCTTGCGGAC CCGATCCGGC CCGTCCTGGT CGAACAGCTG ACCAACAGTG ACATCACGGT CACTACAACC GACGACTCCG CCAACGACGC TCCGGTCAAG AAGGCGGATC AGACCATCCG CGCGGATCCG AGTAAGGGTT CGGGTGAACG GACGCGGGAT GCTGGGGGAA ACACCTCCAG CGCGGTTGCG GTGATTGGGA TCGGGTGCCG CCTTCCTGGC GGCGTTGGTT CCGCGGCGTC GCTGTGGGAT TTCCTCCGCG ACGGTCGTGA TGCGGTCACC GATGTGCCAT CAGAGCGATG GAGTGAGGCG TCGCGGGAAG CATCGCAGGC AGAGACCGAT CGGAACGTGC TGTGGCGTGG CGGGTTTTTG TCGGAGGACG TCGGCGCCTT CGATCCAGAG GCCTTCGGTA TCGACCCGAC CGAGGCAGAG TTGATCGATC CTCAACATCG GCTGTTGCTG GAGGTCGTGC AGGAGGGGTT CGAGCACGCT GGATTGCCAA CCGATGACCT GGTAGGGAGC AGCACTGCTG TGTTCGTCGG CATGTCCAAC TTGGACCATA TGCTCCACGC ACATCAGCTC CCGAGCGGCG GGGGCCCCTA CTTTGTGCCC GGTAACCAGG CCGGACCGGC ATCCGGTCGG ATATCGCACG TCTTCGGTTT GCGCGGTCCC AGCATGACCG TGGACACGTC GTGTTCGACA GGGTTGGCCA CCGTGTACCT GGCATGTAAC AGCCTTCGGC AGGACGAGTG TGACCTCGCG ATCACTGGCG CGGTCAACCT GCTACTCAGC CCCCGAACGT TCCTGGCCTA TAACGAGTTG GGAGTGCTAT CTCCCACCGG GCGATGCTTC AGCTTTGACG AACGAGCGGA CGGCTACGTT CGGGCCGAGG GCTGCGTGGT CCTCGTACTT AAACGTCTCG ACGACGCGAT GCGTGACCAG GACCGCGTGC TCGCCGTGCT ACGGGGGGTG GCGGTCAATC ATGACGGGAA GACGTCCCCG TTCACCGTTC CTTCCGAGCA AGCTCAGGAA GAGGTGTTCC GTACCGCCCT GAGTATTGCC GATGTCGACC CGGAAGAAAT TGGAATGATC GAAGCGCACG GCACCGGAAC CATCGTTGGC GACCCGATCG AGTTCCGTTC GCTGGCCGCC GTCTACGGAC GGGGACGAGG CCGGTGTGCA TTGGGTTCGG CTAAGACGAA CTTCGGCCAC GCCGAACCCG CCGCGGGCAT GGTGGGGCTA CTCAAAGCCA TATTGGCGGT GTACCACGGT GAGGTCCCGG CGTCCCTGCA TTTCCGACGG TGGAACCCCG CTATCAACCC GTCCGGCACC AGGTTGTTCG TTCCGACCGT AACGACACCA TGGCCAGTCA CCGGTGGTCC GCGCCTCGCG GCCGTGTCCT CCTACGGTGT GGGCGGGACC AACGCGCATG CCATCGTGGA GGAACCGCCG GATTCCAGCT CCCCGGTCGC GCCGAGGTCG TCCAGCAGCA GCGACGATTC CGTTCTCACG TTCCTGCTGT CGGAAGGATC AGAAACTGCG CTGCAGCATT CCGCGATCAG GCTCGCCGAC TGGCTCATCC GCTCCGGCGC GACGACACCC TTGAAAGATA TCGCGCACAC GCTCGCGGTA CGCCGCTCGC ACGGCCCCGA GAGACTGGCG GTCGTCGCTC GCTCCCGGGA CGATCTCGTC AACCGCCTAC GCGCATATGC CGACGGGACA CACCCCCGTC CTGAAGGCAT GGTCAACGAT TATGTCCACG CCGAGCACGC AACGGGACCG GTGTGGGTGT TCAGCGGCCA CGGCTCGCAG TGGCCCGGTA TGGGGCGAGA CCTGTTCGCC ACCGAACCCG TGTTTGCCGA CACGATCGCG GCGCTCGACC CCCTGATTCG CGCCGAGTCG GGTTTCTCGC CGGAGGAGGT GCTGCGGGCC GGTGACGAGG TGACCCGAAT CGACCAGGTA CAGCCCTTGA TTTTCGTGGT ACAGGTCGCT CTGGCGCGCA CCCTGCAATC CCACGGCATC CACCCCGCCG CCGTAGTAGG ACACTCCATG GGTGAAATCG CCGCCGCCGT GATCGCGGAG GCGTTGACCG TGGAAGACGG CATCCGCGTC ATCTGCCGTC GGTCTCGGCT GTGCGTTCCC ATCGCCGAAG CCCGCGTTGC GGCGATGGCG GTCGTCGAAC TGGATGCGGC CACGGTCCAG GCAGAGATCG ACCATCTTCC CGACGTGGCT GTCGCGTTCT TCGCCGCACC CCGGTCCACC GTGATCGGCG GTACTCGAGT CGAAGTCGAA CGCCTGGTCG AAAGCTGGAC ATCCCGCGAC GTTCCCGCGC ATATGATCAA TGTCGATGTC GCCTCGCACT GTCCCCTGAT CCATCCAGTC GCCGACGCGT TGACCGCTGA GCTCAGCGAT ATACGGCCTA GGCAGCCGAC GATCCGTTTC TATACCACGG TTCTGCCTGA TCCGCGGCAG ACACCCACAT TCGATGCTGC CTACTGGGGT GAGAACATGC GCTGCCCGGT TCGGGCGGTG GATGCCACGA CCGCCATCGT CAACGACGGA CACCAGCTTT TCCAGGAGAT CTCTCCACAT CCGGTGGCAA TCCACCCGCT TATTCTGACC CTCCAGGCGG CCGGGGCGCC GGAGGCCACG GTTGTACCGA CTACCGATAA CAGGCACGAT CAGGCCACAG CCCTACGAAC GAGCATCGCA GCGCTTCATT GCGCCGGGCT CGACATGAAT TGGCGGCGGT GGCACGGCGA TGGAGCCATC GCGGACGTGC CGCCTACGAC GTGGGACAGA CGCACCTATC TGATCAAGCT ACTCAGGAAC CTGGCGCCCC CGACCGACTC GGCCCGGTCA GCGCATACAG AGCCGACCGA GCAACGTCCG GACACTGGGA CGGACGCCGA CATCAGCGCC GAGATTTCAC AGGCTACCAG AGATGAACGC CTCAAGATCA TCAGAAAGAT GATCATTGAA ATCTTGCGTG AGATCCTCAG CTTGCGAGCG CGTCGACTGA GCCCTAGTGC CGCCTTTTCG GAACTCGGTC TGAACTCCCT ACGCGCCGTG GAATTCCGCG GACGAATCCA GCAAATATTC AAGGTCTCCA TCTCGCTCGC CGCAATCCGG GAGCATCCCA CAATCGCAGA ATTCAGTGAG TATATCGCCG AACTGCTGTA A
|
Protein sequence | MSKQRKSDDR LPRVSPDLAD PIRPVLVEQL TNSDITVTTT DDSANDAPVK KADQTIRADP SKGSGERTRD AGGNTSSAVA VIGIGCRLPG GVGSAASLWD FLRDGRDAVT DVPSERWSEA SREASQAETD RNVLWRGGFL SEDVGAFDPE AFGIDPTEAE LIDPQHRLLL EVVQEGFEHA GLPTDDLVGS STAVFVGMSN LDHMLHAHQL PSGGGPYFVP GNQAGPASGR ISHVFGLRGP SMTVDTSCST GLATVYLACN SLRQDECDLA ITGAVNLLLS PRTFLAYNEL GVLSPTGRCF SFDERADGYV RAEGCVVLVL KRLDDAMRDQ DRVLAVLRGV AVNHDGKTSP FTVPSEQAQE EVFRTALSIA DVDPEEIGMI EAHGTGTIVG DPIEFRSLAA VYGRGRGRCA LGSAKTNFGH AEPAAGMVGL LKAILAVYHG EVPASLHFRR WNPAINPSGT RLFVPTVTTP WPVTGGPRLA AVSSYGVGGT NAHAIVEEPP DSSSPVAPRS SSSSDDSVLT FLLSEGSETA LQHSAIRLAD WLIRSGATTP LKDIAHTLAV RRSHGPERLA VVARSRDDLV NRLRAYADGT HPRPEGMVND YVHAEHATGP VWVFSGHGSQ WPGMGRDLFA TEPVFADTIA ALDPLIRAES GFSPEEVLRA GDEVTRIDQV QPLIFVVQVA LARTLQSHGI HPAAVVGHSM GEIAAAVIAE ALTVEDGIRV ICRRSRLCVP IAEARVAAMA VVELDAATVQ AEIDHLPDVA VAFFAAPRST VIGGTRVEVE RLVESWTSRD VPAHMINVDV ASHCPLIHPV ADALTAELSD IRPRQPTIRF YTTVLPDPRQ TPTFDAAYWG ENMRCPVRAV DATTAIVNDG HQLFQEISPH PVAIHPLILT LQAAGAPEAT VVPTTDNRHD QATALRTSIA ALHCAGLDMN WRRWHGDGAI ADVPPTTWDR RTYLIKLLRN LAPPTDSARS AHTEPTEQRP DTGTDADISA EISQATRDER LKIIRKMIIE ILREILSLRA RRLSPSAAFS ELGLNSLRAV EFRGRIQQIF KVSISLAAIR EHPTIAEFSE YIAELL
|
| |