Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0945 |
Symbol | |
ID | 5669359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1104870 |
End bp | 1107977 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641239872 |
Product | hypothetical protein |
Protein accession | YP_001505307 |
Protein GI | 158312799 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.203937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.591607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGGAA CCAGTCGCAG ACCGCTGCTG ACACGCACAC GCCTGCTCGT GCCGGTGCTG GTCATTCTGG TCCTGCTCGT CGTGTTTCTC GGCGTCTTCA CTCGCCTCTA CACAGATCTT CTGTTCTACC GCTCGGTGGA CTTCAGCAAC GTCTTCGGCA CCGTCGTATT CACACGAATA CTTCTGTTCG TGCTCTTCGG TGCGGTCATG GCAATTGCGG TGGGAACGAA CATAGTACTC GCCTACAAGT TGCGGCCGCC CATCCGGCCC CTCTCGACCG AGCAGCAGAA CCTCGAGCGG TACCGCGTTG CGATCGAACC GTACATGCTG CTCGTCCTGC TGGCGGTCTC GACCCTGTTC GGGCTGATCG CGGGGCTGTC GGCGTCCGGG CGCTGGCGGA CCTGGCTGCT GTGGATCAAC AGCGAGCCGT TCAATCAGAC CGACGCCCAG TTCGGCCGGG ACATCAGCTA CTACACCTTC AGCTATCCGT TCCAGCGCTT CCTGCTTGCC TTCCTGCTCA CCGCGGTCAT GCTCTCGCTG CTGGTCGCGG TGCTCACCCA CTACCTGTTC GGCGGAATCC GGCTGCAGTC CGCCGGCGAG CGGGTGGCCC CGGCGGCGAA GGCGCACATC TCCGTGCTGC TCGGGCTGGT CGCGCTACTC AAGGCGTGGG CCTACTACCT GGACCGCTTC GGCCTGGTGT TCTCGGCCCG GGGCGTGTCG ACCGGAGCGT CCTACACCGA CGTCCACGCG GTGCTGCCGG CGAAGCTCAT CCTGCTGTTC ATCTCGCTGG CCTGCGCCGT GCTGTTCATC TACAACATCT TCCAGCGGGG CTGGACGCTG CCGTTGCTCG GTGCCGGCAT CCTCGTGCTG TCCTCGGTGG TCATCGGCGG GATCTACCCG GCGATCGTCC AGCAGTTCCA GGTCCGGCCG AACGAGGCGT CGCGCGAGGA GCCCTACATC GCGCGCAACA TAGCCGCGAC CAGGTCCGCG TACGACATCC AGGACGTCAA ACCGGTCCCG TATCCGGCGA CCACGGGCGC GACGGCGCAG CAGATAGCCG ACGACAAGGG CACCGTCCCG AACATCCGCC TGCTCGACCC GAGCAAGCTG TCCACCACGT TCCAGCAGCT CCAGCAGATC CGTGGTTACT ACGGGTTCCC GCAGACCCTC GACGTCGACC GCTACACCAC GACCTCGGAC GGGAAGACCA CGACCCGGGA CTATGTGGTG TCCGTCCGTG AGCTCAACCA GGACGGCCTC GGTGAGGACC AGCGGAACTG GATCAACCAG CACCTGACCT ACACCCACGG GCGGGGCTTC GTCGCGGCGC CGTCCAACAC CGCCGATGAG GGCCGCCCCG CGTTCACCGA GCGCAATCTG CCGGAGACCG GTGACTTCGG CGTCACCGAG AACCGGATCT ACTTCGGTGA GATGTCCCCG CAGTACTCGA TCGTCGGCAC CCGCCAAGCG GAGATCGACG GCCCGGGGCC GAACGACACC CAGCTCACGA CCAGCTACAC CGGCGACGGC GGCGTCTCGG TCGGCTCGAC CTTCCGGCAG GCGCTGTTCG CGCTGCGCTT CGGGGAGCCG AACATCCTGC TCTCCGGCGA CATCACCGGC CAGTCGCGCA TCCTCTACGA GCGCAACCCG CGGGACCGGG TCAGCAAGGT GGCGCCCTGG CTCACCCTGG ACGGCGACCC CTACCCGGCG GTGGTCGACG GGCGGGTCAC CTGGATCCTC GACGGCTACA CCACCTCGGA CGGCTACCCC TACTCGGCCC GGCGGACCTT CGGTGACGTC ACGGCCGACG CCGTGACCAC CCAGAGCCGC AACCGCACCC AGCAGCCGGA GAACCAGGTC AACTACATCC GGAACTCGGT GAAGGCGACT GTCGACGCCT ACAACGGCAC GGTGACGCTC TACGCCTGGG ACGAGCAGGA CCCGGTGCTG CGGACCTGGA TGAAGGCCTT CCCCGACACG GTGCGCCCCA AGGCCGACAT CCCGCCGGCG CTGATGGAGC ACTTCCGCTA CCCGGAGGAC ATGTTCAAGG TCCAGCGGGA CCTCCTCGGG CAGTACCACG TCTCCAACCC GCGGGACTTC TACTCCCAGG AGGACTTCTG GACGGTCTCC GAGTCCCCGG ACGACACCCG CGAGCCGCAG CCGCCGTTCT ACGTCTACAG CCAGCTCCCC GGGCGCAAGG AACCGTCCTA CAACCTGACC TCGCCGCTGA TCTCGGCGCG GTCGTCCAAG CTCGCGGCCT ACATGGCCGT CTCGATGGAT TCGGACAACT ACGGCCAGTT CACCCTGCTG CAGCTGCCGC CGGGCGACAC GATCAACGGT CCTGTCCAGG TGCAGGCCGC CATCGAGTCG AACGGCGACG TCGTCCGGCA GCTCAACCTC TGGCGCGGCG CGGGCTCACA GACGATCGAG GGCAACCTGC TGACGCTGCC CGTGGCCGGC GGGCTGCTCT ACGTGGAGCC GTACTACGTC CAGGCCCGCG GATCGACGGG ATATCCGACC CTGCAGGGGG TGGCGGCAGC CTTCGGTGAA CGGATCGGGT TCGGTTCCTC GCTCGCGGAG GCGCTGAACG CCGTCTTCGG GGCGGGGGCC GGCGCGTCCG CGGCCGGGGC GGGCACCTCG GCCGCGCCCT CCACGGGCAC CGAACCGGGG ACAACGTCCC CACCGTCGAG CCCCGCGCCG TCGGCACCGC CGGCCGGGGA TCTGGCGGGC GCCGTCGCCG AAGCCGAACG CGCCTACCAG GCGGGCCAGG ACGCACTCGG CAAGAACCCG CCGGACTTCG CCGCCTACGG CAGGGCCCAG ACGGACCTCC AGGCGGCGTT GGACAGGCTG CGCCAGCTGT CACCGACGAC CCCGGCGCCG ACGACGGCAC CTCCGGCGAC GGCACCGACG AAACCGCCGG CGACGACGGC GCCCGCCTCC GCGCCGCCGG CCACCGCCGA GCCCGCCTCG ACGACCCGCG CGGCGCCAGC CGCCGCGCCC GCCGCCATGG ACGGGGCGGG GGATCCGGGC CCACCGGTGC GGCCGGCCGT CGCCGAGAAC CCACCGGCGG TCCGCCCGGC CGTCGCGCGG ACCGGGGATC CCGGCTGA
|
Protein sequence | MAGTSRRPLL TRTRLLVPVL VILVLLVVFL GVFTRLYTDL LFYRSVDFSN VFGTVVFTRI LLFVLFGAVM AIAVGTNIVL AYKLRPPIRP LSTEQQNLER YRVAIEPYML LVLLAVSTLF GLIAGLSASG RWRTWLLWIN SEPFNQTDAQ FGRDISYYTF SYPFQRFLLA FLLTAVMLSL LVAVLTHYLF GGIRLQSAGE RVAPAAKAHI SVLLGLVALL KAWAYYLDRF GLVFSARGVS TGASYTDVHA VLPAKLILLF ISLACAVLFI YNIFQRGWTL PLLGAGILVL SSVVIGGIYP AIVQQFQVRP NEASREEPYI ARNIAATRSA YDIQDVKPVP YPATTGATAQ QIADDKGTVP NIRLLDPSKL STTFQQLQQI RGYYGFPQTL DVDRYTTTSD GKTTTRDYVV SVRELNQDGL GEDQRNWINQ HLTYTHGRGF VAAPSNTADE GRPAFTERNL PETGDFGVTE NRIYFGEMSP QYSIVGTRQA EIDGPGPNDT QLTTSYTGDG GVSVGSTFRQ ALFALRFGEP NILLSGDITG QSRILYERNP RDRVSKVAPW LTLDGDPYPA VVDGRVTWIL DGYTTSDGYP YSARRTFGDV TADAVTTQSR NRTQQPENQV NYIRNSVKAT VDAYNGTVTL YAWDEQDPVL RTWMKAFPDT VRPKADIPPA LMEHFRYPED MFKVQRDLLG QYHVSNPRDF YSQEDFWTVS ESPDDTREPQ PPFYVYSQLP GRKEPSYNLT SPLISARSSK LAAYMAVSMD SDNYGQFTLL QLPPGDTING PVQVQAAIES NGDVVRQLNL WRGAGSQTIE GNLLTLPVAG GLLYVEPYYV QARGSTGYPT LQGVAAAFGE RIGFGSSLAE ALNAVFGAGA GASAAGAGTS AAPSTGTEPG TTSPPSSPAP SAPPAGDLAG AVAEAERAYQ AGQDALGKNP PDFAAYGRAQ TDLQAALDRL RQLSPTTPAP TTAPPATAPT KPPATTAPAS APPATAEPAS TTRAAPAAAP AAMDGAGDPG PPVRPAVAEN PPAVRPAVAR TGDPG
|
| |