Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7046 |
Symbol | |
ID | 5675357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8597399 |
End bp | 8598604 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641245892 |
Product | inulin fructotransferase (DFA-I-forming) |
Protein accession | YP_001511283 |
Protein GI | 158318775 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCTACCC TCTATGACGT GACCACCTGG ACCGTCCCCA GCAACCCCTC CATCACCTGC TACGTCGACG TCGGCGTGGT AATCAACGAC ATCATCCGCG ACATCAAGGC CCAGCAGCCC AACCAGTCGG CGAAGCCCGG AGCCGTCATC TACATCCCAC CGGGGGACTA CCCCCTGAAG ACACGGGTGA CCGTCGACAT CAGCTACCTG ACCATCAAGG GATCCGGCCA CGGCTTCACC TCGTCCAGCA TACGGTACAA CACGTCCAAC ACCTCGGCGT GGCACGAGTT ATGGCCGGGA AGCAGCCGTA TCAAGGTCGA GAACACCGAC GGCAACAGCG AGGCGTTTCT GGTGTCCCGG ACGGGAGACC CACGACTGAG CTCGGTCGTG TTCTCGAACT TCTGCCTCGA TGGCCTCAGC TTCGGCACCA ACCAGAACTC GTATGTCAAC GGAAAGACCG GAGTTCGGGT CGCCACAAGC AACGACGCTT TCAGGTTCGA GGGGATGGGC TTCGTCTACC TCGAGCACGC ACTGATAGTC ACGAACGCCG ACGCACTGAG CGTCAGCGAC AACTTCATCG CCGAATGCGG GAGCTGCATC GAGCTGACCG GCTCGGGACA GGCGTCCAAA ATCACCGACA ACCTGATCGG TGCGGGTTAC GTCGGCTACT CGGTGTTCGC CGAGGGACAC GAGGGTCTGC TTGTCTCCGG TAACAACATC TTCCCCCGGG GAAGGAGCGC GGTGCACTTC AAGAACACCA ACCGCTCAAC GATCACAGCC AACCGTCTGC ACGACTTCTA CCCGGGGATC ATCGACTTCG AGGGGCTGAA CAAGGAGAAC CTGATCAGCA GCAACCACTT CCGGCGCGAG GCTGAGCCGT GGCCTCCGAT GCAGTCCTAC AACAACGGCA AGGACGACCT GTACGGGCTG GTGCACCTGC GCGGGGACAA CAACATGGTC TCCACGAACC TCTTTGCCTT CTACGTCGAC CCCAGCAAGA TCACCCCCCT CGGCGCCACA CCAACGATCA TCCTGGTCGC GTCCGGAAAC GGGAACTTCA TCTCCAACAA CCACGTCACG GCCAACGTGG GCGTAAAGAA CGTCGTCCTC GACGCGACGA CGACCGGCAC GAAGGTTCTG GACAGCGGGA CGGCGTCCGA GTTCGTTTCG TACACCTCCA ACTACACCTT CCGCCCGACG CCGTGA
|
Protein sequence | MSTLYDVTTW TVPSNPSITC YVDVGVVIND IIRDIKAQQP NQSAKPGAVI YIPPGDYPLK TRVTVDISYL TIKGSGHGFT SSSIRYNTSN TSAWHELWPG SSRIKVENTD GNSEAFLVSR TGDPRLSSVV FSNFCLDGLS FGTNQNSYVN GKTGVRVATS NDAFRFEGMG FVYLEHALIV TNADALSVSD NFIAECGSCI ELTGSGQASK ITDNLIGAGY VGYSVFAEGH EGLLVSGNNI FPRGRSAVHF KNTNRSTITA NRLHDFYPGI IDFEGLNKEN LISSNHFRRE AEPWPPMQSY NNGKDDLYGL VHLRGDNNMV STNLFAFYVD PSKITPLGAT PTIILVASGN GNFISNNHVT ANVGVKNVVL DATTTGTKVL DSGTASEFVS YTSNYTFRPT P
|
| |