Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7047 |
Symbol | |
ID | 5675358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8598625 |
End bp | 8601081 |
Gene Length | 2457 bp |
Protein Length | 818 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641245893 |
Product | glycosyl hydrolase family 32 protein |
Protein accession | YP_001511284 |
Protein GI | 158318776 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.545796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATAC TTGTCAGACT CATGCGTAGC AGCCTCACGC GGGTCAGGTT CACCGCGGTG ACGGTCGCTC TCGCGGCGGT GTTCACCTCC CTTCTGCCGG CCGTGCCCGC CCTGGCCGGA CAGGTCAGCG ACTACCCCGA GTTCCCGTAC CCGACGACAG ACTACACCGA GCCCTTACGC GGCCAGTTCC ACTTCAGCTC GCGCGGCGGC TGGATGAACG ACATCAACGC CCCGCTGTAC CACAATGGCC TCTACCACGT CTTCTATCAG CACAATCCGC ACAGCCTCCT CTGGGAGACC ATGCACTGGG GACATGCCAC CAGCCCCGAC CTGGTGCACT GGACGCAGAA GCCGATCGCG CTGGAACCGG GTGTGCATCC GCATGACCTG TGGTCCGGGG CCGGGGTGGT CGACACCAAC AACACCTCGG GTCTGCAGAC CGGGAGCGAG GCACCGATCC TCGTGTTCAC CGCCACCAAC GGCGTGAGCA TCAACTACAG CAACGACGCC GCGAAAACGT TCCAGATCTA CAACCAAGGT CAGAAGGTCG TCACACCGGC CGGCATCAGT CGTGATCCCA AGGTGTTCTG GCATGCGCCT TCCAACCGGT GGGTGATGGT GGTCTGGTCC GACGCCGGGG GGAACGGCGT CAACATCTAC ACCTCACCCA ACCTGTTGAC CTGGACGTTT CGCAGCCGGT ATGCCGCCGA CTGGCTGTAC GAATGTCCGG ACCTGTTCTC CCTGGCCGTC GACGGCGACC CGGGGAACAC GAGGTGGGTC ATGACCGACG CCGGCGGCGA GTACGTCATC GGTTCCTTCG ACGGTGTCAC CTTCACCCCG GAGTGGACGT CGCCGCAACG GATGGACCAG GGGCACAACA CCTTCGAGGG CACCTTCTAT GCCGGGCTGA CCTTCAACCA CATGCCGGAC AACCGGATCG TGCAGATGGC CTGGATGAGA TCGAACCAGG GCAGCGTCTG GACCGGCAAT GCCTCCTTCC CCGCGGAACT GGGCCTGCGC GCCTATCCCG AGGGGATACG CCTGACCCGC AATCCCGTCG GCGAGATCGC GTCCCTGCGC GTCGATTCTC AATCATGGGT AAACCGCGAC ATCACCCCCG ATCCGGCCAG CGATCCCCTC ACCAGCACCT TCGCCGACAC CTACGAGATC ATCGCCGAGT TCGACACGGC CACCGCCACA GCGTCACGGT TCGGCTTCCG ATTACACACC CGCAGTGACG GAACCTTCGA CCGTGCCGTC ACCTACGACC GGACTGCGCA GACGCTCTAC GGCGCACCGC TGGCGCCGAT CAACGGACGG GTCAGGATGC GGCTACTGGT GGACCGCGGG CAACTGGAGA TCTTCGGCAA CGACGGCAAG CTGTCCTGGA CCGACAACGT CAACTTCAAC TCGGCACCGT CGAGCCAGGG TGTGCAGCTG TATGCCGAAG GCGGCAACGT CCAGCTGGTG TCGCTCCAGT TCCACCGGCT GCAGTCAGCG TGGGGTTCTG GGGAGTCCAC CCTGGAGAGC AACCTGGCCG GCCCCTGGCA CCCGGCCGGC GGGACGTGGG TCGACACCAC CACGGGCAAG CAGGGCACCG CGGGTGGGGA CGGTTTCTAC CTGAGCAACC AGACCGGAGC CGACTTCACC TACGAGGGTG ATCTCCGCCT CGACACCGCC GTGGCAGCCG GGATCACCTT CCGGGCCAAC AGCGACGCCA CCCAGCAGTA CACCGCCAAC GTCGACGCCA ACGGACTGGT GAAACTGTGG CGCCCCGGCC GGGACATCGG GATCTTCTAC ACCCCGATCT CCCAAGGCCG CACCTACCAC CTGAAGGTGG TGACCAGCGG CTCCATCATC AGGGTCTATC TGGACCACCG CCCCACCCCC GTGATCGACG CCGTCGACAC CGCCTACACC AGCGGGTACT TCGGGACCAA CGTCTTCGGC GGGACCGGCG TCGTGCAGAA TGCCAACGTC AACGCCACCG GGTTCGTCTC CAACCTGGGA GCAACCTGGC GGCCGGCGAC CGGGCTGTGG ACCGTCCCCG GTGCCGGTGT CAAGGGTCGG GTCGCCGGGG ACGGCTTCTA CCTCAGCGAC CAGACCGGGA CCAACTTCAC CTATGAGGGT GACGTCAAGG TGATCAACGG GGTCGCCGCC GCGCTGACCT TCCGGTCGAA CGCCGACGCG ACCGGGCACT ACACCGCCAA CGTCGACACC AACGGCCTGG TGAAACTGTG GCGCCCCGGC TCGGTGATAG GCGTCTTCAA CACACCGATC GTCGAAGGCC GGACGTACCA CCTGAAGGTG GTGGCCAACG GTCCCAACAT CAGAGTCTAC TTCGACGGAG GAGCGACGCC GGTCATAGAC GCCGTTGACA GCACTTACAG CAGTGGGTTC TTCGGTGTCA ACGTCTTCAG CGGGGTCGGC GTGATCCAGA ACGTCGTAAC AAGCTGA
|
Protein sequence | MSILVRLMRS SLTRVRFTAV TVALAAVFTS LLPAVPALAG QVSDYPEFPY PTTDYTEPLR GQFHFSSRGG WMNDINAPLY HNGLYHVFYQ HNPHSLLWET MHWGHATSPD LVHWTQKPIA LEPGVHPHDL WSGAGVVDTN NTSGLQTGSE APILVFTATN GVSINYSNDA AKTFQIYNQG QKVVTPAGIS RDPKVFWHAP SNRWVMVVWS DAGGNGVNIY TSPNLLTWTF RSRYAADWLY ECPDLFSLAV DGDPGNTRWV MTDAGGEYVI GSFDGVTFTP EWTSPQRMDQ GHNTFEGTFY AGLTFNHMPD NRIVQMAWMR SNQGSVWTGN ASFPAELGLR AYPEGIRLTR NPVGEIASLR VDSQSWVNRD ITPDPASDPL TSTFADTYEI IAEFDTATAT ASRFGFRLHT RSDGTFDRAV TYDRTAQTLY GAPLAPINGR VRMRLLVDRG QLEIFGNDGK LSWTDNVNFN SAPSSQGVQL YAEGGNVQLV SLQFHRLQSA WGSGESTLES NLAGPWHPAG GTWVDTTTGK QGTAGGDGFY LSNQTGADFT YEGDLRLDTA VAAGITFRAN SDATQQYTAN VDANGLVKLW RPGRDIGIFY TPISQGRTYH LKVVTSGSII RVYLDHRPTP VIDAVDTAYT SGYFGTNVFG GTGVVQNANV NATGFVSNLG ATWRPATGLW TVPGAGVKGR VAGDGFYLSD QTGTNFTYEG DVKVINGVAA ALTFRSNADA TGHYTANVDT NGLVKLWRPG SVIGVFNTPI VEGRTYHLKV VANGPNIRVY FDGGATPVID AVDSTYSSGF FGVNVFSGVG VIQNVVTS
|
| |