Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2027 |
Symbol | |
ID | 5670428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2435613 |
End bp | 2438243 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240948 |
Product | alpha amylase catalytic region |
Protein accession | YP_001506370 |
Protein GI | 158313862 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.748072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.857591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCAGAC ATGGCAGGCC GCGAGCGCCG GCCGAGGTCT ACCGATGGGA CGACGTGACG ACAGAGCATG GCGCCGACCG GCATGTGGGC CCGTCACCCG CCACCGGCGC CGTCCGCGTG GACGTCCTCT ACCACACCGG CGTCGGCCGG CGGATCGCGA ACGGGGCCCG GCTGGTCGGG AGCTGGGACG AGCACGGGCT GCAGGCGGGG AGCTGGTCGT CCACGCCGAT GGAGGAGTTC ACCGCCGAGG ACGGCTGCGC CGCCTACCGG GCCACCGTCA CGGTGGACGC GAGCGTCTCC CGGACCTTCG ACTGGGGCGT CTGGCTGACC CGTCCCGACG GCTCGCAGGT GTGGGGGATC CCCGCCGAGG TCCCGGATCC CGGGGCGACG GAGCAGGTCC GGCGGTTCGA GGTCGGCCCG GGGTGCGGGC CGGTGGTGCG CGCGGAGTTC CGCCTGGCCG CCCACCGGCA CAACGGCGCC CGGCGCGTCC TGCCCGTCGG CTCGAGAGAC CTGGGCAGCC CGGTCAGCAC GGTCAGCACG GGCGGCGGCT CAGGCGGCCC CAGCGGCCCC AGCGGCTCAG GCGCCGCCGT CAGCTCGGAC GCCCGGGAAC GGATCCGGTT CCGGGTCTGG GCGCCGCACG CGCTCGCCGT CGAGGTGGCC TTCGCCGGGC CCGGCGGCTA CATCGCCGAC GACGGCCACG GTGAGGACGA GCTGATCACC AGGCTGCCGA TGCGCCAGGT CGGCGACGGC TGGTGGGAGG CGGCCGCACC CGGCTTCGCC GACTGGGTCG GACGGCGCTA CCTCTACCGC GTCACCCGCG ACGACGGGTC CGTCGCCTGG CGCTCGGACA TGTACTCGGC GCAGCAGTGC GGCACCGGGG ACATCGACCC CTGCGGCGCC CCCCACGACG GCCCGGCCGA GGACCTCGAC GGCTCGGTGA GCTGCTCGGT CGTCGTCGAC ACCCGGGACG ACGAGCGGTT CTGGGCCGAC GAGTTCGACC CGGCCCGGCC CGTCCCCCGG CGGGTCGAGG ACCTGGTCAT CTACGAGCTG CACGTGGGCG CGCTGGGCTT CGGGCACACC GGGGCCGGCA CGTTCGCCGA CGCGCTCGCG TTCGTCGACC ACCTCACCGA CCTCGGGGTG AACGCCGTCG AGCTGCTGCC CATGTTCGAG TTCGCCGGGA CGAGGTCCTG GGGTTACGGC AGCTCGCACT TCCTGGCGGT GAAGCAGAGC GCGGGCGGGC GGGCGGCGCT GCGCCGGTTC GTGCGGGCCT GCCACCAGCG GGGCGTGGCC GTCCTGATGG ACGTCGTCTA CAACCACTAC ACGCCGAACG CGCAGCGTTC CGCCTGGCAG TACGACAGCA CCGCGCCGAG CCGCAACATC TACTACTGGT ACGAGGGCGC CGAGGACGAC CACCCGCACC CGGACGGCGG ATACCTCGAC AACGTCTCCT CGGGCTGGGC GCCGCGCTAC TCCGACGAGA ACGTACGTGC GCTGTTCGTC GCGAGCGCGG TGGCGCTGCT CGACGAGTTC CACATCGACG GGCTGCGGGT GGACCAGACG ACGTCCATCC ACGCCTACAA CAGCCTGCAC GCTGACGGGC GGCCCGTTGC GGCGGCGAAC ATCGCCGGCC GCAAGTTCCT GCGGGAGCTG TGCCAGACGC TGCGCCTGGT CGACCCGGAC GTGATCCTCA TCGCCGAGGA CCACTCCGGC TGGGCGGAGG TCACCCGCCC CGCCGAGTCC GGCGGGCTCG GCTTCGACGC CCACTGGTAC GTCGACTTCT ACCACCACCT CGTCGGCGAC AAGGGCGAGG GACCGGAGTA CGCCAAGCTG CTGCACACCG CGGGGCGCGA CCCCGCCGGC CCGCTGGCCA TGAGCCTGTT CGCCAAGGCG TTCACCGCCG CCGCGGACCG CACGGTCGTC TACACCGAAA GCCACGACGA GGCCGGCAAC TCCGAGCACT CGGCCCGCAA CATCCTCGTC GCCGTCGACC ACGCGCCGCT GCACGGCGAC ACGGCCTGGT TCGCGTTCGC GCGGCTGCGC TGCGCCGCGG CGCTGACCCT GCTTTCGCCC GGCACGCCGA TGTTCCTCAT GGGCGACGAG GTGGGAGCCC GGCGGGCCTA CACCCACGAC GGGTTCGCCG AGGCCAAGGA GGACCTCGCC GGGCTGCGCG CGGGCGAGGG CGCCGAGCTG TTCGCCTGCT ATCGGGCGCT CGTCACGCTG CGGCTGGGCA GCCCGGCGCT GCGCTCGCGC GCGGTCGAAC TGGTCGGCGC CGACGACACC GCGCGGGTGC TGGCGTTCCG CCGCTGGGAC CGCGGCGAGG AGATCCTGGT CGTGGTAAGC CTGAACAACG ATCCGCTGCC CAGGTTCGGG CTATCGCATC CGTCGCTGGC CGGCCGGCGG TGGAAGCCGG TGCTGGACAC CGACGCGCCA CGGTTCGGCG GACGGGCGGG CGGCTCGCGC CGGTCGCTGT CCCCACGGGG CGACAGCGTG CGGGTCGATC TGCCGGCCGC CGGTGCCGTG GTGTTCCGCC GCCGCCGGCG CGGCGCCGGC CTGACCGACG GGCCGTCGGA CGTCCCGGCC CGCCCCCGCC GCCTGCGCCT GCCCGGCGTG CGCCGACGAG GCGGGCGCTG A
|
Protein sequence | MGRHGRPRAP AEVYRWDDVT TEHGADRHVG PSPATGAVRV DVLYHTGVGR RIANGARLVG SWDEHGLQAG SWSSTPMEEF TAEDGCAAYR ATVTVDASVS RTFDWGVWLT RPDGSQVWGI PAEVPDPGAT EQVRRFEVGP GCGPVVRAEF RLAAHRHNGA RRVLPVGSRD LGSPVSTVST GGGSGGPSGP SGSGAAVSSD ARERIRFRVW APHALAVEVA FAGPGGYIAD DGHGEDELIT RLPMRQVGDG WWEAAAPGFA DWVGRRYLYR VTRDDGSVAW RSDMYSAQQC GTGDIDPCGA PHDGPAEDLD GSVSCSVVVD TRDDERFWAD EFDPARPVPR RVEDLVIYEL HVGALGFGHT GAGTFADALA FVDHLTDLGV NAVELLPMFE FAGTRSWGYG SSHFLAVKQS AGGRAALRRF VRACHQRGVA VLMDVVYNHY TPNAQRSAWQ YDSTAPSRNI YYWYEGAEDD HPHPDGGYLD NVSSGWAPRY SDENVRALFV ASAVALLDEF HIDGLRVDQT TSIHAYNSLH ADGRPVAAAN IAGRKFLREL CQTLRLVDPD VILIAEDHSG WAEVTRPAES GGLGFDAHWY VDFYHHLVGD KGEGPEYAKL LHTAGRDPAG PLAMSLFAKA FTAAADRTVV YTESHDEAGN SEHSARNILV AVDHAPLHGD TAWFAFARLR CAAALTLLSP GTPMFLMGDE VGARRAYTHD GFAEAKEDLA GLRAGEGAEL FACYRALVTL RLGSPALRSR AVELVGADDT ARVLAFRRWD RGEEILVVVS LNNDPLPRFG LSHPSLAGRR WKPVLDTDAP RFGGRAGGSR RSLSPRGDSV RVDLPAAGAV VFRRRRRGAG LTDGPSDVPA RPRRLRLPGV RRRGGR
|
| |