Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3237 |
Symbol | |
ID | 5671612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3825801 |
End bp | 3827015 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242130 |
Product | amidohydrolase |
Protein accession | YP_001507550 |
Protein GI | 158315042 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.136124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACAC TCAAGGCCGC CGGCCTGCTG GATGTGGACA CCGGGGAGAT CACCCGACCC GGGATCCTGA AGATCGAGAA CGATCGGATC GTCGGTCTCG GCGGTCAGCC CGAGGGCGAG GTCATGGACC TCGGTGACCT GATCCTGCTG CCCGGCCTGA TGGACATGGA GCTCAACCTC CTGATGGGCG GCCCCGGCGA GCACCAGCTC ACCGGGCAGA TGCGCGACGA CCCGCCGCTG CGGATGATGC GCGCCACCCA GAACGCCCGC CGCACCCTGC GAGCCGGCTT CACCACGGTG CGCAACCTCG GCCTGTTCTG CAAGACCGGC GGCTACCTGC TCGACGTCGC ACTCATGAAG GCGATCGACG CCGGCTGGGT CGACGGCCCG CGGGTCGTGC CCGCAGGTCA CGCGATCACG CCGACCGGCG GCCACCTCGA CCCGACCATG TTCGGGGCGT TCGCCCCGCA CGTGCTCGGC CTCAGTGTCG AGGAGGGCCT GGCCAACGGG CCCGACGAGA TCCGCCGGGC GGTCCGCCAC CAGATCAAGT ACGGCGCCCA GGTCATCAAG ATGTGCGGCT CGGGTGGAGC CATGTCCTAC AGCAGCGGCC CCTCGGGCCA GAAACACTAC TCCGACGCCG AGGTGCTGGC GATCACCGAC GAGGCGCACC GCCGCGGCCT GCGGGTGGCC GCGCACACCC ACGGCTCGGA AGCCGTCCAG CAGATGGTCG AATGCGGCGT CGACTGCATC GAACACGCGT TCATGATCGA CGACGACACC ATCAACCTGC TGGTCAAGAA CGGTGTCTGG GTCGTGGCGA CCCAGGCTCT GATCGACGAC ATGCCGGTGC TGCGAGACGC CGAGCCGCAG ATCCAGGCGA AGGCCGCCTA TATCTTCCCC CGTGCGAGGG CCTCCATCCG CAACGCGATC GAGGCCGGTG TCAAGATCGC GGTCGGCAGC GACGCGCCGG CGATCCCGCA CGGCAAGAAC GCCCTCGAAC TGGTCGCCCT GGTCGACCGG GGCATGACCC CGCTGCAGGC CATCCAGGCC GCGACCATCG TGGGCGCGGA CCTCATCGAC GTCACCGACC GCGGCCGACT CGCCGAGGGC CTGCTCGCCG ACGTCATCGG CGTGCGCGGC AACCCCCTCG AGGACATCCG CGTCCTGCAG CGGGTCCCGT TCGTCATGAA GGGCGGCAAG CAGTTTGTCT ACTGA
|
Protein sequence | MLTLKAAGLL DVDTGEITRP GILKIENDRI VGLGGQPEGE VMDLGDLILL PGLMDMELNL LMGGPGEHQL TGQMRDDPPL RMMRATQNAR RTLRAGFTTV RNLGLFCKTG GYLLDVALMK AIDAGWVDGP RVVPAGHAIT PTGGHLDPTM FGAFAPHVLG LSVEEGLANG PDEIRRAVRH QIKYGAQVIK MCGSGGAMSY SSGPSGQKHY SDAEVLAITD EAHRRGLRVA AHTHGSEAVQ QMVECGVDCI EHAFMIDDDT INLLVKNGVW VVATQALIDD MPVLRDAEPQ IQAKAAYIFP RARASIRNAI EAGVKIAVGS DAPAIPHGKN ALELVALVDR GMTPLQAIQA ATIVGADLID VTDRGRLAEG LLADVIGVRG NPLEDIRVLQ RVPFVMKGGK QFVY
|
| |