Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4855 |
Symbol | |
ID | 5673195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5823610 |
End bp | 5824821 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243710 |
Product | amidohydrolase |
Protein accession | YP_001509126 |
Protein GI | 158316618 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.165345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.275978 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGACAC TCAAGGCCGC CGGCTTGCTC GATGTCGATA CCGGGGAGAT CACCCGGCCC GGGATCCTGA AGATCGAGAA CGATCGGATC GTCGGTCTCG GCGGCCAGCC CGAGGGCGAG GTCCTGGACC TCGGCGACCT GATCCTGCTG CCCGGCCTGA TGGACATGGA GCTCAACCTC CTGATGGGCG GCCCCGGCGA GCACCCGCTC ACCGGGCAGA TGCGCGACGA CCCGCCACTG CGGATGATGC GCGCCACCCA GAACGCCCGC CGCACCCTGC GAGCCGGCTT CACCACGGTG CGCAACCTCG GCCTGTTCTG CAAGACCGGC GGCTACCTGC TCGACGTCGC ACTCATGAAG GCGATCGACG CCGGCTGGGT CGACGGCCCG CGGATCGTCC CCGCCGGCCA CGCCATCACC CCGACCGGCG GCCACCTCGA CCCCACCATG TTCGGCGCGT TCGCCCCGCA CGTCCTGAAA CTCACCGTCG AGGAGGGCCT GGCCAACGGC GTCGACGAGA TCCGCCGAGC CGTCCGGCAC AACATCAAGC ACGGCGCCCA GGTCATCAAG ATGTGCGGCT CGGGCGGAGC CATGTCCCAC AGTGGGCCCT CGGGCGCGCG GCACTACTCC GACCAGGAGG TGCTGGCCCT CACCGACGAG GCACACCGAC GCGGCCTACG CGTGGCGGCG CACACCCACG GCTCGGAAGC CGTCCGGCAG ATGGTCGAAT GCGGCGTCGA CTGCATCGAA CACGCATTCA TGATCGACGA CGACACCATC AACCTGCTGA TCAAGCGCGG CACCTGGGTG GTCGCCACCC AGGCCCTGAT CGACAACATG CCGGTGCTGA ACGACGCCGA GCCGGAGATC CAGGCGAAGG CCGCCCTCAT CTTCCCCCGC GCGAAGGCAT CGATCCGCAA CGCCATCGAG GCAGGCGTCA AGATCGCCGT CGGGAGCGAC GCGCCGGCGA TCCCACATGG CAAGAACGCC CTCGAACTGG TCGCCCTGGT CGACCGGGGC ATGACCCCGC TGCAGGCCAT CCAGGCCGCG ACCATCGTGG GCGCGGACCT CATCGACGTC ACCGACCGCG GCCGGCTCGC CGAGGGCCTG CTCGCCGACG TCATCGGCGT GCGCGGCAAC CCCCTCGAGG ACATCCGCGT CCTGCAGCGG GTCCCGTTCG TCATGAAGGG CGGCAAGCAG TTTGTCTACT GA
|
Protein sequence | MLTLKAAGLL DVDTGEITRP GILKIENDRI VGLGGQPEGE VLDLGDLILL PGLMDMELNL LMGGPGEHPL TGQMRDDPPL RMMRATQNAR RTLRAGFTTV RNLGLFCKTG GYLLDVALMK AIDAGWVDGP RIVPAGHAIT PTGGHLDPTM FGAFAPHVLK LTVEEGLANG VDEIRRAVRH NIKHGAQVIK MCGSGGAMSH SGPSGARHYS DQEVLALTDE AHRRGLRVAA HTHGSEAVRQ MVECGVDCIE HAFMIDDDTI NLLIKRGTWV VATQALIDNM PVLNDAEPEI QAKAALIFPR AKASIRNAIE AGVKIAVGSD APAIPHGKNA LELVALVDRG MTPLQAIQAA TIVGADLIDV TDRGRLAEGL LADVIGVRGN PLEDIRVLQR VPFVMKGGKQ FVY
|
| |