Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3368 |
Symbol | |
ID | 5671739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3993791 |
End bp | 3995035 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242256 |
Product | amidohydrolase |
Protein accession | YP_001507676 |
Protein GI | 158315168 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.929128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCGC CGGTCACGAT CCGGGCCGCC CGCTGGCTGG ACGTCGTCGC CGGCGAGGTC CGCTCGCCCG CGGTGATCGT GGTGGAGGGG AACCGCATCC TGGCGGTCAA CCCCGCCGAG ACCCCCGCCG GCGCGGTCGA GATCGAGCTG GGTGACGTGA CGCTGCTACC CGGCCTGATG GACATGGAGC TCAACCTCCT CATCGGCGGG CCCGACACCC CGACGGGCCT GCCGCTGCCG ATGCACGGTG TTCAGGATGA CCCGGCGTAC CGCACCATCC GCGGGACCGT CAACGCCCGC GCCACGCTGC TCGCCGGTTT CACCACCGTC CGCAACCTCG GCCTGATGGT CAAGAGCGGG GGCTACCTGC TCGATGTCGC GGTGCAACGT GCCGTCGAGC AGGGATGGGT GGAAGGGCCG CAGATCATCC CGGCGGGACA CGCGATCACC CCGTACGGCG GCCACCTCGA CCCGACGGTG TTCCAGCGCC TGGCACCCGG AATCATGCCG CTGAGCATCG GTGAGGGCAT CGCCAACGGC GTGGGCGAGG TACGGGCCTG TGTCCGCTAC CAGATCCGGC ACGGCGCCAA GGTGATCAAG GTGTCGGCCT CCGGCGGGGT GATGTCGCAC AGCACCGGCC CGGGCGCCCA GCAGTACTCC GACGAGGAGC TCGCGGCGAT CGCCGACGAG GCGCACCGGG CGGACATCCG CGTCGCCGCA CACGCGGTGG GCGACCGGGC GGTGCAGGCC TGTGTCCGTG CCGGTATCGA CTGCATCGAG CATGGTTTCC TCGCCAGCGA CGAAACACTG CGGATGATGG CCGACCACGG CACGTTCCTG GTGTCCACGA CCTATCTGAC CGATGCCATG GACATCGCGC GGGCAGCACC GGAGCTCCAG CGGAAGGCGG CTGACGTCTT CCCCCGGGCG AAGGCGATGC TGCCCAGGGC CATCGCCGCC GGGGTGAAAA TAGCCTGCGG CACCGACGCC CCGGCCGTTC CCCATGGCGA CAACGCCAAG GAGCTGGCCG CGTTGGTCTC GCGGGGCATG ACCCCGGTGC AGGCCCTGCG GGCCGCGACC GTCACCAGCG CGGAGCTGGT CGAGCTCGAC CACGAGCTGG GCCAGCTCAG GGACGGCTAC CTCGCCGACA TCATCGCCGT CCCCGGCGAT CCCTCCCGGG ACATCACCCT CACCCAGGAC GTGCGGTTCG TCATGAAGGA CGGCCGTATC CACAAGGGTG CCTGA
|
Protein sequence | MTAPVTIRAA RWLDVVAGEV RSPAVIVVEG NRILAVNPAE TPAGAVEIEL GDVTLLPGLM DMELNLLIGG PDTPTGLPLP MHGVQDDPAY RTIRGTVNAR ATLLAGFTTV RNLGLMVKSG GYLLDVAVQR AVEQGWVEGP QIIPAGHAIT PYGGHLDPTV FQRLAPGIMP LSIGEGIANG VGEVRACVRY QIRHGAKVIK VSASGGVMSH STGPGAQQYS DEELAAIADE AHRADIRVAA HAVGDRAVQA CVRAGIDCIE HGFLASDETL RMMADHGTFL VSTTYLTDAM DIARAAPELQ RKAADVFPRA KAMLPRAIAA GVKIACGTDA PAVPHGDNAK ELAALVSRGM TPVQALRAAT VTSAELVELD HELGQLRDGY LADIIAVPGD PSRDITLTQD VRFVMKDGRI HKGA
|
| |