Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3789 |
Symbol | |
ID | 5672153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4492556 |
End bp | 4493752 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242668 |
Product | fumarylacetoacetase |
Protein accession | YP_001508088 |
Protein GI | 158315580 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR01266] fumarylacetoacetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGCCC GTTCGTGGGT GCCGGTACCC AAGGGCTCGG ACTTCCCGCT GCAGAACCTC CCCTACGGCG CCTTCTCCGC CGACGGTGGG AGCCCCCGGA TCGGCGTCGC CATCGGCGAC CACGTGCTGG ACCTCGCCGC CGCGCTGGGG GACCCGGTGT TCGCCCGGCC CCGGCTGAAC GAGTTCCTCA GCCGGGGCCG GCGCCACTGG ACGGCAGTCC GCGCACGGAT CACCGACCTG CTGACCGACC CGGCCCAGAA AGCGGCGGCG CGGCCCAGCC TGATCCCGCG TGACGCGGTC CGGCTGCACC TGCCGGTGGA CGTGGCCGAC TACGTCGACT TCTACGCCTC CGAGCACCAC GCCAGCAACG TCGGACGGAT CCTGCGCCCC GGCGGTGATC CGCTGAACCC GAACTGGCGG CACCTGCCTG TCGGCTACCA CGGCCGCTCC GGCACGGTCA TCGTCTCCGG AACCGAGATC GTCCGCCCGT GCGGGCAGCG CCGGCCCGCC GACGGCCAGC CGCCGGCGTT CGGCCCGACG ACCCGCCTCG ACATCGAGGC GGAGGTCGGC TTCGTCGTCG GGACGCCGTC GGCGCTCGGT GAACGGGTCA CCCCGGCGGC GTTCGCCGAC CACGTGTTCG GCGTGGTGCT GGTGAACGAC TGGTCGGCGC GGGACATCCA GGCGTGGGAG TACGTGCCGC TCGGGCCGTT CCTCGGGAAG TCCTTCGCGA CGTCGGTCTC GCCCTGGGTC GTCCCGCTCG ACGCGCTGTC GGCCGCGCGG TTCCAGCCGC CGCCGCGGGA GCCCGAACCG CTGCCCTACC TGCGGGACAA CGGCGCGTGG GGCCTCGACC TGCGGCTGGA GGTGAGCTGG AACGGCTCCG TGGTCAGCCG CCCGCCCTTC GCCGAGATGT ACTGGACGCC CGCGCAGCAG CTGGCGCATC TCACCGTCGG CGGCGCGGCG CTGCGCACCG GCGACCTCTT CGCCTCGGGC ACCGTCTCCG GCCCGCGCCG CGACGAGTGC GGGTCCTTCC TGGAGCTCAC CTGGAACGGC ACCGAGCCGC TGCGACTGCC CGACGGCACC GAGCGGACGT TCCTCGAGGA CGGCGACACC GTCACCATCC GGGCCACCGC CCCCAGCGAC AGCGGCGTGC GCATCGGCTT CGGCGAGGTG ACAGGAATGA TCCTCCCGGC CCGGTGA
|
Protein sequence | MTARSWVPVP KGSDFPLQNL PYGAFSADGG SPRIGVAIGD HVLDLAAALG DPVFARPRLN EFLSRGRRHW TAVRARITDL LTDPAQKAAA RPSLIPRDAV RLHLPVDVAD YVDFYASEHH ASNVGRILRP GGDPLNPNWR HLPVGYHGRS GTVIVSGTEI VRPCGQRRPA DGQPPAFGPT TRLDIEAEVG FVVGTPSALG ERVTPAAFAD HVFGVVLVND WSARDIQAWE YVPLGPFLGK SFATSVSPWV VPLDALSAAR FQPPPREPEP LPYLRDNGAW GLDLRLEVSW NGSVVSRPPF AEMYWTPAQQ LAHLTVGGAA LRTGDLFASG TVSGPRRDEC GSFLELTWNG TEPLRLPDGT ERTFLEDGDT VTIRATAPSD SGVRIGFGEV TGMILPAR
|
| |