Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3928 |
Symbol | |
ID | 5672289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4697316 |
End bp | 4698203 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242807 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_001508224 |
Protein GI | 158315716 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.813381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0824427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTCG TGGGATTGCG GGACGGACGG CGGGTCGTGG TCGGCCTGGT GTTCGGTGCC GACGGGGCCG GCCCGCAGGG CGGGCCCATG GGCACGGCCG GGGACGAACG CCCCCGGGTC GCCCGCGTCG CCGAGGTCGA CGAGTTCTAC GGAGACCTCG CCGGCTGGAC GGCGAAGGCC CGCCGGATGA CCGCCGGTGA GCATGACCTC GCCGACGTCG AGCTCGTCCC GCCCGTACCG GCCGGAGCCC GGATCCTCGG CATGGGGCTG AATTATCATG CGCACGCCGC GGAGACCGGG CTGGAACTGC CCAGGCGGCC ACCGATCTTC GGCCGGTGGA CCGCGTCCCT GACGGTGGAC GGCACCCCCG TCCCCGTCCC GCCGGGCGAG CGGGGCCTGG ACTGGGAGGG CGAGCTCGCC GTCATCGTCG GGTCCAGGAT GACCGATGTC GACGAGGACG CCGCCCTGCG CGGCGTGTTC GGCTACGCGG TGTTCAACGA CCTCAGCGCC CGCCGCGCCC AGGGCGCCTC GGCGCAGTGG ACGCTGGGCA AGAACTCCGA CCGCAGCGGG CCGATGGGGC CCGTCGTGAC CGCCGACGAG GTCGGCGATC CGGCGGCGGG CCTGCGGCTG GTCACCCGCG TCAACGGCGA GGTGGTGCAG GACGGCGACA CCAGCGACAT GATCTTCTCG ATCGGCCGGA TCCTGTCGTT CGTGAGCCGC ACCCTGACCC TCAACCCGGG TGACATCCTG ATCACCGGAA CTCCCGCCGG GGTCGGCTAC ATCCGCAAGC CGCCCCGCTA CCTGGGTCCG GGTGATGTCG TCGAGGTGTG GATCGAGCGG GTGGGCACGA TCCGCAACCC GGTCGTGGAC GCGTCCGCCC GGCCGTGA
|
Protein sequence | MRFVGLRDGR RVVVGLVFGA DGAGPQGGPM GTAGDERPRV ARVAEVDEFY GDLAGWTAKA RRMTAGEHDL ADVELVPPVP AGARILGMGL NYHAHAAETG LELPRRPPIF GRWTASLTVD GTPVPVPPGE RGLDWEGELA VIVGSRMTDV DEDAALRGVF GYAVFNDLSA RRAQGASAQW TLGKNSDRSG PMGPVVTADE VGDPAAGLRL VTRVNGEVVQ DGDTSDMIFS IGRILSFVSR TLTLNPGDIL ITGTPAGVGY IRKPPRYLGP GDVVEVWIER VGTIRNPVVD ASARP
|
| |