Gene Franean1_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3789 
Symbol 
ID5672153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4492556 
End bp4493752 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content74% 
IMG OID641242668 
Productfumarylacetoacetase 
Protein accessionYP_001508088 
Protein GI158315580 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCCC GTTCGTGGGT GCCGGTACCC AAGGGCTCGG ACTTCCCGCT GCAGAACCTC 
CCCTACGGCG CCTTCTCCGC CGACGGTGGG AGCCCCCGGA TCGGCGTCGC CATCGGCGAC
CACGTGCTGG ACCTCGCCGC CGCGCTGGGG GACCCGGTGT TCGCCCGGCC CCGGCTGAAC
GAGTTCCTCA GCCGGGGCCG GCGCCACTGG ACGGCAGTCC GCGCACGGAT CACCGACCTG
CTGACCGACC CGGCCCAGAA AGCGGCGGCG CGGCCCAGCC TGATCCCGCG TGACGCGGTC
CGGCTGCACC TGCCGGTGGA CGTGGCCGAC TACGTCGACT TCTACGCCTC CGAGCACCAC
GCCAGCAACG TCGGACGGAT CCTGCGCCCC GGCGGTGATC CGCTGAACCC GAACTGGCGG
CACCTGCCTG TCGGCTACCA CGGCCGCTCC GGCACGGTCA TCGTCTCCGG AACCGAGATC
GTCCGCCCGT GCGGGCAGCG CCGGCCCGCC GACGGCCAGC CGCCGGCGTT CGGCCCGACG
ACCCGCCTCG ACATCGAGGC GGAGGTCGGC TTCGTCGTCG GGACGCCGTC GGCGCTCGGT
GAACGGGTCA CCCCGGCGGC GTTCGCCGAC CACGTGTTCG GCGTGGTGCT GGTGAACGAC
TGGTCGGCGC GGGACATCCA GGCGTGGGAG TACGTGCCGC TCGGGCCGTT CCTCGGGAAG
TCCTTCGCGA CGTCGGTCTC GCCCTGGGTC GTCCCGCTCG ACGCGCTGTC GGCCGCGCGG
TTCCAGCCGC CGCCGCGGGA GCCCGAACCG CTGCCCTACC TGCGGGACAA CGGCGCGTGG
GGCCTCGACC TGCGGCTGGA GGTGAGCTGG AACGGCTCCG TGGTCAGCCG CCCGCCCTTC
GCCGAGATGT ACTGGACGCC CGCGCAGCAG CTGGCGCATC TCACCGTCGG CGGCGCGGCG
CTGCGCACCG GCGACCTCTT CGCCTCGGGC ACCGTCTCCG GCCCGCGCCG CGACGAGTGC
GGGTCCTTCC TGGAGCTCAC CTGGAACGGC ACCGAGCCGC TGCGACTGCC CGACGGCACC
GAGCGGACGT TCCTCGAGGA CGGCGACACC GTCACCATCC GGGCCACCGC CCCCAGCGAC
AGCGGCGTGC GCATCGGCTT CGGCGAGGTG ACAGGAATGA TCCTCCCGGC CCGGTGA
 
Protein sequence
MTARSWVPVP KGSDFPLQNL PYGAFSADGG SPRIGVAIGD HVLDLAAALG DPVFARPRLN 
EFLSRGRRHW TAVRARITDL LTDPAQKAAA RPSLIPRDAV RLHLPVDVAD YVDFYASEHH
ASNVGRILRP GGDPLNPNWR HLPVGYHGRS GTVIVSGTEI VRPCGQRRPA DGQPPAFGPT
TRLDIEAEVG FVVGTPSALG ERVTPAAFAD HVFGVVLVND WSARDIQAWE YVPLGPFLGK
SFATSVSPWV VPLDALSAAR FQPPPREPEP LPYLRDNGAW GLDLRLEVSW NGSVVSRPPF
AEMYWTPAQQ LAHLTVGGAA LRTGDLFASG TVSGPRRDEC GSFLELTWNG TEPLRLPDGT
ERTFLEDGDT VTIRATAPSD SGVRIGFGEV TGMILPAR