Gene Franean1_3928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3928 
Symbol 
ID5672289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4697316 
End bp4698203 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content73% 
IMG OID641242807 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_001508224 
Protein GI158315716 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.813381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0824427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTCG TGGGATTGCG GGACGGACGG CGGGTCGTGG TCGGCCTGGT GTTCGGTGCC 
GACGGGGCCG GCCCGCAGGG CGGGCCCATG GGCACGGCCG GGGACGAACG CCCCCGGGTC
GCCCGCGTCG CCGAGGTCGA CGAGTTCTAC GGAGACCTCG CCGGCTGGAC GGCGAAGGCC
CGCCGGATGA CCGCCGGTGA GCATGACCTC GCCGACGTCG AGCTCGTCCC GCCCGTACCG
GCCGGAGCCC GGATCCTCGG CATGGGGCTG AATTATCATG CGCACGCCGC GGAGACCGGG
CTGGAACTGC CCAGGCGGCC ACCGATCTTC GGCCGGTGGA CCGCGTCCCT GACGGTGGAC
GGCACCCCCG TCCCCGTCCC GCCGGGCGAG CGGGGCCTGG ACTGGGAGGG CGAGCTCGCC
GTCATCGTCG GGTCCAGGAT GACCGATGTC GACGAGGACG CCGCCCTGCG CGGCGTGTTC
GGCTACGCGG TGTTCAACGA CCTCAGCGCC CGCCGCGCCC AGGGCGCCTC GGCGCAGTGG
ACGCTGGGCA AGAACTCCGA CCGCAGCGGG CCGATGGGGC CCGTCGTGAC CGCCGACGAG
GTCGGCGATC CGGCGGCGGG CCTGCGGCTG GTCACCCGCG TCAACGGCGA GGTGGTGCAG
GACGGCGACA CCAGCGACAT GATCTTCTCG ATCGGCCGGA TCCTGTCGTT CGTGAGCCGC
ACCCTGACCC TCAACCCGGG TGACATCCTG ATCACCGGAA CTCCCGCCGG GGTCGGCTAC
ATCCGCAAGC CGCCCCGCTA CCTGGGTCCG GGTGATGTCG TCGAGGTGTG GATCGAGCGG
GTGGGCACGA TCCGCAACCC GGTCGTGGAC GCGTCCGCCC GGCCGTGA
 
Protein sequence
MRFVGLRDGR RVVVGLVFGA DGAGPQGGPM GTAGDERPRV ARVAEVDEFY GDLAGWTAKA 
RRMTAGEHDL ADVELVPPVP AGARILGMGL NYHAHAAETG LELPRRPPIF GRWTASLTVD
GTPVPVPPGE RGLDWEGELA VIVGSRMTDV DEDAALRGVF GYAVFNDLSA RRAQGASAQW
TLGKNSDRSG PMGPVVTADE VGDPAAGLRL VTRVNGEVVQ DGDTSDMIFS IGRILSFVSR
TLTLNPGDIL ITGTPAGVGY IRKPPRYLGP GDVVEVWIER VGTIRNPVVD ASARP