Gene Franean1_3237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3237 
Symbol 
ID5671612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3825801 
End bp3827015 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content69% 
IMG OID641242130 
Productamidohydrolase 
Protein accessionYP_001507550 
Protein GI158315042 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.136124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAC TCAAGGCCGC CGGCCTGCTG GATGTGGACA CCGGGGAGAT CACCCGACCC 
GGGATCCTGA AGATCGAGAA CGATCGGATC GTCGGTCTCG GCGGTCAGCC CGAGGGCGAG
GTCATGGACC TCGGTGACCT GATCCTGCTG CCCGGCCTGA TGGACATGGA GCTCAACCTC
CTGATGGGCG GCCCCGGCGA GCACCAGCTC ACCGGGCAGA TGCGCGACGA CCCGCCGCTG
CGGATGATGC GCGCCACCCA GAACGCCCGC CGCACCCTGC GAGCCGGCTT CACCACGGTG
CGCAACCTCG GCCTGTTCTG CAAGACCGGC GGCTACCTGC TCGACGTCGC ACTCATGAAG
GCGATCGACG CCGGCTGGGT CGACGGCCCG CGGGTCGTGC CCGCAGGTCA CGCGATCACG
CCGACCGGCG GCCACCTCGA CCCGACCATG TTCGGGGCGT TCGCCCCGCA CGTGCTCGGC
CTCAGTGTCG AGGAGGGCCT GGCCAACGGG CCCGACGAGA TCCGCCGGGC GGTCCGCCAC
CAGATCAAGT ACGGCGCCCA GGTCATCAAG ATGTGCGGCT CGGGTGGAGC CATGTCCTAC
AGCAGCGGCC CCTCGGGCCA GAAACACTAC TCCGACGCCG AGGTGCTGGC GATCACCGAC
GAGGCGCACC GCCGCGGCCT GCGGGTGGCC GCGCACACCC ACGGCTCGGA AGCCGTCCAG
CAGATGGTCG AATGCGGCGT CGACTGCATC GAACACGCGT TCATGATCGA CGACGACACC
ATCAACCTGC TGGTCAAGAA CGGTGTCTGG GTCGTGGCGA CCCAGGCTCT GATCGACGAC
ATGCCGGTGC TGCGAGACGC CGAGCCGCAG ATCCAGGCGA AGGCCGCCTA TATCTTCCCC
CGTGCGAGGG CCTCCATCCG CAACGCGATC GAGGCCGGTG TCAAGATCGC GGTCGGCAGC
GACGCGCCGG CGATCCCGCA CGGCAAGAAC GCCCTCGAAC TGGTCGCCCT GGTCGACCGG
GGCATGACCC CGCTGCAGGC CATCCAGGCC GCGACCATCG TGGGCGCGGA CCTCATCGAC
GTCACCGACC GCGGCCGACT CGCCGAGGGC CTGCTCGCCG ACGTCATCGG CGTGCGCGGC
AACCCCCTCG AGGACATCCG CGTCCTGCAG CGGGTCCCGT TCGTCATGAA GGGCGGCAAG
CAGTTTGTCT ACTGA
 
Protein sequence
MLTLKAAGLL DVDTGEITRP GILKIENDRI VGLGGQPEGE VMDLGDLILL PGLMDMELNL 
LMGGPGEHQL TGQMRDDPPL RMMRATQNAR RTLRAGFTTV RNLGLFCKTG GYLLDVALMK
AIDAGWVDGP RVVPAGHAIT PTGGHLDPTM FGAFAPHVLG LSVEEGLANG PDEIRRAVRH
QIKYGAQVIK MCGSGGAMSY SSGPSGQKHY SDAEVLAITD EAHRRGLRVA AHTHGSEAVQ
QMVECGVDCI EHAFMIDDDT INLLVKNGVW VVATQALIDD MPVLRDAEPQ IQAKAAYIFP
RARASIRNAI EAGVKIAVGS DAPAIPHGKN ALELVALVDR GMTPLQAIQA ATIVGADLID
VTDRGRLAEG LLADVIGVRG NPLEDIRVLQ RVPFVMKGGK QFVY