Gene Franean1_4855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4855 
Symbol 
ID5673195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5823610 
End bp5824821 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content69% 
IMG OID641243710 
Productamidohydrolase 
Protein accessionYP_001509126 
Protein GI158316618 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.165345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.275978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGACAC TCAAGGCCGC CGGCTTGCTC GATGTCGATA CCGGGGAGAT CACCCGGCCC 
GGGATCCTGA AGATCGAGAA CGATCGGATC GTCGGTCTCG GCGGCCAGCC CGAGGGCGAG
GTCCTGGACC TCGGCGACCT GATCCTGCTG CCCGGCCTGA TGGACATGGA GCTCAACCTC
CTGATGGGCG GCCCCGGCGA GCACCCGCTC ACCGGGCAGA TGCGCGACGA CCCGCCACTG
CGGATGATGC GCGCCACCCA GAACGCCCGC CGCACCCTGC GAGCCGGCTT CACCACGGTG
CGCAACCTCG GCCTGTTCTG CAAGACCGGC GGCTACCTGC TCGACGTCGC ACTCATGAAG
GCGATCGACG CCGGCTGGGT CGACGGCCCG CGGATCGTCC CCGCCGGCCA CGCCATCACC
CCGACCGGCG GCCACCTCGA CCCCACCATG TTCGGCGCGT TCGCCCCGCA CGTCCTGAAA
CTCACCGTCG AGGAGGGCCT GGCCAACGGC GTCGACGAGA TCCGCCGAGC CGTCCGGCAC
AACATCAAGC ACGGCGCCCA GGTCATCAAG ATGTGCGGCT CGGGCGGAGC CATGTCCCAC
AGTGGGCCCT CGGGCGCGCG GCACTACTCC GACCAGGAGG TGCTGGCCCT CACCGACGAG
GCACACCGAC GCGGCCTACG CGTGGCGGCG CACACCCACG GCTCGGAAGC CGTCCGGCAG
ATGGTCGAAT GCGGCGTCGA CTGCATCGAA CACGCATTCA TGATCGACGA CGACACCATC
AACCTGCTGA TCAAGCGCGG CACCTGGGTG GTCGCCACCC AGGCCCTGAT CGACAACATG
CCGGTGCTGA ACGACGCCGA GCCGGAGATC CAGGCGAAGG CCGCCCTCAT CTTCCCCCGC
GCGAAGGCAT CGATCCGCAA CGCCATCGAG GCAGGCGTCA AGATCGCCGT CGGGAGCGAC
GCGCCGGCGA TCCCACATGG CAAGAACGCC CTCGAACTGG TCGCCCTGGT CGACCGGGGC
ATGACCCCGC TGCAGGCCAT CCAGGCCGCG ACCATCGTGG GCGCGGACCT CATCGACGTC
ACCGACCGCG GCCGGCTCGC CGAGGGCCTG CTCGCCGACG TCATCGGCGT GCGCGGCAAC
CCCCTCGAGG ACATCCGCGT CCTGCAGCGG GTCCCGTTCG TCATGAAGGG CGGCAAGCAG
TTTGTCTACT GA
 
Protein sequence
MLTLKAAGLL DVDTGEITRP GILKIENDRI VGLGGQPEGE VLDLGDLILL PGLMDMELNL 
LMGGPGEHPL TGQMRDDPPL RMMRATQNAR RTLRAGFTTV RNLGLFCKTG GYLLDVALMK
AIDAGWVDGP RIVPAGHAIT PTGGHLDPTM FGAFAPHVLK LTVEEGLANG VDEIRRAVRH
NIKHGAQVIK MCGSGGAMSH SGPSGARHYS DQEVLALTDE AHRRGLRVAA HTHGSEAVRQ
MVECGVDCIE HAFMIDDDTI NLLIKRGTWV VATQALIDNM PVLNDAEPEI QAKAALIFPR
AKASIRNAIE AGVKIAVGSD APAIPHGKNA LELVALVDRG MTPLQAIQAA TIVGADLIDV
TDRGRLAEGL LADVIGVRGN PLEDIRVLQR VPFVMKGGKQ FVY