Gene Franean1_4662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4662 
Symbol 
ID5673005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5563689 
End bp5566070 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content73% 
IMG OID641243520 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001508936 
Protein GI158316428 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.576806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGCA GCCCCCTGAG CCCCGGCAAG GTCCCGGACC TGGTGTCCCG GCTGACGCTG 
GAGCAGAAGA TCGCCCAGCT GACGGGGTTC GCGGTCACCG ATCTCATCGT CCGCGGCGAG
GGACCGGGCC CGGCCAGTCC GGACATCGAC GTGAGCCGGG TGCCGTCACT GCGGCCGCAC
GGGGCGGGCC ACCTGTCCCT GTCCTGGTTC CTCGGCCACG ACGCGGCGAG CCTGCGGTCC
GCGCTCGACA GGATCCAGTC GGCGGTGCGT GAGGTCGCCC CCTTCGGCAT CGGGGCCCTC
GCGCACAACG AGGCCGTCAA CGGCTTCCTG CACGTCTCGG GTTCGCAGTT CCCGACCGCC
TGGGCGCAGG CGGCGACCTG GGACCCCGCG CTGGTGACCC GCGCGGCGGC CGTGAGCGCC
GCCCACATGC GCCAGACCGG CATCCACCTC GCGTTCTCCC CGGTCATGGA CCTGGCGCGG
GACCCACGGT GGGGCCGGGT GCACGAGACG TACGGTGAGG ACCCCGAGCT CGCCGCCCAG
TTCTCGGTCG CGTTCGTCCG GGGCATCCAG GGTTCCGAGG ACGACTCCGG CCTGCTCGCG
ACCGGGAAGC ACTTCGCCGG GTACGGCGCC TCCGAAGGCG GCCTCAACCA GGCGGTCACC
CAGCTGGGAC GCCGCGCCCT GGTCGACGAG TACGCCGAGC CCTTCCGCCG GGCCATCGCC
GAGGCCGGCC TGGTCGCCGT CATGAACTCG TACAACGAGC TCGACGGCGT GCCCTGCGCC
GCCGACCGCT GGCTGCTCAC CGACCTGCTC CGCGGCCAGC TCGGCTTCGA CGGCATCGTG
GTCAGCGACT ACAGCGCCGT GGACATGCTG CGGACGATCT ACCACACCGC ATCCTCGGCA
GGGCAGGCCG CCGTCCAGGC GATCAGCGCC GGCCTGGACG TCGAGCTCCC CAGCGACGTG
AACTTCTCCC ACCTGGCCGA CGAGGTCACC GGTGGCCGTC TCGACGAACA CGTCCTCGAC
ACGGCGGTCG CCCGGGTCCT CACCGTCAAG GCCCGCGTCG GGCTCATCCC GGGGATGGCG
GCGCCACGCG CCGCGTCCGC CGCGGCGCCC CCGGACCGGG CCGAGGCGGC CGAGGTGCGC
CGCGCCGTCG CCGCCCGCGG CCTGGTCCTG CTCGCCAACG ACGGCACCCT GCCCCTGGCG
CCCCGGAGTG GACGCATCGT CGTCGTGGGC CCGGCCGCCG ACGAGCTGCG GATCCACTTC
GGCGCCTACA CCTCCGTCTC CAACGCCGAG ATGCCGCTGG GGATGATGGC GGTGATGACG
GGGCAGGTGC CCGGAGTCGA TCCGGCCACC TTCGTGTTCA CCGACATCTT CCAGCCCCGG
ATGCCCGGGT TCGACGAGCG GTTCGAGGCA GAGGCCCGGC GGATCCACCC CGACGCGCCC
ACCGTGCTCG ACGCGCTGCG GGGCTTCGAT CCGACCGTCG GGTTCGTTCC GCTCGGCCGG
TTCGAGGCCG GCCCCGGGCC CGCGCTGGAC CAGGCGACGG TGACGGCGGC CGTCACCGAC
GCCGACCTCG TCATCGCCGT CGTCGGCGAG CGGACCGGAT GGGTCGGGAA CAACACCGCC
GGCGAAGGCC AGTCCACCGC GTCCCCCGAC CTGCCCGGCG ACCAGGACGA GCTCGTCGCT
CTGCTCGCCG CGACCGGCAG GCCCCTGGTG ACCGTGATGG TCTCCGGACG TCCCCTCCTC
CTCGACAGCG TGGCTCGGGC CTCGCGCGCG GTCCTTCTCG CGCCACTGCT GGGAGAAGAA
GCCGGCCCGG CGATCGCCGG GGCCGTCTTC GGGACGATCA ACCCGAGCGG GAAGCTCCCA
AGCACCTTCC CCCGGCACCT GGGCCAGGTC CCGATCTATC ACGGGCACCA CTTCGGCAGC
GGCTACGACC ACCCCACCGG CACCCGGCAC GGCTACAACG ACCTCGACGA CGACGCCCCG
CTCTACGCCT TCGGGCACGG CCTGTCCTAC AGCACCTTCG ACATCGCCCT CGACGAGTCG
GCCGAACCGG CGGTCGAGGA GATCGACGGG CTGCTGCGGG CGCGGCTCAT CGTGTCGAAC
ACCGGCACCG TCGACGGCGA GACCGTCGTA CAGCTCTACG CACGGGACGA AGCCGCGACG
ATCGTCCGCC CCGTCCGCCA GCTCCTCGGA TTCACCCGCC TGGCCCTGGC CGCGGGAGAG
ACCCGGCGCG TTCTCCTCGA AGCCCCGACC GAACGGCTGT TCTACACCAT GGCCGACGGC
ACCCGTGGCC TCGAGGCCGG CGACGTCACC GTGCTGGCCG CGCTCAGCAG TGACGACGTC
CGCTGCTCGC GGACCGTCCC GCTGTCCGCC CGCCGCGCCT GA
 
Protein sequence
MNRSPLSPGK VPDLVSRLTL EQKIAQLTGF AVTDLIVRGE GPGPASPDID VSRVPSLRPH 
GAGHLSLSWF LGHDAASLRS ALDRIQSAVR EVAPFGIGAL AHNEAVNGFL HVSGSQFPTA
WAQAATWDPA LVTRAAAVSA AHMRQTGIHL AFSPVMDLAR DPRWGRVHET YGEDPELAAQ
FSVAFVRGIQ GSEDDSGLLA TGKHFAGYGA SEGGLNQAVT QLGRRALVDE YAEPFRRAIA
EAGLVAVMNS YNELDGVPCA ADRWLLTDLL RGQLGFDGIV VSDYSAVDML RTIYHTASSA
GQAAVQAISA GLDVELPSDV NFSHLADEVT GGRLDEHVLD TAVARVLTVK ARVGLIPGMA
APRAASAAAP PDRAEAAEVR RAVAARGLVL LANDGTLPLA PRSGRIVVVG PAADELRIHF
GAYTSVSNAE MPLGMMAVMT GQVPGVDPAT FVFTDIFQPR MPGFDERFEA EARRIHPDAP
TVLDALRGFD PTVGFVPLGR FEAGPGPALD QATVTAAVTD ADLVIAVVGE RTGWVGNNTA
GEGQSTASPD LPGDQDELVA LLAATGRPLV TVMVSGRPLL LDSVARASRA VLLAPLLGEE
AGPAIAGAVF GTINPSGKLP STFPRHLGQV PIYHGHHFGS GYDHPTGTRH GYNDLDDDAP
LYAFGHGLSY STFDIALDES AEPAVEEIDG LLRARLIVSN TGTVDGETVV QLYARDEAAT
IVRPVRQLLG FTRLALAAGE TRRVLLEAPT ERLFYTMADG TRGLEAGDVT VLAALSSDDV
RCSRTVPLSA RRA