Gene Franean1_3257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3257 
Symbol 
ID5671631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3854969 
End bp3858022 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content72% 
IMG OID641242149 
Productglycoside hydrolase family protein 
Protein accessionYP_001507569 
Protein GI158315061 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGGA CCGACTTCAC CGCAGTGCCG TGGGAGGACA GCGGCTTCCA GGCCCTGGGT 
CGGCTGCCGA TGCATTCGAT CAGGCGACCC GCCGAGGTCG AGCTGGACGG GGTGTGGGAC
TTCCAGTTGC TGGACAGCCC GACCGCGCCC CCCGGTGCGG AGTGGAAGCG CGCCGACGTC
CCCGGCCTCT GGACGATGAG CGAGGCCGGC GACCCGGCGC ACTACACGAA TGTCCGGATG
CCGTTCGACG AGACGCCCCC GCGCATCCCG GCGCGCAACC CCACCGGGGT CTACCGGCGT
TCCTTCGAGC TGCGCCCCGC CGCGGGCCGC CGGGCGATCC TGCATGTCGG GGCGGCGGAG
GGCCTGCTGC GGGTGTTCGT GAACGGGCGC GCGATCGGCG TCTCCACTGA CTCGCATCTC
GCCGCCGAGT TCGACATCAC CGAGGCCTGT GTCGCCGGCG GACGGGACCG GCACACGGTG
GAGCTGGTGA TCTCCAAGTG GTCGTCGGTC TCCTATCTCG AGGACCAGGA TCACTGGTGG
CAGTCGGGTA TCACCCGGTC GGTGTACGTC TACACGCTCC CGGAGATCCG GCTGGCCGAC
CTGGCTGTCG TCGCCGACTT CGATCCCGAG GCGCGTCGCG GGACGCTGCG CCTGGAGGTG
TCGACCGCCG GTCTCGACCA CCTGCCGGAA CTCGCGTGGA CCGTGCGGGT CGACGTCCTG
GGGCGCCGAT CGACGCTGCC GGTCACCCCC TGCTCCCCGG CTCCGGGACT CCCACCGCCC
TGGGACGACC GGTCGGTCCG GCCCGAGCCC CGCGTCCCGG CCGACTTCAT GAGCCTGGTC
TCCCAGATAG CCGCGGACGC CCCGATCCCG CCGAGGTGGT CGGCGGCGGT TCCGTCCCTG
AGACGCAGCC TCGCGCCCCG GCGGCCTGCC GGCACCGCCA CGCTCCACCT CGACGGCCTC
GAGGTGGAGC CGTGGTCGGC CGAGTCTCCC CATCTGGAGG ATCTCGTCGT CGCGCTCGTC
TCCCCCGGCG GCGAGGTCGT GGACGAGACC CGCACCAGGA TCGGCTTCCG GCGGGTCCGG
ATCGAGGGCC GGGATCTGCT GGTCAACGGC GGACGGATCC TCATCCAGGG GGTGAACCGC
CACGACACGG ACGCCCGCAC CGGCCGGGTT CTCTCGGCGC GGACGATGCT CGCGGAGCTC
TCCCTGCTCA AACGGTTCAA CGTGAACGCG ATCCGCACCT CGCACTATCC CAACGACCCG
CGCCTGCTGG AGCTGTGTGA CGAGCTCGGC TTCTACGTCG TCGACGAGGC GGACATCGAG
GCGCACGCCT TCGCCAACGC GATCTGCGAC GATCCGCGCT ACCTGCCGGC TTTCCTCGAC
CGGGTGTCGC GGATGACGCT GCGCGACCGC AACCATCCGA GCGTGATCGT CTGGTCGCTC
GGCAACGAGA CGGGCTACGG AGCGAACCAC GACGCCGCCG CCGGATGGCT GCGCCGCTTC
GACCCGACCC GCCCCCTGCA CTACGAGGGG GCGATCGCCC TCGACTGGCA CGGCGGCCGG
GCCGCGACCG ACATCGTCTG CCCGATGTAC CCCTCGTTCG AGGCGCTGGC CGCCTTCTCC
GCCGATCCGC GGGCCGACCG TCCGGTGATC CTGTGCGAGT ACGCCTACTC GCAGGGGAAC
TCGACCGGAG GGCTGGCCGA GTACTGGGAG ATGTTCGAGA CGCTGCCCGG CCTGCAGGGC
GGGTTCATCT GGGAGTTCAA GGACCATTCC CTCGATCCCG CGGGCGACGG CGGGTACCGC
TACGGCGGCG ACTTCGGGGA CGTCCCCCAC GACGGCGCGA CCCTGGTGAA CGGCATCGTC
TTTCCCGACC TCACCCCCCA GCCCGCGCTG TACGAGGCAC GCGGCCTGTT CAGCCCGGTA
CGGATCGTCT CGGACGCCGC GGCCGCGCTC GCGGGGGCGA TCGGCATCCG CAACCGCCAG
TCCTTCAACG ACCTGAGCGC CTACGTCCTG GAACTACAGG TGGAGAACGA CCGGTCGACG
GACCCGGTCA CCGTCGACGT GCCGGACGTC GCGCCCGGGG CGACCGGGAC CATCGAGCTG
CCCGACCCGA TCCGCAAGCT GCTCGGGTCC GCACCGCCGT TGGCCCTGAC CCTGACCGTG
CGGACCAGGC AGGGCGCGCG GTGGGCACCG GCGGGCACCG TCGTCGCCGC ACACCAGATC
ACCTTCCCGC GGCCGCCGGT CGCCCTGCTG TCCGCCCCGG TTCCCGACGC CCTCCGCGTC
GACCAGGACG GGTCCGTCGT ACACCCCCTG CTGCGGCGCG CCCCGGCACT GTGCCTGTGG
CGGGCAGTCA CCGACAACGA CAAGTCCTTC TCCCTCGACC AGCGCTTCGT CCGCTCCGGC
TTCTTCCGCC TCACCCCCGG GGCAGTCACG GCCGAAGCGG ACGGGGGCAG CCTGACGATC
ACCACCGCCT ACGCCACCGC GTTCGGCACC GAGGTGACGC ACCGCCGCGT CATCAGCGCG
CTGGCCGAGC ACGACTACCG GTTCGACGAG CACGTCCAGC TGCCCGCGGA CACCGAGGAC
GCGCTGCGGG TCGGCGTGGA GTTCGAGCTC ACGCCCGGTT TCGACGACGC CCGCTGGGTC
GGGCTCGGGC CGTGGGAGAA CTACCCCGAC CGCCGCTCCT CCGCGCTCCT CGGCTCCTGG
CGGGAACGCA TCGACGACCT GGCCGTGCCC TACCTCGTTC CGCAGGAGAA CGGCGGGCGC
GGGGCGGTGA GCGAGCTGTG CCTGAGCGGT CCGGCCGGCA CCGTGCGCAC CTTCCACCCG
ACGCCGGTGC AGATGGCCGT CGGCCGGCAT CGCGTCGATC AGCTCGAGGC CGCGGCCCAC
TGGTGGGAGC TCCCGCCCAG CGACGTCACC GTCGTCCACC TCGACGTGGC GCACCGGGGT
GTCGGCACCG CCCAGCTGGG CCCGGACACC CGTCCCCGTC ACCGCCTCAC CGACCACGAG
TACACATGGA CGTGGCGGCT ACGGCTCGAG TCGGCCGCCG GGGCCGCGGT CTGA
 
Protein sequence
MTGTDFTAVP WEDSGFQALG RLPMHSIRRP AEVELDGVWD FQLLDSPTAP PGAEWKRADV 
PGLWTMSEAG DPAHYTNVRM PFDETPPRIP ARNPTGVYRR SFELRPAAGR RAILHVGAAE
GLLRVFVNGR AIGVSTDSHL AAEFDITEAC VAGGRDRHTV ELVISKWSSV SYLEDQDHWW
QSGITRSVYV YTLPEIRLAD LAVVADFDPE ARRGTLRLEV STAGLDHLPE LAWTVRVDVL
GRRSTLPVTP CSPAPGLPPP WDDRSVRPEP RVPADFMSLV SQIAADAPIP PRWSAAVPSL
RRSLAPRRPA GTATLHLDGL EVEPWSAESP HLEDLVVALV SPGGEVVDET RTRIGFRRVR
IEGRDLLVNG GRILIQGVNR HDTDARTGRV LSARTMLAEL SLLKRFNVNA IRTSHYPNDP
RLLELCDELG FYVVDEADIE AHAFANAICD DPRYLPAFLD RVSRMTLRDR NHPSVIVWSL
GNETGYGANH DAAAGWLRRF DPTRPLHYEG AIALDWHGGR AATDIVCPMY PSFEALAAFS
ADPRADRPVI LCEYAYSQGN STGGLAEYWE MFETLPGLQG GFIWEFKDHS LDPAGDGGYR
YGGDFGDVPH DGATLVNGIV FPDLTPQPAL YEARGLFSPV RIVSDAAAAL AGAIGIRNRQ
SFNDLSAYVL ELQVENDRST DPVTVDVPDV APGATGTIEL PDPIRKLLGS APPLALTLTV
RTRQGARWAP AGTVVAAHQI TFPRPPVALL SAPVPDALRV DQDGSVVHPL LRRAPALCLW
RAVTDNDKSF SLDQRFVRSG FFRLTPGAVT AEADGGSLTI TTAYATAFGT EVTHRRVISA
LAEHDYRFDE HVQLPADTED ALRVGVEFEL TPGFDDARWV GLGPWENYPD RRSSALLGSW
RERIDDLAVP YLVPQENGGR GAVSELCLSG PAGTVRTFHP TPVQMAVGRH RVDQLEAAAH
WWELPPSDVT VVHLDVAHRG VGTAQLGPDT RPRHRLTDHE YTWTWRLRLE SAAGAAV