Gene Franean1_3292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3292 
Symbol 
ID5671664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3900100 
End bp3903213 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content70% 
IMG OID641242181 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001507601 
Protein GI158315093 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGA CTTCGCCCGG ATCTCTGCCG GTCTATCGTG ACCCGGCGCG GTCGACAGCG 
GCTCGGGTCG CCGATCTGAT CGAGCGGATG TCGCTCGAGG AGAAGGTCGC CCAGCTGCGG
TCGATCTGGA TCTCGAAGTA CAAGGTTGTG CTGCCCGACG GCACCTTCGA CCCGGCGCGG
GCCCACGTGC TCATACCTGA TGGCATCGGA TTCGTGGGAC GGCCGGTGGA CGCGATGGGG
ATGGCGGGCT TTCCGGCGAA CTGGCATCGG AGCCGGGAGG AGACCATCGC GTTCGTCGAC
GCGGTCCAGC GGTACCTGGT CGAGGAGACC CGCCTCGGCA TCCCGGCCCT GTTCCACGAC
GAGACGGCGC ACGGCTTCGT CGCCCGCGGT GCGACGATCT TCCCGATTCC CCCGGCTCTG
GCGAGCACCT GGGACGAGGA TCTGGTGGAG GAGGTCTTCA CCGTGGTGGC CCGGGAGGCC
CGGTCCGTCG GATCCACCGT GTCCCTCGGT CCTGTGCTCG ATCTGGCGCG GGATCCGCGC
TACGGCCGTG TGGAGGAGTT CTTCGGGGAG GACCCGTATC TGGTAGGCCG GATGGGAGTC
GCCGCCGTGC GGGGGTTGCA GGGCCGGTCC CGACCGCTCG CGGCCGACCG GATGTTCGCC
ACACTCAAGC ACTTCCTCCA CGCGTCGCCT GAGGGTGGGA TCAACGCGGC GCCCGCGCCG
GCTCATGAGC GGTCCCTGCG GGAGACCTAC CTGGCTCCGT TCGTCGACGT CGTCCGCGAG
GCCAATCCGG CCTTCATCAT GCCCTCGTAT AACGAGGTCG GCGGGCTGCC GTCGCACGCC
AGCCGCGACC TCCTGCAGCG GCTGGGGCGC GCTCTGCTCG GGTTCGAGGG TGTGTACCTG
AGTGACTACG ACGCGCTTGC GCGGCTGATC TCGGATCATC GGGTGGCCGC CGGTCTCGGG
GAGGCGGCGG CGATCGGGCT GACAGCAGGC GTCGACGTGG ACCTCCCGGA CGGCGAGGCA
TTCTCGATGC TCGCCCCGCT GGTCCGGGAG GGCCTCGTCG ACGAGACCCT CGTCGACGAG
GCCCTCGCCC GGGTGCTCGC GCTCAAGTTC GAGGCCGGTC TGTTCGAGCA GCCGTACGGG
CGCCTGGAAC AGGCGGAGTA CAACAGCGCC GAGGCGGTCC GGTTGGCACG CAACTCGGCG
ACGCGTGCGC TCACCCTCCT GACGAACGAC GGCATCCTGC CGCTCGATCC GAACGCCGAG
ATCCGGCTCG CCGTCGTGGG CCCGAATGCC GGCGAGCTCT ACTACGGCGG GTACTCGGGC
GAGAACGACG CTGGGGTGAG CGTGCTCGAC GGCCTGCGGG CGGCCATCGT CGGAAGCGCG
ATCACGGTCG AGCACGCGGA GGGCGTCCGG CTGGTGGGCG CCGAGGAGGA GGCGGCCATG
CCCGGCCCGG GCCGCGCGCC CGTGCTGCCG GTCGACGACG CGGAGAACCG GCGGCGGATC
AAGGACGCCG TCGCTGTCGT CGAGCGGGCG GACGTCGTGC TCCTCGTGGT CGGCGACCAC
CCGGCCATCG CCCGTGAGAC GACCCGGCCC CTCTTCCCGG GTGACCGCAA CGAGCTCGGT
CTCTACGGCC TGCAGGAGGA ACTCGTCGAG GCCGTTGTGC AGGTCGGGAA GCCGGTCATC
GCCCTGCTCG TCAACGGCCG GCCGATCGCC GCCACCCGGC TCGCGGCGGG CGCGAACGCT
CTGCTCGAGG GGTGGTACCT GGGCCAGGAG ACGGGGAACG CCGTCGCGGA CGTGCTCTTC
GGGCGGGCCG AACCAGGTGG TCGACTCCCG GTCTCCGTGC CCCGTGCCTC GGGTGCTGTG
CCCGTCTACT ACGACCGCCA CACCTCGGCC AACCTGTACC CGTACGTCGA GGTGGACCGG
ACGCCGCTGT TCCCGTTCGG TCACGGCCTG GGCTACACCA CGTTCGACAT CTCCGAGCCG
GTGCTCGACC GGTCCAGCAT CCACGTCGGG GAATCCGTCG GCATCAGCGT CGAGGTGAGC
AACACCGGGA ACCGCGCCGG GGACGAGGTG GTCCAGCTCT ATGTCCGGGA CGACGTCTCG
TCGGTACCGA GACCGGAACT GCAGTTGCGC GGCTTCCGCC GCATCACGCT GGAACCGGGG
CGCTCGACGA CGGTGCGGTT CGTCCTCGAG CCACACCAGC TCGCCTTCTG GAACATCGAC
CTCACCGAAC GGATCGTGGA GCCGGGAACA TTCACCATCT CGGTGGGCCG GAGCTCGACG
CAGCTCCGTT CCGTGACGCT GTCCGTGGCG GGGCCCGGTA GCGAGCGAGC CCGGCGGCGT
CGTGGGCGGC CCGGTGGAAT GTGCCGGTAC TTCCTCGGCG GCAGGGCTGT CCGCGTGATC
CGGCCCATGT CGTGGCAACG AGAAGAAGCC CGGTGTCAGC TGTCGTTACT GAAACGTTCC
TGTTCCCCTG GGGTGAGGCT GGTGTCGTTG AGTGCGCCGA CGCGTCTGCG CGTCGAACAT
CTGGACGAGG CGTTCGGGAC AGAGGTCCGT CGTCCGCGGC TGTCGTGGTG GCTGCCGGCG
GGCTCCGCCC GCCAGACGGC CCACCGCATC AGCACGGGGG AGTGGGACTC GGGGCGGATC
GAGAGCGACC GGGCCAGGTG GTGGTGTGGC GGGTCAAGGT GTGGACGGAT CTCGGCGAGG
GCAGTTGGTC CCAGACCTGT TCCTGGGAGG TGGGGATCGG GCCGGACGAG TGGGTGGCGC
GGTGGATCGA ACCGGTCGAG TACGAGACAC CCGTGCCGGG GCACCGCCCC GCCTATCTGC
TCCGGCACGG GTTCGACCTC GACGGGCCGC TGGCCCGCGC CCGGTTGTAC GCGACGGCGC
ACGGGTTCTA CGAGTTCTTC GTCAACGGCA TGCCCTGAGC GGCGCCGCCA GCAGCCAGTG
ATGATGTGCC GTACCCGATG GCGAATCCGG ACGACCTCGA GCAGACCGTG CAGGCGGTCG
CGTCGATCGG CGGGCGGATC GTCGCCCGGC AGGCTGACGT CCGTGACGTG TCCGCGCTGC
GGCGGGCCTT CGAGGAGAGC GTCGGCGGGC TGGGCCCGGT CGACATCGTG CTGA
 
Protein sequence
MQETSPGSLP VYRDPARSTA ARVADLIERM SLEEKVAQLR SIWISKYKVV LPDGTFDPAR 
AHVLIPDGIG FVGRPVDAMG MAGFPANWHR SREETIAFVD AVQRYLVEET RLGIPALFHD
ETAHGFVARG ATIFPIPPAL ASTWDEDLVE EVFTVVAREA RSVGSTVSLG PVLDLARDPR
YGRVEEFFGE DPYLVGRMGV AAVRGLQGRS RPLAADRMFA TLKHFLHASP EGGINAAPAP
AHERSLRETY LAPFVDVVRE ANPAFIMPSY NEVGGLPSHA SRDLLQRLGR ALLGFEGVYL
SDYDALARLI SDHRVAAGLG EAAAIGLTAG VDVDLPDGEA FSMLAPLVRE GLVDETLVDE
ALARVLALKF EAGLFEQPYG RLEQAEYNSA EAVRLARNSA TRALTLLTND GILPLDPNAE
IRLAVVGPNA GELYYGGYSG ENDAGVSVLD GLRAAIVGSA ITVEHAEGVR LVGAEEEAAM
PGPGRAPVLP VDDAENRRRI KDAVAVVERA DVVLLVVGDH PAIARETTRP LFPGDRNELG
LYGLQEELVE AVVQVGKPVI ALLVNGRPIA ATRLAAGANA LLEGWYLGQE TGNAVADVLF
GRAEPGGRLP VSVPRASGAV PVYYDRHTSA NLYPYVEVDR TPLFPFGHGL GYTTFDISEP
VLDRSSIHVG ESVGISVEVS NTGNRAGDEV VQLYVRDDVS SVPRPELQLR GFRRITLEPG
RSTTVRFVLE PHQLAFWNID LTERIVEPGT FTISVGRSST QLRSVTLSVA GPGSERARRR
RGRPGGMCRY FLGGRAVRVI RPMSWQREEA RCQLSLLKRS CSPGVRLVSL SAPTRLRVEH
LDEAFGTEVR RPRLSWWLPA GSARQTAHRI STGEWDSGRI ESDRARWWCG GSRCGRISAR
AVGPRPVPGR WGSGRTSGWR GGSNRSSTRH PCRGTAPPIC SGTGSTSTGR WPAPGCTRRR
TGSTSSSSTA CPERRRQQPV MMCRTRWRIR TTSSRPCRRS RRSAGGSSPG RLTSVTCPRC
GGPSRRASAG WARSTSC