Gene Franean1_0435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0435 
Symbol 
ID5668858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp514801 
End bp517170 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content72% 
IMG OID641239367 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001504806 
Protein GI158312298 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCA CCTGGCGCGA CGAGCACCTG GAGGCCGGCC ACCGGGCCGA GCTGCTGCTG 
GCCGAGATGA CCACCGCCGA GAAATGCCAT CAGCTCACCG CCCGGATGGC GTGGTCGCTG
GTGAAGGCCG ACGGCACCGA CGCCGACGGC GCCGAGGACG TGCTGCGCCA TCCGCCCGGG
CATGTGGCGC AGCTGATCCT CGACGACCCG GCGCAGTTGG CCGCGATGGT CGGCACGATC
CAGAAACAGG TGCTGTCCCG TAGCCGGCTC GGAATTCCGG TGCTCTTCCA CGCCGAGGCG
CTCAGCGGCT TCCTCAACGG TGGGCACATG GTGTTCCCCT CCGGCACAGG GCTCGCCGCC
GGATGGAGCC CCGACCTCGT CGAAGAGATG ACCGACCTGA TCCGGCGGCA GATGCGCCGC
ACCGGTCTGA TGCACGCGCT GTCACCGGTG CTCGACGTGG CCCTCGACCC GCGGTGGGGA
CGCGTGCACG AGACGTTCGG TGAGGACCCG TACCTCTGTG CGGCCTTGGG GGTGGCCTAT
GTGCGCGGCC TGCAGGGGAC CGACGTCTCG CAGGGCGTGA TGGCCACCGG CAAGCACTTC
CTGGGATACG CGGCGCCGCC GGGCGGCCTG AACCTGTCGG CCTTCGAGTC CGGCGCCCGG
CGCACGCGTG ACCTGTTCGC CTATCCGTTC GAGGCGGCGA TTCGCCTGGC CGGGCTGCGG
TCGGTGATGA ACTCGTACGC CGACGTCGAC GGCGTACCCG CCGGGGCCTC GCCGGAGGTG
CTGCAGGACC TGCTGCGCGA CACGCTCGGC TTCGACGGCT TCGTCTCCTC CGACTACACC
ACCCTCGAGC AGCTCGTCGA CCGGCAGCGG ATCGCGAAGG ACGCGGCCGA GGCCGGCCGC
CTGGCCATCC GGGCCGGGCT CGACGTCGAG ATGCCCAATC CGTACGGGTA CGGCGACGTG
CTCGCCGCCG AGGTCGAGCG CGGGGTCGTC GACGTGCGGC ACGTCGACAA CTGCGTCCGC
CGGGTCCTGC GGGCGAAGTT CGAGGTCGGG CTCTTCGAGC ACCCGTACCC GGTGGAGAGC
ATCGACGTGG CCGCGGCGGC CCGCGAGGGC GCCGACCTGT CGTTGGAGCT GGCCCGGCGC
TCGATCGTGC TCGGCCAGAA CAACGGCATC CTGCCCCTGA CCCCCGGCGG GCTCGACATC
GCCGTCATCG GGCCGCACGC CGACGCGCCG ACGTTCCAGT TCCCGACCTA TACGTACGTG
TCGTGGCGCG AGGCCAACGA GGCCGTGATG CGCGGCGAGC TCGGCACCAT GAACGGCGCC
GAGGAGACCG TGTCGGCCTG GTATTCGGCC CTGTTCGGAG CGTGGGACGG CGACAGCGTG
GCTCCCTCGC TGGCGGCTGA GATCGGTGAT CACGCCCGCA CGGTCCGGGT CGAGCGCGGC
TGCACCCTGA CCGAGGATCT AGGTTCTCTC GACCCGGCGG TCGAGGCGGC GCGGGCGGCG
GACGTCGTCG TGCTGGCGCT GGGCGGGGCG AGCCTGTGGT TCACGGGGGA GCGGACCGAG
GGGGAGGCCA GCGACACCGC GGACATCACC CTGCCGGCGG CGCAGGTGCG GCTGGCCGAG
GCAGTGGCGG CCACCGGCAC CCCGCTGGCG GTCGTGCTCG TGCAGGGCCG GGCCTACGCG
CTGCCGAAGG TCATCCAGGA CGCCGCGGCC ATCGTCGTCG CCCCGTACGC CGGACCGTTC
GGCACCCAGG CGATCGCCGA CGTGCTCTTC GGCGTGGTCA ACCCCTGCGG CAAGATGCCG
TACAGCCTGC CCCGGCACAG CGGCCAGATC CCGGTCTACC ACCACCAGAA GGCCGGGTCG
GGCTACCGCA ATCCGTTGCC GCCCAGCGTC TCCCGCCACT ATCTCGACCT GGAGGCGACG
CCGTTGTGGC CGTTCGCCCA TGGGCTGAGC TACACCACGT TCGCGCTGAG CGACCTCGAC
TGCGGTGCCG ACATCGACGC GTACGGCACG GCCCACGTCA GCGCCACCGT CGCCAACACG
GGGCCACGGG ACGGGGCCAC GGTGATGCAG CTCTACCTGC GGGTCAACAC CTCACCGGTG
ACCCGGCCCG CCCAGCAGCT CGCCGGCTTC ACCCGGGTCG AGCTGGCCGC CGGCGAGAGC
CGCCGCGTCA CGTTCGAGAT CGACGCCTCG CAACTGGCGT ACACCAGTCT CGCCCGCCAG
GTGGCGGTCG AGCCGGCCCT GGTGGACGTC TTCGTCGGCC TCGACTCCGA CGACCGGTCG
CTGGCGGGCT CGTTCGCCGT GGTGGGGTCC GCGCGGGTGG TCGAGGGCGC CGAGCGGTCC
TTCCTCTCGC GAGCACGCCC GCACCCATAG
 
Protein sequence
MSPTWRDEHL EAGHRAELLL AEMTTAEKCH QLTARMAWSL VKADGTDADG AEDVLRHPPG 
HVAQLILDDP AQLAAMVGTI QKQVLSRSRL GIPVLFHAEA LSGFLNGGHM VFPSGTGLAA
GWSPDLVEEM TDLIRRQMRR TGLMHALSPV LDVALDPRWG RVHETFGEDP YLCAALGVAY
VRGLQGTDVS QGVMATGKHF LGYAAPPGGL NLSAFESGAR RTRDLFAYPF EAAIRLAGLR
SVMNSYADVD GVPAGASPEV LQDLLRDTLG FDGFVSSDYT TLEQLVDRQR IAKDAAEAGR
LAIRAGLDVE MPNPYGYGDV LAAEVERGVV DVRHVDNCVR RVLRAKFEVG LFEHPYPVES
IDVAAAAREG ADLSLELARR SIVLGQNNGI LPLTPGGLDI AVIGPHADAP TFQFPTYTYV
SWREANEAVM RGELGTMNGA EETVSAWYSA LFGAWDGDSV APSLAAEIGD HARTVRVERG
CTLTEDLGSL DPAVEAARAA DVVVLALGGA SLWFTGERTE GEASDTADIT LPAAQVRLAE
AVAATGTPLA VVLVQGRAYA LPKVIQDAAA IVVAPYAGPF GTQAIADVLF GVVNPCGKMP
YSLPRHSGQI PVYHHQKAGS GYRNPLPPSV SRHYLDLEAT PLWPFAHGLS YTTFALSDLD
CGADIDAYGT AHVSATVANT GPRDGATVMQ LYLRVNTSPV TRPAQQLAGF TRVELAAGES
RRVTFEIDAS QLAYTSLARQ VAVEPALVDV FVGLDSDDRS LAGSFAVVGS ARVVEGAERS
FLSRARPHP