Gene Franean1_3192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3192 
Symbol 
ID5671568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3766259 
End bp3768694 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content72% 
IMG OID641242086 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001507506 
Protein GI158314998 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTACG ACGCCACGCT GACACCTGAA GCCCGGGCGG ACGCGCTTCT CCGGACGATG 
ACGGTGGAGG AGAAGGCGCA GCAGATCACG GGTCTGATGC CGGTCGGGCT GCTCGGCGTG
GACGGCCTGG TCCAGACGGA GGCCGAACGC CGCCTGGGCA CCGGGATCGG CCACATCGCG
CCGCTCGGGA TGCTCAGCCA CCGGACGCCC GCGAACCTGG CGAAGGCTGT CAACGAGATC
CAGCGTTTCC TCGTCACCCG GACCCGCCTG GGCATTCCCG CGCTCTTCCA CGTGGAGGCG
CTGAACGGTG TGGTGTCCCC TGGTCTGACC ACGTTCCCGA CCGCGATCGG GCTCGCCGCC
ACGTGGAACC CCGCCGGGGT GACGGAGATG GCGGACGTCC TGCGCCGGCA GGCACGGGCC
ATCGGTCACC CGCTCGTCCT GTCCCCGGTG ATGGACGTGG CCCGCGACGC CCGCTGGGGC
CGGGTGCACG AGACCTACGG CGAGGACCCG TACCTGGTGT CCGCGATGAG CGTCGCGTTC
GTCCGCGGCA TCCAGGGCGA CGACCCGCGC GAGGGCGTCA TCGCGACCGG GAAGCACTTC
CTCGGCTATG CCCTCACCGA GGCCGGGCAG AACATGGCCC GCACCGCCGT CGGGGCCCGG
GAGCTGTACG AGGTGTACGC CCGCCCGTTC GAGGCCGCCA TCAAACTGGC CGGCCTCGCG
GCGGTCATGA ACTCCTACAG CACCGTCGAC GGCGTCCCGG TCGGTGCCAA CCGCGAGATC
CTCACCGGCC TGCTGCGGGA CCGCCTCGGC TTCGCCGGCA CCGTCGTCTC CGACTACGAG
ACCATCCGCC ACCTGTACAA GCGGCTCGGT GTGGCGCGGG ACGCCGAGGA GGCCGGCCGG
CTCGCCCTCG CCGCCGGGCT CGACGTCGAG CTTCCCGTCG CCGACGGCTA CGGCCCCACG
CTCGCGCGGG CGGTCCACGC CGGCACGGTT CCCGTCGATC AGCTGGACCA GGCCGTGTGG
CGGGTCCTGC GGGACAAGTT CGCGCTGGGC CTGTTCGACC GGCCCTACGC CGACGAGGAT
CCCGTCGTCG TCAACGAGGT CGCCCGCCAG GGTGTCGACC TCTCCTACCG GCTCGCCCAG
CAGTCCGTGA CCCTGCTGGC GAACGACGGC ACCCTGCCGC TGTCCCGGGA CCTGCGCCGG
ATCGCGGTCG TCGGCCCGCA CGCCGACGGG ATCTCCTTCG CGTTCCCGCC CTACACCTAC
CCCGCCGCGC TGGAGATGCT CCGGGCCCGG TTCACCGGCG AGCGGGCCCA CATACCCGGC
ACCGAGAACA TGGCCGGTGA CATCACCCCC GAGGCGGCCG CGCTGATGCG CCAGGAGCTC
GCCGGCCCCA TCGGGACACC GATCGACGAC TACATCCGCG ACGCCTACGG CGCGCTGTCC
CTCGCCGACG CCGTCCGGCG GGCCGTCCCC GGCGCGCAGG TGACCGTGGC CACCGGCTGC
GGGGTCCTCG ACGAGGAGCC GGCCGACATC CCGGCCGCGG TCGCCGCCGC CGCGGGCGCC
GACGTGGTGA TCCTCGCCCT CGGCGGGCGG GCCGGCTGGT TCACTCCCCG GATCACCGAG
GGCGAGGGCT GCGACACCGC CGACATCGAC CTTCCCGCGA ACCAGATCGC GCTCGTCCAG
GCCGTCGCAG GCACCGGGAC TCCCTGCGTG GGCATTGTGT ACACCGGCCG GCCGATGGCC
CTGACCCCGA TCGTCGGGCT GCTCCCGGCG CTGCTCTACG GCTACTACGG CGGCCAGCAC
GCCGCCACCG CCATGGCCGA CGTGCTGTTC GGCAGCGTCA ACCCGGCCGG CAGGCTCCCG
ATCTCCATCC CGCGGCACTC CGGGCAGGTG CCCGTCTACT CCGGCCAGCC CACCGGCACC
GGCTACCGGC GCACCGACCA GGACATGCAC CTGGGCTATC TCGACATGCC GTCAGGTCCC
CTGTTCCCGT TCGGCCACGG GCTGAGCTAC ACCACCTTCG ACTACACCGA CCTCACGGTC
AGCTTCCCCG AGGTCGACAG CGAGGGCGCC GTCACCGTCG GGCTGACCGT CCGCAACACC
GGCGCACGGG CCGGCGACGA GGTCGTGCAG CTCTACTTCT CCGACCAGGC CACCGGGGTC
ACCCGGCCGG CCCAGGAGCT GGTCGGCTTC ACCCGCCTCA GCCTGGACGC CGGCGCGGCC
GCGACCGTGG CGTTCACCGT GCCCATGAGC CAGCTCGGCT ACGTCGCCCT CGACGGCGGC
TTCGTTCTCG AACCCGGCCC CATCCAGATT CTCGCGGGCA GCTCGTCGGA CGACATCCGC
CTGCGCGGCA GCTTCGACGT CGCCGGAAAA GTCGCCGAAC TGGACAGCCG CCGTTCCTTC
CTCTCGGACG TCACCGTCAG CGACACACGC CCTTGA
 
Protein sequence
MTYDATLTPE ARADALLRTM TVEEKAQQIT GLMPVGLLGV DGLVQTEAER RLGTGIGHIA 
PLGMLSHRTP ANLAKAVNEI QRFLVTRTRL GIPALFHVEA LNGVVSPGLT TFPTAIGLAA
TWNPAGVTEM ADVLRRQARA IGHPLVLSPV MDVARDARWG RVHETYGEDP YLVSAMSVAF
VRGIQGDDPR EGVIATGKHF LGYALTEAGQ NMARTAVGAR ELYEVYARPF EAAIKLAGLA
AVMNSYSTVD GVPVGANREI LTGLLRDRLG FAGTVVSDYE TIRHLYKRLG VARDAEEAGR
LALAAGLDVE LPVADGYGPT LARAVHAGTV PVDQLDQAVW RVLRDKFALG LFDRPYADED
PVVVNEVARQ GVDLSYRLAQ QSVTLLANDG TLPLSRDLRR IAVVGPHADG ISFAFPPYTY
PAALEMLRAR FTGERAHIPG TENMAGDITP EAAALMRQEL AGPIGTPIDD YIRDAYGALS
LADAVRRAVP GAQVTVATGC GVLDEEPADI PAAVAAAAGA DVVILALGGR AGWFTPRITE
GEGCDTADID LPANQIALVQ AVAGTGTPCV GIVYTGRPMA LTPIVGLLPA LLYGYYGGQH
AATAMADVLF GSVNPAGRLP ISIPRHSGQV PVYSGQPTGT GYRRTDQDMH LGYLDMPSGP
LFPFGHGLSY TTFDYTDLTV SFPEVDSEGA VTVGLTVRNT GARAGDEVVQ LYFSDQATGV
TRPAQELVGF TRLSLDAGAA ATVAFTVPMS QLGYVALDGG FVLEPGPIQI LAGSSSDDIR
LRGSFDVAGK VAELDSRRSF LSDVTVSDTR P