Gene Franean1_4660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4660 
Symbol 
ID5673003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5561378 
End bp5562619 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content72% 
IMG OID641243518 
Productglycoside hydrolase family protein 
Protein accessionYP_001508934 
Protein GI158316426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.503774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAT TCCCCGAAGG CTTCCTCTGG GGTGCGTCGA CCGCGGCGCA CCAGGTGGAG 
GGCGGCAACG TCAACTCCGA CATGTGGCAC AGCGAATGGG CGGAGAACTC GACGTTCGCC
GAGCCGTCGG GAGACGCCTG CGACCACTAC CACCGGTACC CCGAGGACAT CGCGACCCTG
GCCGGCCTCG GCCTGAACGC CTACCGGTTC GGGGTCGAGT GGGCACGGAT CGAACCGGAG
GAGGGCTATT TCTCCCGGGC CGCCCTCGAC CACTACCGCC GGATGGTCGG CAGCTGCCTC
GAGCACGGCG TCACCCCGGT CGTGACCTAC AGCCACTTCT CGACACCGCG GTGGTTCGCC
GACGCGGGAG GATGGGGCGA CCCGGCGGCG GCGGACCGGT TCGCCCGGTA CGCGGGCCGG
GTGACCGAGC ACATCGGTGA CCTCGTGCCC TGGGTGTGCA CGTTCAACGA GCCGAACGTC
ATCTCCCTGA TGGTGCATCT CGGTGTCATC CCGGCCGCGT CCCGCGACGA GGCCCTCGGC
CTGCCGACCG GCGACGAACG CCAGGATCCC GGCGGCGGGG CAGGGGCGGG CGGGGCACGG
TCGGGCGCGG CGTGGGCCGC CCCGAGCGTC GAGGTGATGG CGACCGCGCA CCGCAAGGCC
GTGGAGGCCA TCAAGTCCGG CCCGGGGAAC CCCGCCGTCG GCTGGACGCT GGCCCTCATC
GACCTCCAGC CCGCCGACGG CGGTGAGCAA CGCTGGCAGG CGGTACGCCA GGCGGCCCTG
CTCGACTGGC TCGACGTCTC CCGCGACGAC GACTTCGTCG GCGTCCAGAC CTACACCCGG
GAACGCGTCG GACCCGACGG TGTCCTGCCC GTTCCCACCG GAGCCCCCAC CACGCAGACC
GGCTGGGAGA TCTACCCGCA GGCGCTGGGC CACACCGTCC GCCTCGCCGC CGAACACGCC
GGTGTGCCGA TCCTGGTCAC CGAGAACGGC ATGGCCACCG ACGACGACGA CGCCCGGATC
GCCTACACCA CCGCCGCCCT CGACGGACTG GCCGGTGCCA TCGCCGACGG TGTCGACGTC
CGCGGGTACC TGCACTGGAC GCTGCTCGAC AACTTCGAGT GGACGTCCGG CTACCAGATG
ACCTTCGGGC TCGTCGCCGT CGACCGCACC ACCTTCGCCC GCACCGTCAA ACCCTCCGCC
CGCTGGCTCG GCAAGGTCGC CCGCGCCGGC GGACTCACCT GA
 
Protein sequence
MSTFPEGFLW GASTAAHQVE GGNVNSDMWH SEWAENSTFA EPSGDACDHY HRYPEDIATL 
AGLGLNAYRF GVEWARIEPE EGYFSRAALD HYRRMVGSCL EHGVTPVVTY SHFSTPRWFA
DAGGWGDPAA ADRFARYAGR VTEHIGDLVP WVCTFNEPNV ISLMVHLGVI PAASRDEALG
LPTGDERQDP GGGAGAGGAR SGAAWAAPSV EVMATAHRKA VEAIKSGPGN PAVGWTLALI
DLQPADGGEQ RWQAVRQAAL LDWLDVSRDD DFVGVQTYTR ERVGPDGVLP VPTGAPTTQT
GWEIYPQALG HTVRLAAEHA GVPILVTENG MATDDDDARI AYTTAALDGL AGAIADGVDV
RGYLHWTLLD NFEWTSGYQM TFGLVAVDRT TFARTVKPSA RWLGKVARAG GLT