Gene Franean1_3254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3254 
Symbol 
ID5671628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3850564 
End bp3851907 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content72% 
IMG OID641242146 
Productglycoside hydrolase family protein 
Protein accessionYP_001507566 
Protein GI158315058 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAACG CCCGCGTCCC ACTCCACCGC GACTCCTCTC GACAGTTCCC CGCCGAGCCG 
GGGGCGCCCA CGGCGGCGCC GGCGACCACG GCTGCCTCAA AGGCCCCGAC GGACTCGTCG
AGGGCGGTGT TTCCCGATGG CTTTCTCTGG GGGGCGGCCA CCGCCCCGCA TCAGGTGGAG
GGCGGTAACG TCGGCTCGGA GATGTGGCGC TCCGAGTGGA TGCCGAACTC GACGTTCGCC
GAGCCGTCCG GGGACGCCTG CGACCACTAC CACCGGTATC CGCAGGACAT CGCCACTCTG
GCGGGGCTGG GCCTGAACGC CTACCGGTTC GGGGTCGAGT GGGCGAGGGT CGAGCCCGAG
GAGGGGTACT TCTCCCGCGC CGCGCTCGAC CACTACCGGC GCATGGTGGC CACCTGCCTT
GAGCACGGCG TGACACCGGT GGTGACCTAC AGTCATTTCT CGTTGCCCCG GTGGTTCGCC
GCGGCCGGCG GGTGGAGCAA CCCGGCGGCC CCGGACCAGT TCGCCCGGTA CGCGGCCCGG
TTGACCGCGC ACATCGGCGA TCTGGTGCCC TGGGTGTGCA CCCTCAACGA GTCGAACGTC
ATCTCGTTGT TGCTGCACCT GCGGGTCGCG CCGGCTGCCG CCCGCGAGGA CGGTCTCGGG
CTGGCGGAGG CCCTCAGGGC CCCGGCGCCC GCCGGGACAC CGAAACGCGG CGGTTGGCCG
CCCCCGGACG TCGAGATCAT GGCCAAGGTG CACCGCAGGG CGGTGGAGGC GATCAAATCC
GGTCCCGGCA ACCCGGCGGT GGGCTGGACG CTGGCCCTGA TCGACATCCA GGCGGCCGAA
GGCGGCGAGC AGCGTCAGCT GGCGGTGCGC CAGGCGGCCG AGCTCGACTG GCTCGAGGTG
TCCCGGGACG ACGACTTCGT GGGTGTACAG ACCTACACGC GGGAACGAGT GGGGTCTGAA
AAGGTGCTCC CGCCGCCGGA GGGCGCGGCC ACGACGCAGA CGGGCTGGGA GGTGTACCCG
CCCGCGCTCG GGCACACGGT CCGGCTCGCC GCCGAACACG CCAGGGTCCC GATCCTGGTC
ACCGAGAACG GCATGGCCAC CGATGACGAC GACGCCCGCG TCGCTTACAC CCGCGCCGCC
CTGCATGGCC TGGCCGCTGC CGTCGCCGAC GGCGTCGACG TGCGCGGCTA CCTGCACTGG
ACGCTGCTCG ACAACTTCGA GTGGACGTCC GGCTTCGCGA TGACCTTCGG CCTGATCGCG
GTCGACCGGA CGAACTTCGC GCGGGCGGTG AAGCCGTCGG CGCGCTGGCT CGGCGCGGTC
GCGCGCGCCA ACGGACTCGT CTGA
 
Protein sequence
MPNARVPLHR DSSRQFPAEP GAPTAAPATT AASKAPTDSS RAVFPDGFLW GAATAPHQVE 
GGNVGSEMWR SEWMPNSTFA EPSGDACDHY HRYPQDIATL AGLGLNAYRF GVEWARVEPE
EGYFSRAALD HYRRMVATCL EHGVTPVVTY SHFSLPRWFA AAGGWSNPAA PDQFARYAAR
LTAHIGDLVP WVCTLNESNV ISLLLHLRVA PAAAREDGLG LAEALRAPAP AGTPKRGGWP
PPDVEIMAKV HRRAVEAIKS GPGNPAVGWT LALIDIQAAE GGEQRQLAVR QAAELDWLEV
SRDDDFVGVQ TYTRERVGSE KVLPPPEGAA TTQTGWEVYP PALGHTVRLA AEHARVPILV
TENGMATDDD DARVAYTRAA LHGLAAAVAD GVDVRGYLHW TLLDNFEWTS GFAMTFGLIA
VDRTNFARAV KPSARWLGAV ARANGLV