Gene Franean1_5276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5276 
Symbol 
ID5675758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6347184 
End bp6349154 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content76% 
IMG OID641244133 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001509540 
Protein GI158317032 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.337424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0935603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATGC GGCGTGTGCG GGTCGTCGGG GTGGCGCTCG CGGTCGTCCT GGCGACGCTG 
GCCGGTGTCA CGGCGTTCGC CTGGGCGGTC CGCGGCACCG ACGCGGGCGG CTCGATGGCG
GACCGGCTCA GCCTCGCCGA CGGCGACACC GCAGGCGGCT CCGCCGGGGC GGGGGACCCG
GGCGGCGGCG CGGCGGGCGC GGAATCCGGG CGCGGCACCG CGGAGCACCC CGCCGACGGC
CCGGAAACCG GCGCCGACGA GCTGTCGGCC GCCGACCCGG GCGGCGCGGG GAGCGGCCCC
GGCGCCGGTT CGGCCGAGGA GAACAGCTGG GTCGAGCCCA CGCTGGCCGG CCTGTCGCTG
GAGCAGCGCG TAGGCCAGAT GATGATGGGA TACGTCTTCG GTACCGCGGG CGCCGACCGG
AGCCCGGCCG TTGTCACCGC GAACCGGCGG ACGTCCGGTG TGGACACGGC CGCCGAAGCC
GTCGCGAGAC GGGGCCTGGG CGGCGTGATC TACTTCGACG CCGGCGGGAC GGGCCCGGGG
GCGCTCCCGG ACAACATCGT CAACCCGAAC CAGGTCAAGA CGCTGTCCGC GGACCTGAGC
GCGGCGGCCA GCATCCCGCT GCTGATCGCC GCGGACCAGG AGCAGGGAAC GGTGCTGCGC
GTCCGGGACG GCGTGACCCT GCTGCCCGGC CAGATGGCAC AGGGCGCGAC GGGACGTCCC
ACCGACGCGC GGGACGCCGC GCAGATCACC GGCGCGGACC TGCGCGCCCT GGGCATCAAC
GTCGACTTCG CCCCGGACGC AGACGTCAAC AGCGACCCGG CGAACCCCGT GATCGGTGAG
CGCTCCTTCG GCGACGACCC TACGGCGGTC GGGCGGTTCA CCGCGGCGGC GGTCGAGGGA
TACCGGCAGG TCGGGGTGGC CGCGGCCGCG AAACACTTTC CCGGGCACGG CGCGACGTCC
GTCGACAGCC ACGCCGACCT GCCGACGATC ACCAGAGACC GGGCGGCGCT GACGGCGCTC
GACCTGCCGC CGTTCCGGGC GGCGATCGCC GCGGGCGTAC CGATGGTCAT GGTCGGTCAT
CTGAACGTCC CCGCGCTCGA CCCGGCCGCG CCGGCGACGC TGTCGAAGCC GGTGGTCGAC
GGCCTGCTGC GCCACGAGCT CGGCTTCGAC GGCGTCATCG TCACCGACGC GCTGAACATG
GCCGCGATCA CCGAGCACAA CACGCCCGGC GGCGCGGCGG TGCGAGCCGT CCAGGCCGGC
GTCGACATGC TGCTGATGCC GCCGGACCTG ACGCAGGCGC TTGATGCTGT GGTTTCCGCT
GTGCGCTCCG GGGCGATCGT TCCGGAGCGG ATCGACGCCT CGGTACGTCG AATCCTGAGG
ATGAAGTGGA GGCTGGCGCA CACCGAGCCC GCCGCGGCCC GAACCCCCGA GGAGGCGGCG
GCCACCGCGG CCGCGATCGC CGAGCGGGCG ATCACGCTCC TTGACCAGCC GACGTGTGAC
CTGCTCCCGC TCAGCCGCGG CACCGCCGGC GCCGGTGCGG GCGGGACGCA GCCGGCGGTG
GAGGTCTCCG GGCCGTCGGG CGCCGCGAAG ATGCTCGTGG ACGCACTGGG TGCCCGGGGG
ATCGGCGCCC GGTTGGCCGC CCAGGGCTCC GGCCGCCCGC CAGCGCCGCC CGCAGCGGGC
GCCTCGGGAG CGGGTACGGC TGTGGTCCGG GTGGTGCTGG TCGGGAACTC GCCGCCGCCC
GTCACCGACC GGCGGACCGT GGTCGTGTCG ACCGGGACGC CCTACCGTCC GCCGGTGGCC
GCCGGGGCGT GGCTGGCGAG CTATTCACGC GACCCAGCGT CGATGAAGGC GCTGGCGGCG
GTGCTGGCCG GCGCGGTGCC ACCCTCCGGC CGGCTGCCGG TCGTCACCCG AACCGCGACC
GGCACCGCGT TGCCGCGCGG CGCAGGCCTA CCGACCCCGC GCGCCTGCTG A
 
Protein sequence
MVMRRVRVVG VALAVVLATL AGVTAFAWAV RGTDAGGSMA DRLSLADGDT AGGSAGAGDP 
GGGAAGAESG RGTAEHPADG PETGADELSA ADPGGAGSGP GAGSAEENSW VEPTLAGLSL
EQRVGQMMMG YVFGTAGADR SPAVVTANRR TSGVDTAAEA VARRGLGGVI YFDAGGTGPG
ALPDNIVNPN QVKTLSADLS AAASIPLLIA ADQEQGTVLR VRDGVTLLPG QMAQGATGRP
TDARDAAQIT GADLRALGIN VDFAPDADVN SDPANPVIGE RSFGDDPTAV GRFTAAAVEG
YRQVGVAAAA KHFPGHGATS VDSHADLPTI TRDRAALTAL DLPPFRAAIA AGVPMVMVGH
LNVPALDPAA PATLSKPVVD GLLRHELGFD GVIVTDALNM AAITEHNTPG GAAVRAVQAG
VDMLLMPPDL TQALDAVVSA VRSGAIVPER IDASVRRILR MKWRLAHTEP AAARTPEEAA
ATAAAIAERA ITLLDQPTCD LLPLSRGTAG AGAGGTQPAV EVSGPSGAAK MLVDALGARG
IGARLAAQGS GRPPAPPAAG ASGAGTAVVR VVLVGNSPPP VTDRRTVVVS TGTPYRPPVA
AGAWLASYSR DPASMKALAA VLAGAVPPSG RLPVVTRTAT GTALPRGAGL PTPRAC