Gene Franean1_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0740 
Symbol 
ID5669156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp862571 
End bp864355 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content71% 
IMG OID641239667 
Productglycoside hydrolase 15-related 
Protein accessionYP_001505104 
Protein GI158312596 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000780754 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGCAT CCGACTACCC GCCGATCGAG GAGCACGCGG TCATCGGCGA CCTGCGCACC 
GTGGCCCTGG TCGCCACGGA CGGCACGATC GACTGGTACT GCCCGCAGCG CTTCGACGGT
CCGTCGGTCT TCGCGAGCCT GCTCGACGCC GACCGCGGCG GCTCCTTCCG CATCCACTGC
TCGCAGTCGA AGGCCAAGCA GCTCTACCTG CCCGACTCGA ACGTCCTGAT GACCCGCTTC
CTCACGCCGC ACTCGGTGGG CGAGGTCGTC GACTTCATGG TCCCGGTCGA CAACGACCTG
ACCGAGACGC CGCACGTGGT CGTCCGCCAG GCCCGCGCCG TGCGCGGCAC CGCCGGGTTC
CGGCTGCGCT GCGACCCCCG GTTCGACTAC GGACGGGCGT CCCACACGGT GACGCTGGTG
CCCGGCGCCG GCGCGGTCTT CGAGTCGGCC ACCGGCACCC TTGTGCTGCG CACGGCGCTG
CCGCTGCGGG TCGACGGCAC CGCGGTCGCC TGCGACTTCG AGCTGGCGAC GGGCGAGACG
GCCGACATCG TCCTCGAGTG GGACTCCCGG ATCCGCCCGC TGCAGCCCGG CGAGGCCGAC
AACCTGTTCA CCCGGACGCT GGAGTACTGG CAGACGTGGG TGCGCCGGGG CCGGTACCAC
GGCCGCTGGC GGGAGATGGT GCTGCGCTCG GCGCTGGTAC TCAAGCTGCT GGTCTACCGC
CCGACCGGAG CACTCATCGC GGCGCCGACC ACCTCCCTGC CGGAGGAGCT GGGCGGGGTG
CGCAACTGGG ACTACCGCTA CACCTGGATC CGCGACGCGG CGTTCACCAC CTACGCGCTG
ATGGCCCTCG GGTTCACCGA GGAGGCCGCC GCGTTCATGG ACTGGCTCGA GCAGCGCTGC
AAGGAGGCGC CGCCCGAGCT CGGCCTGGGC GTGCTGTACA GCGTGGACGG CAACGCCGAC
CTGGACGAGA TCGTCCTCGA CCACCTCGCC GGCTACCGGG GCTCGAAGCC GGTGCGCATC
GGCAACGGCG CCGCCGGCCA GCTTCAGATC GACATCTACG GCGAGCTGAT GGACTCGGTG
TACCTGTACA ACAAGCGGGT CCCGATCTCC TTCCAGCTGT GGGAGGCGCT CGGCCGTCAG
CTCGACTGGC TGGCCAGGCA CTGGGAGCAG CCCGACGAGG GCATCTGGGA GACCCGGGGC
GGACGGCAGC GCTTCACCTA CTCGGCGGTC ATGACCTGGG TGGCGTTCGA GCGTGCCTGC
CGGATCTCCC GGCAGCGCGG TCTGCCCGGG CCCACCAACG ACTGGAAGGA CCACGCCGGG
CGGGCCTACC GGTTCGTCCA GAACGAGGCC TGGGATCCCC GGAAGGGGGC CTACATGGAG
TTCCCGGGCT CGCCGCGGCT GGACGCCTCG CTGCTGTGCA TGCCGCTCGT GAAGTTCTCC
GGCCCGACCG ACCCCCGCTT CCTGTCGACG CTGGACCGGG TCGGCTCGGA GCTGGTGAGC
GACAGCCTGG TGCGCAGGTA CGCCGCGGAC GGCAGCGACG GCCTGACCGG GGACGAGGGC
ACGTTCAACC TGTGCTCGTT CTGGTACGTG GAGGCGCTGA CCCGTGCCGG CCGCACGACC
GAGGCCCGGC TGGTCTTCGA GAAGATGCTC ACCTACGCCA ACCACGTGGG GCTCTACGCC
GAGGAGATCG GCCCCTCCGG CGAGGCCCTG GGCAACTTCC CGCAGGCCTT CACCCACCTA
GCCCTGATCA GCGCGGCGAT CCACCTGGAC CGCGCCTTGG GCTGA
 
Protein sequence
MPASDYPPIE EHAVIGDLRT VALVATDGTI DWYCPQRFDG PSVFASLLDA DRGGSFRIHC 
SQSKAKQLYL PDSNVLMTRF LTPHSVGEVV DFMVPVDNDL TETPHVVVRQ ARAVRGTAGF
RLRCDPRFDY GRASHTVTLV PGAGAVFESA TGTLVLRTAL PLRVDGTAVA CDFELATGET
ADIVLEWDSR IRPLQPGEAD NLFTRTLEYW QTWVRRGRYH GRWREMVLRS ALVLKLLVYR
PTGALIAAPT TSLPEELGGV RNWDYRYTWI RDAAFTTYAL MALGFTEEAA AFMDWLEQRC
KEAPPELGLG VLYSVDGNAD LDEIVLDHLA GYRGSKPVRI GNGAAGQLQI DIYGELMDSV
YLYNKRVPIS FQLWEALGRQ LDWLARHWEQ PDEGIWETRG GRQRFTYSAV MTWVAFERAC
RISRQRGLPG PTNDWKDHAG RAYRFVQNEA WDPRKGAYME FPGSPRLDAS LLCMPLVKFS
GPTDPRFLST LDRVGSELVS DSLVRRYAAD GSDGLTGDEG TFNLCSFWYV EALTRAGRTT
EARLVFEKML TYANHVGLYA EEIGPSGEAL GNFPQAFTHL ALISAAIHLD RALG