Gene Franean1_7047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7047 
Symbol 
ID5675358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8598625 
End bp8601081 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content65% 
IMG OID641245893 
Productglycosyl hydrolase family 32 protein 
Protein accessionYP_001511284 
Protein GI158318776 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.545796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATAC TTGTCAGACT CATGCGTAGC AGCCTCACGC GGGTCAGGTT CACCGCGGTG 
ACGGTCGCTC TCGCGGCGGT GTTCACCTCC CTTCTGCCGG CCGTGCCCGC CCTGGCCGGA
CAGGTCAGCG ACTACCCCGA GTTCCCGTAC CCGACGACAG ACTACACCGA GCCCTTACGC
GGCCAGTTCC ACTTCAGCTC GCGCGGCGGC TGGATGAACG ACATCAACGC CCCGCTGTAC
CACAATGGCC TCTACCACGT CTTCTATCAG CACAATCCGC ACAGCCTCCT CTGGGAGACC
ATGCACTGGG GACATGCCAC CAGCCCCGAC CTGGTGCACT GGACGCAGAA GCCGATCGCG
CTGGAACCGG GTGTGCATCC GCATGACCTG TGGTCCGGGG CCGGGGTGGT CGACACCAAC
AACACCTCGG GTCTGCAGAC CGGGAGCGAG GCACCGATCC TCGTGTTCAC CGCCACCAAC
GGCGTGAGCA TCAACTACAG CAACGACGCC GCGAAAACGT TCCAGATCTA CAACCAAGGT
CAGAAGGTCG TCACACCGGC CGGCATCAGT CGTGATCCCA AGGTGTTCTG GCATGCGCCT
TCCAACCGGT GGGTGATGGT GGTCTGGTCC GACGCCGGGG GGAACGGCGT CAACATCTAC
ACCTCACCCA ACCTGTTGAC CTGGACGTTT CGCAGCCGGT ATGCCGCCGA CTGGCTGTAC
GAATGTCCGG ACCTGTTCTC CCTGGCCGTC GACGGCGACC CGGGGAACAC GAGGTGGGTC
ATGACCGACG CCGGCGGCGA GTACGTCATC GGTTCCTTCG ACGGTGTCAC CTTCACCCCG
GAGTGGACGT CGCCGCAACG GATGGACCAG GGGCACAACA CCTTCGAGGG CACCTTCTAT
GCCGGGCTGA CCTTCAACCA CATGCCGGAC AACCGGATCG TGCAGATGGC CTGGATGAGA
TCGAACCAGG GCAGCGTCTG GACCGGCAAT GCCTCCTTCC CCGCGGAACT GGGCCTGCGC
GCCTATCCCG AGGGGATACG CCTGACCCGC AATCCCGTCG GCGAGATCGC GTCCCTGCGC
GTCGATTCTC AATCATGGGT AAACCGCGAC ATCACCCCCG ATCCGGCCAG CGATCCCCTC
ACCAGCACCT TCGCCGACAC CTACGAGATC ATCGCCGAGT TCGACACGGC CACCGCCACA
GCGTCACGGT TCGGCTTCCG ATTACACACC CGCAGTGACG GAACCTTCGA CCGTGCCGTC
ACCTACGACC GGACTGCGCA GACGCTCTAC GGCGCACCGC TGGCGCCGAT CAACGGACGG
GTCAGGATGC GGCTACTGGT GGACCGCGGG CAACTGGAGA TCTTCGGCAA CGACGGCAAG
CTGTCCTGGA CCGACAACGT CAACTTCAAC TCGGCACCGT CGAGCCAGGG TGTGCAGCTG
TATGCCGAAG GCGGCAACGT CCAGCTGGTG TCGCTCCAGT TCCACCGGCT GCAGTCAGCG
TGGGGTTCTG GGGAGTCCAC CCTGGAGAGC AACCTGGCCG GCCCCTGGCA CCCGGCCGGC
GGGACGTGGG TCGACACCAC CACGGGCAAG CAGGGCACCG CGGGTGGGGA CGGTTTCTAC
CTGAGCAACC AGACCGGAGC CGACTTCACC TACGAGGGTG ATCTCCGCCT CGACACCGCC
GTGGCAGCCG GGATCACCTT CCGGGCCAAC AGCGACGCCA CCCAGCAGTA CACCGCCAAC
GTCGACGCCA ACGGACTGGT GAAACTGTGG CGCCCCGGCC GGGACATCGG GATCTTCTAC
ACCCCGATCT CCCAAGGCCG CACCTACCAC CTGAAGGTGG TGACCAGCGG CTCCATCATC
AGGGTCTATC TGGACCACCG CCCCACCCCC GTGATCGACG CCGTCGACAC CGCCTACACC
AGCGGGTACT TCGGGACCAA CGTCTTCGGC GGGACCGGCG TCGTGCAGAA TGCCAACGTC
AACGCCACCG GGTTCGTCTC CAACCTGGGA GCAACCTGGC GGCCGGCGAC CGGGCTGTGG
ACCGTCCCCG GTGCCGGTGT CAAGGGTCGG GTCGCCGGGG ACGGCTTCTA CCTCAGCGAC
CAGACCGGGA CCAACTTCAC CTATGAGGGT GACGTCAAGG TGATCAACGG GGTCGCCGCC
GCGCTGACCT TCCGGTCGAA CGCCGACGCG ACCGGGCACT ACACCGCCAA CGTCGACACC
AACGGCCTGG TGAAACTGTG GCGCCCCGGC TCGGTGATAG GCGTCTTCAA CACACCGATC
GTCGAAGGCC GGACGTACCA CCTGAAGGTG GTGGCCAACG GTCCCAACAT CAGAGTCTAC
TTCGACGGAG GAGCGACGCC GGTCATAGAC GCCGTTGACA GCACTTACAG CAGTGGGTTC
TTCGGTGTCA ACGTCTTCAG CGGGGTCGGC GTGATCCAGA ACGTCGTAAC AAGCTGA
 
Protein sequence
MSILVRLMRS SLTRVRFTAV TVALAAVFTS LLPAVPALAG QVSDYPEFPY PTTDYTEPLR 
GQFHFSSRGG WMNDINAPLY HNGLYHVFYQ HNPHSLLWET MHWGHATSPD LVHWTQKPIA
LEPGVHPHDL WSGAGVVDTN NTSGLQTGSE APILVFTATN GVSINYSNDA AKTFQIYNQG
QKVVTPAGIS RDPKVFWHAP SNRWVMVVWS DAGGNGVNIY TSPNLLTWTF RSRYAADWLY
ECPDLFSLAV DGDPGNTRWV MTDAGGEYVI GSFDGVTFTP EWTSPQRMDQ GHNTFEGTFY
AGLTFNHMPD NRIVQMAWMR SNQGSVWTGN ASFPAELGLR AYPEGIRLTR NPVGEIASLR
VDSQSWVNRD ITPDPASDPL TSTFADTYEI IAEFDTATAT ASRFGFRLHT RSDGTFDRAV
TYDRTAQTLY GAPLAPINGR VRMRLLVDRG QLEIFGNDGK LSWTDNVNFN SAPSSQGVQL
YAEGGNVQLV SLQFHRLQSA WGSGESTLES NLAGPWHPAG GTWVDTTTGK QGTAGGDGFY
LSNQTGADFT YEGDLRLDTA VAAGITFRAN SDATQQYTAN VDANGLVKLW RPGRDIGIFY
TPISQGRTYH LKVVTSGSII RVYLDHRPTP VIDAVDTAYT SGYFGTNVFG GTGVVQNANV
NATGFVSNLG ATWRPATGLW TVPGAGVKGR VAGDGFYLSD QTGTNFTYEG DVKVINGVAA
ALTFRSNADA TGHYTANVDT NGLVKLWRPG SVIGVFNTPI VEGRTYHLKV VANGPNIRVY
FDGGATPVID AVDSTYSSGF FGVNVFSGVG VIQNVVTS