Gene Franean1_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0456 
Symbol 
ID5675657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp538299 
End bp540581 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content67% 
IMG OID641239387 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001504825 
Protein GI158312317 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAGG CACAGCATCT TGTTTCGACG CTCGCCCTCG AGGAGAAGGC CGAGCTGGTG 
TCTGGTGGCG GTCTCTGGGT GACCGCAGAG ATACCACACG CCGGGGTGCG GCCGGCGGTG
CTCACCGATG GCCCGCACGG GGTCCGCATG ACGCGGGATG GCGTTGATAC CGGGCATATC
GCGAACAGTT TTCCCGCTAC GGCATTTCCC ACGGCGGCGG CATTGGGATC GTCGTGGAAT
GAAGACCTGT TGCGCGAGAT CGGCGCGGCG CTCGGCGCCG AGTCCCGTGC CCTCGGCGTG
GACGTGCTAC TCGGCCCCGG AATCAACATC AAGCGCTCCC CGCTGTGCGG GCGCAACTTC
GAGTACTTCG CCGAGGATCC GCTGCTGGCG GGAGCACTCG GGGCGGCGTG GGTCGAGGGT
GTGCAGTCAC ACGGGGTCGG CGCCTCGGTC AAGCACTTCG CGGCGAACAA CCAGGAGACC
GACCGTATGC GGGTGAGCGC GGACGTCGAC GAGCGCACCC TGCGCGAGAT CTACCTGCCC
GCGTTCGAGC ACGTGGTCCG CCAGGCGCAC GTAGCCACCG TGATGTGCTC GTACAACCGG
ATCAACGGCG TGCGCGCTTC CCAGAACCGA TGGCTGCTCA CAACGGTACT GCGAGAGGAA
TGGGGCTTCG AAGGCTACGT TCTCTCGGAT TGGGGGGCCG TTCACGATCC AGTCGCCGCT
CTGCAGGCCG GCCTTGACCT GGAAATGCCT TCCAGCAGAG GCCGCAGCGC CGCCGAGATC
GTTGGTGCCG TCCGCGCCGG CGCGCTCGAC GAACGGTGCC TGGATCTGGC CGTAGAACGC
CAACTGGCCA CGCACCAGCG GCTGTGGCTC GCCCGCGGCG ACGGCATGGA GACGCCCGAT
TTGCCCGCAT ACCACGTGCT GGCCCGGCGC GCGGCGGCCG AGGGCGCGGT GCTGCTGAAG
AACGATGGCG ATCTCCTTCC GCTCGATCCG GCCACCGGCG GCCGGATCGC GGTCGTGGGT
GAGTTCGCCC GCAGCCCCCG CTATCAGGGC GCCGGTAGCT CGCAGGTCAA TCCGACCCGC
CTGGACGACG CGTTGACCGC GATCCTTGCC GCGACCTCAC GCGAGGTCAC CTTCGCTCCC
GGCTTCCGCC TCGACGGCAC CGCCGATCCG GTCCTCCTTG CCGAGGCCGT GCAGGCCGCC
CGCGACGCGG AGTCGGTGGT GATGTTCCTC GGTCTGCCGG AGCAGACGGA GTCGGAGGGC
TTCGACCGGA CCGACCTGGA CCTGCCCGCC ATCCAGGTGG AGCTACTCGA AGCCGTCGCT
TCGGTCAATT CGCGTGTTGC CGTCATCCTC AGCAACGGTG GTGTCGTGCT GACCGATCCG
GTGATCTCGC GCGCCGCCAC GCTGCTGGAG ATGTGGCTGT CGGGCCAGGC TGGTGGGAGC
GCCGCCGCCG ACCTCATCTT CGGCCATGCC GCTCCCGCCG GTCGGCTGGC CGAGACCATC
CCGCACCGCC TCCAGGACAT CCCGACCTAC GTGAACTGGC CTGGCGCCGA AGGGCACGTG
AACTACGGAG AACGCCTGTA TGTCGGCTAC CGATGGTACG ACCGCACCGA TGAGGACGTC
GCGTTCCCTT TCGGCTTCGG GCTGACCTAC ACCACCTTCG CCTATTCAGA CCTCGCCGTG
CACCTCCCCG ACCCGGCCAG GCCGGAGGCT CGCGTCGAGG TTGTGGTGAC CAACACCGGC
AGGCGGGAAG CGGCCGAGGT CGTCCAGCTG TACATCAGCG ACCCAGTCGC CGACGTTGAT
CGCCCGGTGC GTGAACTGCG CGGCTTCCGC AAGGTCCGCG TGGCCCCCGG ACACAGCGAA
CGGGTGGTGA TCGAGCTTGA CGCCCGTGCG TTCAGCTACT GGAGTACCCG ACGCGGGCAG
TGGGTCGTCG AACCTGGGGA GTACGGCATC CATGTGGGAT CCTCCTCACG CGACCTGCCG
ATGACCCAGA CGATCAACCT GGACATGGCG ATGCCGACAC CACCGCTCAC CGAAGAAAGC
ACGCTGGCCG AATGGTACGA CCATCCGAGT GGTCGACACA TCGTGCGAGA ACTCCTGGGC
AAGACGATGG GCGCGCAGGA CGGCCTGCTG GAGAACCTCG GAACCTGGCA GTTCGTCGCC
CAGATGCCAC TCACCGCTCT GCTCAAAATG TCCCCATCCG CGGGCACCGA GGCCAAGCCG
TTGAAGGACC TGCTGCACGC CGCGAACAAT GGAAACGCCG CGCAGCCAAC GGCGAACGTC
TGA
 
Protein sequence
MIEAQHLVST LALEEKAELV SGGGLWVTAE IPHAGVRPAV LTDGPHGVRM TRDGVDTGHI 
ANSFPATAFP TAAALGSSWN EDLLREIGAA LGAESRALGV DVLLGPGINI KRSPLCGRNF
EYFAEDPLLA GALGAAWVEG VQSHGVGASV KHFAANNQET DRMRVSADVD ERTLREIYLP
AFEHVVRQAH VATVMCSYNR INGVRASQNR WLLTTVLREE WGFEGYVLSD WGAVHDPVAA
LQAGLDLEMP SSRGRSAAEI VGAVRAGALD ERCLDLAVER QLATHQRLWL ARGDGMETPD
LPAYHVLARR AAAEGAVLLK NDGDLLPLDP ATGGRIAVVG EFARSPRYQG AGSSQVNPTR
LDDALTAILA ATSREVTFAP GFRLDGTADP VLLAEAVQAA RDAESVVMFL GLPEQTESEG
FDRTDLDLPA IQVELLEAVA SVNSRVAVIL SNGGVVLTDP VISRAATLLE MWLSGQAGGS
AAADLIFGHA APAGRLAETI PHRLQDIPTY VNWPGAEGHV NYGERLYVGY RWYDRTDEDV
AFPFGFGLTY TTFAYSDLAV HLPDPARPEA RVEVVVTNTG RREAAEVVQL YISDPVADVD
RPVRELRGFR KVRVAPGHSE RVVIELDARA FSYWSTRRGQ WVVEPGEYGI HVGSSSRDLP
MTQTINLDMA MPTPPLTEES TLAEWYDHPS GRHIVRELLG KTMGAQDGLL ENLGTWQFVA
QMPLTALLKM SPSAGTEAKP LKDLLHAANN GNAAQPTANV