Gene Franean1_5502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5502 
Symbol 
ID5673833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6666554 
End bp6668044 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID641244357 
Producthypothetical protein 
Protein accessionYP_001509763 
Protein GI158317255 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0681729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.325123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAGCG CGCGCGGCCA TCGCATCGTC CGCGCACCGA CCGGTCGTCG GCGGTCCTAC 
CTGCGATCGA CGATGCTGCG TCTCGCCGTC CTCGCGCCGA CCCTGTTGGC CGGGACGACG
ATCGCCGGCA GCGGCGCGGA GGCCTTGCCG GTGCCGGGAG CCCTCATCGA GGACACGGCG
ATCGGTTCGG GGGTCTCCCA GGTCAGCTAC TTCGGCGACT GGTCGGCGTG CACGCGGTGC
GGCCCGGCCA CGCCCAACAA TAGTTACCGG GAGTCGGTCC AGCCGTCGAG CGGCGCGGTG
CTCCGGTTCT CCGGGACACA GGTTGACGTC TACGGCGTCC TGGGCCCGTC CGGGGGCGTC
GCGACGATCA GTGTCGATGG CGGGACACCG ACCGCCGTCG ACACCTACGC GGCGGCCGGT
GCGGTGAGCC GCATCTACCA GTCCGGGTTG CTCAATCCCG GGATCCACAC CGCGGTGATC
GTCAACATCG GCTGGCGCAA CCCGGCTTCC AGTGGTGTTC GGGTCGCCTT CGACCGGGCC
CAGGTGTTCG TCGAGCAGGG CGGAGGGGAT CCGGGGAACC GGTCGGGTCA GCCGTGGCTC
TCCGGGGCGA ACGGAGATCC GATCCAGAAC TCGGCGAACG TGGACACGTT CTGCGAGCGC
CGGGGCAGTC CCTGCGACCT CGCTCATGTC TTCGTCTCCC GTAACAACTG GCAGAACATC
GTGCAGCCGT CCTGGACCCA GGCGAACTTC GCCGGATGGC CGGGCCGCCT CGTCATCTCG
GTGCCTCCCT TCCCCGAGAA CTCGGGGAGC ACACTCACCG CCTGCGCATC GGGTGCCTAC
GACTCGCAGT GGCGCACGTT CGGCCAGACA CTGAACTCCA CCGGACGGCA GAACTCGATT
ATCCGTATCG CGTGGGAGGC GAACGGGAAC TGGTACCAGT GGTCGGGTAG CAACCCGTCC
GCCTATGTGG GCTGTTGGCG GCGGATCGCC GACGCCATCA ACTCCACGGC CGAGCCTGAC
CCGCTGCTCG ACTGGACCAT CAACGCGCAC TACTCGCAGA ACCCCGCGAG CCATAACCCG
CTCGACCTGT ACCCGGGCGA CGCCTGGGTG GACATCGTGG GCATCGACGC CTACGACCAC
TACCCGCCGT CCCGTACCCT CGCCGAGTTC AACAACCAGG CGAACGCGGT CGGGGGCATC
ACCTGGCTGT ACAACTTCGC CCGCGCCCAC AACAAGTTGT TCGGTGTCGG TGAATGGGGG
GTCGTGAGCG GACGTAACGA GAACGGTGCC GGGGACAACC CGAACTTCAT CCAGTTCATG
CGCGACTGGA TGAATGCGCG CGCTGGACAG GGAATGTTCT ACGAGAACTA CTACAGCACC
TGCGAGCCGC CGAATGTCGG GTCCAACCTG TACCGGCCGA CCGGGCCGTC CTGCCTGTTC
ATCAACAACG CCTCCGCCCA GCGCTACACC GATCTGTGGA GCAGCCCTTA G
 
Protein sequence
MDSARGHRIV RAPTGRRRSY LRSTMLRLAV LAPTLLAGTT IAGSGAEALP VPGALIEDTA 
IGSGVSQVSY FGDWSACTRC GPATPNNSYR ESVQPSSGAV LRFSGTQVDV YGVLGPSGGV
ATISVDGGTP TAVDTYAAAG AVSRIYQSGL LNPGIHTAVI VNIGWRNPAS SGVRVAFDRA
QVFVEQGGGD PGNRSGQPWL SGANGDPIQN SANVDTFCER RGSPCDLAHV FVSRNNWQNI
VQPSWTQANF AGWPGRLVIS VPPFPENSGS TLTACASGAY DSQWRTFGQT LNSTGRQNSI
IRIAWEANGN WYQWSGSNPS AYVGCWRRIA DAINSTAEPD PLLDWTINAH YSQNPASHNP
LDLYPGDAWV DIVGIDAYDH YPPSRTLAEF NNQANAVGGI TWLYNFARAH NKLFGVGEWG
VVSGRNENGA GDNPNFIQFM RDWMNARAGQ GMFYENYYST CEPPNVGSNL YRPTGPSCLF
INNASAQRYT DLWSSP