Gene Franean1_5462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5462 
Symbol 
ID5673793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6604984 
End bp6606453 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content74% 
IMG OID641244317 
ProductBeta-glucosidase 
Protein accessionYP_001509723 
Protein GI158317215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTACCA GCCCCGTCCC GGCGACGGAC ACGACAGCGG CCACCGGAGT ACCGGCGCCC 
GAGCCGTCGC CGGGAGCCGA CGAGCTCGCG GCCCGGATCG CGACCGGCCT GCCCACCGGA
TTCGTGTGGG GTGCCGCCAC CTCCGCGTAC CAGATCGAGG GCGGCGCCCG GGAGGGCGGC
CGCGGGCCGT CGATCTGGGA CGTCTTCGCC CGCACCCCCG GCATGACCGC GAACGGCGAG
AACGGCGACG TCGCCGCCGA CCACCTGCAC CGCTACCGCG AGGACGTCGC CCTCATGGCC
GGCCTCGGCC TGGGCGCGTA CCGGTTCTCG GTCGCCTGGC CGAGGATCGT CCCGGCCGGG
TCGGGGGCCG TGAACCCCGA CGGGCTGGCC TTCTACGACC GGCTCGTCGA CGAGATCCTC
GCCGCCGGGA TCGACCCGTG GCTGACGCTC TACCACTGGG ATCTGCCACA GCCGCTGCAG
GACGCCGGCG GCTGGCCGAA CCGCGACACG GCGCTGCGCT TCGCCGACTA CACCGAACAC
GTCGTCAGCG CGCTCGGCGA CCGGGTCCGG CACTGGTCGA CCCTGAACGA GCCCTGGTGC
TCGGCCTTCG AGGGACACCT GACCGGGCGG CACGCCCCCG GGGTGCGGGA CGCCGGGGCC
GCCGTCCGGG CCGTCCACCA CCTGCTGCTG GCCCACGGTC TCGGCACGGC CGTGCTGCGG
GCGCACGGCG CGGCGGACGT CGGCATCACC CTCAACCTGA TTCCGGCGGA ACCGCCGCCG
GACGCCGGCG GGGCGGGCGA CGCCGTGCGG CTCGTCGACG GGCAGCAGAT CCGGCTCTGG
CTCGATCCCG TCCTGCACGG CACGTACCCG TCCGACGTGA TCGCCGACTT CGCCCGGGCC
GGCGCCGAGC TGCCCGCCCA GGACGGTGAT CTCGCCGTCA TCGCGCAGCC GGTGGACTGG
ATCGGGGTGA ACTACTACGC GCCGCACATC GTCGCGCCCG GGCCGGACCC GTCCCCGAAA
CCGACCCCCT TCGCCGGCGC GGGGAACGTC ACGAGGGTGC CCACGAGCGA GGACGTGACG
GCGCTGGGCT GGCCGGTACG GCCGCGCAGC TTCCTGGCCC TGCTGCGCCG GCTCGGCGCG
GACTATCCGG GGGTCCCGTT CTACATCCAC GAGAACGGCG CGGCCTACGA CGACGAGGTG
TCCCCGGACG GGACGGTGCA CGACCCGCTG CGGGTGCGTT ACCTGGCGGG TCACCTCGAC
GCCGTCCGGC AGGCGTCCGA GGACGGGGTC GACGTTCGGG GATACTTCGT GTGGTCGTTG
CTGGACAACT TCGAGTGGGC CGAGGGGTAC CGGATGAGGT TCGGCATCGT GCACGTCGAC
TTCGAGTCCC TGGTGCGCAC GCCGAAGTCC AGCGGGCTGT GGTACTCGCG GCTGATCCGG
GAGCACCGCG CGGCAAGCGG GACGCGGTGA
 
Protein sequence
MTTSPVPATD TTAATGVPAP EPSPGADELA ARIATGLPTG FVWGAATSAY QIEGGAREGG 
RGPSIWDVFA RTPGMTANGE NGDVAADHLH RYREDVALMA GLGLGAYRFS VAWPRIVPAG
SGAVNPDGLA FYDRLVDEIL AAGIDPWLTL YHWDLPQPLQ DAGGWPNRDT ALRFADYTEH
VVSALGDRVR HWSTLNEPWC SAFEGHLTGR HAPGVRDAGA AVRAVHHLLL AHGLGTAVLR
AHGAADVGIT LNLIPAEPPP DAGGAGDAVR LVDGQQIRLW LDPVLHGTYP SDVIADFARA
GAELPAQDGD LAVIAQPVDW IGVNYYAPHI VAPGPDPSPK PTPFAGAGNV TRVPTSEDVT
ALGWPVRPRS FLALLRRLGA DYPGVPFYIH ENGAAYDDEV SPDGTVHDPL RVRYLAGHLD
AVRQASEDGV DVRGYFVWSL LDNFEWAEGY RMRFGIVHVD FESLVRTPKS SGLWYSRLIR
EHRAASGTR