Gene Franean1_3250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3250 
Symbol 
ID5671624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3844278 
End bp3846044 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content70% 
IMG OID641242142 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001507562 
Protein GI158315054 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCCC CCTCGGTCCC CGCCTCGGGT TCGGGAATCC CGGAGGTCAG CCTCCCCTAT 
CAGGACACGA CTCTTAGCAC CGACCGGCGC GTGGCGGACC TCCTCTCCCG CCTGGACCTG
GAAGCCAAAG CCGGCCTCCT GTTCCATCCC CTCGCGATGC TCGGCGGACT CGACGATCCC
GGCATGTTCG GCATGCCCTC GATGCGGTCC ATGCTGCACA AGCGCATCAA CCACTTCAAC
ATCGCCCTGG TGCCCTCTGC GCGGGAACTC GCGCAGTGGC ACAACAAGCT CCAGGAAGAA
GCACTCGGCA CGCCGCTGGG CATACCGGTG ACAATCTCCA GCGATCCGCG GCACTCCTTC
ACCGACAACC CGGCGACCGC GCTTCTGGCC GGGCCGTTCT CCCAGTGGCC CGAACCGCTC
GGATTCGCCG CGATCGGCTC GACCGAGCTG GTGGGGCGCT TCGCGGACAC CGTACGCCGG
GAGTACCTCG CCACGGGCAT CCGCGTCGCC CTGCACCCGC AGATCGACCT CGCCACCGAG
CCACGCTGGT CCCGGATCTC CGGCACGTTC GGGGAGGACG CCGATCTGGC CTCCCGGCTC
GCTCCCGCCT ACGTGCGCGG CCTGCGGGCT GACGCCCTGG GGCCGGAGTC CGTCGCGGCC
ATGGCCAAGC ACTTCCCGGG CGGCGGCCCG CAGAAGGACG GCGAGGACCC CCATTTCGCC
TACGGCCGGG AGCAGGTCTA CCCCGGGGGC CGGTTCGAGC TGCACCTCGA ACCGTTCCGA
GCGGTCATCG ACGCGGGCGT GTCGCAGATG ATGCCGTACT ACGGCCTGCC CGTCGGCCTC
GAGTTGGAGG AGGTGGGCTT CGCCTTCAAC AAGGCTGTCG TCACCGGAAT CCTGCGTGAG
CAGCTCGGCT TCGACGGCAT CGTGTGCACC GACTGGGGCG TCCTGACCCA GATGTCCTGG
GGCGTGGAAC ACCTCACCTT CGAGGAGCGC ATGCTGAAGG CCCTCGACGC GGGCGTGGAC
CAGTTCGGCG GTGAGCTGCG CCCCGACGTC CTGGTCTCCC TCGTGCGGAA CGGCTCGGTC
AGCGAGAGTC GTCTCGACGT CTCCGCCCGA CGGATGCTGC GCGAGAAGTT CCACCTCGGC
CTTTTCGACC ATCCGTTCGT CGACGTCGAG CGGGCGACCG TGCTGGTCGG TTCGGAGACC
GCCCGTGTAG CCGGCCTCGC CGCGCAGCAG GCCGCATACA CACTGCTCAA GAACGAGGCG
GACTCGCCCG CGCGGCTACC GCTGCGGCGC GGCCTGCGCG TCTACGCGGA AGGTCTCGCA
CCGGCGGCAC TGGCGGACCG CGCGGCGGTC GTCGCCACGC CGCAGGAGGC CGACGTGGCC
GTGATCAGGC TGTCGGCTCC CTTCGAGAAG CGCGGCGCGG AGGGCGAGTA CGAATCCTTC
TTCCACGCCG GATCCCTCGC CTTCCCCGCC GAGGAGGAGC GGCGCGTCCG GGAGATCTGC
GAAACCCTTC CGACCGTGCT CGACGTCTAC CTCGACCGCC CCGCCATCAT CGGCGGGCTC
GCCGCGGCCG CCGCCGCCGT CACGGTCAAC TTCGGCGCTT CGGAGCAGGC CTGCGCCGCG
GTCCTCTTCG GGGACGCGCA GCCGCAGGGA AACCTCCCCT TCGACATCCC CTCCTCCATG
GCCGCCGTGG AGAACAGCCG GTCCGACACG CCCTTCGACA CCACCGATCC GGCCTTTCGC
TTCGGATCCG GCCTCCGATA CGCATGA
 
Protein sequence
MPSPSVPASG SGIPEVSLPY QDTTLSTDRR VADLLSRLDL EAKAGLLFHP LAMLGGLDDP 
GMFGMPSMRS MLHKRINHFN IALVPSAREL AQWHNKLQEE ALGTPLGIPV TISSDPRHSF
TDNPATALLA GPFSQWPEPL GFAAIGSTEL VGRFADTVRR EYLATGIRVA LHPQIDLATE
PRWSRISGTF GEDADLASRL APAYVRGLRA DALGPESVAA MAKHFPGGGP QKDGEDPHFA
YGREQVYPGG RFELHLEPFR AVIDAGVSQM MPYYGLPVGL ELEEVGFAFN KAVVTGILRE
QLGFDGIVCT DWGVLTQMSW GVEHLTFEER MLKALDAGVD QFGGELRPDV LVSLVRNGSV
SESRLDVSAR RMLREKFHLG LFDHPFVDVE RATVLVGSET ARVAGLAAQQ AAYTLLKNEA
DSPARLPLRR GLRVYAEGLA PAALADRAAV VATPQEADVA VIRLSAPFEK RGAEGEYESF
FHAGSLAFPA EEERRVREIC ETLPTVLDVY LDRPAIIGGL AAAAAAVTVN FGASEQACAA
VLFGDAQPQG NLPFDIPSSM AAVENSRSDT PFDTTDPAFR FGSGLRYA