Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3250 |
Symbol | |
ID | 5671624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3844278 |
End bp | 3846044 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242142 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001507562 |
Protein GI | 158315054 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCCC CCTCGGTCCC CGCCTCGGGT TCGGGAATCC CGGAGGTCAG CCTCCCCTAT CAGGACACGA CTCTTAGCAC CGACCGGCGC GTGGCGGACC TCCTCTCCCG CCTGGACCTG GAAGCCAAAG CCGGCCTCCT GTTCCATCCC CTCGCGATGC TCGGCGGACT CGACGATCCC GGCATGTTCG GCATGCCCTC GATGCGGTCC ATGCTGCACA AGCGCATCAA CCACTTCAAC ATCGCCCTGG TGCCCTCTGC GCGGGAACTC GCGCAGTGGC ACAACAAGCT CCAGGAAGAA GCACTCGGCA CGCCGCTGGG CATACCGGTG ACAATCTCCA GCGATCCGCG GCACTCCTTC ACCGACAACC CGGCGACCGC GCTTCTGGCC GGGCCGTTCT CCCAGTGGCC CGAACCGCTC GGATTCGCCG CGATCGGCTC GACCGAGCTG GTGGGGCGCT TCGCGGACAC CGTACGCCGG GAGTACCTCG CCACGGGCAT CCGCGTCGCC CTGCACCCGC AGATCGACCT CGCCACCGAG CCACGCTGGT CCCGGATCTC CGGCACGTTC GGGGAGGACG CCGATCTGGC CTCCCGGCTC GCTCCCGCCT ACGTGCGCGG CCTGCGGGCT GACGCCCTGG GGCCGGAGTC CGTCGCGGCC ATGGCCAAGC ACTTCCCGGG CGGCGGCCCG CAGAAGGACG GCGAGGACCC CCATTTCGCC TACGGCCGGG AGCAGGTCTA CCCCGGGGGC CGGTTCGAGC TGCACCTCGA ACCGTTCCGA GCGGTCATCG ACGCGGGCGT GTCGCAGATG ATGCCGTACT ACGGCCTGCC CGTCGGCCTC GAGTTGGAGG AGGTGGGCTT CGCCTTCAAC AAGGCTGTCG TCACCGGAAT CCTGCGTGAG CAGCTCGGCT TCGACGGCAT CGTGTGCACC GACTGGGGCG TCCTGACCCA GATGTCCTGG GGCGTGGAAC ACCTCACCTT CGAGGAGCGC ATGCTGAAGG CCCTCGACGC GGGCGTGGAC CAGTTCGGCG GTGAGCTGCG CCCCGACGTC CTGGTCTCCC TCGTGCGGAA CGGCTCGGTC AGCGAGAGTC GTCTCGACGT CTCCGCCCGA CGGATGCTGC GCGAGAAGTT CCACCTCGGC CTTTTCGACC ATCCGTTCGT CGACGTCGAG CGGGCGACCG TGCTGGTCGG TTCGGAGACC GCCCGTGTAG CCGGCCTCGC CGCGCAGCAG GCCGCATACA CACTGCTCAA GAACGAGGCG GACTCGCCCG CGCGGCTACC GCTGCGGCGC GGCCTGCGCG TCTACGCGGA AGGTCTCGCA CCGGCGGCAC TGGCGGACCG CGCGGCGGTC GTCGCCACGC CGCAGGAGGC CGACGTGGCC GTGATCAGGC TGTCGGCTCC CTTCGAGAAG CGCGGCGCGG AGGGCGAGTA CGAATCCTTC TTCCACGCCG GATCCCTCGC CTTCCCCGCC GAGGAGGAGC GGCGCGTCCG GGAGATCTGC GAAACCCTTC CGACCGTGCT CGACGTCTAC CTCGACCGCC CCGCCATCAT CGGCGGGCTC GCCGCGGCCG CCGCCGCCGT CACGGTCAAC TTCGGCGCTT CGGAGCAGGC CTGCGCCGCG GTCCTCTTCG GGGACGCGCA GCCGCAGGGA AACCTCCCCT TCGACATCCC CTCCTCCATG GCCGCCGTGG AGAACAGCCG GTCCGACACG CCCTTCGACA CCACCGATCC GGCCTTTCGC TTCGGATCCG GCCTCCGATA CGCATGA
|
Protein sequence | MPSPSVPASG SGIPEVSLPY QDTTLSTDRR VADLLSRLDL EAKAGLLFHP LAMLGGLDDP GMFGMPSMRS MLHKRINHFN IALVPSAREL AQWHNKLQEE ALGTPLGIPV TISSDPRHSF TDNPATALLA GPFSQWPEPL GFAAIGSTEL VGRFADTVRR EYLATGIRVA LHPQIDLATE PRWSRISGTF GEDADLASRL APAYVRGLRA DALGPESVAA MAKHFPGGGP QKDGEDPHFA YGREQVYPGG RFELHLEPFR AVIDAGVSQM MPYYGLPVGL ELEEVGFAFN KAVVTGILRE QLGFDGIVCT DWGVLTQMSW GVEHLTFEER MLKALDAGVD QFGGELRPDV LVSLVRNGSV SESRLDVSAR RMLREKFHLG LFDHPFVDVE RATVLVGSET ARVAGLAAQQ AAYTLLKNEA DSPARLPLRR GLRVYAEGLA PAALADRAAV VATPQEADVA VIRLSAPFEK RGAEGEYESF FHAGSLAFPA EEERRVREIC ETLPTVLDVY LDRPAIIGGL AAAAAAVTVN FGASEQACAA VLFGDAQPQG NLPFDIPSSM AAVENSRSDT PFDTTDPAFR FGSGLRYA
|
| |