Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_04102 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | - |
Start bp | 2022168 |
End bp | 2025149 |
Gene Length | 2982 bp |
Protein Length | 854 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | beta-glucosidase (Eurofung) |
Protein accession | CBF74704 |
Protein GI | 259481308 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCG GTTGGCTTGA GGCTGCGGCT TTGACGGCTG CCTCAGTGGC TAGTGCTCAG GTACGTTGAT TCCTAGGTTG GTTTTCGCGA GGAGGCTGAT TAGGTGAAGC AGGATGACCT CCCCGTATCG CCTCCGTACT ACCCTTCTCC ATGGTCGAAT GGTGAAGGCG AATGGGCCGA AGCATACAAT CGCGCCGTTC AGATCGTCTC GCAAATGACT CTCGATGAGA AGGTCAATCT TACAACTGGT ACAGGGTATG GTCAGTGGCC GCGGAAACTG TTCTGGTGCT AATTGGCTGC AGGTGGATGT CTGAGAAGTG CGTTGGACAG ACAGGAAGCG TTCCTAGGTA CAGCTGCCGG TTTTCGGAAC TGAGGTGATA CTGACTGCAG CAGACTCGGA ATAAATTCCA TTTGCTTACA AGACGGCCCT CTGGGTATCC GATTCAGTAG GTTGCCAATA AAAACGGGTA TTTGAGGGTA CGGTGCTGAC TTTACCAGCC GACTACAACT CAGCGTTCCC CGCCGGTGTT AACGTCGCAG CAACATGGGA TCGACAATTG GCTTACATCC GTGGTCATGC AATGGGTCAG GAATTCAGCG ACAAGGGCAT TGACGTACAA CTAGGCCCGG CCGCAGGTCC TCTCGGCAGA TTCCCCGACG GTGGCCGAAA CTGGGAAGGG TTCTCACCCG ATCCTGTTCT GTCCGGTGTT CTCTTCGCCG AGACTATCAA GGGCATTCAG GATGCCGGTG TGATCGCAAC CGCGAAGCAC TATCTTCTCA ACGAACAAGA GCATTTCCGA CAGGTACCTG AGGCGAATGG TTACGGCTAC AACATCACGG AGACCCTGAG TGAGAACGTT GACGACAAGA CCTTGCACGA ATTGTACCTC TGGCCCTTCG CAGATGCCGT GCGTGCTGGC GTGGGCGCCA TTATGTGCTC ATACCAACAT CTGAACAACA CGCAAGCCTG CCAGAACAGC CACCTTCTGA ACAAGCTCCT GAAGGCTGAA TTAGGCTTTC AGGGGTTTGT CATGAGCGAC TGGTCTGCGA CGCACAGCGG TGTCGGCTCT GCTCTGGCTG GTATGGACAT GACAATGCCT GGCGACATCG CCTTCAACGA TGGGCTCTCT TACTATGGCC CCAACCTGAC GATCAGTGTT CTCAACGGAA CCGTTCCCCA ATGGCGTGTC GATGACATGG CCGTCCGAGT CATGGCCGCT TTTTACAAGG TTGGCCGTGA CCGTTTGGCC ACCCCGCCCA ACTTCAGCTC CTGGACGAGG GCAGAAAAGG GGTACGAACA CGCTTCGATT GATGGAGGCG CCTACGGCAC AGTCAATGAG TTTGTTGATG TCCAGCAAGA CCATGCTTCT CTCATCCGTC GCGTTGGTGC AGACAGTATC GTACTCCTGA AGAACGAGGG TTCGCTTCCT CTCACCGGAA AGGAGCGCAA TGTCGCTATC CTTGGTGAGG ATGCGGGCTC CAATCCCTAC GGTGCAAATG GCTGCGATGA CCGTGGTTGT GCTCAAGGAA CCCTTGCCAT GGGTTGGGGT AGTGGTACAG CCAACTTCCC CTACCTGGTC ACCCCCGAGC AGGCGATCCA GCAGGAGGTC CTCAAGGGAC GTGGCAACGT GTTTGCGGTG ACGGACAACT GGGCCTTGGA TAAGGTTAAC AAGACCGCGT CTGAGTCTAC GTAAGTTTTC CAAGTCCTAT AGCAAGAGAT GGTGCTAATT TGTTAGGGTA TCTCTCGTGT TTGTCAACGC AGGTGCCGGA GAAGGATTCA TCAGCGTAGA CGGAAACGAG GGTGACCGCA AAAACCTTAC CCTGTGGAAG AACGGCGAGA ACCTGATAAA GGCGGCCGCA AGCAACTGCA ACAATACAAT CGTCGTCATC CACTCTGTTG GCGCCGTTCT CGTGGACCAG TTCTATGAGC ATCCCAACGT CACTGCCATC CTCTGGGCTG GTCTTCCCGG CCAGGAGTCT GGAAACTCGC TTGTCGACGT GCTCTACGGC CGTGTCAACC CCAACGGAAA GTCTCCGTTC ACCTGGGGCA AAACCCGCGA GGCATACGGC GCTCCTCTGC TGACTGAAGC CAACAACGGC AATGGTGCCC CGCAGACCGA CCACACCGAA GGCGTCTTCA TTGACTACCG CCACTTCGAT AGGACCAATC AGACTCCGAT CTACGAGTTC GGCCATGGGC TGAGCTACAC CACATTCAAG TACTCGAACC TTACTGTCCA AAAGCTGAAT GCCCCAGCGT ACTCTCCCGC ATCTGGCCAG ACAAAGGCTG CGCCCACATT CGGCACCATT GGTGAGGCCG AGGACTACGT GTTCCCTGAC AGCATCACAA GGGTCAGGGA GTTTATCTAC CCTTGGATCA ACTCTACTGA TCTAAAGGAG TCATCCGGAG ATCCCAACTA CGGATGGGAC GATGAGGACT ACATCCCCGA GGGTGCCAAG GATGGATCAC CACAAGACGT TCTGCCGTCT GGTGGAGGTG CAGGAGGAAA CCCACGCCTT TACGACGACC TCTTCCGAAT AACAGCCATC ATTAAGAACA CTGGGCCTGT GGCTGGTACA GAAGTTCCCC AGCTGGTAAG TCGCTTAATC TTTCCCTTTT TTTTTTTTTT CAGCTAGGAG CTAACCCGGC TAGTACGTTT CCCTGGGCGG GCCCAACGAA CCGAAGGTGG TTCTTCGCGG CTTTGACAAA CTTGTCATCC AGCCCGGCGA GGAGCGGGTT TTCACGACGA CCCTGACCCG CCGTGACCTG TCCAACTGGG ATATGGAGAA GGACGATTGG GTCATCACGT CTTATCCTAA AAAGGGTGTT CGTGGGAAGC TCCTCACGCA AGCTTCCTCT TAGGGCTTCG TTGCCGGCGG TTCAGTAGCG AATTACTTAT CTAACAGGAT GCTGGCCTCG GTTGGCTGGC TCCGATTTCT GGTATATTAA TTGATAGCCA ATTTACGACT TCCTATTGGT AA
|
Protein sequence | MKLGWLEAAA LTAASVASAQ QDDLPVSPPY YPSPWSNGEG EWAEAYNRAV QIVSQMTLDE KVNLTTGTGW MSEKCVGQTG SVPRLGINSI CLQDGPLGIR FTDYNSAFPA GVNVAATWDR QLAYIRGHAM GQEFSDKGID VQLGPAAGPL GRFPDGGRNW EGFSPDPVLS GVLFAETIKG IQDAGVIATA KHYLLNEQEH FRQVPEANGY GYNITETLSE NVDDKTLHEL YLWPFADAVR AGVGAIMCSY QHLNNTQACQ NSHLLNKLLK AELGFQGFVM SDWSATHSGV GSALAGMDMT MPGDIAFNDG LSYYGPNLTI SVLNGTVPQW RVDDMAVRVM AAFYKVGRDR LATPPNFSSW TRAEKGYEHA SIDGGAYGTV NEFVDVQQDH ASLIRRVGAD SIVLLKNEGS LPLTGKERNV AILGEDAGSN PYGANGCDDR GCAQGTLAMG WGSGTANFPY LVTPEQAIQQ EVLKGRGNVF AVTDNWALDK VNKTASESTV SLVFVNAGAG EGFISVDGNE GDRKNLTLWK NGENLIKAAA SNCNNTIVVI HSVGAVLVDQ FYEHPNVTAI LWAGLPGQES GNSLVDVLYG RVNPNGKSPF TWGKTREAYG APLLTEANNG NGAPQTDHTE GVFIDYRHFD RTNQTPIYEF GHGLSYTTFK YSNLTVQKLN APAYSPASGQ TKAAPTFGTI GEAEDYVFPD SITRVREFIY PWINSTDLKE SSGDPNYGWD DEDYIPEGAK DGSPQDVLPS GGGAGGNPRL YDDLFRITAI IKNTGPVAGT EVPQLYVSLG GPNEPKVVLR GFDKLVIQPG EERVFTTTLT RRDLSNWDME KDDWVITSYP KKGVRGKLLT QASS
|
| |