Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_02072 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001307 |
Strand | + |
Start bp | 2710988 |
End bp | 2714387 |
Gene Length | 3400 bp |
Protein Length | 1022 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | ubiquitin C-terminal hydrolase, putative (AFU_orthologue; AFUA_2G04720) |
Protein accession | CBF86122 |
Protein GI | 259487441 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.701074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCAG ATGCTGCGGC GCTTCCTTCA CCGCCGAGTT TCGCTCCGGA GGATGCTCCC AGCCGGCACA ACCTAGGTGG CACGGCTCCG GGTGGGGGGC ATCAGGATGG GCTCTCGCCG CGCTTTCCAA ATATAAAAGC CCTTCAGGAC GAGGCAGCAG CATTAGATGT GAACGAATCC ACGACGGTAA GTGGCTTGTG GCTTGTGCTA TGCTGAAAAC GACCGTCCTG ACCATTAGTT TGGAATATAG ATCACCGATT TGTTGGCCAC GGCTCAAGAC GCAATTACGA AGTTTCGAGG GTTCGCAGAC AGTGATCAAA TCGATAGGGC ATACGTACAA TATGTTCGCG CTTCGGAAAT CACCATAAAT CTTATCCCTA ATCACCCTGA CTATCGGTCC GCCTCCACGC AGATCCCTGG TTGGCATAAG CAGTTTTCGG ACTTGATGAT GGCAAGTCCT TCTCGGCGGA CCTGTTTTGC GATCGCGCTT CTCTTCGCTA ACTATATCTT TCGCTAGGCC GTACGAACCA AGCAGGGGAC CGCGGACTCA ATCAAGCAAC AAATTGTGGA GAATAATCTC AAGAGTATGG CCAAGCCGGC GGGCCAATCC TCACCAATCT CTCGACGACC AGTATCGCAG ATATTACCTG CTGGAACATA TTTACCTTCA AGTCAACGCG GCCGCCAGAG CAATGCGGGA GACTCGTCAA CGCGAATGCC TAGTCCTTCG CAGTTCCAGT CCTTCGGCTT GCAAGACCAA CCCAATAGAT ATTCGGCACC TCCAGATGCA CTTGCGCAGC GCCTTGCCAA GCTGAAGACA ACATCACCTA CAGCAAATGG CGTCAGTTCA GGTTCAGCCA GTTGGGGTGA CAATGAACTA GGACACGTTG AAAACCGATC TCCCCGGCCA TCCTCTTATA TCTCTCATGG GCCAACACAT GGGTCAACAC TAAGTTCACC TTCTAGACGA CCACTAGGCC CTCGAGACAT GGGAACCGCT CAGAGCGGCC CCTCTATACC CCCCAAGATT CCCCTCAACA CATCACTTCC GCGGGCACCG GATCCGACAT ATAGCCCAAT ATACACCGTA CCTTCTCAGC CTCCGTCTAA CCCGCCGAGA ACTTCAACAG AGAACTTTCG TCCGAATAAT GCACGATATT CCCATCTTGC AAATTCTCCT CATGGCAGCC AAAGTCCTAA TGGCTCCGAT GATAATCCTT ATAGGTCTCG AACACCGAAC GGCGTCCATG CGCTGAAGGG CGCCAGTAGT AGTACGCTTG ATTTACCGCA CAGATCTACT ATCAGCGCCC AGGAGTTGCT GGACTACCTC CGTCGTTATA GCATATTGCT GATCGACGTC CGCTCCCGAG AGGATTTTGA CAGCGGCCAT GTATATGCGA AATCGATCAT ATGCATAGAG CCCGTGGCAT TAAAGGAGAA CGTTTCCGCA GAAGAGCTTG AGGAGCGGTT GGTGGTGTCT CCTGAGCATG AACAGTCGTT GTTCGAGAGG CGAAATGAGT TTGATATGGT GGTCTACTAC GATCAGAGTA CCAGTTCGAA CAGCTACCTT GCCGGCCCAC CGGTGGGGAC CACAGCGCCT CACCTGAGGG CGTTGTATGA TACACTGTAC GAGTTCAATG CCTACAAACC TCTGAAGGAT GGCCGGCCAC CTATGCTTCT AGCGGGAGGA CTTGACGCTT GGATTGATCT CTTAGGGCAA CAGTCGCTGG CCACATCATC TACCGCTGCT GTAATAGCGT CTTTGCAAAC CAGGCGGCCT GTTGCGAGGC CGGGACGCCC TCTCGGTAGA GTCCCGACAA TGGCTAGCGC TAACTCCAGT CTGGAGATCA GAAAGCGGAG GCTTCGAAAA CTTTCACAGT TGAACCCAGA TGAGCTTACG GAGTGGTTGG AGAAGTCAAA GACAGAAGAG ATTGACGCAA GTGCCTATCT TGGAGAGGAT AACCTCTCAG AGGAGCCCGA GCCGGAGCAA CAAGCAGGAA CTCCCATCTC CCCCTTCATT CGTTCGTACG AGGATTTTCT GCGCCGTTTC CCGGAACCTC ATAATATTCA GCAATCTATG ATAACAGCCC AGCCCAGACC TTTGACCCCC GACTATGCGT CTCATGTGCC TATTGCTCCT TCAAGACCTC CACCTGCTGT TCCGCGTCCC AGCTACAGCG GTGTGTCGGA TGGCCGGCAG ATACAAGCGC CGCTGCAGAG ACAAAATTCA GCAACCAAGC ACGCGCTTTA CACGTCGAAC TCATTGCTTA ATCGTCTCAA ACTTCCGCGC ACCGGATTGG CAAATTTCGG AGTTACCTGT TATATGAACT CAACAATCCA GTGTTTAAGC GCAACAGTGC TCATGAGCAA GTTTTTCATT GATGACCGGT TCAGGTTTTA TGTCCAGAAG AATTGGAAGG GGTCTCAGGG TGTTATGCCT GGGCTATACG CCAATCTCAT CAGATCACTG TGGAAGAATG ATGTGCAAGT TATCATGCCG ACGTCATTTC GAAATTTCTG CGGACGGCTG AATCAGGAAT GGGCCATAGA CAGGCAACAG GATGCTAAGG AGTTCTTTGA TTTTGTTGTC GATTGCCTGC ATGAAGATCT CAACGTAAAT TGGCAGCGGA CACAGCTTAG GCCCCTGACT TTCGAGGAGG AAATGCAGCG GGAAAGGATG CCAGTGGCCA AGGTCTCCAA GATAGAATGG GATCGGTATT GTCACCGAGA GGAATCATTT ATCTCATCGT TGTTTGCGGG GCAACATGCA AGCCGGCTTC GGTGTACAAC TTGCAAGCGA ACATCAACCA CATATGAAGC TTTCTACAGT ATCAGTGTGG AGATCCCGGC ATCCGGAAAG GGCGATATCT ACCAATGCCT CCGTAGCTAC TGCCAGGAGG AAATGTTAAG CGGCGACGAG GTCTGGAAGT GTCCCTATTG CAAATGTGAG CGTGTCGCGA CCAAACAAAT CATTATCACT CGCGCACCTC AGATCCTGGT GGTTCATTTC AAACGCTTCT CTGCCTCCAA GACACAGAGT GCCCGTAAAA TCCACACCCC CATCGACTTT CCCCTTCACG GTCTTCGCAT GGACGACTTC GTCTTCTCGC AGCCTAAGCA GACATCCAAC GGCGATGGCC CCAGTTCAGG TCCACAGGAT CCAACTTCCG CCACAGAACC ACCCTTTACA TACGACGCAT ACGGTGTTTT ACGCCATTTG GGCTCGTCCA TGGGGAGCGG GCATTATATA TCACTAGTGC GTGATGCGTC ACGTCAATGC TGGCGCAAAT TTGATGACGG CCGCGCAACA GATTTTATAC CGCGTGATCT GCCTTTCAAA GACCGGTTGC AGAACGAACA AGCGTATATT GTGTTCTACG AGCGCGTTCC AGCGAAATAG
|
Protein sequence | MAPDAAALPS PPSFAPEDAP SRHNLGGTAP GGGHQDGLSP RFPNIKALQD EAAALDVNES TTAVRTKQGT ADSIKQQIVE NNLKSMAKPA GQSSPISRRP VSQILPAGTY LPSSQRGRQS NAGDSSTRMP SPSQFQSFGL QDQPNRYSAP PDALAQRLAK LKTTSPTANG VSSGSASWGD NELGHVENRS PRPSSYISHG PTHGSTLSSP SRRPLGPRDM GTAQSGPSIP PKIPLNTSLP RAPDPTYSPI YTVPSQPPSN PPRTSTENFR PNNARYSHLA NSPHGSQSPN GSDDNPYRSR TPNGVHALKG ASSSTLDLPH RSTISAQELL DYLRRYSILL IDVRSREDFD SGHVYAKSII CIEPVALKEN VSAEELEERL VVSPEHEQSL FERRNEFDMV VYYDQSTSSN SYLAGPPVGT TAPHLRALYD TLYEFNAYKP LKDGRPPMLL AGGLDAWIDL LGQQSLATSS TAAVIASLQT RRPVARPGRP LGRVPTMASA NSSLEIRKRR LRKLSQLNPD ELTEWLEKSK TEEIDASAYL GEDNLSEEPE PEQQAGTPIS PFIRSYEDFL RRFPEPHNIQ QSMITAQPRP LTPDYASHVP IAPSRPPPAV PRPSYSGVSD GRQIQAPLQR QNSATKHALY TSNSLLNRLK LPRTGLANFG VTCYMNSTIQ CLSATVLMSK FFIDDRFRFY VQKNWKGSQG VMPGLYANLI RSLWKNDVQV IMPTSFRNFC GRLNQEWAID RQQDAKEFFD FVVDCLHEDL NVNWQRTQLR PLTFEEEMQR ERMPVAKVSK IEWDRYCHRE ESFISSLFAG QHASRLRCTT CKRTSTTYEA FYSISVEIPA SGKGDIYQCL RSYCQEEMLS GDEVWKCPYC KCERVATKQI IITRAPQILV VHFKRFSASK TQSARKIHTP IDFPLHGLRM DDFVFSQPKQ TSNGDGPSSG PQDPTSATEP PFTYDAYGVL RHLGSSMGSG HYISLVRDAS RQCWRKFDDG RATDFIPRDL PFKDRLQNEQ AYIVFYERVP AK
|
| |