Gene ANIA_02072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_02072 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001307 
Strand
Start bp2710988 
End bp2714387 
Gene Length3400 bp 
Protein Length1022 aa 
Translation table 
GC content52% 
IMG OID 
Productubiquitin C-terminal hydrolase, putative (AFU_orthologue; AFUA_2G04720) 
Protein accessionCBF86122 
Protein GI259487441 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.701074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCAG ATGCTGCGGC GCTTCCTTCA CCGCCGAGTT TCGCTCCGGA GGATGCTCCC 
AGCCGGCACA ACCTAGGTGG CACGGCTCCG GGTGGGGGGC ATCAGGATGG GCTCTCGCCG
CGCTTTCCAA ATATAAAAGC CCTTCAGGAC GAGGCAGCAG CATTAGATGT GAACGAATCC
ACGACGGTAA GTGGCTTGTG GCTTGTGCTA TGCTGAAAAC GACCGTCCTG ACCATTAGTT
TGGAATATAG ATCACCGATT TGTTGGCCAC GGCTCAAGAC GCAATTACGA AGTTTCGAGG
GTTCGCAGAC AGTGATCAAA TCGATAGGGC ATACGTACAA TATGTTCGCG CTTCGGAAAT
CACCATAAAT CTTATCCCTA ATCACCCTGA CTATCGGTCC GCCTCCACGC AGATCCCTGG
TTGGCATAAG CAGTTTTCGG ACTTGATGAT GGCAAGTCCT TCTCGGCGGA CCTGTTTTGC
GATCGCGCTT CTCTTCGCTA ACTATATCTT TCGCTAGGCC GTACGAACCA AGCAGGGGAC
CGCGGACTCA ATCAAGCAAC AAATTGTGGA GAATAATCTC AAGAGTATGG CCAAGCCGGC
GGGCCAATCC TCACCAATCT CTCGACGACC AGTATCGCAG ATATTACCTG CTGGAACATA
TTTACCTTCA AGTCAACGCG GCCGCCAGAG CAATGCGGGA GACTCGTCAA CGCGAATGCC
TAGTCCTTCG CAGTTCCAGT CCTTCGGCTT GCAAGACCAA CCCAATAGAT ATTCGGCACC
TCCAGATGCA CTTGCGCAGC GCCTTGCCAA GCTGAAGACA ACATCACCTA CAGCAAATGG
CGTCAGTTCA GGTTCAGCCA GTTGGGGTGA CAATGAACTA GGACACGTTG AAAACCGATC
TCCCCGGCCA TCCTCTTATA TCTCTCATGG GCCAACACAT GGGTCAACAC TAAGTTCACC
TTCTAGACGA CCACTAGGCC CTCGAGACAT GGGAACCGCT CAGAGCGGCC CCTCTATACC
CCCCAAGATT CCCCTCAACA CATCACTTCC GCGGGCACCG GATCCGACAT ATAGCCCAAT
ATACACCGTA CCTTCTCAGC CTCCGTCTAA CCCGCCGAGA ACTTCAACAG AGAACTTTCG
TCCGAATAAT GCACGATATT CCCATCTTGC AAATTCTCCT CATGGCAGCC AAAGTCCTAA
TGGCTCCGAT GATAATCCTT ATAGGTCTCG AACACCGAAC GGCGTCCATG CGCTGAAGGG
CGCCAGTAGT AGTACGCTTG ATTTACCGCA CAGATCTACT ATCAGCGCCC AGGAGTTGCT
GGACTACCTC CGTCGTTATA GCATATTGCT GATCGACGTC CGCTCCCGAG AGGATTTTGA
CAGCGGCCAT GTATATGCGA AATCGATCAT ATGCATAGAG CCCGTGGCAT TAAAGGAGAA
CGTTTCCGCA GAAGAGCTTG AGGAGCGGTT GGTGGTGTCT CCTGAGCATG AACAGTCGTT
GTTCGAGAGG CGAAATGAGT TTGATATGGT GGTCTACTAC GATCAGAGTA CCAGTTCGAA
CAGCTACCTT GCCGGCCCAC CGGTGGGGAC CACAGCGCCT CACCTGAGGG CGTTGTATGA
TACACTGTAC GAGTTCAATG CCTACAAACC TCTGAAGGAT GGCCGGCCAC CTATGCTTCT
AGCGGGAGGA CTTGACGCTT GGATTGATCT CTTAGGGCAA CAGTCGCTGG CCACATCATC
TACCGCTGCT GTAATAGCGT CTTTGCAAAC CAGGCGGCCT GTTGCGAGGC CGGGACGCCC
TCTCGGTAGA GTCCCGACAA TGGCTAGCGC TAACTCCAGT CTGGAGATCA GAAAGCGGAG
GCTTCGAAAA CTTTCACAGT TGAACCCAGA TGAGCTTACG GAGTGGTTGG AGAAGTCAAA
GACAGAAGAG ATTGACGCAA GTGCCTATCT TGGAGAGGAT AACCTCTCAG AGGAGCCCGA
GCCGGAGCAA CAAGCAGGAA CTCCCATCTC CCCCTTCATT CGTTCGTACG AGGATTTTCT
GCGCCGTTTC CCGGAACCTC ATAATATTCA GCAATCTATG ATAACAGCCC AGCCCAGACC
TTTGACCCCC GACTATGCGT CTCATGTGCC TATTGCTCCT TCAAGACCTC CACCTGCTGT
TCCGCGTCCC AGCTACAGCG GTGTGTCGGA TGGCCGGCAG ATACAAGCGC CGCTGCAGAG
ACAAAATTCA GCAACCAAGC ACGCGCTTTA CACGTCGAAC TCATTGCTTA ATCGTCTCAA
ACTTCCGCGC ACCGGATTGG CAAATTTCGG AGTTACCTGT TATATGAACT CAACAATCCA
GTGTTTAAGC GCAACAGTGC TCATGAGCAA GTTTTTCATT GATGACCGGT TCAGGTTTTA
TGTCCAGAAG AATTGGAAGG GGTCTCAGGG TGTTATGCCT GGGCTATACG CCAATCTCAT
CAGATCACTG TGGAAGAATG ATGTGCAAGT TATCATGCCG ACGTCATTTC GAAATTTCTG
CGGACGGCTG AATCAGGAAT GGGCCATAGA CAGGCAACAG GATGCTAAGG AGTTCTTTGA
TTTTGTTGTC GATTGCCTGC ATGAAGATCT CAACGTAAAT TGGCAGCGGA CACAGCTTAG
GCCCCTGACT TTCGAGGAGG AAATGCAGCG GGAAAGGATG CCAGTGGCCA AGGTCTCCAA
GATAGAATGG GATCGGTATT GTCACCGAGA GGAATCATTT ATCTCATCGT TGTTTGCGGG
GCAACATGCA AGCCGGCTTC GGTGTACAAC TTGCAAGCGA ACATCAACCA CATATGAAGC
TTTCTACAGT ATCAGTGTGG AGATCCCGGC ATCCGGAAAG GGCGATATCT ACCAATGCCT
CCGTAGCTAC TGCCAGGAGG AAATGTTAAG CGGCGACGAG GTCTGGAAGT GTCCCTATTG
CAAATGTGAG CGTGTCGCGA CCAAACAAAT CATTATCACT CGCGCACCTC AGATCCTGGT
GGTTCATTTC AAACGCTTCT CTGCCTCCAA GACACAGAGT GCCCGTAAAA TCCACACCCC
CATCGACTTT CCCCTTCACG GTCTTCGCAT GGACGACTTC GTCTTCTCGC AGCCTAAGCA
GACATCCAAC GGCGATGGCC CCAGTTCAGG TCCACAGGAT CCAACTTCCG CCACAGAACC
ACCCTTTACA TACGACGCAT ACGGTGTTTT ACGCCATTTG GGCTCGTCCA TGGGGAGCGG
GCATTATATA TCACTAGTGC GTGATGCGTC ACGTCAATGC TGGCGCAAAT TTGATGACGG
CCGCGCAACA GATTTTATAC CGCGTGATCT GCCTTTCAAA GACCGGTTGC AGAACGAACA
AGCGTATATT GTGTTCTACG AGCGCGTTCC AGCGAAATAG
 
Protein sequence
MAPDAAALPS PPSFAPEDAP SRHNLGGTAP GGGHQDGLSP RFPNIKALQD EAAALDVNES 
TTAVRTKQGT ADSIKQQIVE NNLKSMAKPA GQSSPISRRP VSQILPAGTY LPSSQRGRQS
NAGDSSTRMP SPSQFQSFGL QDQPNRYSAP PDALAQRLAK LKTTSPTANG VSSGSASWGD
NELGHVENRS PRPSSYISHG PTHGSTLSSP SRRPLGPRDM GTAQSGPSIP PKIPLNTSLP
RAPDPTYSPI YTVPSQPPSN PPRTSTENFR PNNARYSHLA NSPHGSQSPN GSDDNPYRSR
TPNGVHALKG ASSSTLDLPH RSTISAQELL DYLRRYSILL IDVRSREDFD SGHVYAKSII
CIEPVALKEN VSAEELEERL VVSPEHEQSL FERRNEFDMV VYYDQSTSSN SYLAGPPVGT
TAPHLRALYD TLYEFNAYKP LKDGRPPMLL AGGLDAWIDL LGQQSLATSS TAAVIASLQT
RRPVARPGRP LGRVPTMASA NSSLEIRKRR LRKLSQLNPD ELTEWLEKSK TEEIDASAYL
GEDNLSEEPE PEQQAGTPIS PFIRSYEDFL RRFPEPHNIQ QSMITAQPRP LTPDYASHVP
IAPSRPPPAV PRPSYSGVSD GRQIQAPLQR QNSATKHALY TSNSLLNRLK LPRTGLANFG
VTCYMNSTIQ CLSATVLMSK FFIDDRFRFY VQKNWKGSQG VMPGLYANLI RSLWKNDVQV
IMPTSFRNFC GRLNQEWAID RQQDAKEFFD FVVDCLHEDL NVNWQRTQLR PLTFEEEMQR
ERMPVAKVSK IEWDRYCHRE ESFISSLFAG QHASRLRCTT CKRTSTTYEA FYSISVEIPA
SGKGDIYQCL RSYCQEEMLS GDEVWKCPYC KCERVATKQI IITRAPQILV VHFKRFSASK
TQSARKIHTP IDFPLHGLRM DDFVFSQPKQ TSNGDGPSSG PQDPTSATEP PFTYDAYGVL
RHLGSSMGSG HYISLVRDAS RQCWRKFDDG RATDFIPRDL PFKDRLQNEQ AYIVFYERVP
AK