Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_07704 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001304 |
Strand | - |
Start bp | 2378090 |
End bp | 2380955 |
Gene Length | 2866 bp |
Protein Length | 811 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | polyubiquitin binding protein (Doa1/Ufd3), putative (AFU_orthologue; AFUA_5G08370) |
Protein accession | CBF79941 |
Protein GI | 259484050 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00597659 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGAA GCTTCGTAGT AGGCGGCAAA AGCTAAGCCA ATTTGGCTCA GTACCTTCGA AGCAATTACT CACTTCATCA TCACAGACCG ACTGCATCTT GCTGTTGATT ACTCATCTTT CACGAGGAGT ACTTCTAGCA GCTTCTGGGC CTCAGGCTTA CCGCCCACAT GACGCCCCTC TTTCCTTGGA GATCGCCGGG GCTGCCACGA GCCAGGTCCC CGTTGGAGCT GGGGTTACTA TTGGACTCAT CACCTACCTT CACCTCACTC TATCCTTTGG GACGTTCTTA CCTTGACCCC TCTATTTCAG TGCCACGTCT CCGACTCAAT CACAACTCTC TTGTTCCGCA GTCTGTCCTT TGTTTATTGT ATGCCTGAGT TCAAGATCTC CGCTGCTTTA GAGGGCCACG GCGATGATGT AAGACAGCTC CCCGCTCCTT GTGTTGACAA CGCAGCCTCC ACGAACCCAA GGCATTCGGC CACTAACCTT GTGTTATCTT CCGTCAGGTC CGTGCCGTGG CCTTTCCCAA CTCTAAAGCT GTTTTTTCCG CTTCGCGAGA TGCGACCGTC CGGCTCTGGA AGCTTGTATC GAGCCCGCCT CCAACATTCG ACTACACCAT CATCTGTCAC GGTTCCGCAT TCATCAATGC GCTGGCATAT TACCCTCCCA CCCCAGATTT CCCCGAAGGA TTAGTGTTTT CTGGAGGTCA AGACACTATC ATAGAGGCAA GGCAGCCCGG TAAGACCTCG AATGACAATG CGGATGCTAT GCTACTGGGC CATGCGCATA CGGTCTGTTC ATTGGACGTC TGCCCGGAAG GGGAATGGAT TGTCAGTGGG AGCTGGGATT CCACTGCCCG TCTATGGAGG ATTGGAAAAT GGGAGTCTGA AGTTGTTCTG GAAGACCATC AAGGGAGCGT TTGGGCAGTG TTGGCCTACG ATAAGAACAC TATAATCACA GGTAACTCCG CCTTGCACGG AGCACGGCAC AGGTGCTAAT CAGAACCCAG GTTGCGCGGA CAATATAATC CGAATATTCA ATTCGTCTGG GAAGCTGCTC AAACGCATTA AAGATTCTCG GGACGTTGTT CGGGCTCTGT GCAAACTGCC GCCTACACAT CCCACTGGTG CCAACTTCGT CTCTGCAAGC AACGACGGGG TGATCCGCCT ATTCACGCTG CAAGGAGATC TTGTTGGTGA ACTTCACGGT CACGAGAGCT TCATCTATTC ACTGGCCGTT TTACCAACGG GTGAGCTGGT CAGCTCCGGG GAGGATCGAA CAGTACGAAT CTGGAACGAA ACGCAGTGCG TACAAACTAT CACCCACCCT GCGATTTCTG TCTGGGGTGT CGCTGTCTGT CCAGAGAACG GCGATATCGT TACAGGGGCA AGCGATAGAG TTACGCGAGT TTTCACTCGG GCTCCCGAAA GACAAGCGAG CGCGGAGGTG CTGCAACAGT TTGAAACGGC TGTCAGGGAG TCGGCAATCC CCGCGCAGCA GGTCGGAAAA ATTAACAAAG AGAAGTTGCC TGGCCCCGAG TTCTTGCAGC AGAAATCCGG CACCAAGGAC GGTCAAGTGC AGATGATCCG CGAAGCGAAC GGCAGTGTTA CCGCTCATAC ATGGTCAGCT GCGTTGGGGA GATGGGAATC GGTCGGCACT GTTGTGGATT CTGCCGGTAG CAGCGGAAGG AAGACTGAAT ATCTGGGTCA GGATTATGAC TTTGTTTTTG ATGTTGACGT TGAGGACGGC AAGCCCCCTC TTAAACTACC CTACAATCTT TCGCAGAATC CATATGAAGC AGCGACCAAG TTCATCGGTG ACAACGAGTT ACCAATGAGT TACCTTGATC AAGTCGCTCA GTTCATCGTT CAAAACACCC AAGGCGCGAC TATCGGACAG CCCAGCCAGG AGACTGCAGG TGGTCCCGAT CCATGGGGTC AAGATAGGCG CTATCGACCT GGGGATGCCC CTGCTCAGTC AACAGCTATC CCTGAGTCGC GGCCAAAAGT ACTTCCACAA AAGACCTATC TATCAATCAA GTCTGCCAAT CTCAAAGTGA TCTCAAAGAA GTTGAACGAG CTCAATGGCA AGCTTGTCTC GGAAGGTTCA AAGGATCTGT CTCTGAGTCC TTCGGAGTTG GAGACGATAG TATCGCTGTG CAATGAGTTG GAAGCCTCGA ACACTTTGAA AGGCCCCTCG GCCGTGGAAG CGGTCGTGAT TTTGCTCTTT AAAGTCGCGA CAGTGTGGCC AGCAGCGAAT AGGCTGCCTG GTCTTGACCT CCTGCGCTTA TTTGCTGCCG CCACTCCTGT AACAGCCACG GCAGACTATA ATGGTAAAGA CTTGGTCTCC GGAATTATCG AGAGCGGAGT ATTCGACGCC CCAGTCAATG TCAACAACGC CATGCTTTCA GTCCGAATGT TCGCTAACCT CTTCGAGACC GATGCTGGCC GCCGTCTCAT CATTGACCGC TTCGATCAAG TCATCGCCGC TATTAGGACA TGCCTAACAA ACAGTGGGTC TTCGGTGAAC CGCAACCTCA CGATTGCAGT GGCGACTCTC TATATCAACA TCGCAGTATT TTCGACGTCG GAAGCGAGGA ATCTCAGCAT CGAGTCGAAC CAACGGGGTC TGATACTTCT AGAAGAACTT ACGGGGATGC TCCGCAATGA AAAGGACTCG GAGGCAGTAT ATCGCAGTCT TGTTGCTTTG GGGACTTTGG TCAAGGAACT GGTGAGCGAA GTGAAAGCGG CTGCGAAAGA AGTGTACGAT CTTGGTGCCA TTCTGCAGGC TATATCGAGC TCTAATCTCG GAAAGGAGCC AAGAATCAAA GGTATTGTCG CAGAGATTAA GGACTCTCTG CCGTGA
|
Protein sequence | MARSFVCHVS DSITTLLFRS LSFVYCMPEF KISAALEGHG DDVRAVAFPN SKAVFSASRD ATVRLWKLVS SPPPTFDYTI ICHGSAFINA LAYYPPTPDF PEGLVFSGGQ DTIIEARQPG KTSNDNADAM LLGHAHTVCS LDVCPEGEWI VSGSWDSTAR LWRIGKWESE VVLEDHQGSV WAVLAYDKNT IITGCADNII RIFNSSGKLL KRIKDSRDVV RALCKLPPTH PTGANFVSAS NDGVIRLFTL QGDLVGELHG HESFIYSLAV LPTGELVSSG EDRTVRIWNE TQCVQTITHP AISVWGVAVC PENGDIVTGA SDRVTRVFTR APERQASAEV LQQFETAVRE SAIPAQQVGK INKEKLPGPE FLQQKSGTKD GQVQMIREAN GSVTAHTWSA ALGRWESVGT VVDSAGSSGR KTEYLGQDYD FVFDVDVEDG KPPLKLPYNL SQNPYEAATK FIGDNELPMS YLDQVAQFIV QNTQGATIGQ PSQETAGGPD PWGQDRRYRP GDAPAQSTAI PESRPKVLPQ KTYLSIKSAN LKVISKKLNE LNGKLVSEGS KDLSLSPSEL ETIVSLCNEL EASNTLKGPS AVEAVVILLF KVATVWPAAN RLPGLDLLRL FAAATPVTAT ADYNGKDLVS GIIESGVFDA PVNVNNAMLS VRMFANLFET DAGRRLIIDR FDQVIAAIRT CLTNSGSSVN RNLTIAVATL YINIAVFSTS EARNLSIESN QRGLILLEEL TGMLRNEKDS EAVYRSLVAL GTLVKELVSE VKAAAKEVYD LGAILQAISS SNLGKEPRIK GIVAEIKDSL P
|
| |