Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_02689 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001306 |
Strand | - |
Start bp | 3086104 |
End bp | 3089234 |
Gene Length | 3131 bp |
Protein Length | 965 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | nuclear pore complex subunit Nup133, putative (AFU_orthologue; AFUA_5G14040) |
Protein accession | CBF84207 |
Protein GI | 259486401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGTATCCTCT CATCCTCTTC TTAACCGCCC ATTCTTCATG GAGATATCTC AAAATTGGGC GATTCTTGCG ACCAGCTTTG TTGTATTTCT CTTTCGAAAG CTTTCAGTTG CCGCGAGTTT TCTCGTGTTG TTAACCGCGC TTTCATTTCT CGACTCTTTG TCGACACCGT CGCAGATACC CCTCCTCCTT CAATTCATTT TATTTCAACC TCAGTCTCGC ACCATCGCAC GCCATGGATT TCGTTACCAA TGGCCAGCCC TCTCCTAAGG ATACAGTTAT GAACGACGCG TCTCCGTACG ACGCTCTTCA TCATCAACGA ACTAATCGGC AATTCACTAG CGAAGACCTG GCTGATCCTA ACTACGTCCC TAACCCCGTC CCATTCGGCG CGAAGAAGAT AACGCCTGAC GAAACTCATC GCCCTTTCTA CCATATACAT TCGAGAAAGC CTAGTCAAAG GCCTGATATG GTCCTGAAGC ATCAGAAAAC AGCTTACAGC ACGACCCGAA GGCGTTATGA TTCACTCTCC CCACCCCAAT TCCAATTCAC ACGAGGACGA ACTGGATCCC AACATGAACA GGCACCACTT GTGTTGCCCC GCGTCAATGA GATTCAGCCC TACCGATTGA CCGCAACGAC CGCTTCTCGA TTGAACGCCA CAACCTACAG CAGTAGTCTC ATGAATCCGA TGCGCTCGTC TGGACACGAC TCGATACTTG GGGCTGGTCT TCGTGGTCGT GACCGTCCGC TCTCGCTGTT CGGATCCGAG GCTTCTCGCC AAGCTGCGAT GATCCGACCT AGGAAGCGCG ACCGCGAAGG CAATATCCTC GACACTACGG GCAGCATATT TGTCCGCAAC AACAATGCCA ATGATGGGCG CAACAATGAC CAACAGCATA TCGCGTCAGC CGACGCTGAT AGTCCCGTTT TGAAATATTG CCGTGGTGGC ACAGAGTCAT CATTTAGTGC TGCTGTCGAT AATATTAAGA AAGTAAATGG GGATCCAGTT CAACCTTTGG CGCAACGCCC TTCCTTCCAC TGGCAGCGCG CTTTGCCTTC AAAGACGACC GACCCCGCAA CTCCTGGTAA GCAAACAGGA AGCTCGACTG CTACAGGCCG TATTCCTGGC TGCTGGCCAT CGGCATCGAA ACATGGTTCG ATGCCGCTAC TTCCTGAGCC ACAGCAGACC GCTCAGACGC AGCACCAAAC CGAGTCTCCT GTAACTTCCC AAGAGATATG TGGCCAGGTC GACTTGCCCA GCAATGCAAA TCCGGAGCCG GCCACCGTTA ACCCTGACCA AGCTATTCTG GACGAAACTC CCTCTTGGAC TCAACACTAT TCTGGCGTTT ATGGCACCCT ACGGATAGCT TACTCTTTCC AGTGTGGTAT GGTGCAGACT GTTGCAAATG CATTCCATGT TGCACTCGCT GCTGCCAGCA CTATAACCCA TCAAACGCAA CAGGCGCTGG GAACTGTAAC ACAGCGGGTC ATGGCCATGT ACAGACAACG TCGCTTTGAT CGTGCGCGTT CGCGTGCTCG TGCAAGCCCT GCCGCTCCGG CTCGGCAACC TCCAACTACA ATAGCCTCTC CTGCTCGTGT GAACGTTGCG ACACTGCCGC CTGGGCAGCA GGAGCGTGTG CGAATCAACC AGTGGCGTAG ACGTCGAGGA TTTCCTGTCA ATGAAGAACT CCCATTCCCG AATATGACAA CGCCAATGGG AGCTCTATTC TATGATCCGC AAATAATCAC AACATCTTCG CCTAGCGTGC AGCGCAGTCT TGACCTCGTG GTAGATAATG CCTCCGGGGC TACTTTGCAC AGGCATCCCG CGCAGCGGCG AACATCTGTG AACGACCGCG ATGACAAAAA CCGGCCTCAA GCACCCAAAG CTGGGATTCT CAAAAAGAAT TCTCTGGTTC CCACCATGAG CCCCGCAACC CGACGTCGCC TCCTTCCTGG ATACATCACA CCGCGTGACC GTCGGCTTGG ACTACAGCAC AGGGTCCGTT TCCGGTCTCC CATAGTTCAG CCTTCGCCTT TGCGCCTTCG CCAGTGGGCT AATTCATCCG CCGAGTCGGG ACCCGGGCTT GATGAACTGT TGCGCACACA GCTGAACGGA GCCGATGCGC CGTCAACCGT GGCTAGCGAT CAGCGTACTG GACCCGATGA ACAGCTATAC GCGCAATTGG CTGCTTCTCT CGAGCCGTAT GTGGATCCTT GGGCGCAGCC GCGCGACTTC ACCAAAGGTA CTCCTAGGTC TGCTGTCAAA CTCGTCAAAC CCAAGATAGA GCCAGTCCCC GACGGCCGGT CCGAGTCGAT TTATGCAAAG GAATATGAAG AGATGCAAAA AATGAAGAAA CTGGAGTATG GGCCAGTTGG ACGACAGGTC CCTGAGGGTG TTGCCGTGCG TCCTCTCCCG GACAATTGGA AAGCTCGTCT CAAAGACCTC AAGAAGAAGG CGCATTGGGT AGAGGTTGCG ACCACCCCGT CCGGCGAGTC TCTGACTCGG GACGACATCG ACACATGCCT TACTCCAATG GCTTGGCTGA ACGACGAGGT GATCAACTCG TACCTTGGTC TTATCGTTAA CCACATGCGC CACGAGAATG GAAACGCGGG TCGTCACGAC AAGCCACGCT ACCATGCTTT TAATACATTC TTCTTCTCGA ATCTCCGAGA CAAGGGCTAC GACTCTGTCA AACGCTGGGC TAAAAGAGCT AAGATTGGCG GCAAAGATCT TCTGGATGTG GATACTGTTT TCATTCCGGT GCACAACAAG GCTCACTGGA CGTTGATTGT TGTAAAGCCA TCAGCTAGGA CAATCGAGCA CTTCGACTCA CTTGGCTCTC TCTCGCGCCG TCATGTTGAA ACCGTCAAGG GCTGGCTTCG AGGAGAACTC GGTGACTTGT ACGACGACGA TGAGTGGGAG GTTCTTCCAT CGGAGTCCCC ACAACAAGAC AACGGCAGCG ACTGCGGCGT GTTCCTTCTG ACAACAGCAA AAGCCGTAGC GCTTAATATT GAACCACTCG CTTATGGTGC CCGTGATACC CCTTTGCTTC GCCAAAAGAT AGTAGCCGAA CTTATTAACG GAGGGTTTGA AGGGGATTTT ACCCCTGACG GTGCGCTCTG A
|
Protein sequence | MDFVTNGQPS PKDTVMNDAS PYDALHHQRT NRQFTSEDLA DPNYVPNPVP FGAKKITPDE THRPFYHIHS RKPSQRPDMV LKHQKTAYST TRRRYDSLSP PQFQFTRGRT GSQHEQAPLV LPRVNEIQPY RLTATTASRL NATTYSSSLM NPMRSSGHDS ILGAGLRGRD RPLSLFGSEA SRQAAMIRPR KRDREGNILD TTGSIFVRNN NANDGRNNDQ QHIASADADS PVLKYCRGGT ESSFSAAVDN IKKVNGDPVQ PLAQRPSFHW QRALPSKTTD PATPGKQTGS STATGRIPGC WPSASKHGSM PLLPEPQQTA QTQHQTESPV TSQEICGQVD LPSNANPEPA TVNPDQAILD ETPSWTQHYS GVYGTLRIAY SFQCGMVQTV ANAFHVALAA ASTITHQTQQ ALGTVTQRVM AMYRQRRFDR ARSRARASPA APARQPPTTI ASPARVNVAT LPPGQQERVR INQWRRRRGF PVNEELPFPN MTTPMGALFY DPQIITTSSP SVQRSLDLVV DNASGATLHR HPAQRRTSVN DRDDKNRPQA PKAGILKKNS LVPTMSPATR RRLLPGYITP RDRRLGLQHR VRFRSPIVQP SPLRLRQWAN SSAESGPGLD ELLRTQLNGA DAPSTVASDQ RTGPDEQLYA QLAASLEPYV DPWAQPRDFT KGTPRSAVKL VKPKIEPVPD GRSESIYAKE YEEMQKMKKL EYGPVGRQVP EGVAVRPLPD NWKARLKDLK KKAHWVEVAT TPSGESLTRD DIDTCLTPMA WLNDEVINSY LGLIVNHMRH ENGNAGRHDK PRYHAFNTFF FSNLRDKGYD SVKRWAKRAK IGGKDLLDVD TVFIPVHNKA HWTLIVVKPS ARTIEHFDSL GSLSRRHVET VKGWLRGELG DLYDDDEWEV LPSESPQQDN GSDCGVFLLT TAKAVALNIE PLAYGARDTP LLRQKIVAEL INGGFEGDFT PDGAL
|
| |