Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_01983 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001307 |
Strand | - |
Start bp | 2403888 |
End bp | 2406826 |
Gene Length | 2939 bp |
Protein Length | 845 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | small nucleolar ribonucleoprotein complex component (Utp5), putative (AFU_orthologue; AFUA_4G10550) |
Protein accession | CBF85945 |
Protein GI | 259487344 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.000779795 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | CCAACGTCTA GGTGTCGCTC CCAGTCGAAG CTAAGCTAGA ACTATTGTTT CGCCAACTGG GGCTCTTGAA TGGTCTTCGT TCTTCATTGT TTCAAGTTTA TCGTGCATAG ACTCGATTGA GGATGGCCAA AAAGTCAAAC CAAAAGCCTG CCTCGAAGAC CTCTTCCGCG GCGGCTCTCG CAGTCGCGGA CTCTACAAAC ACAGGAAATA AGTCATCGAT ACTGAGGGCG GCCTTCGCCC CTTCTGGCTT TCAATTGGCC TTGTTCGCAT CCGTGATCCA GGGTCTTGAA GGTCAAAATC TTCGCATTCA CGATACAAAC ACCGGCCGGT TACAATGCGA GCATGTTTTG GGCCCGAAGG AACTAGTAAC GTCGTTGGAC TGGGGTCATT ACTATGGCCG ACGAGATCAG TCAAAGAGGA AGAGAAAGCG CCCTTCCGAC GTTAATGGGA CAGCCGAACT CGACCAAGGC GATGTAGTGG TTGCCTTCGG TACCAACGCG TCTGATATCC GCATGTTCTC GCCTGCCGAA GATAAGATTG TCGGAACTCT TGCCGGTGGG CATACAGGAG GAGTAAAGGA CTTCAAGTTT ACAGCAGATA GGCCTCAGGA AGGTTGGAGT ATCGGTGGGG ACAACAAGCT GGTACAGTGG GATCTTGTCA CCGGCCAAAG GACAAGGTAA GGGTTGCCTG ATATGACGAG CTGCTATTAA GCTAACTGAA ATAAAAGAGT GATCAGCCTT TCTACTACCT CTGCATTCAC TACTCTGTCC CGCCCGCTCG CCTCTAACCC GCCTGTCATC TGTGCATCTC AAACACCCCA CATTGTGAAT CTCGAAGATG AATCTCCTAT CAAGTTTCCT GCGATGCGAA ACTCAATAAA GACAATCATC ACATCCTCGA CAAGCTCCAT CTCTGATGGG CTGTTCCTAG CATCGGATAA TGATCGGTAT ATAAACGTCT TCGACCCGAA AAGCGGACAG CTTACGATGA ACCTTGTGGC TGAAAAAGAA GTAACTTCAC TATCGATATA TAAGACGCAG GGTACGGAAG CCAAACTAGC GCTGGAGAAA CAAGTTCTTG CGGCTGTTAC TCAAGATGGT ACTATTGAAC TCTTTGCGCG GCCATTCGTC CGACCTCAGG GCCTTGAAGG ATCAAAAGGT TCCAGTCTGA AGGCGCGATC TATGCAAATG ACGAGGCGAG CGGAGGCCTC GCTCAGAATA ATTAAGGCCT CTGAGTCGGA CGATTTAGTT CCTGTCGTTG CCGTGTCGTT CCAGGGTCCT GACCTGCTCG TTGCATGGGC ACAGGGAGGT ATCATTCCAT TATTCGAGCG AGTAAGGTGG CTTAACGAGG AAACCGACGA GCTGGCTTTT ACGGGCGTGA AGACTATTTC GAAGACCAAA TCTAGCTCCA TCCTTCAATC AGCGACGACT AACGGAATGA AGAATGCAAA CGAAAGCCAC GTCAATGAGA AGAAGATAGT CGTCGAGCAA GGTGATCTTG CTGATGACGA CGTCAACATG GAGGACTCCA AACAAGATGC GGTTTCTGAA GATGAGAGCG AAGTGGACTC CGAAAACGAT GACGGATTCA AGCAACAGAG GGAACCAGCT GCGCAGGACG AGGAAAAGGC CGGGAGCGAC GTGGAAATGC AGAACGCTGC AGAATCAGGT GCGGAAAATG AGGACGAAGA TGACGAAGAA GAGACGGGTG CTGAGCCATC TTTCGGAGAA CTCATGAGAG CACACGCTGC CGAAGAAATC GACGTCGAAG CCGAACTAGA AGACGATGTG CACACGCGGT CCCTCATCCC CGGAAAGCCT ATCACAACAG TCCAGCAAAT CCCCTCCGGA GTCTCCCTCT CTACCGTCCT CTCACAGTCC CTCAAGACAA ATGACAACGA CATGTTGGAA GCCTGCTTTC ACACGGGCGA CTCCGGCACC ATCCGCACCA CCATCCAACG TCTCGATTCT CCCCTGGCTG CAACCCTTTT GCAAAGACTT GCCGAGCGCC TCTCCGCCCG CCCTGGTCGA TATGGCCACT TGCTTGTCTG GGTACAATGG ACTTGTATCG CGCACGGAGG AGCCTTGGCA GGAAACAAGG ATCTCCTCAA GCAGATGTCC ACTCTATTCA AAGTAATGGA CCAGCGCTCC TCAACCCTCT CCTCTCTTCT CTTGCTCAAG GGCAAACTAG ACATGTTAGA CGCCCAGCTT GGCCTCCGCC AGTCGCTCCG CGAAAACGCA GACCACATGG ACAGCGAAGA CGAGGAGAAT GTAATCTACG TTGAAGGCTA TGATGAGGAC GAGGTTGAAG ATAGCGACGC TGAGGCAACA AAGAACATTG ACACGCCCCG GACAAAGGCA ATTCGCGACC AGACCGATAT CTCCATGATC GACGAAGACG AAGATGCAGG AAGCGAGGAC GACGAAGAAG ACGAGGAAGA AGAAGACGAA GAAGGCCCTA GCGCCATTTT CGACGTCGAA GCTGAAGAGT CCGCCGGCTC TTCAGATGCC GAGGAATCCC CTAATGACGA CGAGGACGAT GATGAAGATG AGGATGCGGA CAGCGCCGGG TCCATTGCGG ATTTCATCGC GGATACGGAG GACGACGACT CAGAGGTAGA TAATCTCTCG CGGCCTCCAC CGTCAAAGAA GGCCAGGTTA AGCCAAGGAG GTAGAAAAGG GAAGAAACAG GCTGGGTCAG GAAGGAAATA GACTACTTTC AATCCCTAGA TTTATACCCT GTTCTATTTT TCTCTTGCGG AGAGCACTTT GGATCGTTGT TGGCTTCCTC CGCCGTTCCT GTTTTTCATT ATCCGATGTG CGCACAAAAA AGTATAGGGA GAAGGCATCT ACATTTTTTT TTCTATTTTC TCTCTGTCTC CATCCAGTTG TTCTATATTC AAATTCTCTA AATCTGGAAC GGGTGGGAAA GATCCTGAT
|
Protein sequence | MAKKSNQKPA SKTSSAAALA VADSTNTGNK SSILRAAFAP SGFQLALFAS VIQGLEGQNL RIHDTNTGRL QCEHVLGPKE LVTSLDWGHY YGRRDQSKRK RKRPSDVNGT AELDQGDVVV AFGTNASDIR MFSPAEDKIV GTLAGGHTGG VKDFKFTADR PQEGWSIGGD NKLVQWDLVT GQRTRVISLS TTSAFTTLSR PLASNPPVIC ASQTPHIVNL EDESPIKFPA MRNSIKTIIT SSTSSISDGL FLASDNDRYI NVFDPKSGQL TMNLVAEKEV TSLSIYKTQG TEAKLALEKQ VLAAVTQDGT IELFARPFVR PQGLEGSKGS SLKARSMQMT RRAEASLRII KASESDDLVP VVAVSFQGPD LLVAWAQGGI IPLFERVRWL NEETDELAFT GVKTISKTKS SSILQSATTN GMKNANESHV NEKKIVVEQG DLADDDVNME DSKQDAVSED ESEVDSENDD GFKQQREPAA QDEEKAGSDV EMQNAAESGA ENEDEDDEEE TGAEPSFGEL MRAHAAEEID VEAELEDDVH TRSLIPGKPI TTVQQIPSGV SLSTVLSQSL KTNDNDMLEA CFHTGDSGTI RTTIQRLDSP LAATLLQRLA ERLSARPGRY GHLLVWVQWT CIAHGGALAG NKDLLKQMST LFKVMDQRSS TLSSLLLLKG KLDMLDAQLG LRQSLRENAD HMDSEDEENV IYVEGYDEDE VEDSDAEATK NIDTPRTKAI RDQTDISMID EDEDAGSEDD EEDEEEEDEE GPSAIFDVEA EESAGSSDAE ESPNDDEDDD EDEDADSAGS IADFIADTED DDSEVDNLSR PPPSKKARLS QGGRKGKKQA GSGRK
|
| |