Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_03628 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | + |
Start bp | 3488331 |
End bp | 3491231 |
Gene Length | 2901 bp |
Protein Length | 833 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | N-terminal acetyltransferase catalytic subunit (NAT1), putative (AFU_orthologue; AFUA_4G11910) |
Protein accession | CBF75743 |
Protein GI | 259481843 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.108078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.314019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAGC AATTGAGCTC GAAAGATGCC TCCTTGTTCC GACAGGTGGT CCGACACTAC GAGAACAAGC AGTACAAAAA GGGTGAGTCG TTGCCAATGC CATGCCGAGT TTACCACGCT GGTCGTCGAT GTTTCATGCG GGCGGACGTA AAAGGGCTGA CTGTCATACT TTGCTTTTCT CGAGGGAATT ACAGGCATCA AAACAGCCGA TCAGGTTCTC CGGAAAAACC CAAACCATGG CGATACGTTA GCGATGAAGG CTCTGATCAT GAGCAATCAA GGTGAACAGC AGGAGGCCTT CGCCTTGGCC AAGGAAGCTC TTAAGAATGA CATGAAGTCG CATATCTGCT GGCACGTGTA CGGCTTGCTC TATCGCGCGG AGAAGAATTA CGAGGAGGCT ATTAAGGCAT ACCGGTTTGC TTTGCGGATC GAACCGGACT CCCAGCCCAT TCAGCGCGAT CTCGCCCTCC TCCAGATGCA GATGCGGGAC TACCAGGGCT ACATACAGAG TAGAAGCACT ATGCTGCAGG CTCGACCCGG CTTCCGACAA AACTGGACCG CTCTCGCTAT CGCACACCAC CTCTCCGGCG ACCTAGAGGA GGCTGAAAAG GTGCTGACGA CATATGAGGA GACGCTGAAG ACCCCACCAC CTCTGTCAGA TATGGAACAT TCCGAGGCGA CACTGTACAA AAACATGATT ATTGCGGAGT CAGGAAATAT CCAGAAGGCG TTGGAACACC TCGAGTCTGT AGGACACCGC TGCTCAGATG TGCTCGCTGT GATGGAGATG AAGGCGGACT ACCTTCTACG CCTGGACAAG AAGGAAGAAG CCGCTGCAGC CTACACTGCT CTTCTGGAAC GCAACTCGGA GAACTCTCTC TACTACGATG GCTTGATTAA GGCCAAGGGT ATCTCCAGCG ACGACCACAA AGCCCTCAAG GCCTTGTATG ACTCCTGGGC GGAGAAGTAC CCCCGCGGTG ATGCGCCTCG CAGAATTCCC TTGGACTTCC TTGAAGGCGA TGATTTTAAG CAGGCTGCCG ATGCCTATCT GCAGCGCATG CTCAAGAAAG GCGTGCCATC GCTCTTTGCG AACATTAAGC TTTTGTACAC CAACTCTAGC AAGCGTGACA CAGTACAAGA GTTGGTCGAA GGTTACGTCT CAAACCCCCC AGCGAACGGT GCGGCTGACG GTTCCGAAAA CACCGAATTC CTCTCCTCCG CATATTACTT CCTCGCACAG CACTACAATT ATCACCTTAG CCGCGATCTA TCCAAGGCTC TCCAGAACGT TGACAAGGCC CTTGAATTGT CCCCCAAGGC GGTAGAGTAC CAGATGACCA AGGCTAGGAT ATGGAAGCAT TATGGCAACC TTGAGAAGGC AGCAGAGGAG ATGGAGAACG CCAGAAAGAT GGATGAGAAG GATCGTCACA TCAACTCCAA GGCCGCCAAA TACCAGCTTC GCAACAACAA CAACGACAAG GCGCTTGACA AAATGAGCAA GTTTACGCGC AATGAGACCG TTGGCGGTGC CCTTGGCGAC CTCCATGAAA TGCAATGTGT GTGGTATCTG ACGGAAGATG GCGAGGCTTA CCTGCGGCAG AAGAAGCTCG GGCTCGCCCT CAAGCGTTTT CACGCCGTCT ACAATATCTT TGACGTGTGG CATGAGGACC AGTTTGATTT CCACAGTTTC TCTCTGCGGA AGGGTATGAT TCGAGCCTAC GTTGACATGG TTCGCTGGGA GGATCGTCTG CGCGAACATC CTTTCTACAC CCGAGCTGCG CTTTCCGCGA TCAAGGCCTA TATACTTCTC CATGACCAAC CGGATCTAGC TCATGGGCCT CTTCCTGAGA TTAACGGTGC TGATGGGGAC GATGCTGAGC GCAAGAAGGC TCTGAAAAAG GCTAAGAAGG AGCAGCAGCG GCTCGAGAAA CTCGAGCAAG AGAAACGGGA GGCTGCTAGA AAGGCTGCCG CTAACCCCAA GAGCCTAGAC GGAGAGGTCA AGAAAGAGGA CCCCGACCCC CTCGGCAACA AGCTTGCGCA GACACAAGAA CCGCTAAAGG AAGCACTAAA ATTCCTCACA CCCTTGCTGG AGCACTCTCC CAAGAATATC GAGGCTCAAT GCCTTGGGTT CGAGGTACAC CTTCGAAGGG GTATGTTTCA AGAGTTCCGC AAAATGCGCG TGTACATAAG ACTGACTACT CTTCTAGGCA AATATGCACT TGCGCTCAAG TGTCTCGCAG CGGCCCATTC TATCGATGCG TCCAACCCTA CTCTTCACGT CCAGCTACTT CAATTCCGCC AAGCTTTGAA CAAGCTATAC GAGCCTCTTC CGCCTCAGGT CGCGGAAGTT GTCGACTCGG AATTCGAGGC TCTCCTGCCG AAGGCGCAGA ACCTCGAGGA GTGGAACAAA TCCTTCCTCT CGGCACACAA GGACAGTATC CCACACAAGT ATGCTTACCT TACCTGCCAA CAGCTCCTGA AACCCGAGTC CAAGTCGGAA AATGAGAAGG AGCTCGCTGC TACCCTGGAT GCAGGCATTA TGTCACTTGA GACAGCCCTT GCTGGTCTAG ACCTACTAGG CGAGTGGGGA AGCGACAAGG CCGCAAAGAC TGCTTATGCT GAAAAGGCTA GCAGCAAGTG GCCGGAGTCG ACCGCCTTCC GAGTTAATTG AATGTCGACA CTGCTTGCTT CTGTCTCTGT TGGTGTCTGA ATATGGTTCT TGGTCTTATT GGTTTCATTG TGCGCACTTA GCGAAGCCTA TAGAACCAAA ATGAGATGTC TACTTCAGCC GAATTGTCAA GATGAATGGA AAGACCATGT ATGAGTATTT ATGGCTACTT GAGTGGGGGT ACGTTGCTTG GAGTATATAC TAGGTACGGA GTATAGCCAA AAATCCAACA A
|
Protein sequence | MPQQLSSKDA SLFRQVVRHY ENKQYKKGIK TADQVLRKNP NHGDTLAMKA LIMSNQGEQQ EAFALAKEAL KNDMKSHICW HVYGLLYRAE KNYEEAIKAY RFALRIEPDS QPIQRDLALL QMQMRDYQGY IQSRSTMLQA RPGFRQNWTA LAIAHHLSGD LEEAEKVLTT YEETLKTPPP LSDMEHSEAT LYKNMIIAES GNIQKALEHL ESVGHRCSDV LAVMEMKADY LLRLDKKEEA AAAYTALLER NSENSLYYDG LIKAKGISSD DHKALKALYD SWAEKYPRGD APRRIPLDFL EGDDFKQAAD AYLQRMLKKG VPSLFANIKL LYTNSSKRDT VQELVEGYVS NPPANGAADG SENTEFLSSA YYFLAQHYNY HLSRDLSKAL QNVDKALELS PKAVEYQMTK ARIWKHYGNL EKAAEEMENA RKMDEKDRHI NSKAAKYQLR NNNNDKALDK MSKFTRNETV GGALGDLHEM QCVWYLTEDG EAYLRQKKLG LALKRFHAVY NIFDVWHEDQ FDFHSFSLRK GMIRAYVDMV RWEDRLREHP FYTRAALSAI KAYILLHDQP DLAHGPLPEI NGADGDDAER KKALKKAKKE QQRLEKLEQE KREAARKAAA NPKSLDGEVK KEDPDPLGNK LAQTQEPLKE ALKFLTPLLE HSPKNIEAQC LGFEVHLRRG KYALALKCLA AAHSIDASNP TLHVQLLQFR QALNKLYEPL PPQVAEVVDS EFEALLPKAQ NLEEWNKSFL SAHKDSIPHK YAYLTCQQLL KPESKSENEK ELAATLDAGI MSLETALAGL DLLGEWGSDK AAKTAYAEKA SSKWPESTAF RVN
|
| |