Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_05795 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001305 |
Strand | - |
Start bp | 1423099 |
End bp | 1427094 |
Gene Length | 3996 bp |
Protein Length | 1220 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC 2.1.1.43)(COMPASS component SET1)(SET domain-containing protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5] |
Protein accession | CBF81174 |
Protein GI | 259484715 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000000960866 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGCGCT CTTCCGCAGG CTTCGCAGAC TTCTTTCCCA CCGCCCCATC GGTCATCCAG CAGAAACGAT ATCAAGCCAC TCGAGAACGA CAGCGGTCGA GACCTCATCT CAGCAGGGAG CATGCCGACG AGGAACAGAT TGTCACCGGA TCCCGGACCT CTGGGGAAAC TGTCAACGGC AATAGCCCTC AGAATCTTGG ACAAGAACTC CGGTCAGACC TGAACAAGTC GCGCAAGGAA GTCGCGGAAG ACGGTTCCGC GAGCCATGGA GAGGCGAACA CTCCGGCGAA CAACACATCG GGACCCGGAA CAGGTTCATC GAATGATACC CGCCTGGATA CTCTTACGCC GCTCACAAAC ACGGAGTCAT CCCCGCAGAA CAATCCGAGC CCGTCTCAAG CGAAGGCGCC GAACGGAGAT GAACCTGATG GGTTTAGACA AGCAAGAGCA AATGTCTCAA ACAGCACCAT GACACCGCTT CATACGCCGC CAACACCAAC AACGCATTCA CTGAGTCAGC GTGTTGCAAT CGTCAAGGGC AGCAAGTTGG TACATGACCC TGATAGAGCC CCGTCCAAGG ATAAACGAAA GAGGCCTTGC TACGTGGATA TTGTTAGTGA TGAACAAGAA GGACGTTTGT CCGACCCTCG ATTGAGCATA CAAAATTATA CTCGCGGTGC TGGATGCCGA CAGAAGACGA AGTACCGACC CGCTCCGTAC GTCCTAAGGC ACTGGCCCTA CGACCCTGCC AGCACGGTAG GACCGGGGCC TCCCGTGCAA ATTGTTGTTA CCGGTTTCGA CCCATTGACG CCTCTCGCAC CGATAAGTAC GCTCTTTTCG AGTTTTGGCG AGATAGCAGA GATCAATAAC CGAACGGATC CCGACACGGG CAGATTCCTG GGCATATGCT CCGTCAAATA TAAGGACAGC GCATCGTTTC ACGGTAGCGG TCCTGTTTCC GCGTCGTTGG CAGCGAAGAA TGCATTTCAT GAATGCAAAA AGGGGCAGCG TATTGGCAAC AATCGCATCA AGGTTGAGTA TGATCGCGAT GGGCAGACTT CGGAAAAACT AGCTTCGAGG GCTATTGCAG CTCAAAGAAT TGATTCGAAG ATTGATATGC CCGTGGTGGG TGAGCCTAAA TCAGAGGCAC AGGTGAACAA AAATGAACCA CCCCCGACAG CTCCAAAAGG GCCCTCTGGA AGATCCTTCA TGCGCCCTTC GGCTGTCATC CCTGAAGGAC CTAGGGCAAG CTTTCAGAAA CCGGCAATAC CATCATTGAT TGAAGAAACT CCCATACTAA ACCAGATAAA GCGTGATCCC TATATCTTCA TCGCACATTG TTATGTACCT GTTCTAAGCT CGACTCTGCC GCACTTAAAA AAGCGATTAA AGGCGTTCAA CTGGAAAGAT ATTCGATGCG ACAGAACCGG ATACTATATC ATATTTGAGA ACTCTCGGCG CGGTGAAGAG GAAACTGAAC GCTGTTACAA GTTCTGCCAT ATGAAACTTT TATTCACGTA TATCATGAAT ATGGAAAGCC AACCGTACGG CAATCCGCAT TACGAGCGCA GCCCCAGTCC CGAGCGCATG AAGCAAGAAC AACGGCAAAA AGCGGAAACT GAACGATTGA AGAAAGAGGC AGAGTTGGAC ATTGAAGAAG AAAAGAAACA GCGAGCCCTG GATTTGGATC CCTGCACTGA AGTTTTGGCG ATCGTAATAA AGGATCTAAG GGACAAACTG CTCGAAGACG TTAAGTCACG CATTGCTGCC CCCGCTCTGT ATGATTATCT CGACCCAGAA CGGCATGCAT CAAGGAGGAA ACAACTGGGG ATACCTGATC CTGAAGGCAT TAAACGTCCT ATGTTCCGCC TTGATTTTGA CAGCCGTGAT AGTACACCCG ACCCACATGC AAAATTCTTG AATAAACGGC ACCCTTCAGG CGTATCTGGT CTGAATATAC TCTCTGCCCT TCCAAGGATT AGGAAAGCTC ATCGCCTCGA CCGTACGGAC GTCGCTTTCT TAGACGAGAG GCGCAAGCAA CCGCTACGGA GAAGAAATGT CCGACCACTT TACCATCGAT TACAGCAGCT ACATGATGCA GAAGACTCTG ATGAAGAGCA GCATACTCCA CTTTCCCGAG ACACCGATGA TCAGGACAGT CGTCCTCCCA GTCGAATTGG CTCAGAAACC GAATCTGAGG ATGCGGATGA AGATGCAGCC GAAGCATTGG ATAATTCAAC AGAAAGGCTC GATAATGAAG ACCGACATTC CGAGATAGGC GACTTGGAGG CTGCAGTACA AGACTACTCT CCGTCGCGGA AGAGGAAAAG AACCAGTGAA AGTCCCAGCC ACCGCAAGAA ACAAAAGGAA TCTGATGATT TCTCTGCTGT GGGCGAGGGA ACTCGGACGG ATGACATTCC TCAGGTTTTG GATGGAGTTC ACAAGGGCAC CGTTTCTCAG GGATTATCTG ACTCGGCTGA CGAGTCATCA CGCCTTGACC ACAACAAGGT ATTGCTTGAA GAACTAGTTG AAGACATAAA AACCACGCAC TCAGAAGAGC CTGGCATCAA AACCCATCAT GTTCAAGTTA GACAATCAGC GGAAAATATG GTCGAGGGGG CCGAATATGG CGAGGCGGCC CGGCACGAGG TTGAATGGAG AGTATCAAAT GATGAGCCAC GGCCGATTGT GGACGATGAT GATTCTGTTG TCATGGATCT TGACGGATGG CAAGACGTCG TGAAGGACGA GGAAGATTTA CAGTTCCTTC GCAATATCCT GGAAAAGCAG CCGATGTCTG TGATTGGAAA CCTATCAGCT TGGGCATGGA GACAGAAGGA AATCAAGGCT CTCAATCGCC CCGGCGATGT AGGGCCAACG CGCCAGGCTG CAAGCATCGA AGGTTATTAT GTTCCTAATA TTACAGGAGC TGCTCGAACC GAAGGCAGAA AGAGGATTCT CGAATCGGAA AAATCAAAAT ATCTACCCCA TCGCATTAAG GTCCAGAAGG CTCGCGAGGA ACGCGAGGCG AAGGCCAAGA GCGATCCACA GAATGCAGCT GCCGAGGCGG CTCGAATCGC AGCAGCAAAG ACAATATCAA AGTCAACATC TCGTTCGACA AGGGTGAATA ACCGCAGACT GATTGCTGAC ATCAATGCTC AAAAGCAAGC TCTACCATCG CAGGGTGGCG ATAGCGATGT TCTTCGATTT AACCAATTGA AAAAACGGAA AAAGCCTGTG CGTTTTGCAC GATCAGCTAT TCATAATTGG GGGCTCTATG CTGAAGTCAA TATATCTGCA AACGAAATGA TCATCGAATA TGTGGGGGAG AAGGTACGGC AGCAGGTTGC CGATATGAGG GAGCGACGAT ACCTAAAGAG CGGCATTGGC AGTAGCTATC TCTTTCGAAT TGATGAGAAC ACAGTCATTG ATGCCACAAA GAGAGGCGGT ATTGCTAGAT TTATCAACCA CAGCTGCACA CCAAACTGCA CAGCGAAGAT CATCAAGGTT GATGGAAGCA AACGAATAGT TATTTATGCA TTGAGGGATA TCGAAAGGGG TCAGTCCTTT AAATGAGATT TGCTGATTTG GATGGTAACA TGCTGACCCT ACTGCAGATG AAGAATTAAC TTACGACTAC AAATTCGAGA GGGAATGGGA CAGCGATGAC AGAATACCAT GCCTTTGCGG CTCGGCCGGA TGTAAAGGCT TTCTGAATTA GATTTCTTGA CTCAAGAAAC AGCGAGCACA CTCCTTTCGT TCGAACGTTT ATCATTGATT TTTATTCTAA CGACATGGGG CCTTTTTATA GGCTTGTTTC ACGGCAAAGG TATAGGCCCG GGGCAGCGTT GTGCATGTTA CATTGAGGCG TTCCGGATGA GTCTGCAATT GTTGGTTGCA ATTGGTTCTC CTTATCTTCC ACCCTGAGAC AATGGTGTGA CACGGATTAT GTATAACTTG CAAGTGCTGT ATCATATGAG ATCTCCCGCT TCATTT
|
Protein sequence | MSRSSAGFAD FFPTAPSVIQ QKRYQATRER QRSRPHLSRE HADEEQIVTG SRTSGETVNG NSPQNLGQEL RSDLNKSRKE VAEDGSASHG EANTPANNTS GPGTGSSNDT RLDTLTPLTN TESSPQNNPS PSQAKAPNGD EPDGFRQARA NVSNSTMTPL HTPPTPTTHS LSQRVAIVKG SKLVHDPDRA PSKDKRKRPC YVDIVSDEQE GRLSDPRLSI QNYTRGAGCR QKTKYRPAPY VLRHWPYDPA STVGPGPPVQ IVVTGFDPLT PLAPISTLFS SFGEIAEINN RTDPDTGRFL GICSVKYKDS ASFHGSGPVS ASLAAKNAFH ECKKGQRIGN NRIKVEYDRD GQTSEKLASR AIAAQRIDSK IDMPVVGEPK SEAQVNKNEP PPTAPKGPSG RSFMRPSAVI PEGPRASFQK PAIPSLIEET PILNQIKRDP YIFIAHCYVP VLSSTLPHLK KRLKAFNWKD IRCDRTGYYI IFENSRRGEE ETERCYKFCH MKLLFTYIMN MESQPYGNPH YERSPSPERM KQEQRQKAET ERLKKEAELD IEEEKKQRAL DLDPCTEVLA IVIKDLRDKL LEDVKSRIAA PALYDYLDPE RHASRRKQLG IPDPEGIKRP MFRLDFDSRD STPDPHAKFL NKRHPSGVSG LNILSALPRI RKAHRLDRTD VAFLDERRKQ PLRRRNVRPL YHRLQQLHDA EDSDEEQHTP LSRDTDDQDS RPPSRIGSET ESEDADEDAA EALDNSTERL DNEDRHSEIG DLEAAVQDYS PSRKRKRTSE SPSHRKKQKE SDDFSAVGEG TRTDDIPQVL DGVHKGTVSQ GLSDSADESS RLDHNKVLLE ELVEDIKTTH SEEPGIKTHH VQVRQSAENM VEGAEYGEAA RHEVEWRVSN DEPRPIVDDD DSVVMDLDGW QDVVKDEEDL QFLRNILEKQ PMSVIGNLSA WAWRQKEIKA LNRPGDVGPT RQAASIEGYY VPNITGAART EGRKRILESE KSKYLPHRIK VQKAREEREA KAKSDPQNAA AEAARIAAAK TISKSTSRST RVNNRRLIAD INAQKQALPS QGGDSDVLRF NQLKKRKKPV RFARSAIHNW GLYAEVNISA NEMIIEYVGE KVRQQVADMR ERRYLKSGIG SSYLFRIDEN TVIDATKRGG IARFINHSCT PNCTAKIIKV DGSKRIVIYA LRDIERDEEL TYDYKFEREW DSDDRIPCLC GSAGCKGFLN
|
| |