Gene ANIA_05795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_05795 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001305 
Strand
Start bp1423099 
End bp1427094 
Gene Length3996 bp 
Protein Length1220 aa 
Translation table 
GC content49% 
IMG OID 
ProductHistone-lysine N-methyltransferase, H3 lysine-4 specific (EC 2.1.1.43)(COMPASS component SET1)(SET domain-containing protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5] 
Protein accessionCBF81174 
Protein GI259484715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000000960866 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCGCT CTTCCGCAGG CTTCGCAGAC TTCTTTCCCA CCGCCCCATC GGTCATCCAG 
CAGAAACGAT ATCAAGCCAC TCGAGAACGA CAGCGGTCGA GACCTCATCT CAGCAGGGAG
CATGCCGACG AGGAACAGAT TGTCACCGGA TCCCGGACCT CTGGGGAAAC TGTCAACGGC
AATAGCCCTC AGAATCTTGG ACAAGAACTC CGGTCAGACC TGAACAAGTC GCGCAAGGAA
GTCGCGGAAG ACGGTTCCGC GAGCCATGGA GAGGCGAACA CTCCGGCGAA CAACACATCG
GGACCCGGAA CAGGTTCATC GAATGATACC CGCCTGGATA CTCTTACGCC GCTCACAAAC
ACGGAGTCAT CCCCGCAGAA CAATCCGAGC CCGTCTCAAG CGAAGGCGCC GAACGGAGAT
GAACCTGATG GGTTTAGACA AGCAAGAGCA AATGTCTCAA ACAGCACCAT GACACCGCTT
CATACGCCGC CAACACCAAC AACGCATTCA CTGAGTCAGC GTGTTGCAAT CGTCAAGGGC
AGCAAGTTGG TACATGACCC TGATAGAGCC CCGTCCAAGG ATAAACGAAA GAGGCCTTGC
TACGTGGATA TTGTTAGTGA TGAACAAGAA GGACGTTTGT CCGACCCTCG ATTGAGCATA
CAAAATTATA CTCGCGGTGC TGGATGCCGA CAGAAGACGA AGTACCGACC CGCTCCGTAC
GTCCTAAGGC ACTGGCCCTA CGACCCTGCC AGCACGGTAG GACCGGGGCC TCCCGTGCAA
ATTGTTGTTA CCGGTTTCGA CCCATTGACG CCTCTCGCAC CGATAAGTAC GCTCTTTTCG
AGTTTTGGCG AGATAGCAGA GATCAATAAC CGAACGGATC CCGACACGGG CAGATTCCTG
GGCATATGCT CCGTCAAATA TAAGGACAGC GCATCGTTTC ACGGTAGCGG TCCTGTTTCC
GCGTCGTTGG CAGCGAAGAA TGCATTTCAT GAATGCAAAA AGGGGCAGCG TATTGGCAAC
AATCGCATCA AGGTTGAGTA TGATCGCGAT GGGCAGACTT CGGAAAAACT AGCTTCGAGG
GCTATTGCAG CTCAAAGAAT TGATTCGAAG ATTGATATGC CCGTGGTGGG TGAGCCTAAA
TCAGAGGCAC AGGTGAACAA AAATGAACCA CCCCCGACAG CTCCAAAAGG GCCCTCTGGA
AGATCCTTCA TGCGCCCTTC GGCTGTCATC CCTGAAGGAC CTAGGGCAAG CTTTCAGAAA
CCGGCAATAC CATCATTGAT TGAAGAAACT CCCATACTAA ACCAGATAAA GCGTGATCCC
TATATCTTCA TCGCACATTG TTATGTACCT GTTCTAAGCT CGACTCTGCC GCACTTAAAA
AAGCGATTAA AGGCGTTCAA CTGGAAAGAT ATTCGATGCG ACAGAACCGG ATACTATATC
ATATTTGAGA ACTCTCGGCG CGGTGAAGAG GAAACTGAAC GCTGTTACAA GTTCTGCCAT
ATGAAACTTT TATTCACGTA TATCATGAAT ATGGAAAGCC AACCGTACGG CAATCCGCAT
TACGAGCGCA GCCCCAGTCC CGAGCGCATG AAGCAAGAAC AACGGCAAAA AGCGGAAACT
GAACGATTGA AGAAAGAGGC AGAGTTGGAC ATTGAAGAAG AAAAGAAACA GCGAGCCCTG
GATTTGGATC CCTGCACTGA AGTTTTGGCG ATCGTAATAA AGGATCTAAG GGACAAACTG
CTCGAAGACG TTAAGTCACG CATTGCTGCC CCCGCTCTGT ATGATTATCT CGACCCAGAA
CGGCATGCAT CAAGGAGGAA ACAACTGGGG ATACCTGATC CTGAAGGCAT TAAACGTCCT
ATGTTCCGCC TTGATTTTGA CAGCCGTGAT AGTACACCCG ACCCACATGC AAAATTCTTG
AATAAACGGC ACCCTTCAGG CGTATCTGGT CTGAATATAC TCTCTGCCCT TCCAAGGATT
AGGAAAGCTC ATCGCCTCGA CCGTACGGAC GTCGCTTTCT TAGACGAGAG GCGCAAGCAA
CCGCTACGGA GAAGAAATGT CCGACCACTT TACCATCGAT TACAGCAGCT ACATGATGCA
GAAGACTCTG ATGAAGAGCA GCATACTCCA CTTTCCCGAG ACACCGATGA TCAGGACAGT
CGTCCTCCCA GTCGAATTGG CTCAGAAACC GAATCTGAGG ATGCGGATGA AGATGCAGCC
GAAGCATTGG ATAATTCAAC AGAAAGGCTC GATAATGAAG ACCGACATTC CGAGATAGGC
GACTTGGAGG CTGCAGTACA AGACTACTCT CCGTCGCGGA AGAGGAAAAG AACCAGTGAA
AGTCCCAGCC ACCGCAAGAA ACAAAAGGAA TCTGATGATT TCTCTGCTGT GGGCGAGGGA
ACTCGGACGG ATGACATTCC TCAGGTTTTG GATGGAGTTC ACAAGGGCAC CGTTTCTCAG
GGATTATCTG ACTCGGCTGA CGAGTCATCA CGCCTTGACC ACAACAAGGT ATTGCTTGAA
GAACTAGTTG AAGACATAAA AACCACGCAC TCAGAAGAGC CTGGCATCAA AACCCATCAT
GTTCAAGTTA GACAATCAGC GGAAAATATG GTCGAGGGGG CCGAATATGG CGAGGCGGCC
CGGCACGAGG TTGAATGGAG AGTATCAAAT GATGAGCCAC GGCCGATTGT GGACGATGAT
GATTCTGTTG TCATGGATCT TGACGGATGG CAAGACGTCG TGAAGGACGA GGAAGATTTA
CAGTTCCTTC GCAATATCCT GGAAAAGCAG CCGATGTCTG TGATTGGAAA CCTATCAGCT
TGGGCATGGA GACAGAAGGA AATCAAGGCT CTCAATCGCC CCGGCGATGT AGGGCCAACG
CGCCAGGCTG CAAGCATCGA AGGTTATTAT GTTCCTAATA TTACAGGAGC TGCTCGAACC
GAAGGCAGAA AGAGGATTCT CGAATCGGAA AAATCAAAAT ATCTACCCCA TCGCATTAAG
GTCCAGAAGG CTCGCGAGGA ACGCGAGGCG AAGGCCAAGA GCGATCCACA GAATGCAGCT
GCCGAGGCGG CTCGAATCGC AGCAGCAAAG ACAATATCAA AGTCAACATC TCGTTCGACA
AGGGTGAATA ACCGCAGACT GATTGCTGAC ATCAATGCTC AAAAGCAAGC TCTACCATCG
CAGGGTGGCG ATAGCGATGT TCTTCGATTT AACCAATTGA AAAAACGGAA AAAGCCTGTG
CGTTTTGCAC GATCAGCTAT TCATAATTGG GGGCTCTATG CTGAAGTCAA TATATCTGCA
AACGAAATGA TCATCGAATA TGTGGGGGAG AAGGTACGGC AGCAGGTTGC CGATATGAGG
GAGCGACGAT ACCTAAAGAG CGGCATTGGC AGTAGCTATC TCTTTCGAAT TGATGAGAAC
ACAGTCATTG ATGCCACAAA GAGAGGCGGT ATTGCTAGAT TTATCAACCA CAGCTGCACA
CCAAACTGCA CAGCGAAGAT CATCAAGGTT GATGGAAGCA AACGAATAGT TATTTATGCA
TTGAGGGATA TCGAAAGGGG TCAGTCCTTT AAATGAGATT TGCTGATTTG GATGGTAACA
TGCTGACCCT ACTGCAGATG AAGAATTAAC TTACGACTAC AAATTCGAGA GGGAATGGGA
CAGCGATGAC AGAATACCAT GCCTTTGCGG CTCGGCCGGA TGTAAAGGCT TTCTGAATTA
GATTTCTTGA CTCAAGAAAC AGCGAGCACA CTCCTTTCGT TCGAACGTTT ATCATTGATT
TTTATTCTAA CGACATGGGG CCTTTTTATA GGCTTGTTTC ACGGCAAAGG TATAGGCCCG
GGGCAGCGTT GTGCATGTTA CATTGAGGCG TTCCGGATGA GTCTGCAATT GTTGGTTGCA
ATTGGTTCTC CTTATCTTCC ACCCTGAGAC AATGGTGTGA CACGGATTAT GTATAACTTG
CAAGTGCTGT ATCATATGAG ATCTCCCGCT TCATTT
 
Protein sequence
MSRSSAGFAD FFPTAPSVIQ QKRYQATRER QRSRPHLSRE HADEEQIVTG SRTSGETVNG 
NSPQNLGQEL RSDLNKSRKE VAEDGSASHG EANTPANNTS GPGTGSSNDT RLDTLTPLTN
TESSPQNNPS PSQAKAPNGD EPDGFRQARA NVSNSTMTPL HTPPTPTTHS LSQRVAIVKG
SKLVHDPDRA PSKDKRKRPC YVDIVSDEQE GRLSDPRLSI QNYTRGAGCR QKTKYRPAPY
VLRHWPYDPA STVGPGPPVQ IVVTGFDPLT PLAPISTLFS SFGEIAEINN RTDPDTGRFL
GICSVKYKDS ASFHGSGPVS ASLAAKNAFH ECKKGQRIGN NRIKVEYDRD GQTSEKLASR
AIAAQRIDSK IDMPVVGEPK SEAQVNKNEP PPTAPKGPSG RSFMRPSAVI PEGPRASFQK
PAIPSLIEET PILNQIKRDP YIFIAHCYVP VLSSTLPHLK KRLKAFNWKD IRCDRTGYYI
IFENSRRGEE ETERCYKFCH MKLLFTYIMN MESQPYGNPH YERSPSPERM KQEQRQKAET
ERLKKEAELD IEEEKKQRAL DLDPCTEVLA IVIKDLRDKL LEDVKSRIAA PALYDYLDPE
RHASRRKQLG IPDPEGIKRP MFRLDFDSRD STPDPHAKFL NKRHPSGVSG LNILSALPRI
RKAHRLDRTD VAFLDERRKQ PLRRRNVRPL YHRLQQLHDA EDSDEEQHTP LSRDTDDQDS
RPPSRIGSET ESEDADEDAA EALDNSTERL DNEDRHSEIG DLEAAVQDYS PSRKRKRTSE
SPSHRKKQKE SDDFSAVGEG TRTDDIPQVL DGVHKGTVSQ GLSDSADESS RLDHNKVLLE
ELVEDIKTTH SEEPGIKTHH VQVRQSAENM VEGAEYGEAA RHEVEWRVSN DEPRPIVDDD
DSVVMDLDGW QDVVKDEEDL QFLRNILEKQ PMSVIGNLSA WAWRQKEIKA LNRPGDVGPT
RQAASIEGYY VPNITGAART EGRKRILESE KSKYLPHRIK VQKAREEREA KAKSDPQNAA
AEAARIAAAK TISKSTSRST RVNNRRLIAD INAQKQALPS QGGDSDVLRF NQLKKRKKPV
RFARSAIHNW GLYAEVNISA NEMIIEYVGE KVRQQVADMR ERRYLKSGIG SSYLFRIDEN
TVIDATKRGG IARFINHSCT PNCTAKIIKV DGSKRIVIYA LRDIERDEEL TYDYKFEREW
DSDDRIPCLC GSAGCKGFLN