Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_06121 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001301 |
Strand | - |
Start bp | 1193788 |
End bp | 1197110 |
Gene Length | 3323 bp |
Protein Length | 1047 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | SET domain protein (AFU_orthologue; AFUA_2G08775) |
Protein accession | CBF70143 |
Protein GI | 259479689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.26496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0261867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACTCG TCAACAATCT AATCGAGCTC CTCTACATAT CCGCCAGCAT GGGCGTCAAC TCCTGGAGTG ACCTCCTCGC TATTGAATTG ATTCATCTGA GCGTTCTCAG CAAGAACGAT CTTCTTAGTG AGCTAGTCGA GTTGTTCCTC CTGCTCACCA TGCAGAAGCT GTTAGGTTTC TTAGAGTCAC GTGAGTATCC TTCACTCCAT TTAACTGCCT CGCCGTCGAT ATGATTAAAA GACCATGCAA GCAAAGACAC TTACTTCTCT CCTTGTGCTT GTGTTTGTGC CTTGGCTTTC TTTCTTTTCT GATGCACTTC TACATTTGAT CTAGTTCAGT TGGCTAACTT GTTCTTCTGA TCTTCATAGT AGGGGCTACC GCTAGTTGTT TTGAAGGCAA CGCTTGTCCT AACCCTGAAC AAAGTGAACC TGAGTCCGAT CCTGCTGAGC CTGCGGATAC TAACACTAAG GCTGACATTG ATCGTGATCC CAAGGCAGAA GCTACGCAAG CTGATTTAGA AGCTGTCGTG AACGGCACGT CTACCTCTGA CGCTGGCTCC AGATTTGAAT CTGCATCTGT GTTTGTGTTA CTACCTGAAC CTAAGCATAC TCCGGCATCC AACCTTGAGC ATCATCCTGT CTCAGGTTCT GACTTTGAGT CAGTCTCTTC GTCTACTTCT GAGCTTGTAG CCGAGAGCAA GCCTCAAGGC CTGGACGAGA GAGAAAGTCA GACCAATCTC GAGCCTGAAT CCCAGATTGA AGATCCATTG GTGGTCAAAG AAAAACAAGA CGCACAATGC CAGATCGAAT CTGAAAATGC TCTTATGCTT GCGAAGGGTC TAAGGGTTGA GAGTCAAGAT CAACACGAAG AGTATATCTC GAGCGAGCAT CAGATCAACA GCCAGCAAGG TCTCCTCGTT GAATCCCAAA ACCAAGACAG ACAGAACCTG GTTAACTCTG AACCCGGGAG CAATACCGAC ATCAAGAGTG ACGACCGTTA CGGACATCAA ATCCACAGCG GAGACCAGAA AGAAAGTCCA CTGTTGGTAA ACTACGAGGA TCAAGTCCAC AAAGAACACA TCTCAGCTGG CTCTGAATTG GCTCCTGTAT CCGGGTCTGA GTCTGAGCCT GAGGCGGGTT TTGACTCTGA TTCTGTATCT TCACCCGCTC CTACACCTGA GCTTGTCGAG GAAGATAAAA CTAGTGTTAC CAAACACCAA GGGCCCCATT TAGGAGCAGC GACCGGACCG TTAATCCTAC AAGCGAAGCC TGGTGCACTT ATTCTCAGAC AGAATTCAGA CCTACTGGGT CTACGAATCA CTGCCCCTCG TGCTCGTTCT CCTTCTCCCC TTCGCTCATT CTCTCGCTCT GCGATTGAAT GCTTGGGAAT ACCAATGCCG TCAGAAATGC CTCGCCCTTC AACCTCAAAC GTGCCAGATT CATTGGCTGA TATACCGCCT GAGTACCTGC AGTTACCGTT TGTCGACGAA CATGTGAACA GCTCAGCCCC AAAAGGCAAC AAGAACTGGC ACTCGGTCGA CCTCCCACAC GAGCAGCTCA CCGCCGCCAA CCTCACTCGT GCTCAGGATC TCTCGAATTG TGATCTGCGA GAGCCCGTTG TCTCGGACTG CTTTGAAAAG CGCTGGATTA ACGATAAAAG CGGCTTTGGT TTGGTGGCCA CAAAGCACAT TCCTGCTGGT ACAGTTATTA TTGCGGACGA GCTCATGATT TTGTGGTCAG GTGAGCACAA GAAGTGCAAG TCTTATGGAG ACACGAATGC GATGCTACAG GCGAAGGCGG CGAGAATGGG GCCGGAGTGG AACAATGAGT TCCTTTCACT CGCGAAAGGT CAGAAGAAGA AGGGCTTTGC GATCAGAAAG CGCAGACTTG GTCTTGAAGG CGCAATATGG GACCAGCATG CTTTGCCTAC GGCGTGGGAG GGAAACGTCG GTGAGGTGCT TGGTTTGAAC CTGGCTTGGT TGAATCACTG CTGTATTCCG AACTGTGTTC TGCGGTTCCG GAACGAGTAT CCCACGAACA AAAAGGGCGA GATTTGCTAC GACAAGAAGC CGAGATTGGG AAAAGCGGTC GTCCGCGCTT GTGCAGACAT CAAGCCTAAC GTGGAGATCA GTATCGCATA CATGCAGACA GAAGGAACGG CCAGAGAAAG GCGAGCAGCT ATGAATCGCC GCTTCGGCTT TCTATGTGCG TGTCGGTTCT GCGCGACGCC GCATCCTTCA GCCGACAAAG CCATGTGTCA CTACCGCCGC CTCAAGCGCC GAATCGAGGA CCCGAACATC GTATCTAATA AACCGGCTAT CGCGTACCTG CATGCGTCTA TGCTGCTCGA CCATCTCGCG GCAATGCGCG TACAAGACTT CCGAGTTGCC GATATCTGGG TCAAATGTGC TATGATTGCG GGTCACCACA CTGATTTGGC AAGAGCATGG TGTTTTCTCA GGGAAGCGAG GGAGCTGTAC CTCGTGCTCG AAGGACTTAA CGGCGCTATT TATCATCAGG TCGAAGGTTG GTATCGTGAC CCGACCAGTA TGCCAGGATT TGGGGGTACA AGGCGAGGTT TGAGCTCTCG CATGAAGGCG TTCACAGACT TCAACCAGGG CATCATGCCA AAGAAAATGC TATTCATGCT GGATGCGAAG CCTAACGAGT ACATCGCAGT CCCCCGTTAT CGTCCTCTTC CCCGCACTGA TAACGACAAG GGACCGTCCT ATGAAATTAT CGATGGGCCT GACTTTGCCC CGGCAGAGCT CAAGTTCGAC TCCAAGAACC CTTCTGATAT GTGCCGTACC CACGACAACT GTCAGACGCT TGTGGACCAT TTAGACCGCG TTCGTGAGAA GCGCAGGAAC CGCGGACCTG CGCGAGAAAT GACAGATGAC CATGATTCGG AGCCTTATTG TGACGGTTTC AAGCAAGACT TCCTGGCAGC GTTTATGGCC GTGGCGAGGG AGCGCTTCGG GAAAGCCGCG GTAGATGCTG AACTAGAAAA AGACCATGAT AATCCTGAAG AATCGTTTAA TCAAGACGTT CAGGTGTTTA CATCTTGCGG AGAGACATGT CCCAAGCCCG GTTGCACGCA CCAGCAGTAT GAAGTCGAGC TTCAGAATCC TTCATGCGCC TGCGACAAGG AGCCCGGTGA AGAAGGCCAG CCCGATAACA AGGAGAACGA GGAGTCTCAT ATTCAAGAAC CGACCGTTGG AGCGCGGAAC CCCGTAGTTA AGCACGTTGG AGCTGAAAAG AAACCGCAAT TTGTCAATTT GGATGATTTT GCCTACTGCG ATGCAGGTGT TGAGTATAAC TGA
|
Protein sequence | MTLVNNLIEL LYISASMGVN SWSDLLAIEL IHLSVLSKND LLSELVELFL LLTMQKLLGF LESLGATASC FEGNACPNPE QSEPESDPAE PADTNTKADI DRDPKAEATQ ADLEAVVNGT STSDAGSRFE SASVFVLLPE PKHTPASNLE HHPVSGSDFE SVSSSTSELV AESKPQGLDE RESQTNLEPE SQIEDPLVVK EKQDAQCQIE SENALMLAKG LRVESQDQHE EYISSEHQIN SQQGLLVESQ NQDRQNLVNS EPGSNTDIKS DDRYGHQIHS GDQKESPLLV NYEDQVHKEH ISAGSELAPV SGSESEPEAG FDSDSVSSPA PTPELVEEDK TSVTKHQGPH LGAATGPLIL QAKPGALILR QNSDLLGLRI TAPRARSPSP LRSFSRSAIE CLGIPMPSEM PRPSTSNVPD SLADIPPEYL QLPFVDEHVN SSAPKGNKNW HSVDLPHEQL TAANLTRAQD LSNCDLREPV VSDCFEKRWI NDKSGFGLVA TKHIPAGTVI IADELMILWS GEHKKCKSYG DTNAMLQAKA ARMGPEWNNE FLSLAKGQKK KGFAIRKRRL GLEGAIWDQH ALPTAWEGNV GEVLGLNLAW LNHCCIPNCV LRFRNEYPTN KKGEICYDKK PRLGKAVVRA CADIKPNVEI SIAYMQTEGT ARERRAAMNR RFGFLCACRF CATPHPSADK AMCHYRRLKR RIEDPNIVSN KPAIAYLHAS MLLDHLAAMR VQDFRVADIW VKCAMIAGHH TDLARAWCFL REARELYLVL EGLNGAIYHQ VEGWYRDPTS MPGFGGTRRG LSSRMKAFTD FNQGIMPKKM LFMLDAKPNE YIAVPRYRPL PRTDNDKGPS YEIIDGPDFA PAELKFDSKN PSDMCRTHDN CQTLVDHLDR VREKRRNRGP AREMTDDHDS EPYCDGFKQD FLAAFMAVAR ERFGKAAVDA ELEKDHDNPE ESFNQDVQVF TSCGETCPKP GCTHQQYEVE LQNPSCACDK EPGEEGQPDN KENEESHIQE PTVGARNPVV KHVGAEKKPQ FVNLDDFAYC DAGVEYN
|
| |