Gene ANIA_10020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_10020 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001308 
Strand
Start bp4465498 
End bp4468583 
Gene Length3086 bp 
Protein Length906 aa 
Translation table 
GC content50% 
IMG OID 
Product26S proteasome regulatory subunit Mts4, putative (AFU_orthologue; AFUA_5G11720) 
Protein accessionCBF90150 
Protein GI259489680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.857696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.62117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGG AAGGCGAGCG GTCAGCTCCG GCCGACAAGG GCAAGGGCAA GGTTGATGAT 
GTCAAGGATC TTGGAGGGAG TAAAGAGAAG CCTGAGGAGA AGACACAAGG CAACGGGAAG
AAGAAGGACG ATGAGCCGCA GGAAGGTAAG CAGCACCTCT TTTATCGTGA TGGAAGCATG
ATTCGGTTCC CATCATCAGC TGTGACGTGT ATCTAATTAT GTTTTCTGCA GAGGAGCTCA
GTGAAGAGGA TCAACAGCTA AAGAGTGAAC TCGAGATGCT TGTTGAAAGG TTACAGGTAT
GATGGCAGAG ATTGCCTGCT GTTCACTTTG CCAACGCTGA CCGATCTGCC GCATCGCAGG
AACCGGATAC TTCGCTTTAC GGACCCGCTT TGGACGCCAT CAAGACTTTT ATTAAAACTT
CTACCTCTTC AATGACTGCA GTTCCTAAGC CTCTGAAATT CCTACGACCA CACTACGATG
ATCTAGCGGC GCTCTATGAC AAGTGGTCCG CCGGCGCAAC CAAGGTGGGT TCAGGACGAA
TAGTACACAA GACCGAAAAC TCATGTTTTG CCTTATTTTC AGGATTCGTT GGCGGATATG
CTTTCTGTCC TCGGAATGAC GTACGGGGAC GAAGAGAAAC TCGAAACGCT CAAATACCGA
CTTCTCACCA AATCGGATGA CCTCGGTTCC TGGGGCCACG AATACGTCAG GCACCTGGCG
TTGGAGATCG GCCAGGAATA TCAGAACAGA GTAAACGACG AAAAGGAAGT AGACGATCTG
ATCAAACTCG CGGTTTCGCT TGTTCCATAT TTCCTTAGAC ACAATGCAGA AGCCGATGCC
GTTGATCTTA TGAGCGAACT TGAGATTATA GAGGAGATTC CTCAGTTCGT GGATGAGAAC
ACATATTCAA GGGTTTGCTT GTATATGGTC AGCATGGTGC CTCTCCTTAC CTACCCCGAG
GACCACCAGT TCCTCCGGAC GGCACACGAA ATCTACGTTC GTTACAAGGA GCTCACGAAA
GCTATTGTGC TCGCTATCCG CCTAAACGAT GTTGACCTCA TCAAGAGTGA CCTTGAAGCG
ACGTCGGATC GGTCGCTCAA GAAACAGATG GCTTTCCTAG TTTCTAGGCA ACAAATATGG
CTCGATGACT TGGGCGATGA CGAGCAGGAC GAGACTTTCA TGGAGTGTCT GAACAACACC
TCGATCCCAA AGCATTTCAA GTCGCTTGGG AAGGAACTGA ACATCCTCGA CCCAATTATG
CCGGAAGACA TCTACAAAAC CCACTTAGAA AGCAGCCGAG GAGCAGGCCT CACCAATGTC
GACTCTGCCA GACATAATCT TGCAAGTGCC TTTGTCAATG CATTCGCAAA TGCCGGTTTT
GGCAACGATG AGATGATGAT TGTCGAAGGT GACAAGGGTT CTTGGGTTTG GAAGACAAAG
GATGATGGCA TGTTGTCTAC CACCGCCTCA ATGGGTATGC TCCTGCACCG AGATGTCGAC
ACTGGTTTGG ACAAAATTGA TAAGTACACG TACGCCTCCG AGGATCAGAT CAAGGCCGGT
GCTTTATTGT CTATTGGAAT ACTCAATTCA GGCGTGCGCC TTGATTCTGA CCCCGCGTTG
GCCCTTCTGT GTGACAACGA GAACTTGGAG GCAAAGAATA TTCCCATGAG AGTTGCCACA
ATCATGGGCC TTGGTTTAGC GTACGCCGGG TCCAACAAGC AGGAAATTCT TGACGCTTTA
CTGCCTATCG TGGAAGATGT ATCTCTCGAT ATGCAACTCT CCGCAATGGC GGCTGTCTCA
CTTGGTCTTG TCTTTGTTGG GTCATCGAAT CACCAAGTCA GTGAGGCAAT CGCTACCACC
CTCATGGACG AGGAGCGCCA GAAGCAGCTT AAGGATAAAT GGACTCGCTT CATGGCTCTT
GGTCTAGCGC TTTTGTACTT CGGTCGCCAG GAAGAAGTTG ATGTGATCCT CGACATCCTC
AAGGCTGTCG ATCATCCTAT GGCGAAGCCT ACCTCCGTCC TCGCCTCCGT CTGTGCTTGG
GCAGGTACCG GCACCGTTCT GAAGCTGCAG GAGCTTCTCC ACATCTGCAA CGATGTCATT
GAGGAAAGTG ATGAGAAGCA GGGTGAAGAG CTTGTGCAAT CTTACGCCGT GCTAGGTCTG
TCGTTGATTG CGATGGGAGA AGATGTTGGT CAGGATATGA TTCTTCGACA GTTCGGCCAT
CTCATGCACT ACGGCGCTAG CAACATTCGA AAGGCGGTTC CTCTTGCTAT GGGTCTTATC
AGCCCAAGTA ACCCTCAGAT GAAGGTGTAC GACACTTTAT CGAGGTACAG TCACGACAAT
GATAATGATG TTGCCATTAA TGCCATTTTC GCCATGGGTC TCTGTGGTGC CGGTACGAAG
AACTCGCGTT TGGCGCAACT ATTGAGGCAG TTGGCCAGCT ACTACCACCG CGACCAGAAC
TCCTTATTCA TGGTGCGTAT TGCTCAGGGT CTACTGCACA TGGGCAAGGG CACTATGACA
CTAAACCCAT TCCACACCGA CCGCCAGGTG CTGAGCCGAG TATCGGCTGC TGGCTTGCTC
ACTGTTCTCG TGTCGTTGAT CGATGCGAAG CAGTTCATCC TTGCTGAGCA CCATTACCTC
CTCTACTTCC TCATCACAGC CATGTACCCG CGCTTCCTTG TCACGCTCGA CGAAGACCTC
CAGCCGCTTC CGGTCAACGT CCGCGTCGGA CAGGCTGTTG ATGTTGTTGG ACAGGCTGGA
AGGCCAAAGA CGATCACTGG TTGGCAGACA CAGAGCACCC CTGTGCTGCT TTCCTACGGT
GAGCGAGCAG AGCTGGAGGA TGAGAAATAT ATTCCTCTCA GTAGCACATT GGAGGGTTTG
GTTATCTTGC GTAAGGTAAG TCATCGCAGC TTCTCTGTAT TGTGTACGAA CATTTCTAAC
ATTCTCACAG AACCCTAACT GGGAGGAAGA AAGCTCCGCC TGAGCAACAG TGTCCTGAAT
GGTATCTAGT GAGATAGACC AAACGCAATA TCTGGCGTTC CTATACAGCT TAGGCCTTAA
TGAATTCACA AAAGTCCAAA GCATAG
 
Protein sequence
MAKEGERSAP ADKGKGKVDD VKDLGGSKEK PEEKTQGNGK KKDDEPQEEE LSEEDQQLKS 
ELEMLVERLQ EPDTSLYGPA LDAIKTFIKT STSSMTAVPK PLKFLRPHYD DLAALYDKWS
AGATKDSLAD MLSVLGMTYG DEEKLETLKY RLLTKSDDLG SWGHEYVRHL ALEIGQEYQN
RVNDEKEVDD LIKLAVSLVP YFLRHNAEAD AVDLMSELEI IEEIPQFVDE NTYSRVCLYM
VSMVPLLTYP EDHQFLRTAH EIYVRYKELT KAIVLAIRLN DVDLIKSDLE ATSDRSLKKQ
MAFLVSRQQI WLDDLGDDEQ DETFMECLNN TSIPKHFKSL GKELNILDPI MPEDIYKTHL
ESSRGAGLTN VDSARHNLAS AFVNAFANAG FGNDEMMIVE GDKGSWVWKT KDDGMLSTTA
SMGMLLHRDV DTGLDKIDKY TYASEDQIKA GALLSIGILN SGVRLDSDPA LALLCDNENL
EAKNIPMRVA TIMGLGLAYA GSNKQEILDA LLPIVEDVSL DMQLSAMAAV SLGLVFVGSS
NHQVSEAIAT TLMDEERQKQ LKDKWTRFMA LGLALLYFGR QEEVDVILDI LKAVDHPMAK
PTSVLASVCA WAGTGTVLKL QELLHICNDV IEESDEKQGE ELVQSYAVLG LSLIAMGEDV
GQDMILRQFG HLMHYGASNI RKAVPLAMGL ISPSNPQMKV YDTLSRYSHD NDNDVAINAI
FAMGLCGAGT KNSRLAQLLR QLASYYHRDQ NSLFMVRIAQ GLLHMGKGTM TLNPFHTDRQ
VLSRVSAAGL LTVLVSLIDA KQFILAEHHY LLYFLITAMY PRFLVTLDED LQPLPVNVRV
GQAVDVVGQA GRPKTITGWQ TQSTPVLLSY GERAELEDEK YIPLSSTLEG LVILRKNPNW
EEESSA