Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_00990 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001308 |
Strand | - |
Start bp | 1810885 |
End bp | 1814092 |
Gene Length | 3208 bp |
Protein Length | 884 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | Endoribonuclease ysh1 (EC 3.1.27.-)(mRNA 3'-end-processing protein ysh1) [Source:UniProtKB/Swiss-Prot;Acc:Q5BEP0] |
Protein accession | CBF88384 |
Protein GI | 259488717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCAA AGCGAAAAGC CGCGGCGATG AACGCCGTCG ATGACGAGCC CGTCGATCCG TCGGATGAAC TGGCGTTTTA CTGCTTAGGT GGTGGTAATG AAGTTGGAAG ATCATGCCAT ATCATCCAGT ATAAAGGGAA AACTGTTATG GTGTGTATTT GCCAACCAAT CCCGTCCATA TGGTCTAATG CGACATCAAC AGCTTGATGC TGGGATGCAC CCTGCGAAAG AGGGATTCTC AGCCCTTCCG TTTTTCGATG AGTTCGATCT GAGCACGGTG GATATACTCT TGATCAGCCA GTATGTGGAA TACCTAGTCT CCTTGTTGTT GTTCTCCCTT CTAGTCGTAG GGATGTCAGG ATGCCATAGG GTTCTTCGAC ACCTCGCGGA TGATGGCACG ACGAAGTCAT CGAGTATCAA CGGTGGAGTG GCTCTTTTGG CAATCCGGCC CCTGCATCTT TCATGTATAC AAGAAACGCA ATACGAACTA GTTCATCTTG TATTCTGTGT CCCATCATCT GTATTCTGTA TCATCATTAG TTCTGGATTC ATGCGCTGGT CTTCTCAACC GATAGACTAA TAGACTTTTG TCGCATCACA GTTTCCATGT CGACCACTCA TCCGCGCTTC CCTATGTCCT CAGCAAAACG AACTTCAAGG GCCGTGTCTT CATGACGCAT GCTACAAAAG CTATATACAA GTGGCTGATT CAGGATAATG TGCGAGTCAA CAACACGGCC TCCTCCTCTG ACCAACGGAC TACCCTATAC ACTGAACATG ATCACCTCTC AACGCTGCCG CTGATTGAGA CCATTGATTT CAACACAACA CATACGATAA ATAGCATTCG CATCACTCCT TATCCTGCCG GGCACGTTCT TGGAGCTGCC ATGTTCCTAA TATCAATTGC GGGTTTAAAT ATCCTTTTTA CCGGCGACTA CTCCCGCGAA GAGGACCGCC ACCTTATTCC AGCTACGGTT CCCCGGGGAG TGAAGATTGA TGTTCTTATT ACCGAATCAA CATTTGGAAT TTCCTCCAAC CCCCCTCGCC TGGAGCGTGA GGCCGCCCTT ATGAAGTCTA TAACTGGAGT TCTAAATCGT GGTGGTCGAG TTCTTATGCC GGTTTTTGCG CTTGGTCGTG CGCAGGAACT CCTTCTTATC CTTGAGGAAT ACTGGGAGAC ACACCCCGAG CTGCAAAAGA TCCCTATATA CTACATTGGT AACACGGCTA GGAGGTGCAT GGTTGTCTAC CAGACTTATA TCGGAGCCAT GAATGATAAC ATTAAGCGGC TTTTCCGCCA GCGAATGGCC GAAGCAGAAG CGAGCGGTGA CAAAAGTGTC AGCGCGGGGC CCTGGGACTT TAAATATGTT CGCAGTCTGC GCAGTCTTGA ACGATTTGAC GACGTTGGAG GATGCGTCAT GCTAGCGTCC CCGGGTATGC TGCAGACTGG TACAAGTCGA GAACTATTGG AACGATGGGC TCCGAACGAA CGAAATGGCG TTGTCATGAC GGGATATAGC GTGGAGGGCA CGATGGCTAA GCAATTGCTC AACGAGCCAG ACCAGATCCA TGCCGTGATG TCACGGGCAG CCACCGGCAT GGGTAGAACG CGCATGAACG GCAACGACGA GGAACAGAAG ATCATGATTC CTAGACGATG CACTGTCGAC GAGATATCCT TTGCGGCCCA CGTTGACGGC GTCGAGAACC GAAACTTCAT CGAAGAAGTA TCTGCACCTG TCGTGGTATG TCCCCTTCCA CTAACACGTT CCTTCTCGAC TGCGCCATTA ACGTCCATAG ATCCTAGTCC ACGGCGAGAA GCACCAAATG ATGCGCCTAA AATCCAAATT GCTCAGCCTC AACGCAGAGA AGACAGTGAA AGTCAAAGTT TACACGCCAG CCAACTGTGA AGAAGTCCGA ATCCCCTTCC GCAAAGACAA AATCGCCAAG GTAGTCGGTA AACTAGCGCA AACAACTTTG CCGACCGATA ATGAAGATGG AGACGGGCCG CTCATGGCCG GTGTCCTCGT CCAGAACGGC TTCGACCTAT CGCTGATGGC ACCGGACGAT CTGCGCGAAT ATGCCGGTCT AGCGACCACG ACTATTACCT GCAAGCAGCA TATCACCCTA AGCTCAGCTA GCATGGATTT AATTAAGTGG GCGCTGGAGG GTACATTCGG TGCCATTGAA GAAATTGGCA CCGACGAGGA TGCAGAGAAG GAGGATCAAC AAAGCGAAAG TGAAGAGAAA CAACGCATGA AAGAGGAGGC TGATGAGGAA ATACCCATGG AGAAGCCTCA AGCATACCTG GTTATGGGCT GCGTTGTAAT CAGATACCAC CCCCGGACTC GCGAGGTCGA GCTTCAGTGG GAAGGGAACA TGATGAATGA CGGGATTGCC GATGCAGTCA TGGCCGTCCT ACTTACTGTG GAGAGCAGCC CAGCATCCGT GAAACGTATG TCCCCGCGCA ACTCATACTT GACTTCTTTA TCCATATTGA TTGTTCCCTG CTAACAAAAT CTACCACCAG AATCCGCCAA GCACAATAAA CACCACCACC ACCACCATCA CGATGAAACC GACACCCTCA AATTCCTTAA TCCGCACGCT GCTCAAGATG CAGAAGAACG ATTCGCCCGC CTGCTCATGA TGCTGGAAGC TCAATTTGGC TCTGACATCG CCCCAATCGA GCGTCCACGC GTCCCATCCT CAACCGAATC TGCAACCACA ACTACGAACG GCAACGGCAA TTCCAAGTCC GACTCCGAGC AGCTTAGTTC ACTCGAGTCA AAAACAGATG GTGCAACTCC TCAAGATCCT GACACACTAT CGGAACTCGA AGCTGCAGAG CTATCCCGCC TCCATGCGCT GGGTATCCCG GTGCCGGGTA TCGAAATCAA AGTTGACAAG CATGTTGCTC GGGTCTGGTT GGAGGATCTG GAGGTCGAGT GTGCGAATGC AGTGCTGAGG GACCGCGTCC GGGTTGTGAT TGAGCGGGCT GTTGAGACGG TTGCAAGTAT GTGGTCTGTC GGCCGGTCTT CGAAGACCAT TACGAATGGT GGTGGGAAGG AAATTGCTGG TACTGGTGCG GATGATGTGG CTTCGAAACC TGGATTGGAA GTTGCGGCAA GGGCTTAGTT TTATTGCCTT TGCTATGTGC CTTGTAATAT ACGTATCTTA TTGAACCTAC GGTGAAGGTT ATTCCTCT
|
Protein sequence | MAAKRKAAAM NAVDDEPVDP SDELAFYCLG GGNEVGRSCH IIQYKGKTVM LDAGMHPAKE GFSALPFFDE FDLSTVDILL ISHFHVDHSS ALPYVLSKTN FKGRVFMTHA TKAIYKWLIQ DNVRVNNTAS SSDQRTTLYT EHDHLSTLPL IETIDFNTTH TINSIRITPY PAGHVLGAAM FLISIAGLNI LFTGDYSREE DRHLIPATVP RGVKIDVLIT ESTFGISSNP PRLEREAALM KSITGVLNRG GRVLMPVFAL GRAQELLLIL EEYWETHPEL QKIPIYYIGN TARRCMVVYQ TYIGAMNDNI KRLFRQRMAE AEASGDKSVS AGPWDFKYVR SLRSLERFDD VGGCVMLASP GMLQTGTSRE LLERWAPNER NGVVMTGYSV EGTMAKQLLN EPDQIHAVMS RAATGMGRTR MNGNDEEQKI MIPRRCTVDE ISFAAHVDGV ENRNFIEEVS APVVILVHGE KHQMMRLKSK LLSLNAEKTV KVKVYTPANC EEVRIPFRKD KIAKVVGKLA QTTLPTDNED GDGPLMAGVL VQNGFDLSLM APDDLREYAG LATTTITCKQ HITLSSASMD LIKWALEGTF GAIEEIGTDE DAEKEDQQSE SEEKQRMKEE ADEEIPMEKP QAYLVMGCVV IRYHPRTREV ELQWEGNMMN DGIADAVMAV LLTVESSPAS VKQSAKHNKH HHHHHHDETD TLKFLNPHAA QDAEERFARL LMMLEAQFGS DIAPIERPRV PSSTESATTT TNGNGNSKSD SEQLSSLESK TDGATPQDPD TLSELEAAEL SRLHALGIPV PGIEIKVDKH VARVWLEDLE VECANAVLRD RVRVVIERAV ETVASMWSVG RSSKTITNGG GKEIAGTGAD DVASKPGLEV AARA
|
| |