Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_07447 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001304 |
Strand | + |
Start bp | 1511245 |
End bp | 1514271 |
Gene Length | 3027 bp |
Protein Length | 941 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | mRNA splicing factor (Prp1/Zer1), putative (AFU_orthologue; AFUA_2G06070) |
Protein accession | CBF79397 |
Protein GI | 259483750 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.143301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.632834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCTG GACGAAAAGA TTTCCTCAGT CAACCCGCCC CCGAGAATTA CGTTGCTGGT CTAGGTCGAG GAGCGACCGG CTTCACCACC CGCTCAGATC TAGGTCCTGC TCGAGAGGGT CCAACGCCGG AGCAGATCCA AGCTGCGCTT GCGAAAAGAG CACAGTTACT TGGAGCAGCA CCTCCCACGG CTTATGGGGC TACAAGAGAA AAGGGTAAAG GAGAGGAAAA GCCGGCCGAG GAAGAAGATG ATGAACGGTT CCAGGACCCC GACAATGAAG TTGGGCTCTT TGCCTACGGC CAGTTTGATC AAGAAGATGA TGAGGCGGAT CGCATCTACA GAGAGGTAGA TGAGAAGATG GATAGACGGC GCAAAGCACG AAGGTTAGTG GACTTCGTAC ACTCTATTTT TTTATTTTCC CTTTTTGCCT TAAGATGAGT TGTTTATCGG GTTCACCCCG TCTATTCTTG GACAGATGAC CGACTTGTAC CTTTACCGCA GGGAAGCTCG AGAGCGTCAG GAGCGGGAAG AGTATGAACG GAAGAATCCC AAAATTCAAC AGCAATTCGT CGATTTGAAG CGGTCTCTTG CGTCGGTCTC GGAAGACGAA TGGGCAAACC TCCCCGAAGT CGGTGACCTT ACGGGTAGGA ATAGACGAAC GAAGCAGAAC TTACGTATGC AACAACGTTT TTACGCGGTC CCCGATAGTG TGCTCGCGAG TGCAAGAGAT TCATCTCAGT TCGATACAAC CGTTGCGGAC GATGGAACAG CAACAGATGC TGGTGCTAAC GGGGCGGACG GAATGATAAC GAACTTTGCC AACATTAGTG CTGCTCGTGA CAAAGTATTA CAGGTTAAGC TTGATCAGGC GGCAATGGGG TCCTCTGGGG ACGCGGCATC TGGAAGTGCG ACTAGCATCG ATCCAAAGGG CTACCTCACA AGTCTTACGC AATCAGAGCT GAAGGCAGGT GAAATCGAAG TGGGAGACGT CAAACGTGTG CGCGTCTTGC TGGAATCTGT AACAAGGACG AATCCCAAGC ATGCTCCGGG GTGGATTGCG CTGGCGCGCC TGGAAGAGCT GGCGGGCAGG ATAGTCACCG CTCGGAATGT GATTGCAAAA GGATGTGAGC TCTGCCCAAA GAGTGAAGAT GCGTGGCTTG AGAACATTCG ACTTAACGAA GGTCACAATG CCAAAGTCAT TGCTGCAAAC GCAATCAAAA ACAATGACCA CTCCACTCGG CTTTGGATCG AAGCTATGCG ATTGGAAACA GAGCCACGTG CAAAAAAGAA CGTGTTGAGA CAAGCTATTC TGCATATTCC GCAATCCGTC ACAATCTGGA AGGAGGCGGT TAACCTGGAA GAGGACCCCG CAGACGCACG CCTTTTACTG GCTAAAGCAG TTGAACTGAT ACCGCTCTCG GTTGAGTTAT GGCTGGCGCT CGCTCGTCTT GAGACACCTG AAAACGCCCA AAAAGTTTTG AACGCGGCGC GAAAGGCCGT GCCTACCAGC CATGAGATCT GGATTGCTGC TTCTCGACTT CAGGAGCAAA TGGGAACCTT CAACAAAGTG AATGTTATGA AGCGAGCTGT TCAATCGTTG GCGAGAGAAA ATGCTATGCT TAAACGGGAG GAATGGATAG CGGAGGCAGA GAAGTGTGAG GAGGAAGGGG CTGTCCTCAC TTGCGGTGCG ATCATTCGGG AGACGCTCGG ATGGGGGCTG GATGAAGATG ACGATCGGAA AGACATCTGG ATGGATGACG CAAAGGCGAG TATTTCCAGA GGGAAATATG AGACGGCAAG GGCTATCTAT GCGTATGCCT TGCGTGTCTT CGTCAATCGC CGATCCATAT GGGTTGCAGC AGCGGACCTT GAACGCAACC ACGGCACCAA GGAAGCGTTA TGGCAGGTAC TTGAAAAAGC AGTTGAGGCT TGCCCTCAAA GCGAAGAGCT ATGGCTACAG CTTGCGAAGG AGAAGTGGCA GTCAGGAGAG ATTGACGATG CCAGACGAGT GCTCGGACGT GCATTTAACC AGAACCCTAA TAACGAGGAT ATCTGGCTTG CTGCTGTCAA GCTGGAGGCG GATGCTCAGC AGACGGACCA AGCCCGAGAG CTTCTTGCAA CAGCTCGACG CGAAGCAGGA ACAGATCGCG TATGGATAAA GAGCGTCGCC TTCGAGCGGC AACTGGGTAA TGTTGACGAT GCGCTCGACC TTGTCAATCA AGGTCTTCAG TTGTATCCCA AGGCCGATAA ACTCTGGATG ATGAAGGGCC AGATATACGA GTCACAGAAT AAGCTCCCTC AGGCCCGCGA AGCATATGGC ACTGGTACTC GAGCATGTCC GAAATCTGTC GCTCTATGGC TATTGGCGTC ACGACTGGAA GAAAAGGCCG GGGCAGTGGT CAGAGCCCGA TCTGTTCTTG ATAGGGCTCG TCTAGCAGTA CCAAACAGCC CCGAACTGTG GACAGAGAGT GTCCGAGTTG AACGGCGGGC AAATAACATC CCTCAGGCGA AGGTTCTGAT GGCCAGAGCA TTACAGGAGG TCCCATCATC CGGCCTTCTG TGGAGCGAAA GCATTTGGCA CCTCGAACCG CGCTCGCAGC GGAAGGCTCG CAGTCTGGAA GCTATCAAGA AGGTTGACAA TGATCCAATC CTCTTCATCA CAGTAGCGCG AATCTTCTGG GGCGAACGTC GACTTGAGAA GGCGATGACA TGGTTTGAAA AGGCGATCAT ATCAAACAGT GATTTCGGCG ACGCGTGGGC CTGGTACTAC AAGTTCCTGC TGCAGCATGG TACAGATGTA AGTTTTCTCC TCTCCTGCAT ATAATCTCTT TTTTGCCACT CAGTGGAAAG AACCCCTGTT CTAATATTCA TTCTTCATAG GAAAAACGAG CCGACGTCAT TTCGAAATGT GTACTTTCTG AGCCTAAGCA CGGTGAAGTC TGGCAGTCCA TAGCGAAAAA TCCCGCTAAT GCCTATAAAT CAACCGAGGA TATCCTAAAG TTAGTTGCGG ACAGTCTTGT CCAATAA
|
Protein sequence | MASGRKDFLS QPAPENYVAG LGRGATGFTT RSDLGPAREG PTPEQIQAAL AKRAQLLGAA PPTAYGATRE KGKGEEKPAE EEDDERFQDP DNEVGLFAYG QFDQEDDEAD RIYREVDEKM DRRRKARREA RERQEREEYE RKNPKIQQQF VDLKRSLASV SEDEWANLPE VGDLTGRNRR TKQNLRMQQR FYAVPDSVLA SARDSSQFDT TVADDGTATD AGANGADGMI TNFANISAAR DKVLQVKLDQ AAMGSSGDAA SGSATSIDPK GYLTSLTQSE LKAGEIEVGD VKRVRVLLES VTRTNPKHAP GWIALARLEE LAGRIVTARN VIAKGCELCP KSEDAWLENI RLNEGHNAKV IAANAIKNND HSTRLWIEAM RLETEPRAKK NVLRQAILHI PQSVTIWKEA VNLEEDPADA RLLLAKAVEL IPLSVELWLA LARLETPENA QKVLNAARKA VPTSHEIWIA ASRLQEQMGT FNKVNVMKRA VQSLARENAM LKREEWIAEA EKCEEEGAVL TCGAIIRETL GWGLDEDDDR KDIWMDDAKA SISRGKYETA RAIYAYALRV FVNRRSIWVA AADLERNHGT KEALWQVLEK AVEACPQSEE LWLQLAKEKW QSGEIDDARR VLGRAFNQNP NNEDIWLAAV KLEADAQQTD QARELLATAR REAGTDRVWI KSVAFERQLG NVDDALDLVN QGLQLYPKAD KLWMMKGQIY ESQNKLPQAR EAYGTGTRAC PKSVALWLLA SRLEEKAGAV VRARSVLDRA RLAVPNSPEL WTESVRVERR ANNIPQAKVL MARALQEVPS SGLLWSESIW HLEPRSQRKA RSLEAIKKVD NDPILFITVA RIFWGERRLE KAMTWFEKAI ISNSDFGDAW AWYYKFLLQH GTDEKRADVI SKCVLSEPKH GEVWQSIAKN PANAYKSTED ILKLVADSLV Q
|
| |