Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_01698 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001307 |
Strand | - |
Start bp | 1599925 |
End bp | 1603231 |
Gene Length | 3307 bp |
Protein Length | 920 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | transcription initiation protein spt5 (AFU_orthologue; AFUA_4G08500) |
Protein accession | CBF85391 |
Protein GI | 259487039 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.80397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.06585 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCGGA ATTTGATGGA CCAGGACTTC GGGTCCGAGG AGGAAGATGA TGACTTCAAC CCCGCACCCG CTGAAGAGTC GGATAATGAG GAGGCTCATC ATGACAAGGT TTGTGTGCAG TTGGCTGCTG TTCGATGATG TTTAAGCTGA TGGGCAGGAC TTTAGACCAG AAAACCAGAC CGTGATTCTG ATGCGCGAAA TGGAAGCGAT GATGAAGGCG CTGATGAGGC TGGCGAGGAA GACGAGGAGG AAAACGAGGA AGGCGGAGAA GGAGAAGGGG ATGAAGAGGA AGATGAGGAG GAAGATGAAG ACGACGACGA TGTTTCGGTG AGCAACTCCA GATTATGTCG GAAGCTTACC AACGATCACT AATAATGGGT CTTGCCTGTT TCCTAGAAAC CACGAAAGCG AAGAAAGGGA CATGGGGGGC TTAGTGCCTT CATTGATTAT GAAGCTGGTG TTGATGAGGA GGAGGATGAG GTTGAAGACG AAGAGGAAGA AGAGGGTTAT GGCTTAGAAC AACATCCTGA TGATGTTCTG CCTGCAGGAG CTGAAACCGA TGATCGTCAA CATCGTCGAC TCGATCGTGA GCGTGAATTA GCCGCCACTC TGGACGCCGA GAAGCAGGCA CAGTTACTCA AGGAGCGCTA CGGAAGAAAC CGTGCCGCTG CTACAGATGC CGTCATAGTC CCTAAGCGGC TGCTGCTCCC CAGTGTTGAT GATCCCAGTA TATGGGGTGT CCGGTGCAAG GCGGGCAAGG AGCGTGAGGT TGTCTTCTCC ATCCAGAAAC GCATCGAGGA CAGACCGCCC GGTTCTCGAA ATCCCATCAA GATCATCTCG GCCTTCGAGC GTGGGGGCGC GATGAGTGGC TACATCTATG TCGAAGCGCG GAGACAGGCC GATGTCATGG ATGCTTTACA AGACATGTCA AACGTTTATC CAAGGACGAA GATGATTCTG GTTCCGGTTA AGGAAATGCC GGACCTACTG CGAGTGCAGA AGTCCGAGGA GCTCAACCCT GGCGGTTGGG TCCGGATCAA GCGCGGCAAG TACATGAATG ATCTTGCTCA GATTGAGGAG GTTGAGACGA ACGGCCTGGC TGTCACTGTT CGTTTGGTCC CTCGTCTGGA CTACGGTATG AATGAGGATT CTGGTGCTCC TATCATGGAC CCTAAGAGGA AACGGCCGGG AGCGAATCCC GCTGTTGCAC GTCCTCCACA GCGGCTGTTC AGCGAGGCCG AGGCCAAGAA GAAGCACAGC AAGTATCTCA CTGCGACGGC CGGCTTGGGT GCCAAGTCAT GGAACTACCT TGGCGAGACT TATATTGACG GTTTCCTGAT CAAGGACATG AAAGTCCAGC ATTTGATCAC GAAGAACGTC AACCCTCGAC TGGAGGAGGT CACCATGTTT GCTCGTGACT CCGAGAATGG TACTTCAAAT CTTGATCTTG CTTCCCTTGC GGAAACACTC AAGAATTCCA CTGCGGAAGA ATCATATCTC CCTGGGGACC CAGTGGAGGT GTTCAAGGGC GAACAGCAGG GTCTTGTCGG TCGCACTAGT TCCACCCGTG GAGATATTGT GACTATATTG GTTACAGAAG GTGAGCTGGC TGGCCAAACG ATTGAAGCTC CTGTGAAGAC TCTTCGCAAG CGCTTCAGGG AGGGAGACCA CGTCAAAGTT ATCGGCGGTA GTAGGTACCA AGACGAGTTG GGTATGGTTG TCCAAGTGAG AGACGACACA GTCACTCTTC TTAGCGACAT GAGCATGCAA GAGATTACTG TATTCAGTAA AGACCTTCGC CTGTCCGCCG AGACGGGCGT TGATGGGAAA CTCGGCATGT TCGATGTTCA TGATCTCGTC CAATTAGAGT GAGTTCCTAT CTCCCTTTTA TGTCCGTTTT CACTAATGTC ATAAAAGTGC CGCTACTGTT GCTTGCATTG TCAAAGTCGA TCGTGAATCA TTACGAGTCT TGGACCAGAA CGGTTCTATT CGCACTATTC TCCCCTCCCA AGTCACGAAC AAAATCACAC CACGACGAGA CGCGGTGGCT ACCGACAGAA ATGGTGCTGA AATTCGGCAT GGAGATACTG TTAGAGAAGT ATACGGCGAG CAACGCAGTG GTGTAATCCT GCACATCCAC CGCTCCTTCC TGTTCATCCA CAACAAAGCC CAGGCTGAAA ACGCAGGTAT TGTCGTTGTG CGTACTACCA ATGTTGTGAC AGTATCCGCA AAAGGAGGCA GACCTACAGG ACCAGATCTC TCCAAGATGA ACCCAGCTCT GATGAGGAAT GGTGCTCCGG GTGGTATGAT GGCACCTCCT CCCTCGAAGA CTTTCGGACG GGATCGACTT CTCGGCAAGA CAGTCTTGGT TAAAAAGGGG CCTTTCAAGG GACTTCTTGG TATCGTCAAG GATACGACCG ATGTTCAGGC TCGAGTGGAA CTTCATTCGA AGAACAAGCT TGTTACTATA CCGAAAGAGC TCCTTGTTGT CAAAGATCCT GTTACCGGCC AGACTATCGA TATTGGTCGT GGTAGGGGTG GTCCCCGGGT CCCTCAAAAC TCAGCCGCTC CGTCATCTGG CTGGCAGGGC GGTCGTACCC CAATGGCCGC TGCCGATTCT TCGAGGACAC CCGCCTGGGG TGCGGCAATG TCGTCAAGAA GTAAGTTTCT CAAGCTCTTG CTTTACTCTC AAGGTTTTCT AACATAAGTC CTGCAGCGCC TGCTTGGAGC GGTGCTGGTC TTGGTTCTCG CACACCAGCC TGGAAAGCCG ACGGCAGCCG GACAGCCTAT GGCGGCGCAG GATCACGCAC ACCCGCCTGG AACGCAGGTG CTCGAACCCC CTACGGCGGA GGCTTCGGTT CCGGCTCCGG CAACAGCGAC TTCGACGCTT TCGCCGCCGG GTCCCGAACG CCCGCCTGGG GCGCAGCATC CGGCAGCCGC ACCCCAGCCT GGTCAGCTAG TGCTAATACC ACCAGCAGGA ACGACAACAA GGCCTACGAC GCCCCGACAC CTGGGGCAAC ATACTCGGCA CCCACGCCTG GCGCATATGG CGGCGCTCCT ACACCTGGTC TTTCAGCACC CACGCCTGGA GCCTGGGCTG ATAGCGCGCC AACACCTGGG GCGTACAATG CGCCTACGCC CGCCGACTTT GGCGAAGGAA GTCGTCCGTA TGATGCGCCA ACACCAGCAA TGGGCGGCGC GGCTGCTACA CCGGGTGCTG GGGCATATGG TGATACGGAT GATGGTGCTC CCAGGTATGA GGAAGGAACG CCCAGTCCTT GATATTCGTG TGACGCATGT TTATGAG
|
Protein sequence | MSRNLMDQDF GSEEEDDDFN PAPAEESDNE EAHHDKTRKP DRDSDARNGS DDEGADEAGE EDEEENEEGG EGEGDEEEDE EEDEDDDDVS KPRKRRKGHG GLSAFIDYEA GVDEEEDEVV FSIQKRIEDR PPGSRNPIKI ISAFERGGAM SGYIYVEARR QADVMDALQD MSNVYPRTKM ILVPVKEMPD LLRVQKSEEL NPGGWVRIKR GKYMNDLAQI EEVETNGLAV TVRLVPRLDY GMNEDSGAPI MDPKRKRPGA NPAVARPPQR LFSEAEAKKK HSKYLTATAG LGAKSWNYLG ETYIDGFLIK DMKVQHLITK NVNPRLEEVT MFARDSENGT SNLDLASLAE TLKNSTAEES YLPGDPVEVF KGEQQGLVGR TSSTRGDIVT ILVTEGELAG QTIEAPVKTL RKRFREGDHV KVIGGSRYQD ELGMVVQVRD DTVTLLSDMS MQEITVFSKD LRLSAETGVD GKLGMFDVHD LVQLDAATVA CIVKVDRESL RVLDQNGSIR TILPSQVTNK ITPRRDAVAT DRNGAEIRHG DTVREVYGEQ RSGVILHIHR SFLFIHNKAQ AENAGIVVVR TTNVVTVSAK GGRPTGPDLS KMNPALMRNG APGGMMAPPP SKTFGRDRLL GKTVLVKKGP FKGLLGIVKD TTDVQARVEL HSKNKLVTIP KELLVVKDPV TGQTIDIGRG RGGPRVPQNS AAPSSGWQGG RTPMAAADSS RTPAWGAAMS SRTPAWSGAG LGSRTPAWKA DGSRTAYGGA GSRTPAWNAG ARTPYGGGFG SGSGNSDFDA FAAGSRTPAW GAASGSRTPA WSASANTTSR NDNKAYDAPT PGATYSAPTP GAYGGAPTPG LSAPTPGAWA DSAPTPGAYN APTPADFGEG SRPYDAPTPA MGGAAATPGA GAYGDTDDGA PRYEEGTPSP
|
| |