Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_08079 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | + |
Start bp | 682390 |
End bp | 684595 |
Gene Length | 2206 bp |
Protein Length | 680 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | Putative Zn(II)2Cys6 transcription factor (Eurofung) |
Protein accession | CBF73839 |
Protein GI | 259480836 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0152693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.153171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCGC CCATCTCCAC CCACGATCCG AGCGTCTGGG AACGAAGTCC TGATGAGAGC GTTACCGTCA GCGTCAACGG TGATCGAGAG CGCGAGGGTG AAGCTGATGC TGCGCCGAGG AAGAGGCGGA GAAAGTACAT TGCTAAGGCC TGGTTTGTAG ACTGTCCATT ATTCTAGGGC CGCAATCTCC GACCTTGCTG ACAGTTTCAA GCAACGCATG CAGACAACGG AAGATCAAGT GTAACGGGGA GCGACCGTGC CGCAGGTGTG GTCGGCAGAA CATATCCTGC GTCTATGAAA ATTCTCATGA GCCTGGAGGA AATGAGAATA TGTGAGTAAG TGCTGTCTTG ACGAGGCTTA GGCTAACTGG GTAGTGGGAT CGAGCGGCTG TACGAGCAGA TGAATGCCAT GCAGGCCCAG ATTAGTGCCC TTACCGCGTC TGTCCACTCG CTCGCCCAGA GTAATGCATC AGCTTCGATG CCACGTTCCG AGGCTGGACC GCGACTACAC AGGCGTATCT CCATGGCGTC AAAGGAATTG ACTTTCCAGG GTCCCACGAC GTCGGGTTTT AGCTTCGATC TTGCCAAGTC GAGTCTCAAG GAACGCGGAA TTGAAGTTGA GCGTAATGAA GGCGACATAA CGCGGGAACC GTCGCCACTG CCGACTCCCC CATCGCCAGG ATCCTGTCAT GTTGGGGATC CACTATGGAG TATCAGCAAG ACGGAAGCCC TCCGGCTGTG TCGAGTGTAT GAAGAGGAGA TGGGAGTCAT GTACCCGGTA GTGGAGCTTG ATCAGCTTCT ACATAATGTC CAGTTGTTGT ACGGACCGAC AGAAACCGGG CCCTGGCTTC AAGCGCCTGC TCATGCACAA GGCATTGGGG AGTTGGACAG TGATGACGTT CACATCCTGC GATTGGTATT CGCGTGCGCT CTGACAGCAG AGGCGAGTGG GAGTAGTGAG TTAGCGATGA GTTTGTTTGC GACTGTTCGA GATGCTGCAG ACCACTATGT CTGGGCAGCC CCAGAGCTCA AAAGCATTAC GTTACTCGCG CTTGTGGTAA GTCGCACCTC TGGAGTACCA ACGAATCTAA TGGGCGTAGG CGATATTGTA TTTCCAGATC GACGAGGAGA CCCTTGCTTG GCGAACGATC GGGATTGTGG AACGAATGTG CTTAGAGAAA GGTCTGCACC GAAGAGAAAC ACTCAAGCAT CCTGCTATTA TGAAAGCAGG AAAGAACCGC GTGCTCAAGC TGTTTTGGTC GGTTCGGGTA CTGGATCTGA GGTGGAGCTT CGGAACGGGC ATGCCGTTTA GCATGGATGA CTCTGATATC GACCCCTGGC TCCCAGAGCC AGAGGAGGAG GACTCATACT TGCGCGTGAT GGTCCGGTAT AGTAGGATAG CAGCTAAGGT CTGGAGGTTT ATATCAGCCT TCAATAACAC CAACGAGCTC AAGAAGGACG AGATGAACTA CCTAGACTGG CAAGTCCTGC AGTGGGTTGC TGCTCTGCCA GACTCTTTGC GCCTGCGAAG TACCTCAGGC TATGCAGAAG CCGAGACGCG AAGCCTCCGG CGACTCCGCT CCTTGGTGTA CCTCCGGGCA AACCAGCTGC GGATGCTCAT CCATCGCCCT GTCCTACATT CAGCAGCACA TATGATGCGC TTCCCAAACG AGACCCAGAC CGTAGTCGAC ATGGCCAAAG ACTCAATTCG TTTCATAACG CAGCTCCACG CATCGTCTGA TATTTACCAG CTGCAGCAAG TGGTCTTCAA CTGGTTCCTG GTGTCGGCAG TGATGGCACT GTTCCTTGCT GTCGCCCAGA TACCCAGCCA GTACAGCGTA GCCTGCCGAG AAGAGTTTTA TATGGCTCTT GAGCTAGTCA AAGGTGTCTC CGCGCGGTCG TACATCTCAC GACGGCTGTG GAAGTCGATC AAAGGCCTTC GCCGGCTTGG CCCGCAGCTT ATGCATCGAC AAAGCGACGT TGATGGTGCC ACCACTGAAG GCCTCGCCGA CCATGCCGTA ACGTCTAGCA CCCAGAGTCA GACACCAGAT GGAGCCCAGA TGACGCAAGA GTTGAAGGAT TGGTTCGAAG CCGTTGGCAA TTTGGAAGAT CAGATCATGG GTGTCGGATC GCTGGACAAT TTTCAGGGAG GATATATGAT GGATTACAGC AATGGACTAT CGAGCATGAT AAATCATTGC TTTTAG
|
Protein sequence | MNPPISTHDP SVWERSPDES VTVSVNGDRE REGEADAAPR KRRRKYIAKA CNACRQRKIK CNGERPCRRC GRQNISCVYE NSHEPGGNEN IGIERLYEQM NAMQAQISAL TASVHSLAQS NASASMPRSE AGPRLHRRIS MASKELTFQG PTTSGFSFDL AKSSLKERGI EVERNEGDIT REPSPLPTPP SPGSCHVGDP LWSISKTEAL RLCRVYEEEM GVMYPVVELD QLLHNVQLLY GPTETGPWLQ APAHAQGIGE LDSDDVHILR LVFACALTAE ASGSSELAMS LFATVRDAAD HYVWAAPELK SITLLALVID EETLAWRTIG IVERMCLEKG LHRRETLKHP AIMKAGKNRV LKLFWSVRVL DLRWSFGTGM PFSMDDSDID PWLPEPEEED SYLRVMVRYS RIAAKVWRFI SAFNNTNELK KDEMNYLDWQ VLQWVAALPD SLRLRSTSGY AEAETRSLRR LRSLVYLRAN QLRMLIHRPV LHSAAHMMRF PNETQTVVDM AKDSIRFITQ LHASSDIYQL QQVVFNWFLV SAVMALFLAV AQIPSQYSVA CREEFYMALE LVKGVSARSY ISRRLWKSIK GLRRLGPQLM HRQSDVDGAT TEGLADHAVT SSTQSQTPDG AQMTQELKDW FEAVGNLEDQ IMGVGSLDNF QGGYMMDYSN GLSSMINHCF
|
| |