Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_07118 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001304 |
Strand | - |
Start bp | 956254 |
End bp | 959330 |
Gene Length | 3077 bp |
Protein Length | 991 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | Putative transcription factor with C2H2 and Zn(2)-Cys(6) DNA binding domain (Eurofung) |
Protein accession | CBF79046 |
Protein GI | 259483558 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCGG TCCCCTACTC ACCATCACGC GGAGGGTGTT CGATCCAGTG CCATCTCTGC CAAACAACAT TCACTCGGCA AGAACATCTC AGCCGTCACC TCCGCAGCCA TAATGCAGAA AAACCGTTCC ATTGTGTGAA ATGCGGCAAA TCATTCAGCC GCTTGGACGT GCTTCACCGC CATAGTCAGT TGCATAGGTA TAAGCGTAAC GCGTCAGAAG GTCGAGGTGG GAGCAGCTCG AGGGCTTGCA AGGAGTGCGC GTTGAGTCGC GTCCGCTGCT CCCGAGGAAG TCCCTGCGAC CGCTGCTCAG TGAAGGAAAT TCAGTGTGAA TATCCAGTAC CGCGGAAGAG AAAAAGCCAA TCCGGGTTGG GTCAAACTGG CTCGCTTCCT CAGACAGCAG AGAGCGGCTT CTATCTCAGC GATGTCAGCT CTTTCCAGCT TCATTCGTCC CAACAAACTC TGAACAATTC GATAATTGAT GGCCTCCCGT CAACTTCACA CTTGTTAGGC GAAGAGTCGG AAAGTCAGGC CCCGAATGCT GGAATTACAC AGGATTGGCT GCCGTTGCAG CAATCCGGCA CACTGGCCAG TGCGAGTCCT GGGTTGGTTA GTGAAGGTAT GCTAGCATTC GAGAATAATA CGCTAGAGAA CCATTTAGGC CACTGGATTG GGAGCCTGAC GTCTGTGAAT TGGCTCTCCC CCGAAAGTAA TCATTTCCCA GAAGCTCAGT TCGAGGATTT ACTGCTGCCA AAGGGCAAGG ACGGAACTAC CGCGCGGGTT GAGAGTATGG ATAGACTGGG TGAGCACGCT TCTCAGACAA ATCGCAACAA TGCTGTCCCC GTGGGTCAGG GGAATACCAA TGCTACCCAG CCCTCAACGC CGTTGTGGAC GCCCTCGGGA ACCGGACAAG CATTGACGGC TGGCCAATCC ACGTGCACTG ATACAACTGA TACGCCGACA GTGCGACATT CCGAAGGCGT GTACTACGTG GAAGGAAGTG CCGGACGGGC ACCATTTCAG GGGAGATCAA GCTGGAGAGC TAGAATGACT TCTCGCTGGA GCATCGGTAA TGCGTCGGCC GAGCAAGGGG TACCAGACTC TGAGAGCTCC ATAAGGTCAA GGGGGCACTA TGTGTCCGAA ACGATCTACC AATCCTGTCG TGAAAGACTC GAGGCAGAGT CAAATCACTT CGGGCTTGGG ATAGATATGG ACAGTATCCC GCAGCTCGGT GAGATGCAGG ACCTGGTAAA TCTCTACTTC GATGGTTTCC ATCCATCTTA TCCCTTTTTA CGCAAAAGTC AGTCCATTTT TGTCAAGAGC TCATGCTGGA TTCTGCTCTT GGCTGTGGCC GCAACTGGGT CGCGATATAG TACTGAGGCT AGGCATCACA AGCTCGGGGA GTCTCTTGTT GATATGGTAG ATCAGCTTGT ATCGATGCGG CTGCAAAATC CTGTATTGGC GGGCAGTGAT CCGACGTGGA AGCCATGTGC TGGGTCTGAC GAGGGGTCTC TGGACACCGT AACCCTCCAG GCCGCGTTGC TGAATTCTAT ATCCCTTCTG CACTGTGGAA AGGAACACGG CGTTCGACGT GCCTTGCGCC GGAGATTTTA CTTTTTCGAA GCCTACCACG CTCTGAAACA GGCCACATCC ATAAAGAGGA GGTCATCGCA GTTACGAGAA GGAACCGAGG AAGATACCTT TCAACATTGG GTAGACACAG AGTCGCTTAT CAGGACGAGT TGGATGATCT GGGTAGGTGC CTCTGAAGGT CGGCTCCGCG TTACGTCTAG TGCCTCGCTC ACGGTTTGCA GTTTCTTGAT TGTATTGCCC TATACCAATT TCGCCACGCT CCGCTGATTC AATTGGGAGA CTCAAAAGCT CCTCTTCCCT GTCATGAGGA CCTCTGGGAC GTTTCCTCAC TAACCGAGGG TTTCAGCAAT GCAGACCATC AATCAGGTTC GTTTTATTCA CTTGGTCCCT TCGAACAGAC CTATAACCTG TCGAAGGCTG ACTTTGATAT CGCATTACTG ATAGTTACCT TGCTGGAAGC CCTTGAGCTG CTCCATATGG AAAAGACATT ACCTCCTAAG TTGGGAAATT TCAGCACTAC GATCATCATC TTTGGCATCT GCCGTCGTAA TCAAGAAGCC ACCGTGCAGC ACCAAACCAA CTTAACCCTT TGGTTACCCA GCGCGCAGAA ACAGTCGCGC CCTCCGTTGC ATCCGATAGA AGAGGCATGG CCGCCAACTG TCTCCTCCCT GTCCAGGTGG CGAAGCAGCG CTTGCGATTG CCTTGACATC TTGCATTGGA ACGCAAATAG CATAGCTGCG AGTGTGGGTG GCTGGGAACA TCCGACGATT CTGCACCTCC ACCTCGCGCG ACTTCTGCTG TTGGCTCCGG TACAGCACAT CGAGACACTT GGTAGCGAGT CAACGATATC TCACACTCCC CAAACTTCCA GCTCGACTGC ATACACGATA GCTCGATACC ACACCCTCCG CTGGGCAATC CGCGATCAGT ATAAGGCGAG ACTCTGCCTT GTTCACGCAG GAGCCCTATT CTGGCACGTT CGACGATACA GCAGTAATAG CTTTCTGGAA CCATTTAGCG TATATACTGC CACGCTTGTC ATTTGGGCAT ATAGTATGGC AATGCACACC ATGCGAGGCC AAGGCCGCGA AAAGGCGATT CTTTCCGAAA CTCATCTAAG CCCGCGCGAT CCCGTGCAGC AAGAAGCGCC GTGTCTTGAG GAGATCGGTC TGGATGACAA GAGTAGTGAT AGTGACGCTG AGGTGATGGT TATACAGCTC GACCGCCCGT GTGATGATGA GATTGTTCAG AACTTTGTTC GCTTTGGGCA CACCATGTCC GCGCGCATGC ATCGGGTTGG GGATATCCAA GAACAAAGTG CACCACGACG GATCCTCAAG CAGGGTCTAC GGTTGTTAAC CGGCGCCTTA TCAGATTCTG ACAGAGCAGT CCCTAGTTGG GGTGTGGAAA AGTCCTTCAT TGATTCCCTA AATACCTTTA TTGAGCTCCC GATGGTCACT TCAAAGAACG ACAGGTTACC TGGATGA
|
Protein sequence | MDPVPYSPSR GGCSIQCHLC QTTFTRQEHL SRHLRSHNAE KPFHCVKCGK SFSRLDVLHR HSQLHRYKRN ASEGRGGSSS RACKECALSR VRCSRGSPCD RCSVKEIQCE YPVPRKRKSQ SGLGQTGSLP QTAESGFYLS DVSSFQLHSS QQTLNNSIID GLPSTSHLLG EESESQAPNA GITQDWLPLQ QSGTLASASP GLVSEGHWIG SLTSVNWLSP ESNHFPEAQF EDLLLPKGKD GTTARVESMD RLGEHASQTN RNNAVPVGQG NTNATQPSTP LWTPSGTGQA LTAGQSTCTD TTDTPTVRHS EGVYYVEGSA GRAPFQGRSS WRARMTSRWS IGNASAEQGV PDSESSIRSR GHYVSETIYQ SCRERLEAES NHFGLGIDMD SIPQLGEMQD LVNLYFDGFH PSYPFLRKSQ SIFVKSSCWI LLLAVAATGS RYSTEARHHK LGESLVDMVD QLVSMRLQNP VLAGSDPTWK PCAGSDEGSL DTVTLQAALL NSISLLHCGK EHGVRRALRR RFYFFEAYHA LKQATSIKRR SSQLREGTEE DTFQHWVDTE SLIRTSWMIW FLDCIALYQF RHAPLIQLGD SKAPLPCHED LWDVSSLTEG FSNADHQSGS FYSLGPFEQT YNLSKADFDI ALLIVTLLEA LELLHMEKTL PPKLGNFSTT IIIFGICRRN QEATVQHQTN LTLWLPSAQK QSRPPLHPIE EAWPPTVSSL SRWRSSACDC LDILHWNANS IAASVGGWEH PTILHLHLAR LLLLAPVQHI ETLGSESTIS HTPQTSSSTA YTIARYHTLR WAIRDQYKAR LCLVHAGALF WHVRRYSSNS FLEPFSVYTA TLVIWAYSMA MHTMRGQGRE KAILSETHLS PRDPVQQEAP CLEEIGLDDK SSDSDAEVMV IQLDRPCDDE IVQNFVRFGH TMSARMHRVG DIQEQSAPRR ILKQGLRLLT GALSDSDRAV PSWGVEKSFI DSLNTFIELP MVTSKNDRLP G
|
| |