Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_08177 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | + |
Start bp | 1025725 |
End bp | 1028781 |
Gene Length | 3057 bp |
Protein Length | 867 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | Putative Zn(II)2Cys6 transcription factor (Eurofung) |
Protein accession | CBF74045 |
Protein GI | 259480950 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.681278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.988868 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCGG CCTACTATAT TGGACCCTTC GGGCGTATGT TACCTATTGC CGATGGTCCC GCGGAGCAAG AACCGGATCG TCCAGCTAAC CTATCAGTTC CGGGATCCCA TCCATATCAA CTTCCACCGC CTCGTACTTC AGCACCATTA CAGTTTGGAA CAGATCCATT CTTACGTCCA CGAAATCGAG CCGACAGGCT AGACGAGAGA GAAGAGCCTT CCGTTTACTC GAGCAGTCAT GGGTACCAGC AAACGAACGA GCAGCTTCCC TCTGTCAGTC AACTTCTTAC ACCTACCGCG CAATTAAGCC GGTCGCCATC ACCATATAAC CCAAGTGCTT TTGGGATCTA CTTGGCGCCA CACGAGCCTG GAGAATCACC CCAGCGCTAT AATGAGACAA CCGCGCCCCC TACACAGGCG CGCTCAAGCA TCTATGAAAG ATCCAGGGCG TTCCCGGACG TAGCAACCTC GCCGCAATCG AAAAACTTAC CTCCTATATC TCACATTTCG ACTCATGCTC CTGGCCGTAA TACGCCGACG TACTTGAGTA ACAACTTGAA TTCAATTCAC CCGCCATACC TACCTAGTTT TCATGGCTAT AATGAGGAAC CCAACGGCAG AAATTTCATG AAGACCCCTT CACTCAGTAC AACTGAGTTG AGTCAGCGCG GTGCTTCTGG CAAATCAGCG AAACCTCAAG TACGTCTGCA CGTCGTGGAT GAGCGTTTCA TTGAGGGTGA AGGCCTGTGT TATATATACG CTGACGGCTC TCACTGCCCC AAAATCATTG ATGGCATGCC AGTTAACGCC AACTGGGGGG TCACGAAAGC TGGCAAGCCT CGAAAAAGAT TAGCGCAAGC CTGTCTTACA TGCCGAGAAA AGAAGATAAA GTGTCATCCA AACCTGCCGA AGTGCGACCA ATGCCAGAAG TCTGGGAGGG AATGCAGATT TGAGAGTGCG TAAGTATATG GACAACAGAG TGCCACGAAC CTGAATTGCT GATGACGTTC AGACCGCGTG GACACCGCGC AGCATCGAAG GCGTCGCAAT TCACGAGGAA ATATGATATA AGACATAATA CTGCAACGGA GGACTCTAAC AATGCAGGTA CCTCCAGTTC ACTATATTCT GTTGCGAGGG CTTCAGAGAG CTCTACCTCC CTCCCTGGAA CAAACTCACA GTCTCCCCTA TCTGATGACT CCATGCTTAC GCCTTCTGCT GTGGATAGCA ACCATAACAA CATTAGCGAT CCCGACCCGG CAATATGCGA CAAGGCCGCA GCACCTTCCC GGTGAACGTG AGCGTATGCC GCGGCATTCA ACAGGCAGCG CTGCCAGCTC ACCATCCGCG GACTACGCGG AGATCTTGAC GGAGATCAAG GACTTGGATG AACACGACCC ACTGGCGACT GACTGGAGAA CAGACCCTTA CGCGGTCGAT CCGGAATCTG CAACTCATTT CACCGAATTG TACTTCACAT ACGTGAATGA CCGCTTATAC TATTTGTTTC CGCGAAGAAG GTTTCTCCTT TGGCTCAAAT CATGCCATAC GAAGTCTCTC GCCGATAATA TGCTTCTTTA CTGCATCATG GCACTGGGAT CTGTCTTCTC AGACCGCCCT GGTAAGATCA CAGCTATGAG GAGATACTCC CGCATTGCAA CATACGCCCT CGAGCACAGC CAGCACAGTC TATCCTTACA GCTTGCACAG AGCCGCATTA TAATTAGCCT TTGGTACTAC GCAATTGGCG CACTCGTGAA ATCTTGGGAT GCTGCCGGCG CCGCGGTGCG AACGGTATGC GGCCTGCGTT ACAATGTCGA AATGGGGGGA GTTATTGTAG AGCAAAGCCA GCCTTGCGAG TATGGCCTAC ATCCACAAGC TCTGATAGAA TGTCGTCGGC GAACCTTCTG GATCGCTTTT CTGACTGATG TGAGTTACAT ATCAATCGCC CTCTCCCCGC TTTTACCCCT TCTGAGCTCA GTTGTCTCTA ATTTCCGTTA CACAGCGCCT GTCGTGCTTC TATGCCCCTT CAACGACCTT CATCTCCTCC CAAACAGCCT TCCTACGTCT CCCTTGCCGC GAAGAGATTT ACGAAGCCCA GGAATATACC ACAGTTCCCT TCTTCCAAAA TTTCCTTAAT CAAGTCCCCT CCGAATCGGA CGAACTCTCC AACCTAAGCG TCTTGGCACT CCTCATAGAC GTGATATCAC ATGATGGGGC GATGTCTCTG ACCACGTTTT CCGCCTATCT CTCATCCCGG CAGATTCATA CAACAAACTC TTCGAGGATT TCTATACCGC CATAGTCCGC CGATCAGACC AGTGGCTCTC AAGGCTTCCA AACCACCTAA CATTTACGGC TGTAAACCTC GAACGCAGCA TCCAAGCACG AAACACTGAC CATTTCATCT CAATTCACCT TTTGTATCAT GCCGCCCTTT TAAAACTCAA CCGTTACGCA CGCGCACAGC TCCTTAGACC TGGAATGGCA AAACAGTACG TTCACACAGC CCGCAACCAT GCTGCAGAGA TACTCCGCAC CGCACTCGCG CTTGAACGCT ACGCCTCCGA TCACAACGTC TCTCCAATGA CAGCTGACCC GACCCCAAGG TCCGAAACAC TACTGCTGGA TCCCTTCCTT GGCTACATAA TCCTCTCCGC AGTAGACGTT CTCAGCGCCG GTGGTCTAGT TATCGACTTG CCTGAGTGCA TCAACCTTAT CCGCGGGGGA CTTGACGTTG TCCGTGACCT CAGCAGCTTC TGGAACAGTA CGAAGCCGCT GGTGTCGGCT ACGGAATCAC GCTTAGAGGC GCTGATTGAG GCGCACCGCT CTGTTTCTAC GAGCCGCACC ACACTTGAAG GGAGAGTGGC TTTCTTGTTC GATGGCCCCT CGCTGGACAG TCAAATCCAG AATGGCGTGC AGAAGCAGGA TTCGTCCGTG AATGAGGACC TACTATATGG CGGTCTGCCA AGGGAGCAGC TATTCCTTGC GTTTGGGGTG ATGGATGTGT CGTGTTCGTT GAGGAATGTG GTTTGGGTTC GAGCGAGGCG TGAGTGA
|
Protein sequence | MSSAYYIGPF GRMLPIADGP AEQEPDRPAN LSVPGSHPYQ LPPPRTSAPL QFGTDPFLRP RNRADRLDER EEPSVYSSSH GYQQTNEQLP SVSQLLTPTA QLSRSPSPYN PSAFGIYLAP HEPGESPQRY NETTAPPTQA RSSIYERSRA FPDVATSPQS KNLPPISHIS THAPGRNTPT YLSNNLNSIH PPYLPSFHGY NEEPNGRNFM KTPSLSTTEL SQRGASGKSA KPQVRLHVVD ERFIEGEGLC YIYADGSHCP KIIDGMPVNA NWGVTKAGKP RKRLAQACLT CREKKIKCHP NLPKCDQCQK SGRECRFESA LPYLMTPCLR LLLWIATITT LAIPTRQYAT RPQHLPGERE RMPRHSTGSA ASSPSADYAE ILTEIKDLDE HDPLATDWRT DPYAVDPESA THFTELYFTY VNDRLYYLFP RRRFLLWLKS CHTKSLADNM LLYCIMALGS VFSDRPGKIT AMRRYSRIAT YALEHSQHSL SLQLAQSRII ISLWYYAIGA LVKSWDAAGA AVRTVCGLRY NVEMGGVIVE QSQPCEYGLH PQALIECRRR TFWIAFLTDR LSCFYAPSTT FISSQTAFLR LPCREEIYEA QEYTTVPFFQ NFLNQVPSES DELSNLSVLA LLIDVISHDG AIIQARNTDH FISIHLLYHA ALLKLNRYAR AQLLRPGMAK QYVHTARNHA AEILRTALAL ERYASDHNVS PMTADPTPRS ETLLLDPFLG YIILSAVDVL SAGGLVIDLP ECINLIRGGL DVVRDLSSFW NSTKPLVSAT ESRLEALIEA HRSVSTSRTT LEGRVAFLFD GPSLDSQIQN GVQKQDSSVN EDLLYGGLPR EQLFLAFGVM DVSCSLRNVV WVRARRE
|
| |