Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_04013 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | - |
Start bp | 2283309 |
End bp | 2286172 |
Gene Length | 2864 bp |
Protein Length | 864 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | C2H2 transcription factor, putative (AFU_orthologue; AFUA_1G04110) |
Protein accession | CBF74886 |
Protein GI | 259481402 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATCA GAAAGTGCAA CATCTGTGAC CGTCGGTTTA AGAAGACGGA GCATTTCAAG CGCCATGAAC GATCACGTCA GTGCAGTGAT TCTGCTGAGA AGTATCTTTA CTGACATGCT CTAGACACCA AGGAAAAGCC TTACGAGTGT AGTGTCTGCC ATAAGCGATT TTCCCGCAGG TGAGTTAACT CCAACCTGCA CCAACTGCGA CGCTGATCGC TATCCCAACA GCGATGTGTT GAGCCGTCAC GCCAAAGGAC ATAATACGAA TGGAACAGCT GCGACTGCGA CTCCGGCGCA GAATAAGCCG CGCGCTCCGT CGACCGCCCA GCCTCCCACT CCTCAGCAAC CGCCCGCTCA GACCGATCCT GCCCGCTTCC TGTCAATGGA CGGCGATGCT CATCCGTCGC ACAATCTCCC TGTCGCGCCG CGCGATATCT CGCTTTCCAA TCCTGCCCAC CTCCAGTCTT CGTCGCTGAA CTTTCTCGCG GACATCTCCG CCCACCATGG CCGGGCCGAA CCGGATGTCG GGTCCATGAT GGTCGAAGAA CAGCAGACGT ATTTCGGCTG GAACGAAATG CCCGCTCCTC CGCCGCCTCA GCCTGTCGCT GCTCCGCCGT CTGATCAGCA GGCTTACCGA ACGCCGATGT TCGATGCTAT GCCGAACGAT ATGCTACAGT TTTGGTTGGA GCCGCGCGGG GACACCGCCT CACATCATGG GTCACTTGAT ATGTTGGGCG AGCCCAGTTT CAGTCTTATG GGAGATAATG TGGCAATTAC GCCGGAACAG CAAGCCCGGC CGTCGAACGA CGATCTAAGC AGCAAACATA CTGGAGATAT CCCCAATGAG CGATTCGCAA GGGTCCAAAG GTACTGGGTG GCCCCGTCAA ATGCCGCCGG GAGACTCGTG AACAATCTTT GGCGGGACAT TGCTGCCAGC GACGTTGATA ACATTTTCTC TCTCCAACCC TCGCATTCTT TCCATAGCCC GTCTGGACTG CTTCCCGGAT CTCGATATGG CCTGGACGAG GAATGCCGTC AGCAGTTGCA AGCGGTTTTC GGAAACCTTA GACACTACAA CCAACTACAC TCCCCGAATA GTGCGGTATC GCCTACGTCC AGTTCGACTT TTGGAGGCCG TCCAAGTTTT CCTCCAGCTG AGATACTGGA CATGGCTCTT GACTTATACT TTCGCAATTT CCACCCTCTT ATCCCTTTCG TTCATGTTTC GACTTTTTCG GTGAAGAATA CTCGTCTCCC AGTGCTATAT GTCATGTGTC TCATTGGGAT GATCATGTTA GGGACTAAAG GAACAACTAC TTTTGTCGCC AAGAACTTTT CCGTGAGTCC ATCGTCGTGG CGTTGTATTC TATCACTGAC ATCTGCAGTT TGTCCTGGAA CGGATCACTG CCGACCTCGC AAAATGCGCG GTGGGTGTTG AGAACACCTT GGACACTATG AACACCTTCG CGGCTGCCTT CCTCTTCCTC AACTTGGCGG CGATGACGGG GGTAAGCCGA AGCCTGCTAT TTGGTGACAC CCGGCTAACA GCAGCAGGAA AAAGAACATC TTGAGAAATC GCAGATGCTT TACGTCAACC TCATGTCGGT ATGTTAACCA CCGTGCTGAC TCTTCAAGGC TAACAGTCAA AGATCGCACA ACGCCATGGT CTGTTTGCAG CTACTGAAGG ACAAATACTC GATATCACCC TCTTCGAAGC GGTACCTGAT ATTGATATCA GGTGGAAAAC CTGGAGCAAA GTTGAATCCG TCAAGCGGTA TGTCGACTGA TCAATTTATT CTACCAAATA CTGACCAGTC AGGTTGATTA CTGGGCTTCT CTTATTGGAT TCCTGGTATT CCTCTTTCCT CTCGACCAGT CCCATTATCG TTCCCGACTC AATCCAACTC ATTCTACCCT GCAACGAAGG TCTCTTCCGG GCCAACGGCT CCATGCGATG GATTCAATTG GTCCGTAGCG GCAAACGTCT ACTAATGCCA ACAGTCATGG CACCTTCGGA GAACGTCACC GTGCCCGTCT TGGAAAGCCC TGTCGACGAT TTTTGTATTC ACGGCGTTCT AGCAATGGTC CAGCTCCGCT TATCAGAAGC TTATCACCGC CTTCTTTCGA ATAGGGCAAG CTATCCTTTT GCACCTTGCC ATACGTATGC CATGGATGGA CGCGCCAGAT GTTTACCCTC ACTCCAGCTG CAAATTGCAG ACAAATACGG CGAGGTTCTC GAGCGGCTCA ATCCTAACGC CGCCGTCATG TGGCACAACA TATGCATGAC CTTAACCGCA GATACTCAAA TATTCGATCT AGCTGCCGGC CGCGCAGGAC CAGGACCCGC CAAGAAAGCC CTAGATGACA TTGCAACCTG GTCACAGACA CCAGCTGCCA GACGAGCATG CCTTCACGCC GCACACATCT ACAAGGCGAT GACCAACCGC AAAGCATCAG ATCACACAAT GTTCCACTCG GTTTTCTCTC TGTTTTCAGC AGCCCTCGTC CTCGGCCTCT ACATCTTCAT GGTACCGAAC CCCAGTGAGC TACAAGTCGG CGGGACGTCA ATCGAACTCC TTGATGACAT CGACTGGGAG CGTGTCGGGA CAGAGGGATT CACCAGTTTT ATGGAACCAC GCGGGACTCA GACATTCACG CCGTCGGATG ATCCGGCAGT GAACTTCATT CGGAATGGAG GCACTGTTTA TTTCCGTGGT GTACCGTTTC AAGGTGGGTA TCAGTCTGCA AGACGTATCC TGCTGGATTA TGCAGGCCTC CTGAAGGATG CGGGGAAGTG GAGCGTACGC AAGTTTTCGT ATGTTCTGCA TATCATGAGT GATGTTCTTA TGGAGGTTGA GTAG
|
Protein sequence | MVIRKCNICD RRFKKTEHFK RHERSHTKEK PYECSVCHKR FSRSDVLSRH AKGHNTNGTA ATATPAQNKP RAPSTAQPPT PQQPPAQTDP ARFLSMDGDA HPSHNLPVAP RDISLSNPAH LQSSSLNFLA DISAHHGRAE PDVGSMMVEE QQTYFGWNEM PAPPPPQPVA APPSDQQAYR TPMFDAMPND MLQFWLEPRG DTASHHGSLD MLGEPSFSLM GDNVAITPEQ QARPSNDDLS SKHTGDIPNE RFARVQRYWV APSNAAGRLV NNLWRDIAAS DVDNIFSLQP SHSFHSPSGL LPGSRYGLDE ECRQQLQAVF GNLRHYNQLH SPNSAVSPTS SSTFGGRPSF PPAEILDMAL DLYFRNFHPL IPFVHVSTFS VKNTRLPVLY VMCLIGMIML GTKGTTTFVA KNFSFVLERI TADLAKCAVG VENTLDTMNT FAAAFLFLNL AAMTGQQEKE HLEKSQMLYV NLMSSKIAQR HGLFAATEGQ ILDITLFEAV PDIDIRWKTW SKVESVKRLI TGLLLLDSWY SSFLSTSPII VPDSIQLILP CNEGLFRANG SMRWIQLVRS GKRLLMPTVM APSENVTVPV LESPVDDFCI HGVLAMVQLR LSEAYHRLLS NRASYPFAPC HTYAMDGRAR CLPSLQLQIA DKYGEVLERL NPNAAVMWHN ICMTLTADTQ IFDLAAGRAG PGPAKKALDD IATWSQTPAA RRACLHAAHI YKAMTNRKAS DHTMFHSVFS LFSAALVLGL YIFMVPNPSE LQVGGTSIEL LDDIDWERVG TEGFTSFMEP RGTQTFTPSD DPAVNFIRNG GTVYFRGVPF QGGYQSARRI LLDYAGLLKD AGKWSVRKFS YVLHIMSDVL MEVE
|
| |