Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_06747 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001301 |
Strand | + |
Start bp | 2995077 |
End bp | 2998255 |
Gene Length | 3179 bp |
Protein Length | 962 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | Putative transcription factor with C2H2 and Zn(2)-Cys(6) DNA binding domain (Eurofung) |
Protein accession | CBF71378 |
Protein GI | 259480339 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00110078 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0171603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAGTG CCCTCGACTC TCCGGGTTCC AGGTCGCGGC AAGGCTACTT CCAATGCGGC TTCGGGTCGT GTCGCAAGGC CTATAATCGG GCGGACCATC TGATCCGCCA TGTACGCTCA CGTCTGTCTA TGTGAACATT TATCTTAGAA AACGTTCATT GACAAGTATA GATACTCGAG AGAAGCCATA CGTCTGTCAA GTTTGCAACA AGGGATTCTC GCGACCGTGA GTTTCGTCCT AAAAGTCTTC CCCCTATCGT GGATGCCCGC TGAAGAGAGG ATCAGAGACC TGCTGAAACG ACACGCCGCA GGCCATAGCC ACTTGCAAGA CGGAAAGCGC AGACGGACTT TGTCGTACTC AAAGAGCGGG CGTGTCTCGC AGGCTTGTAA GGCGTGCGCG ACGTCGAAAC TCAAGTGCGA CGAGGAAAAG CCGTGTCGGC GGTGTCGAGA TCGGAAGCTG TTCTGTGATT ATGCTGATGC GAATGCACAG GATGGTGAGC AACAAGGTAC GTCTCCAGTC TATGGACTCG GGGCGACGCG GGTGAACATA CTGACAGGGT AGGGTCCAAC GATGAACACG AGCATGAGCA TGAGCAGGAG GAATACTCCC CCCTTGAATC GCAGACCTTT TCGCCGCTCA ATCCGGAGAT GAACTTCCAG CAACTGGTTA CGCCTGACGT GCAGATTGAT TACCTGCCGC CGAATCCAGG CCCTCCGCCG CCGATACCCG CATATGATGG CAAGTCATGC TTGGACAAAG CGCAGACGCA AGCTGACCAG GTCCCAGCTC AAACTGTTTC TTCTCTTGTC GACCACGAAA GCGGCGTTTT CAGCGTCGAC GGGACCTTTT TTCCTGAATT TATCCCAGAC TCCCTCGTCT CATTATCGCG TCCTGGAGAA GCCGACCCAT CCGCATTCCC TCCGAACGAC TATTACGCCC ACGGACTCTT TGACTATAAC GGTAAATTCG ACTTCGATCT GACAGAAGTT GATTTTGGCC TGATCGACTT CTACAACTCC CGAGGCTCCG CCAATCCCGC TCCCCTGCAG CCCGACGAGA CCGACTGTGA CGCAGACAGA GACAGCGGCA TTGCCCTCGG CGCAGAGGCA TACAGCCGGT CCAGCCTCTC AGCCTGGAAG CCAGGGCATT CCGACCACGC CTTCGCCGAC CAGAATGACC TGTCCGTGCC CAAGTCGATC GACAGCCCTG AAGCGAGCGC GCAGTGCAGA CATCAGATCC TTTCAGAACG GTTGTCACCG GGTAGTCGTG ATCTCATCTT TGGAATGGTC CTGCAGACGA GCCAGCGCGC TAACCTAGCA CGCATTATGA AGTCGTTTCC CAGCACGGAG CTGCTCGACA GCCTGATCCA GGATTTCTTT GCGTACCAGG CTCAGCAGGT GGACTCGTGG ATCCATGGGC CGACCTTCCA TCCGAACGAG GAGAGCCCTG ATATGGTAGG CATCGTTGCT GCTGCGGCTG CAGTCAAGTC GACTATCCCA ACGATTAGGA AGTTGGGGTA TGCGTTAATG GAGGTTGTGC GGTTGCAGAT GAGTTTGAAG GTAGCCCGTC ACTTTCTTTG CCTTTACTGC AAATCATGCC TCTCTGGGTG GCTGATAATA GGCTTCACAG TATGAAAACG ACAACACTAC AATCCGCGAC CTCCGGGCGT CGCAGACGTT TGCCCTGACG ATCGATATAG GCATGTGGAG TGGGAGTGGA CGGAGGACAG AAATCGCCGA GAGCTTCCAG CAGCCCGTCC TTACGGTATG ATTTTATTCT ATCAATACAC TTACCCTGTG CTCTGTCCTA AGTTACCTAA GTTAATCGTG GCTATCTAGA TGCTCCGCCG CGGGCTGCGT TTCCGACGCT CGCTCTACCC GACTATCGTA CCGTGTTTAG AAGACACGCC GTCAACATTA GAGCGCAAAT GGCGCGACTG GGCGGAACAG GAGTCATTCA AGAGGTCTGC CCGTTTCATT CCTCCTTCGT ACTCTATCAG ACTGACCGGC AGGTTAGACT CGTCCATCAC CTCTTTCTCC ACGACGCCCA ATCCAGCCTG ATGCTGAATA TTAACCCCCT TATCTCCTAC GCAGACCTGG AACTCCCTCT CCCCATGATC CGAGCACTCT GGGATGCAAA ATCTGCAACT GAATGGCGAG ACATTTACAT CGCTACGTCA GCATCAGCCT TGGCTCCAGA GAGACTCCCC TCTCTCGTCG ACACCCTACG AGACATGTCT GCATACCAGG GCCGTATCGA CCACCAGCTC TCGGCCTCGG TGATTCTGCA TGGCCTCTCG GCTCTTATCA ATGAATACCA CCGGCTCAAA TTAATTGCGC AGGGGAGCTC AAAACACTGG AACGCGCTGG TCATCAATTC CAGACAGCAG GAGCTCGAGC AGGTGTTGCA ACATTTTCGT ATGATCAGCG TCGACACAGA CCCCACCGCT ATGTCTAGCC CTAGTTCCAG TCATGAGATA TCGCTTCTGC ACGAGGTGAT CTCCATGTTC CTCTTCATGT CGCTCGAAGA CCTCCAGCTC TTCGCCGGAA AAGAGGACAG GAACGAAGCG CGGCGCGTGT ATGACAGCGC CCTGGAATGG ATTGGCAGCG CCGACAGTCG TAAAGCGATC TGGCATGCCG GGCAGGTGAT CCGGGCCGCG AGAGCCATGT CCATGGCCAA GGAAGGCTCG CTCACGGGGT TCCTTGCCAT CAGCGTTTTC TATGCGTCGT TGGCGTTCTG GTCATACGGG GTTGTCAGCA GAGCTCGTCG CTCAAAGATC TCTACCACCT CTGCAACCTC TGCCGGAACC ATTACCTCTA CCGCCAGCAC TAATACTGTA TCTAGTGTGC TGAGTCCGAA ATTGGGCCTC GGCAGCTCGA CCTCGGCCTC CGGCTCTGAG CTGTTAGTCT TCCTAGACGG CGAAGAAACG GCCGACGTCC ATAGATTCAT CTCTCTGGCG CGCGGGTGCC CTGCTCTCCG GGGATTCAGT CTAAGTGACG GGCCGGCTCT TGTTTCTGAT CCGGGGAAAG TGATGGACGT CGCTCAGAGA CTCCTAAGAG GTGATGCGAG TGCCCAGACG CCGTATGAGG CGCTGTCGCC GCTGGTACAG GGCTTGTGTC AGCTGATGCA TGGGCTGGGG AGTGCGGCTG GGAGAGAGCC GGACAGTGAT AAGCAGTAG
|
Protein sequence | MQSALDSPGS RSRQGYFQCG FGSCRKAYNR ADHLIRHKTF IDKYRYSREA IRLSSLQQGI LATVSFVLKV FPLSWMPAEE RIRDLLKRHA AGHSHLQDGK RRRTLSYSKS GRVSQACKAC ATSKLKCDEE KPCRRCRDRK LFCDYADANA QDGEQQGSND EHEHEHEQEE YSPLESQTFS PLNPEMNFQQ LVTPDVQIDY LPPNPGPPPP IPAYDGKSCL DKAQTQADQV PAQTVSSLVD HESGVFSVDG TFFPEFIPDS LVSLSRPGEA DPSAFPPNDY YAHGLFDYNG KFDFDLTEVD FGLIDFYNSR GSANPAPLQP DETDCDADRD SGIALGAEAY SRSSLSAWKP GHSDHAFADQ NDLSVPKSID SPEASAQCRH QILSERLSPG SRDLIFGMVL QTSQRANLAR IMKSFPSTEL LDSLIQDFFA YQAQQVDSWI HGPTFHPNEE SPDMVGIVAA AAAVKSTIPT IRKLGYALME VVRLQMSLKY ENDNTTIRDL RASQTFALTI DIGMWSGSGR RTEIAESFQQ PVLTMLRRGL RFRRSLYPTI VPCLEDTPST LERKWRDWAE QESFKRLVHH LFLHDAQSSL MLNINPLISY ADLELPLPMI RALWDAKSAT EWRDIYIATS ASALAPERLP SLVDTLRDMS AYQGRIDHQL SASVILHGLS ALINEYHRLK LIAQGSSKHW NALVINSRQQ ELEQVLQHFR MISVDTDPTA MSSPSSSHEI SLLHEVISMF LFMSLEDLQL FAGKEDRNEA RRVYDSALEW IGSADSRKAI WHAGQVIRAA RAMSMAKEGS LTGFLAISVF YASLAFWSYG VVSRARRSKI STTSATSAGT ITSTASTNTV SSVLSPKLGL GSSTSASGSE LLVFLDGEET ADVHRFISLA RGCPALRGFS LSDGPALVSD PGKVMDVAQR LLRGDASAQT PYEALSPLVQ GLCQLMHGLG SAAGREPDSD KQ
|
| |