Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_04351 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001303 |
Strand | + |
Start bp | 2302762 |
End bp | 2305631 |
Gene Length | 2870 bp |
Protein Length | 847 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | pH-response regulator protein palA/RIM20 [Source:UniProtKB/Swiss-Prot;Acc:P79020] |
Protein accession | CBF77701 |
Protein GI | 259482839 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.237971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.155453 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCGT ACGTACGTCC AGTACTTTGC AAAAATATCA CCTATTGACA GAAACAGAAA TATCCTCCAG ATTCCCTTCC GCCGCTCGCA CACTGTCTCC CTCTCGACCG CCTTGACCCA ATACATTTCC ACCAAATATG ACCAGCGCCC TGACATGTTT GCAGATGACT TGCTCATTAT CGATCGGTTA CGAAATGAGG CCATAAACGT GCAGGAACCA CATGTCAGCG GAATCAGCCG GCTGGTTACT TACGCCGCGC AACTGAAATG GCTTGGGGGA AAGTTTCCAG TTGATGTACG TATCAATAAT ATCATAGATA TTCCGGGCTG AGTTGACGGT CGTAGGTCGG GGTCGAGTTC CCCTGGTATC CTGCTTTTGG GTTCAACACA AGTCGGCCAG GTACGACGTT TCTCACAGCG AATCTGCTAG GATGTTGGCT AATGTACGAC AGTCTCACAG GATAACATCC GCTTCGAGCT GGCAAACGTC ATCTTCAACC TCGCCGCACT CTACTCCCAG CTCGCCTTCG CCGTAAACCG CACAACAACC GACGGTCTCA AGCAAGCATG CAACTATTTC TGCCAGGCAG CCGGTATCCT AGCACACCTC CGAACAGACA TCGTCCCTGA CATGCGCTCC GCCCCGCCGG AAGACATGGA CGAGATGACC CTCCGAAGCC TGGAAGAGCT GCTCCTCGCA CAAGCTCAGG AATGTTTCTG GCAGAAGGCC GTGATGGATG GGCTAAAGGA TGCATCAATT GCACGACTCG CGGGCCAAGT GTCGGACTTT TATGGCGATG CGTGCGATCA CGCCGTCAAG TCGAATGCGA TCAGCCCCGA ATGGATCCAC CATATGACGG CGAAACAGCA TCATTTTGCA GCTGCAGCGC AGTATCGCCA GTCGCTGGAT TGCCTGGAGA AGCGCAAGTA TGGAGAGGAG GTGGCACGGT TACGGGACGC TGTGGCTTGT GTGAATGAAG CGCTCAAAGA GAGCCGGTGG ATCAATCGCA CGGTGCTGGG TGATTTGCAG GGGTTAAAGA ATAGAGTAAC GGAGGATTTG AAGCGCGCCG AGAAGGACAA CGATATGATT TATCTCAACC CCGTGCCGCC CAAGTCGGAG CTCAAGCTTA TTGATCGGGC GTGTATGGTT GCGGCTAAGG CGCCGTCGCA GGTCACGGAC GCAATCTCGA TGCTGGGGGA AAAAGGGCCG TTGGGACAGC CGCTCTTTTC GAAGCTCGTC CCGTATGCCG TGCACATTGC GGCGAGCATT TACTCGGACA GGAGAGACCG TCTTGTCAAT GAGCGGATTA TCGGCGAATT GGAGAACATG ACGGACAAGC TACGCGAGTA TGTCACATCC TTTTACATTC TCTATTTATG CTAACAGGCC AGTCTACTAT CATCGCTCAA TCTCCCCGGC TCGCTGCAAG CCCTTGAGAA GCCCCTGGGC CTGCCGCCAT CACTGGTCGC CCATGCCGAA GAAATGCGTC AACAAGACGG CCTCAACCGC CTCCGCAAGT CCCTTCTCGA CATCGCCAAA GTCAAATCCA ACGACCGCGC CGTCTATACC GAAGGCGTGG AACTCCTCGC CGCTGAAAAA GCTGAGGACG ACGCTTCGCG CCGGAAATTT GGCACCGACC GCTGGACGCG CGAGGCCTCT GAAGCCGCCG CTCCTAAACT CTACACCACC GCCCGGGAAA TCGACGGCTA CTTCACCTCA GCGCAGAGCA GTGACAACCT GGTTGAGCAG AAACTGCACG ACTCGGAAGC TGTCTTTCGC GTTTTGACCG GCACGAATCG CGACCTTGAG GCTTTTGTCC CAAGTAGTCG ACGCGCAACG ATACCGCCTG AAGTCGAACG CGAGGTGAGT CGGCTCCGCA GCTGCATCAG CGAAGTAAAT CGACTCGAAA GCCGGCGAAA GCGCAAGGCT CAGGCCGTCA AGGACAAAGC GCGCGCGGAC GACATCAGTT CTGCACTTGT CCGCGAGGCA GCACGTTTAG AGCGCGAGTT CCCAATGCAG GCTATCCAGG CTAGTCAATT CGAGGACCTC TTTGAATCAC GACTGCGCGA TTACGACGTC GATCTAGATA TGGTTGCGCA GGAGATGCAC GATCAGGATC AGATCGTCGC GCAGGTGCGG GACGCGAATC GTGCGTTTAC GCGCGCCCAT ACGGGTGATG CTTCTACAAA GGAGCGTGAG AAGGCGCTCC AGGAGCTGGA GAACGGGTAC CTGAAGTATA AGGAGATCAT TTCAAATATC GAAGTCGGAC GGAAGTTCTA TAATGATCTC GCGAAGATCG TGGGGCGGTT CAGGGATGAT GTCAAGGCGT TTGTGCATAA GAGGCGTATG GAGGCTAGTC AGCTTGAGCA GTGCGTCTTC CCCCCCACCT TTTTTTTTAT AACCGTCTAA TGCGATATAG GGACATATCC AGCGTCGCCG CAATGGCATC CCTGAATATC TCGCCTATCA GGCAACCACC CCAGCAAACG GTAGTTTCAG CCCCAGTTTC AGTCTCTGCC GCGGCGTCAG TTCCAGCGCC GACACATTTT AATCCAGTCA AACCCCAACC TCAACCTCCG TCGCAGGCAA TCCCACCACA GTCACAACCA CAACCGCAAC CGCAACCCCT GCGGACACCT CTAACAGCGC CCCAGCCAAC ACGCAGCGTT CCGCAAGTGA CGCCAGGGAT GTGGTCCCCT GAAATGGGAA TACGCTTTGG GCCGGGGGGT ACGACGGCCC AGCAGTCTCA GCAAACGTGG GATCCGTCGA AGGGGATGAA GTTCTCATGA ACCTTAATAC GCTACTAAAC ATATGCTTTG ATGTAACTAA TGGTGTTAGA CCAGTACATA CTTGCATACA TACACGCACA
|
Protein sequence | MASNILQIPF RRSHTVSLST ALTQYISTKY DQRPDMFADD LLIIDRLRNE AINVQEPHVS GISRLVTYAA QLKWLGGKFP VDVGVEFPWY PAFGFNTSRP VSQDNIRFEL ANVIFNLAAL YSQLAFAVNR TTTDGLKQAC NYFCQAAGIL AHLRTDIVPD MRSAPPEDMD EMTLRSLEEL LLAQAQECFW QKAVMDGLKD ASIARLAGQV SDFYGDACDH AVKSNAISPE WIHHMTAKQH HFAAAAQYRQ SLDCLEKRKY GEEVARLRDA VACVNEALKE SRWINRTVLG DLQGLKNRVT EDLKRAEKDN DMIYLNPVPP KSELKLIDRA CMVAAKAPSQ VTDAISMLGE KGPLGQPLFS KLVPYAVHIA ASIYSDRRDR LVNERIIGEL ENMTDKLRDL LSSLNLPGSL QALEKPLGLP PSLVAHAEEM RQQDGLNRLR KSLLDIAKVK SNDRAVYTEG VELLAAEKAE DDASRRKFGT DRWTREASEA AAPKLYTTAR EIDGYFTSAQ SSDNLVEQKL HDSEAVFRVL TGTNRDLEAF VPSSRRATIP PEVEREVSRL RSCISEVNRL ESRRKRKAQA VKDKARADDI SSALVREAAR LEREFPMQAI QASQFEDLFE SRLRDYDVDL DMVAQEMHDQ DQIVAQVRDA NRAFTRAHTG DASTKEREKA LQELENGYLK YKEIISNIEV GRKFYNDLAK IVGRFRDDVK AFVHKRRMEA SQLEQDISSV AAMASLNISP IRQPPQQTVV SAPVSVSAAA SVPAPTHFNP VKPQPQPPSQ AIPPQSQPQP QPQPLRTPLT APQPTRSVPQ VTPGMWSPEM GIRFGPGGTT AQQSQQTWDP SKGMKFS
|
| |