Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_05157 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001305 |
Strand | + |
Start bp | 1112531 |
End bp | 1115618 |
Gene Length | 3088 bp |
Protein Length | 993 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | armadillo repeat protein (AFU_orthologue; AFUA_1G07050) |
Protein accession | CBF80979 |
Protein GI | 259484609 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.304489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGCG CCGAGGCGCC GCCTATCTTC CTCCAGCTAC AAAATGCGGA TTCCTTGTCA TCACAAGCTG CTGCTCTGAG AGCCCTGAAG AATGAGACAA TTGGCCATGA TCAAAGGAAA GAGGCCTGGG TACGATTGGG GCTCATTCCC ATACTTTCCA ACGTGCTTGC GTCTCGGGCA CTCGACAAGA GCGAGCTCAA TAACGGCACC AAGCAGCCCG AGTTGCCTGG CTCTAGAGAA GAATCGGATG ATGTTTGCTT ACAGGCAATA ATTCTTGTTG GGAGCTTAGC GCAAGGTACT AATTATTTTT TTTTGATCAC TCGGCATGCA ATCACTAACG TCCCAGCAGG AGGCACACCT TTCCTATCGC CGATCTTATC GAGCAATATA CTTCCGATAC TCCTCTCCAT TCTATCATCC AACTGCCCTT CTTCCTTCGT TCTTCCTATT CTCCGGGTTT TGAATAGTGT GGCGGATAGA TTGCCTCTAC AGAGCCAGCA ACAATGGCCC AGGGATACTC GCTTGGCGGA CATTCTTTTC TCAACGGAGC ACATTGGCTG CTTGACCCGC ATCCTTGGCC AGGATTACAG CAGCCACAGT CGGCTGACTG CGATTGAACT GGCTGCAGGT CTTATTGGGA AGCTGTGCAC AGAGGAAAGC CACAAGGCTG TTCTGGCTGA AAGTGGTGTT TTAGACGCTC TGGCGGTCAA AGTCGCATCG TTTATAGTTG CGCAGGGATT CGTTTTCCCC GGCGCAGAGA GCCACCTAGA TGATGTAGGC GCTCTGGGGT CACTGCCACC TCCTGCGCCC CGCGGGGCTA AGCTTGCGCC CATTTTACGT GCTGTGACGG TCATCGTTGA GCATTCCAAG TGGCGAGCGG AGCATTTTCT CTCTTCTCCA GGTATAGTTA CTGTGTTTCC ACGGCAAATA CCAGGCTTTT CCCCATCGGA TATCAAGAAG GGCCCTTGGG GCTCCACTTA TTTTTCAGGG TCCGCGGTGC CACGGCACCT TGGAGGGACG CCTCTAGAGT ATCTTCTTCC ATCTATTCCT TTGTCACAGT TGAAGCCCTC TGCTAGCTCA TCCAACTTTC CACCGCTAGG TCAGTATGGG CAGCATCGCC GACAGAGCCA TTCATTTCCC ACCCCGCTGT CCAGTTTCGA ACCGCCCACG GCTGAGGACG ATGAGAATCC GGTTGTCCCC TGGCTGCTAT ACCTCGTCCG TGCTGAGAGC GGCATGGCTC GTCTGATGGC AGCCCGCTTT GTGACGGTAT TATGCCGCCT GGGACTAACC AAAAAGCACA GGATCTCCAT GCTCTGCTAT CTGTTAATCC CGATTCTGCT TCGCATGCTC GATAAGGACT ACGAGGCCTC TGACGACGGT GTCCAATACG GTGGACTTAT TTCTTCCTCG CAACGCATTA AGGAGGAAGC TCCGGGTGTG CTGGCCACCT TACTTGTTGA TGATCGAGAA CTGCAGAAAC ATGCGGTTGA GGGGGATGCG ATCAAGCGAC TATCCCAGCT TCTCAAAGAA ACTTATAATC CAATCCATGA GCCAGCTCGA ACAATGTGGC ATGCTGAAGG CCAACCGAAG GTTGAGGACC ATGACTCGCA GCCGGCGGAG TGTCGATTAG GCCCTCCTGG ATACTCACCC CTCCGTTACC ATATCTTGAG ATATCGGGAA AATATATTGA AAGCCTTGGC TGCACTGGTT CCTTTCAAGG ACGAGTATCG CAAGGCGGTA TGCGAGCACG GTGTTGTGCC ATATATCATT GATTCTCTCA AACCCTTCCC AGACCAAATA CCAGCAGAGT CCTCCGATCC AGGAAACACT GCTGCTGACG GCAACCCAAC ACCGACCCTT CTGGCAGCCT GTGGTGCAGC CCGCATGCTC ACTCGCTCCG TTAGCGCTTT ACGAACGAGC TTGATTGACG CCGGCGTCTC AACCCCGCTT TTCGCTTTGA TTCGACATCC TGATATTGAG GTGCAAATTG CCGCGACCTC AGTAATCTGC AATCTTGCTC TAGATTTCAG TCCTATGAAA GAGGTACAAT CTGCTCGGCC CTTGTGACCT GAAGCTGCTA ACGTATGTCC TATAGGCAAT TATATCGGCC GAAATTCTTC CCATTCTGTG TGAGCATGCA CACTCATCGA ATACTAAACT TCGGATTGAA TCATTATGGG CGCTAAAGCA CGTCGCCTAT AACTCGGCAA ATGACGTCAA AATCAAGATC ATCGAGGGCT TGGGGCCGGA ATGGATTAAA CAAGTTATTA CTCAGGATCC GACAAGTGTT CTCGCGAAGC GTGGGCTTGA GGACGATACA GACAGTAACA CTCCAAGCGG GATGAGTCGG GCCAATTCAG CTGGCGAACG GGTAGACTTG CTGAATCCGA TGGATGACTT CCGGGAGAGG GATGAGGACA TGAAAATGAC CGATCCTGTG CCATCATCCA AAGTCAGTCT AGATATGTTC TTTCCAGACG CCACTAGACG ACGTAAGCTC GCTTTGCATG GCGATCTTGA CCAAACCACA CAAGCCCGTC AGGATGACAT TGCGGCGCAA GAGCAAACCT TTGATCTTCT AAGAAATGTC ATATGTGGGC CTGGTGCATC GGAAATGATT GACTATCTCT TCAAGGAACT CGGCCAGGAT TTGCTGCTGG ATACCTTGGC CGATAAACTG CGCCCAAGGT CTATCCAGCT GCCTCATCGG CGAGAGTCCC CAAACCATCG CGCGCTTCAG GTCCCCACTG AGATTTTGGT CGCAGTAACG TTCGTTATCA TCCACCTCGC TGCAAGCCTT CCATGGTACC GGCAGCTCAT AGTCTCACAC CGCGATCTCA TTCGTTATTT GATGGGTTAC TTTAACCACA GCCACCGAGA CGTCCGTGCC AATTGCGTGT GGGTGGTAAT TAACCTCACA TATGAGGATG ATGTTCACGA TCGAGAGGGT TGCCGGAAAC GCGCACTCGA GCTACGTTCA ATTGGGGTAC TAGATCGACT GGCTAGCCTT GAACATGACC CGGACCTTGA CGTTCGCGAG CGAACGAAGA CGGCACTGCA CTTGGTAAAC TCGTTGACAC ACTCTTAG
|
Protein sequence | MTRAEAPPIF LQLQNADSLS SQAAALRALK NETIGHDQRK EAWVRLGLIP ILSNVLASRA LDKSELNNGT KQPELPGSRE ESDDVCLQAI ILVGSLAQGG TPFLSPILSS NILPILLSIL SSNCPSSFVL PILRVLNSVA DRLPLQSQQQ WPRDTRLADI LFSTEHIGCL TRILGQDYSS HSRLTAIELA AGLIGKLCTE ESHKAVLAES GVLDALAVKV ASFIVAQGFV FPGAESHLDD VGALGSLPPP APRGAKLAPI LRAVTVIVEH SKWRAEHFLS SPGIVTVFPR QIPGFSPSDI KKGPWGSTYF SGSAVPRHLG GTPLEYLLPS IPLSQLKPSA SSSNFPPLGQ YGQHRRQSHS FPTPLSSFEP PTAEDDENPV VPWLLYLVRA ESGMARLMAA RFVTVLCRLG LTKKHRISML CYLLIPILLR MLDKDYEASD DGVQYGGLIS SSQRIKEEAP GVLATLLVDD RELQKHAVEG DAIKRLSQLL KETYNPIHEP ARTMWHAEGQ PKVEDHDSQP AECRLGPPGY SPLRYHILRY RENILKALAA LVPFKDEYRK AVCEHGVVPY IIDSLKPFPD QIPAESSDPG NTAADGNPTP TLLAACGAAR MLTRSVSALR TSLIDAGVST PLFALIRHPD IEVQIAATSV ICNLALDFSP MKEAIISAEI LPILCEHAHS SNTKLRIESL WALKHVAYNS ANDVKIKIIE GLGPEWIKQV ITQDPTSVLA KRGLEDDTDS NTPSGMSRAN SAGERVDLLN PMDDFRERDE DMKMTDPVPS SKVSLDMFFP DATRRRKLAL HGDLDQTTQA RQDDIAAQEQ TFDLLRNVIC GPGASEMIDY LFKELGQDLL LDTLADKLRP RSIQLPHRRE SPNHRALQVP TEILVAVTFV IIHLAASLPW YRQLIVSHRD LIRYLMGYFN HSHRDVRANC VWVVINLTYE DDVHDREGCR KRALELRSIG VLDRLASLEH DPDLDVRERT KTALHLVNSL THS
|
| |