Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_10621 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001303 |
Strand | - |
Start bp | 304832 |
End bp | 308038 |
Gene Length | 3207 bp |
Protein Length | 945 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | DNA mismatch repair protein Msh2, putative (AFU_orthologue; AFUA_3G09850) |
Protein accession | CBF76294 |
Protein GI | 259482119 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGGTC AAGATTACTT TGACGTGACG CGAGATCACG TGCCCCGTCG CGTCTTACGC GCCGAGTTAA TCTATCTATC AACCGATAAG AGGGAGAGCT TAAGACATAC ATTCTTCACA GCACACAAGC CAGTATTCAG GTCATCCTCA AGAGCAGGAG TCATTGCTGG CATCATCATG TCGTCTCGGC CAGATCTCAG GGTAAGTTCT TTTTAGCCTA GAAACCGAAC AGCTTCTTAC TAACTTTCTT TCTAAAGGTT GACGACGAAG TCGGCTTTAT CCGCTTCTAC CGCTCCCTCG CCTCCGATGA CTCCCATAAT AATGAAACAA TTCGCATCTT CGACCGTGGG GATTGGTACT CAGCCCACGG CAAAGAAGCC GAATTCATCG CTCGCACAGT CTACAAAACA ACCTCTGTCC TTCGTAATCT CGGCCGCAGC GAAACGGGCG GCTTGCCGTC CGTCACAATG AGCATTACTG TCTTCCGTAA TTTTTTACGT GAGGCTCTAT TCCGGCTAAA TAAGAGGATT GAGATCTGGG GCTCCGCCGG CACGGGCAAA GGGCACTGGA AGAAGGTTAA GCAGGCGAGT CCCGGAAATC TGCAGGATGT GGAGGAGGAA TTAGGGGCAA TGGGTATGGA GGGAAGTAAC GGAGCGCCCA TTATCATGGC AGTGAAGCTT AGTGCAAAGG CCGGGGAGGC GCGAAATGTA GGTGTTTGTT TTGCAGATGC AAGTGTGCGC GAGCTTGGTG TGAGTGAGTT CCTGGACAAT GATGTTTACT CAAACTTTGA GGCGCTTGTT ATCCAGCTCG GTGTGAAAGA GTGTCTCGTT GTGCAGGATG TCAATCGGAA GGATGTGGAG GTGGCCAAGA TCCGAGCAAT ATGTGATAAC TGCGGGATAG CGATATCGGA GCGCCCGGCA TCTGATTTTG GGGTTAAGGA TATTGAACAG GACCTTACAA GGTTGCTGAG GGATGAGCGG TCGGCTGGGA CACTGCCGGA GACGGAGCTG AAGCTTGCGA TGGGCGGTGC GGCGGCGCTA ATTCGGTATT TGGGCGTGAT GTCGGATGCG ACAAATTTCG GGCAGTATCA ACTCTACCAG CATGATTTGG CGCAGTACAT GAAGCTCGAT GCGGCGGCAT TGAGAGCTTT GAATCTTATG CCTGGGCCGA GGGATGGATC AAAATCGATG AGTTTATTTG GGCTGTTGAA TCATTGTAAA ACGCCTGTTG GGAGCCGGTT GCTGGCACAG TGGCTGAAAC AGCCGTTAAT GGATCTGGCG GAGATTGAAA AGCGGCAAAG GCTTGTTGAG GCGTTTGTCG TGAGCACGGA GCTTCGGCAG ATGATGCAGG AGGAGCATCT ACGATCTATT CCGGATCTGT ATCGGCTTGC GAAACGATTC CAGCGAAAAC AGGCGAATCT GGAAGATGTA GTGCGTGTGT ATCAGGTTGC TATTCGGCTG CCTGGGTTTG TGAACTCTCT GGAGAATGTT ATGGATGAGG AGTACCAGAC GCCGCTTGAG ACAGAGTACA CGGCCAAGCT ACGCAACCAT TCGGCGAGCC TGGCGAAACT GGAGGAGATG GTCGAGACGA CGGTTGATCT GGATGCCCTC GAGAATCACG AGTTCATCAT CAAGCCCGAA TTCGATGATA GTCTGCGCAT CATTCGCAAA AAGCTGGATC AGTTGCGCCA TGATATGTAC CTTGAGCATA AGGCTGTCGC GAGAGACCTA GATCAGGAAA TGGACAAGAA GCTGTTCCTG GAGAACCACC GCGTGTACGG ATGGTGTTTC CGTCTGACGC GGAATGAGGC GGGTTGCATT CGCAACAAGA AGGCCTACCA GGAGTGCTCA ACGCAGAAGA ACGGTGTGTA CTTTACCACA TCGACGATGC AATCTCTCCG CCGGGAACAT GATCAGCTCT CCTCCAATTA CAACCGCACC CAGACGGGAC TTGTCTCGGA GGTTGTCAAC GTTGCAGCAT CGTACTGTCC GGTCCTGGAA CAACTAGCCG GCGTCCTGGC TCACCTCGAT GTCATTGTGA GCTTTGCGCA CGCCTCTGTA CACGCGCCAA CAGCCTATAC GAAACCCAAG ATCCACCCGC GCGGCACGGG CAATACAGTC CTTAAAGAAG CACGCCACCC CTGCATGGAG ATGCAGGACG ACATCTCCTT CATAACTAAT GATGTCTCCC TTATCCGCGA CGAGTCCTCA TTCCTTATCA TCACTGGCCC CAATATGGGC GGTAAATCGA CCTACATCCG CATGATTGGC GTTATAGCGC TCATGGCGCA GATAGGCTGC TTCGTGCCCT GCACCGAAGC AGAGTTGACG ATCTTTGACT GCATCCTTGC CCGTGTTGGT GCGAGCGATT CGCAGCTTAA AGGCGTTTCT ACGTTCATGG CGGAGATGCT CGAAACGTCA AACATCCTCA AGTCAGCGAC CTCTGAGTCC TTGATCATCA TTGATGAATT GGGCCGCGGC ACTAGCACTT ACGACGGATT CGGTCTCGCC TGGGCGATTT CAGAGCACAT TGTGACCGAA ATCCGCTGCT TTGGACTATT CGCGACACAC TTCCACGAAC TCACGACACT TGCAGATCGA TACCCCAAGT CTGTCAAGAA CCTGCACGTC GTTGCATTCA TCGGCGACGG AACAACAGCG AACGAAGAAG ACGAAAAAGA GAAGAGAAAG ACCCGGCAGA AGGTAACATT GCTCTACCGC GTTGAACCAG GCATCTGCGA CCAGTCTTTC GGCATCCACG TCGCTGAGCT CGTCCGCTTC CCAGAGAAGG TAGTCAACAT GGCGCGGCAG AAGGCCGAGG AGCTGGAGGA TTTTACGTCC GCTGATTCCG CTGGAAATGC TGCGTCAGCG ACGATTGATA AGTACTCGCA GGAGGAAGTC GAAGAAGGGA GTGCGCTGTT GAAAGCGCTG CTGGTGAAGT GGAAGAGTGC AATTGAGGAG CCGGGGAGAG AGCTGACGCT TGAGGAGAAG AGGCAGGTTA TGAGGGATTT GGTAAAAGGG GACGAGAAGT TGCAGGCGAA TAGAGTCTTC CAGGGGATTC AGGCGCTGTG AGCCATCATC CAGAGTTGTT TCTGGTCTTC GGTTTCGGGT TATCTATGCT TCTGTCTGCA ATGGTGGTAG GTAGTTATGG CGTTCGGGTA GATTCTACGC AAGGTTGTAT AGGTCATTCA ACTGCCAGGT AATGGAT
|
Protein sequence | MIGQDYFDVD DEVGFIRFYR SLASDDSHNN ETIRIFDRGD WYSAHGKEAE FIARTVYKTT SVLRNLGRSE TGGLPSVTMS ITVFRNFLRE ALFRLNKRIE IWGSAGTGKG HWKKVKQASP GNLQDVEEEL GAMGMEGSNG APIIMAVKLS AKAGEARNVG VCFADASVRE LGVSEFLDND VYSNFEALVI QLGVKECLVV QDVNRKDVEV AKIRAICDNC GIAISERPAS DFGVKDIEQD LTRLLRDERS AGTLPETELK LAMGGAAALI RYLGVMSDAT NFGQYQLYQH DLAQYMKLDA AALRALNLMP GPRDGSKSMS LFGLLNHCKT PVGSRLLAQW LKQPLMDLAE IEKRQRLVEA FVVSTELRQM MQEEHLRSIP DLYRLAKRFQ RKQANLEDVV RVYQVAIRLP GFVNSLENVM DEEYQTPLET EYTAKLRNHS ASLAKLEEMV ETTVDLDALE NHEFIIKPEF DDSLRIIRKK LDQLRHDMYL EHKAVARDLD QEMDKKLFLE NHRVYGWCFR LTRNEAGCIR NKKAYQECST QKNGVYFTTS TMQSLRREHD QLSSNYNRTQ TGLVSEVVNV AASYCPVLEQ LAGVLAHLDV IVSFAHASVH APTAYTKPKI HPRGTGNTVL KEARHPCMEM QDDISFITND VSLIRDESSF LIITGPNMGG KSTYIRMIGV IALMAQIGCF VPCTEAELTI FDCILARVGA SDSQLKGVST FMAEMLETSN ILKSATSESL IIIDELGRGT STYDGFGLAW AISEHIVTEI RCFGLFATHF HELTTLADRY PKSVKNLHVV AFIGDGTTAN EEDEKEKRKT RQKVTLLYRV EPGICDQSFG IHVAELVRFP EKVVNMARQK AEELEDFTSA DSAGNAASAT IDKYSQEEVE EGSALLKALL VKWKSAIEEP GRELTLEEKR QVMRDLVKGD EKLQANRVFQ GIQAL
|
| |