Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_03749 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | + |
Start bp | 3128516 |
End bp | 3131923 |
Gene Length | 3408 bp |
Protein Length | 1091 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | DNA mismatch repair protein msh3 (MutS protein homolog 3) [Source:UniProtKB/Swiss-Prot;Acc:Q5B6T1] |
Protein accession | CBF75474 |
Protein GI | 259481704 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.642565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTAC CCTCATCCCA GCCGTCCGCA TCCTCCTCTC CAAATCTAAA GCGGAAGCAA CCCACTATCT CCAGCTTCTT CACGAAAAAA CCACAGGCGC CAAAGCAATC CACTTCGAAC GAAGGCCCTG CGCCGATCGA CAACGATTCC GAAATTACAG ACAAGTTAGC GGAGGACGAT GAGGAGGATA TAGTTGCCCC TGTTCCGAAG CGGACAAAAT CAAATGGGTC TCTTACCGTA AACCGGCCAC AGAGTCCCAA GGCGAAGTCC GTATCGCGGG TGGAGCAGGA AAGCAGCCAA CGGACCGAGC TCTCTAAATT CGCCAGTTCT CCTGCTATTG AGACGGAGGG AAATGAAGCG ACCGAATTGG ACGGGTCAGC GAAAGTCCGA CAGCAGGAGA GGGAGAAACT GCATCAAAGA TTCGTCCGAA AACTCGGCGG GCCGGATTGT CTGGTGGGAA TTGGTCGTAA TTGCGTCGGC GAAACAACAT CGATTGAGGA GGCTGCAGAG GGGGATGAAG ACGATGAGAC GCCGCAACCA GTACAACCAA AGGGGAAGGC AGGGAAGAAG GGTGGAGGTA AACTCACTCC GATGGAAAAG CAAGTCATTG AGATCAAGAA AAAGCACATG GACACAATCC TGCTTATCGA AGTCGGGTAC AAGTTTCGAT TCTTTGGTGA AGATGCAAGA ATCGCTGCAA AGGAACTTAG CATTGTATGC ATTCCTGGCA AGTTCCGATA TGATGAACGT GAGTACCCGT ACTCTATACG TGGTTGGATT ATTGACGTTG TTCAGATCCT TCAGAAGCCC ATCTTGATCG GTTCGCCTCG GCGAGCATAC CTGTACAGCG ACTTCATGTT CACGTAAAGC GTCTCGTCGC TGCCGGTCAC AAGGTCGGCG TCGTTAGGCA ATTGGAAACT GCTGCGCTGA AAGCAGCTGG AGATAACCGC AACGCACCGT TTGTTCGTAA ATTGACGAAT GTTTACACGA AAAGCACCTA TATTGATGAT ATCGAGAGCC TTGAAGGGTC TACGGCTGGG GCATCTGGTG CATCGGCCAC GGGATATATT CTTTGCATAA CGGAGACGAA CGCTCGGGGC TGGGGGAATG ACGAAAAAGT ACATGTGGGT ATTGTTGCCG TGCAGCCGAC TACCGGGGAT ATCGTTTACG ATGAGTTCGA TGATGGCTTC ATGCGGAGCG AGATAGAAAC AAGATTGCTC CATATCGCGC CCTGCGAAAT GCTAATAGTC GGTGAGCTAT CGAAAGCGAC GGAGAAGCTT GTGCAGCATC TTTCCGGGAG CAAGATGAAT GTATTCGGTG ACAAGGTGCG GGTGGAGAGA GCACCCAAAG CGAAGACTGC AGCTGCCGAA TCGCACAGCC ATGTTTCGAG TTTCTACGCT GAAAAAATGA AATCTGCAGA CGCTGCGGAT GATGAGGTTG CGAGTAACCT GCTCCAGAAG GTGCTTGGCT TGCCGGACCA GGTCACGATA TGCCTCTCTG CCATGATCAA ACATATGACT GAGTATGGCC TGGAACACGT TTTACAGCTG ACAAAATATT TCCAGCATTT TTCTTCACGC TCTCATATGC TTCTCAATGG AAACACCCTG ACAAGCCTTG AGATATACCA AAACCAGACT GATTATTCGT CCAAAGGCAG TTTGTTTTGG ACTCTAGATC GGACACAGAC CCGATTTGGG CAAAGAATGC TTCGAAAATG GGTTGGACGA CCGTTGTTGG ATAGGCGTCA ACTTGAGGAT CGAGTCAATG CTGTAGAAGA GCTTAAGGAC TTCCGAAATG TCGTAATGGT CGAACGAATC AAAGGTTTGC TTGGTAAAAT CAAGCACGAT CTAGAGAAAG GCCTGATCCG GATATACTAT GGAAAGGTGA GTAACACTGA CCCTCGTCTG ACGTGGCTAA CAGTGAAAGT GCTCCCGGCC GGAACTTTTG ACCATCTTGC AAACAATGCA GATGATAGCA CAGGAATTTG CCGATATCGA GTCACCAGCA GATACCGGGT TTTCCTCACC TGCCATCAGC CAAGCAATCA TGTCTCTGCC TACAATTTTG AAAGATGTCG TGTTTTTCCT GAACAAAATA AACATGCACG CGGCTCGAAA TGATGACAAG TACGAATTCT TCCGCGAAGA AGAAGAGACG GAGGAAATTA GCGAGCACAA ACTCGGAATT GGGGCCGTTG AGCATGAACT TGAGGAGCAT CGTCCTGTAG CCGGAGAAGC TTTAGGGAAG AAAATGGTCA CCTATGTCTC GGTTGCAGGC ATCGACTATT TGGTGGAAGT CGAGAACAAT TCGCCGGCCA TCAAGCGAGT GCCGGCATCA TGGATGAAAA TAAGCGGCAC AAAAAAGGTG TCAAGATTTC ACACTCCGGA GGTTGTCAAG ATGATTCGGC AGAGAGACCA ACACAGAGAA GCGCTCGCCG CAGCCTGCGA TAAGGCGTTT TTGGCCCTCC AGGCCGAGAT AGCGACCAAT TACCAGGCGC TACGTGACTG CGTTCAATCC CTGGCAACGC TAGACTGTCT GGTGTCATTG GCCACCTTAG CCAGCCAGCC GGGGTACGTG AAACCTGAAT ATACGGAAGA GACGTGCATC CATGTCGAGC AAGGGCGTCA CCCGATGGTG GAGCAACTCC TTCTAGACAG CTATGTGCCC AATGACATCA ACCTGGATAG CAGCAAGACG CGCGCTCTTC TTGTGACTGG CCCTAATATG GGTGGGAAGT CCAGCTACGT GCGCCAGGTG GCACTTATTG CAATAATGGG GCAGATTGGC TCATATGTCC CAGCACAGGC CGCAAAGCTT GGTATGCTGG ACGCGGTGTT CACCCGGATG GGCGCATTCG ACAATATGCT CGCAGGCGAG TCTACCTTCA TGGTTGAGCT TTCCGAGACG GCAGATATAC TGAAGCAAGC AACGCCCCGC TCTTTAGTAA TACTAGACGA GCTGGGCCGA GGCACGTCTA CCCATGATGG AGTCGCCATT GCACAGGCCG TTCTCGACTA CATGGTGCGG TCTATCCGCA GTCTCACCCT CTTCATCACA CATTACCAGC ATCTTTCTGC CATGGTGCAT TCGTTTCCTG ATGGCGAGCT GCGAAATGTG CACATGCGAT TCAGCGAGTC GGGGACTGGC GCGGACGAAG ACATTACCTT TCTTTATGAG ATTGGAGAAG GTGTCGCGCA TCGTAGCTAT GGGCTTAATG TTGCGCGGCT GGCAAACTTG CCTGCGCCAC TTTTGGAGAT GGCCAAGCAG AAGAGTGCCG AGCTGGAGGA GAAAATTCGT CGCCGAAGAC TTGCTGGTTT TGTTGCTGCG GTTGGAGCGG TAGTGCAGTC GAATCAGGCC GATGAGAGTG TAATCGAGCG GCTGGTTAGC AGTATGGAGG AGCTGTAA
|
Protein sequence | MPLPSSQPSA SSSPNLKRKQ PTISSFFTKK PQAPKQSTSN EGPAPIDNDS EITDKLAEDD EEDIVAPVPK RTKSNGSLTV NRPQSPKAKS VSRVEQESSQ RTELSKFASS PAIETEGNEA TELDGSAKVR QQEREKLHQR FVRKLGGPDC LVGIGRNCVG ETTSIEEAAE GDEDDETPQP VQPKGKAGKK GGGKLTPMEK QVIEIKKKHM DTILLIEVGY KFRFFGEDAR IAAKELSIVC IPGKFRYDEH PSEAHLDRFA SASIPVQRLH VHVKRLVAAG HKVGVVRQLE TAALKAAGDN RNAPFVRKLT NVYTKSTYID DIESLEGSTA GASGASATGY ILCITETNAR GWGNDEKVHV GIVAVQPTTG DIVYDEFDDG FMRSEIETRL LHIAPCEMLI VGELSKATEK LVQHLSGSKM NVFGDKVRVE RAPKAKTAAA ESHSHVSSFY AEKMKSADAA DDEVASNLLQ KVLGLPDQVT ICLSAMIKHM TEYGLEHVLQ LTKYFQHFSS RSHMLLNGNT LTSLEIYQNQ TDYSSKGSLF WTLDRTQTRF GQRMLRKWVG RPLLDRRQLE DRVNAVEELK DFRNVVMVER IKGLLGKIKH DLEKGLIRIY YGKMIAQEFA DIESPADTGF SSPAISQAIM SLPTILKDVV FFLNKINMHA ARNDDKYEFF REEEETEEIS EHKLGIGAVE HELEEHRPVA GEALGKKMVT YVSVAGIDYL VEVENNSPAI KRVPASWMKI SGTKKVSRFH TPEVVKMIRQ RDQHREALAA ACDKAFLALQ AEIATNYQAL RDCVQSLATL DCLVSLATLA SQPGYVKPEY TEETCIHVEQ GRHPMVEQLL LDSYVPNDIN LDSSKTRALL VTGPNMGGKS SYVRQVALIA IMGQIGSYVP AQAAKLGMLD AVFTRMGAFD NMLAGESTFM VELSETADIL KQATPRSLVI LDELGRGTST HDGVAIAQAV LDYMVRSIRS LTLFITHYQH LSAMVHSFPD GELRNVHMRF SESGTGADED ITFLYEIGEG VAHRSYGLNV ARLANLPAPL LEMAKQKSAE LEEKIRRRRL AGFVAAVGAV VQSNQADESV IERLVSSMEE L
|
| |