Gene ANIA_03749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_03749 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001302 
Strand
Start bp3128516 
End bp3131923 
Gene Length3408 bp 
Protein Length1091 aa 
Translation table 
GC content51% 
IMG OID 
ProductDNA mismatch repair protein msh3 (MutS protein homolog 3) [Source:UniProtKB/Swiss-Prot;Acc:Q5B6T1] 
Protein accessionCBF75474 
Protein GI259481704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.642565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTAC CCTCATCCCA GCCGTCCGCA TCCTCCTCTC CAAATCTAAA GCGGAAGCAA 
CCCACTATCT CCAGCTTCTT CACGAAAAAA CCACAGGCGC CAAAGCAATC CACTTCGAAC
GAAGGCCCTG CGCCGATCGA CAACGATTCC GAAATTACAG ACAAGTTAGC GGAGGACGAT
GAGGAGGATA TAGTTGCCCC TGTTCCGAAG CGGACAAAAT CAAATGGGTC TCTTACCGTA
AACCGGCCAC AGAGTCCCAA GGCGAAGTCC GTATCGCGGG TGGAGCAGGA AAGCAGCCAA
CGGACCGAGC TCTCTAAATT CGCCAGTTCT CCTGCTATTG AGACGGAGGG AAATGAAGCG
ACCGAATTGG ACGGGTCAGC GAAAGTCCGA CAGCAGGAGA GGGAGAAACT GCATCAAAGA
TTCGTCCGAA AACTCGGCGG GCCGGATTGT CTGGTGGGAA TTGGTCGTAA TTGCGTCGGC
GAAACAACAT CGATTGAGGA GGCTGCAGAG GGGGATGAAG ACGATGAGAC GCCGCAACCA
GTACAACCAA AGGGGAAGGC AGGGAAGAAG GGTGGAGGTA AACTCACTCC GATGGAAAAG
CAAGTCATTG AGATCAAGAA AAAGCACATG GACACAATCC TGCTTATCGA AGTCGGGTAC
AAGTTTCGAT TCTTTGGTGA AGATGCAAGA ATCGCTGCAA AGGAACTTAG CATTGTATGC
ATTCCTGGCA AGTTCCGATA TGATGAACGT GAGTACCCGT ACTCTATACG TGGTTGGATT
ATTGACGTTG TTCAGATCCT TCAGAAGCCC ATCTTGATCG GTTCGCCTCG GCGAGCATAC
CTGTACAGCG ACTTCATGTT CACGTAAAGC GTCTCGTCGC TGCCGGTCAC AAGGTCGGCG
TCGTTAGGCA ATTGGAAACT GCTGCGCTGA AAGCAGCTGG AGATAACCGC AACGCACCGT
TTGTTCGTAA ATTGACGAAT GTTTACACGA AAAGCACCTA TATTGATGAT ATCGAGAGCC
TTGAAGGGTC TACGGCTGGG GCATCTGGTG CATCGGCCAC GGGATATATT CTTTGCATAA
CGGAGACGAA CGCTCGGGGC TGGGGGAATG ACGAAAAAGT ACATGTGGGT ATTGTTGCCG
TGCAGCCGAC TACCGGGGAT ATCGTTTACG ATGAGTTCGA TGATGGCTTC ATGCGGAGCG
AGATAGAAAC AAGATTGCTC CATATCGCGC CCTGCGAAAT GCTAATAGTC GGTGAGCTAT
CGAAAGCGAC GGAGAAGCTT GTGCAGCATC TTTCCGGGAG CAAGATGAAT GTATTCGGTG
ACAAGGTGCG GGTGGAGAGA GCACCCAAAG CGAAGACTGC AGCTGCCGAA TCGCACAGCC
ATGTTTCGAG TTTCTACGCT GAAAAAATGA AATCTGCAGA CGCTGCGGAT GATGAGGTTG
CGAGTAACCT GCTCCAGAAG GTGCTTGGCT TGCCGGACCA GGTCACGATA TGCCTCTCTG
CCATGATCAA ACATATGACT GAGTATGGCC TGGAACACGT TTTACAGCTG ACAAAATATT
TCCAGCATTT TTCTTCACGC TCTCATATGC TTCTCAATGG AAACACCCTG ACAAGCCTTG
AGATATACCA AAACCAGACT GATTATTCGT CCAAAGGCAG TTTGTTTTGG ACTCTAGATC
GGACACAGAC CCGATTTGGG CAAAGAATGC TTCGAAAATG GGTTGGACGA CCGTTGTTGG
ATAGGCGTCA ACTTGAGGAT CGAGTCAATG CTGTAGAAGA GCTTAAGGAC TTCCGAAATG
TCGTAATGGT CGAACGAATC AAAGGTTTGC TTGGTAAAAT CAAGCACGAT CTAGAGAAAG
GCCTGATCCG GATATACTAT GGAAAGGTGA GTAACACTGA CCCTCGTCTG ACGTGGCTAA
CAGTGAAAGT GCTCCCGGCC GGAACTTTTG ACCATCTTGC AAACAATGCA GATGATAGCA
CAGGAATTTG CCGATATCGA GTCACCAGCA GATACCGGGT TTTCCTCACC TGCCATCAGC
CAAGCAATCA TGTCTCTGCC TACAATTTTG AAAGATGTCG TGTTTTTCCT GAACAAAATA
AACATGCACG CGGCTCGAAA TGATGACAAG TACGAATTCT TCCGCGAAGA AGAAGAGACG
GAGGAAATTA GCGAGCACAA ACTCGGAATT GGGGCCGTTG AGCATGAACT TGAGGAGCAT
CGTCCTGTAG CCGGAGAAGC TTTAGGGAAG AAAATGGTCA CCTATGTCTC GGTTGCAGGC
ATCGACTATT TGGTGGAAGT CGAGAACAAT TCGCCGGCCA TCAAGCGAGT GCCGGCATCA
TGGATGAAAA TAAGCGGCAC AAAAAAGGTG TCAAGATTTC ACACTCCGGA GGTTGTCAAG
ATGATTCGGC AGAGAGACCA ACACAGAGAA GCGCTCGCCG CAGCCTGCGA TAAGGCGTTT
TTGGCCCTCC AGGCCGAGAT AGCGACCAAT TACCAGGCGC TACGTGACTG CGTTCAATCC
CTGGCAACGC TAGACTGTCT GGTGTCATTG GCCACCTTAG CCAGCCAGCC GGGGTACGTG
AAACCTGAAT ATACGGAAGA GACGTGCATC CATGTCGAGC AAGGGCGTCA CCCGATGGTG
GAGCAACTCC TTCTAGACAG CTATGTGCCC AATGACATCA ACCTGGATAG CAGCAAGACG
CGCGCTCTTC TTGTGACTGG CCCTAATATG GGTGGGAAGT CCAGCTACGT GCGCCAGGTG
GCACTTATTG CAATAATGGG GCAGATTGGC TCATATGTCC CAGCACAGGC CGCAAAGCTT
GGTATGCTGG ACGCGGTGTT CACCCGGATG GGCGCATTCG ACAATATGCT CGCAGGCGAG
TCTACCTTCA TGGTTGAGCT TTCCGAGACG GCAGATATAC TGAAGCAAGC AACGCCCCGC
TCTTTAGTAA TACTAGACGA GCTGGGCCGA GGCACGTCTA CCCATGATGG AGTCGCCATT
GCACAGGCCG TTCTCGACTA CATGGTGCGG TCTATCCGCA GTCTCACCCT CTTCATCACA
CATTACCAGC ATCTTTCTGC CATGGTGCAT TCGTTTCCTG ATGGCGAGCT GCGAAATGTG
CACATGCGAT TCAGCGAGTC GGGGACTGGC GCGGACGAAG ACATTACCTT TCTTTATGAG
ATTGGAGAAG GTGTCGCGCA TCGTAGCTAT GGGCTTAATG TTGCGCGGCT GGCAAACTTG
CCTGCGCCAC TTTTGGAGAT GGCCAAGCAG AAGAGTGCCG AGCTGGAGGA GAAAATTCGT
CGCCGAAGAC TTGCTGGTTT TGTTGCTGCG GTTGGAGCGG TAGTGCAGTC GAATCAGGCC
GATGAGAGTG TAATCGAGCG GCTGGTTAGC AGTATGGAGG AGCTGTAA
 
Protein sequence
MPLPSSQPSA SSSPNLKRKQ PTISSFFTKK PQAPKQSTSN EGPAPIDNDS EITDKLAEDD 
EEDIVAPVPK RTKSNGSLTV NRPQSPKAKS VSRVEQESSQ RTELSKFASS PAIETEGNEA
TELDGSAKVR QQEREKLHQR FVRKLGGPDC LVGIGRNCVG ETTSIEEAAE GDEDDETPQP
VQPKGKAGKK GGGKLTPMEK QVIEIKKKHM DTILLIEVGY KFRFFGEDAR IAAKELSIVC
IPGKFRYDEH PSEAHLDRFA SASIPVQRLH VHVKRLVAAG HKVGVVRQLE TAALKAAGDN
RNAPFVRKLT NVYTKSTYID DIESLEGSTA GASGASATGY ILCITETNAR GWGNDEKVHV
GIVAVQPTTG DIVYDEFDDG FMRSEIETRL LHIAPCEMLI VGELSKATEK LVQHLSGSKM
NVFGDKVRVE RAPKAKTAAA ESHSHVSSFY AEKMKSADAA DDEVASNLLQ KVLGLPDQVT
ICLSAMIKHM TEYGLEHVLQ LTKYFQHFSS RSHMLLNGNT LTSLEIYQNQ TDYSSKGSLF
WTLDRTQTRF GQRMLRKWVG RPLLDRRQLE DRVNAVEELK DFRNVVMVER IKGLLGKIKH
DLEKGLIRIY YGKMIAQEFA DIESPADTGF SSPAISQAIM SLPTILKDVV FFLNKINMHA
ARNDDKYEFF REEEETEEIS EHKLGIGAVE HELEEHRPVA GEALGKKMVT YVSVAGIDYL
VEVENNSPAI KRVPASWMKI SGTKKVSRFH TPEVVKMIRQ RDQHREALAA ACDKAFLALQ
AEIATNYQAL RDCVQSLATL DCLVSLATLA SQPGYVKPEY TEETCIHVEQ GRHPMVEQLL
LDSYVPNDIN LDSSKTRALL VTGPNMGGKS SYVRQVALIA IMGQIGSYVP AQAAKLGMLD
AVFTRMGAFD NMLAGESTFM VELSETADIL KQATPRSLVI LDELGRGTST HDGVAIAQAV
LDYMVRSIRS LTLFITHYQH LSAMVHSFPD GELRNVHMRF SESGTGADED ITFLYEIGEG
VAHRSYGLNV ARLANLPAPL LEMAKQKSAE LEEKIRRRRL AGFVAAVGAV VQSNQADESV
IERLVSSMEE L