Gene ANIA_10621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_10621 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001303 
Strand
Start bp304832 
End bp308038 
Gene Length3207 bp 
Protein Length945 aa 
Translation table 
GC content52% 
IMG OID 
ProductDNA mismatch repair protein Msh2, putative (AFU_orthologue; AFUA_3G09850) 
Protein accessionCBF76294 
Protein GI259482119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGTC AAGATTACTT TGACGTGACG CGAGATCACG TGCCCCGTCG CGTCTTACGC 
GCCGAGTTAA TCTATCTATC AACCGATAAG AGGGAGAGCT TAAGACATAC ATTCTTCACA
GCACACAAGC CAGTATTCAG GTCATCCTCA AGAGCAGGAG TCATTGCTGG CATCATCATG
TCGTCTCGGC CAGATCTCAG GGTAAGTTCT TTTTAGCCTA GAAACCGAAC AGCTTCTTAC
TAACTTTCTT TCTAAAGGTT GACGACGAAG TCGGCTTTAT CCGCTTCTAC CGCTCCCTCG
CCTCCGATGA CTCCCATAAT AATGAAACAA TTCGCATCTT CGACCGTGGG GATTGGTACT
CAGCCCACGG CAAAGAAGCC GAATTCATCG CTCGCACAGT CTACAAAACA ACCTCTGTCC
TTCGTAATCT CGGCCGCAGC GAAACGGGCG GCTTGCCGTC CGTCACAATG AGCATTACTG
TCTTCCGTAA TTTTTTACGT GAGGCTCTAT TCCGGCTAAA TAAGAGGATT GAGATCTGGG
GCTCCGCCGG CACGGGCAAA GGGCACTGGA AGAAGGTTAA GCAGGCGAGT CCCGGAAATC
TGCAGGATGT GGAGGAGGAA TTAGGGGCAA TGGGTATGGA GGGAAGTAAC GGAGCGCCCA
TTATCATGGC AGTGAAGCTT AGTGCAAAGG CCGGGGAGGC GCGAAATGTA GGTGTTTGTT
TTGCAGATGC AAGTGTGCGC GAGCTTGGTG TGAGTGAGTT CCTGGACAAT GATGTTTACT
CAAACTTTGA GGCGCTTGTT ATCCAGCTCG GTGTGAAAGA GTGTCTCGTT GTGCAGGATG
TCAATCGGAA GGATGTGGAG GTGGCCAAGA TCCGAGCAAT ATGTGATAAC TGCGGGATAG
CGATATCGGA GCGCCCGGCA TCTGATTTTG GGGTTAAGGA TATTGAACAG GACCTTACAA
GGTTGCTGAG GGATGAGCGG TCGGCTGGGA CACTGCCGGA GACGGAGCTG AAGCTTGCGA
TGGGCGGTGC GGCGGCGCTA ATTCGGTATT TGGGCGTGAT GTCGGATGCG ACAAATTTCG
GGCAGTATCA ACTCTACCAG CATGATTTGG CGCAGTACAT GAAGCTCGAT GCGGCGGCAT
TGAGAGCTTT GAATCTTATG CCTGGGCCGA GGGATGGATC AAAATCGATG AGTTTATTTG
GGCTGTTGAA TCATTGTAAA ACGCCTGTTG GGAGCCGGTT GCTGGCACAG TGGCTGAAAC
AGCCGTTAAT GGATCTGGCG GAGATTGAAA AGCGGCAAAG GCTTGTTGAG GCGTTTGTCG
TGAGCACGGA GCTTCGGCAG ATGATGCAGG AGGAGCATCT ACGATCTATT CCGGATCTGT
ATCGGCTTGC GAAACGATTC CAGCGAAAAC AGGCGAATCT GGAAGATGTA GTGCGTGTGT
ATCAGGTTGC TATTCGGCTG CCTGGGTTTG TGAACTCTCT GGAGAATGTT ATGGATGAGG
AGTACCAGAC GCCGCTTGAG ACAGAGTACA CGGCCAAGCT ACGCAACCAT TCGGCGAGCC
TGGCGAAACT GGAGGAGATG GTCGAGACGA CGGTTGATCT GGATGCCCTC GAGAATCACG
AGTTCATCAT CAAGCCCGAA TTCGATGATA GTCTGCGCAT CATTCGCAAA AAGCTGGATC
AGTTGCGCCA TGATATGTAC CTTGAGCATA AGGCTGTCGC GAGAGACCTA GATCAGGAAA
TGGACAAGAA GCTGTTCCTG GAGAACCACC GCGTGTACGG ATGGTGTTTC CGTCTGACGC
GGAATGAGGC GGGTTGCATT CGCAACAAGA AGGCCTACCA GGAGTGCTCA ACGCAGAAGA
ACGGTGTGTA CTTTACCACA TCGACGATGC AATCTCTCCG CCGGGAACAT GATCAGCTCT
CCTCCAATTA CAACCGCACC CAGACGGGAC TTGTCTCGGA GGTTGTCAAC GTTGCAGCAT
CGTACTGTCC GGTCCTGGAA CAACTAGCCG GCGTCCTGGC TCACCTCGAT GTCATTGTGA
GCTTTGCGCA CGCCTCTGTA CACGCGCCAA CAGCCTATAC GAAACCCAAG ATCCACCCGC
GCGGCACGGG CAATACAGTC CTTAAAGAAG CACGCCACCC CTGCATGGAG ATGCAGGACG
ACATCTCCTT CATAACTAAT GATGTCTCCC TTATCCGCGA CGAGTCCTCA TTCCTTATCA
TCACTGGCCC CAATATGGGC GGTAAATCGA CCTACATCCG CATGATTGGC GTTATAGCGC
TCATGGCGCA GATAGGCTGC TTCGTGCCCT GCACCGAAGC AGAGTTGACG ATCTTTGACT
GCATCCTTGC CCGTGTTGGT GCGAGCGATT CGCAGCTTAA AGGCGTTTCT ACGTTCATGG
CGGAGATGCT CGAAACGTCA AACATCCTCA AGTCAGCGAC CTCTGAGTCC TTGATCATCA
TTGATGAATT GGGCCGCGGC ACTAGCACTT ACGACGGATT CGGTCTCGCC TGGGCGATTT
CAGAGCACAT TGTGACCGAA ATCCGCTGCT TTGGACTATT CGCGACACAC TTCCACGAAC
TCACGACACT TGCAGATCGA TACCCCAAGT CTGTCAAGAA CCTGCACGTC GTTGCATTCA
TCGGCGACGG AACAACAGCG AACGAAGAAG ACGAAAAAGA GAAGAGAAAG ACCCGGCAGA
AGGTAACATT GCTCTACCGC GTTGAACCAG GCATCTGCGA CCAGTCTTTC GGCATCCACG
TCGCTGAGCT CGTCCGCTTC CCAGAGAAGG TAGTCAACAT GGCGCGGCAG AAGGCCGAGG
AGCTGGAGGA TTTTACGTCC GCTGATTCCG CTGGAAATGC TGCGTCAGCG ACGATTGATA
AGTACTCGCA GGAGGAAGTC GAAGAAGGGA GTGCGCTGTT GAAAGCGCTG CTGGTGAAGT
GGAAGAGTGC AATTGAGGAG CCGGGGAGAG AGCTGACGCT TGAGGAGAAG AGGCAGGTTA
TGAGGGATTT GGTAAAAGGG GACGAGAAGT TGCAGGCGAA TAGAGTCTTC CAGGGGATTC
AGGCGCTGTG AGCCATCATC CAGAGTTGTT TCTGGTCTTC GGTTTCGGGT TATCTATGCT
TCTGTCTGCA ATGGTGGTAG GTAGTTATGG CGTTCGGGTA GATTCTACGC AAGGTTGTAT
AGGTCATTCA ACTGCCAGGT AATGGAT
 
Protein sequence
MIGQDYFDVD DEVGFIRFYR SLASDDSHNN ETIRIFDRGD WYSAHGKEAE FIARTVYKTT 
SVLRNLGRSE TGGLPSVTMS ITVFRNFLRE ALFRLNKRIE IWGSAGTGKG HWKKVKQASP
GNLQDVEEEL GAMGMEGSNG APIIMAVKLS AKAGEARNVG VCFADASVRE LGVSEFLDND
VYSNFEALVI QLGVKECLVV QDVNRKDVEV AKIRAICDNC GIAISERPAS DFGVKDIEQD
LTRLLRDERS AGTLPETELK LAMGGAAALI RYLGVMSDAT NFGQYQLYQH DLAQYMKLDA
AALRALNLMP GPRDGSKSMS LFGLLNHCKT PVGSRLLAQW LKQPLMDLAE IEKRQRLVEA
FVVSTELRQM MQEEHLRSIP DLYRLAKRFQ RKQANLEDVV RVYQVAIRLP GFVNSLENVM
DEEYQTPLET EYTAKLRNHS ASLAKLEEMV ETTVDLDALE NHEFIIKPEF DDSLRIIRKK
LDQLRHDMYL EHKAVARDLD QEMDKKLFLE NHRVYGWCFR LTRNEAGCIR NKKAYQECST
QKNGVYFTTS TMQSLRREHD QLSSNYNRTQ TGLVSEVVNV AASYCPVLEQ LAGVLAHLDV
IVSFAHASVH APTAYTKPKI HPRGTGNTVL KEARHPCMEM QDDISFITND VSLIRDESSF
LIITGPNMGG KSTYIRMIGV IALMAQIGCF VPCTEAELTI FDCILARVGA SDSQLKGVST
FMAEMLETSN ILKSATSESL IIIDELGRGT STYDGFGLAW AISEHIVTEI RCFGLFATHF
HELTTLADRY PKSVKNLHVV AFIGDGTTAN EEDEKEKRKT RQKVTLLYRV EPGICDQSFG
IHVAELVRFP EKVVNMARQK AEELEDFTSA DSAGNAASAT IDKYSQEEVE EGSALLKALL
VKWKSAIEEP GRELTLEEKR QVMRDLVKGD EKLQANRVFQ GIQAL