Gene ANIA_00126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_00126 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001308 
Strand
Start bp4460034 
End bp4462413 
Gene Length2380 bp 
Protein Length744 aa 
Translation table 
GC content52% 
IMG OID 
ProductDNA mismatch repair protein Mlh1, putative (AFU_orthologue; AFUA_5G11700) 
Protein accessionCBF90147 
Protein GI259489678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.621771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCTG GAATGGACAT CGACGTCCCC GAGGCTCGGG GCACGAAACG GCCCATTGAT 
GATACTGAGG GTCCTCGTAA ACCCAGAAAG ATCAGAGTAA GCTATTACCG TTCGTGAATG
TTCAGGCCAA CTAATGATTG AAACAGGCGT TAGACCCAGA TGTCGTGAAC AAAATTGCGG
CAGGGGAAAT CATTGTTGCC CCGATGCATG CACTTAAAGA GTTGATAGAG AACGCAGTCG
ATGCCGGATC GACTTCGATT GAGATTCTTG TGAAGGAAGG GGGTCTCAAA CTGCTTCAAA
TTACAGATAA TGGTCACGGG ATTGATGTAA TATTGTCTAG CTCTGGTAAT GATGCACCTG
CTAATATCTC CTAGCGCGAT GACCTTCCCA TTCTCTGCGA GAGGTTCACT ACTTCGAAGC
TAAAGGAATT TGAGGACCTT TCGTCAATAG CCACATATGG GTTTCGTGGC GAAGCTCTGG
CTAGTATCAG CCATATCGCC CATCTCACTG TCACCACGAA GACGGCTGAC TCAAGCTGTG
CTTGGAGAGC TCATTATGCC GATGGAAAGC TGGTCCCTCC CAAACCAGGG CAGAGCGCAG
CTCCTAAGGC CACTGCGGGG CGTGGAGGGA CGCAGATCAC AGTTAGTCAA TGACCATATC
AAGATGAAGA CGTTTCTAAC ACTATCAGGT CGAAGATCTG TTCTATAATG TGCCTACCAG
ACGCAGAGCA TTTCGTTCAG CCAGCGAAGA GTACGCCAAG ATCCTTGATG TAGTCGGCCG
CTACGCTGTA CATTGCTCTG GTGTTGCTTT TTCATGCCGT AAACATGGTG ACGCTGGCGT
CAGCATCTCT ACGGCGGTTG CTCTCAACAC CATTGACCGA ATCCGCCAAA TCCATGGAAG
TGCAGTGGCC AACGAGCTGG TTGAGTTCAG CGTCAAGGAC GAAAAGTTAG GCTTCACGTC
ATCTGGACTC GTCACAAATG CAAACTACCA TGTCAAACGA ACCACCATCC TACTCTTCAT
AAATCACCGC TCGGTTGAGT CCACCGCTAT TAAGCGAGCC GTCGAGCAGA CATACGCCAG
CTTCCTACCT AAAGGCGGCC ACCCTTTTGT CTACATCGAC CTCGAGATTG AGCCACATCG
GCTAGACGTA AACGTGCATC CCACAAAACG AGAGGTCAAC TTCCTTAACG AAGACGAAAT
CATCGACAAC ATCTGCGCCG AAATCAGATC CAAGCTCTCT CAAGTTGACT CCAGCCGGAC
ATTCCTAACC CAGACACTCC TGCCCTCTAT CCAGACCCCC AAACGGTCAA GCCAAGTCCA
AGACGCTGAT GCAGCCCCTA AAACACCCGC CCCGACCAAA AAGCCGTACG AAAACAGCCT
CGTCCGCACA GACTCCCGTG TCCGCAAAAT AACCTCCATG CTCTCCCCTG CAACCAGTCA
ACCCCCATCC GCAACGCTCA ACCTCGAGGG CCAACTGGAA AATACTCAAA CCGTTCTTGA
CGACGGACTC ATCTATACAA CAACCGACCG CGAACCGCTC AAAATAGCCC TAACTTCCGT
AAAAAACCTC CGTGCGGCCG TGCGGTCCTC AATGCACCAG TCTCTCACCG AAACCATTGC
TTCACACACA TATGTTGGCC TCGTCGACGT AAACCGCCGC ATCGCAGCCG TGCAAGCCGG
CGTCAAACTC TATCTTATCG ACTACGGCAT GTTCTGCGCC GAGTTCTTCT ACCAGCTTGG
TCTCACTGAT TTCGGCAACT TCGGGACGAT CCAGCTAGAA CCGCCGCCTA AATTAATAGA
TCTGCTACAT ATAGCAGCGG AGTCCGAGCT GCAGCAAGCT AGCGAGGATT ATGAAGAGAA
AAGGGAGATA TTTTCGGCCG CCCCAGAGCT CGTAGCGAAA ACGCTCATTG ATAGGAGGGA
AATGCTCTCC GAGTATTTCT CTATCCAAAT CTCAGACGAC GGATACCTGC TTACTATCCC
CTTATTGCTG AAGGGCTATG TCCCTTGCCT AGGTAAACTA CCGCGGTTTC TGCTTCGTCT
CGGTCCGTAC GTCGACTGGA CAAGCGAGGA AGAATGTTTC CGCACGTTTT TGGCTGAGCT
CGCGGCATTC TATACACCCG AGCAGCTGCC TAGAATGCCG CCATCAGAAG AATTGAGAGC
CGAGTCTAGA GCTTCACAGG GGCACTCGGA CGCTGGTGAC GCAGACGCCG ATGCAGAGAA
TGAGTTCGTT AGCAAGCGCA GGGTGCAGCT AGCGAGCGCG TTAGAACATG TGATCTTCCC
CGCGTTGAGG GCGCGATTGG TTGCTACGAC GAAGTTATTG AGGGGTGCTG TGGAGGTAGC
GGACCTGAAG GGATTGTATC GGGTGTTTGA ACGATGCTAA
 
Protein sequence
MEPGMDIDVP EARGTKRPID DTEGPRKPRK IRALDPDVVN KIAAGEIIVA PMHALKELIE 
NAVDAGSTSI EILVKEGGLK LLQITDNGHG IDRDDLPILC ERFTTSKLKE FEDLSSIATY
GFRGEALASI SHIAHLTVTT KTADSSCAWR AHYADGKLVP PKPGQSAAPK ATAGRGGTQI
TVEDLFYNVP TRRRAFRSAS EEYAKILDVV GRYAVHCSGV AFSCRKHGDA GVSISTAVAL
NTIDRIRQIH GSAVANELVE FSVKDEKLGF TSSGLVTNAN YHVKRTTILL FINHRSVEST
AIKRAVEQTY ASFLPKGGHP FVYIDLEIEP HRLDVNVHPT KREVNFLNED EIIDNICAEI
RSKLSQVDSS RTFLTQTLLP SIQTPKRSSQ VQDADAAPKT PAPTKKPYEN SLVRTDSRVR
KITSMLSPAT SQPPSATLNL EGQLENTQTV LDDGLIYTTT DREPLKIALT SVKNLRAAVR
SSMHQSLTET IASHTYVGLV DVNRRIAAVQ AGVKLYLIDY GMFCAEFFYQ LGLTDFGNFG
TIQLEPPPKL IDLLHIAAES ELQQASEDYE EKREIFSAAP ELVAKTLIDR REMLSEYFSI
QISDDGYLLT IPLLLKGYVP CLGKLPRFLL RLGPYVDWTS EEECFRTFLA ELAAFYTPEQ
LPRMPPSEEL RAESRASQGH SDAGDADADA ENEFVSKRRV QLASALEHVI FPALRARLVA
TTKLLRGAVE VADLKGLYRV FERC