Gene ANIA_03947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_03947 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001302 
Strand
Start bp2493433 
End bp2495470 
Gene Length2038 bp 
Protein Length544 aa 
Translation table 
GC content50% 
IMG OID 
Productmajor apurinic/apyrimidinic endonuclease (Eurofung) 
Protein accessionCBF75032 
Protein GI259481477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCCC GAGGAGCTCG AAAAGCGGCG GGGATTGAGA CGAACAAACG GGAATTAGGT 
GATGACCACG AGTTACTCTC TACAAGACGG CAGTCGAAAC GTATCAAGTC GGCCGAACAA
CTTCGGTCTA CCCCATCTAG CCCACAACCC AAATCTAGAA GAGTTCAGGT GAAGAGAGAG
GCTCGATATT TGCAGGAGGA GACAACAAAA GCTGAAAACC CCACGCTTTT AAACACAGAG
GAGCTTACGG AGGTGAAATC GGAAGTCTCT GTTAAAGTCA ATGAAGAGGA GATAAAGGAA
CAAACATCTG CGACAAAGGC CACTAGAAAA CGGAGGTCTA CAAAGAAAGA AGAAGCTGAG
ATGACGCCAT TAAGGGTACG AACACAGGGA TTGCGGATGT TCGTCGGCGC TCATGTAAGC
GCAGCAGGAG GTAAGAATCT TTCCGGCTGC TTCGCTCCGG TCAAGGCTTC GAAGCTTATA
CTCATAATTT TATCAGGAGT TTTCAATGCC GTGAATAACA GCAAACACGT TGGGTTAGTC
CTCAGCTATA GCAAAGCCCA CCTCATATAT AACTGTGGCT GATTATGCTC AGCGGCAACG
CTTTCGCCCT TTTCCTGAAA TCTCAGAGGA AGTGGGACAA CCCTCCGCTC CAGGATGAGC
ATAGAGATAA CTTCGTCAAG CTTTGCAAGG AGCACGACTA CGATGCTGCG AAGTACGTCA
GAGTGTCCTG TCTAAAGGGA GACGCTTGGC CGGACTTGGT GCGCTAATAT GGCGCAGATA
TGTTCTCCCT CATGGCTCTT ATCTGGTTAA TCTAGCACAG GAGGATGAGG CCAAAGCGAA
ACAAGCATAT GACGCGTTCT TAGACGATCT GCGTCGCTGC GAAGCACTCG GGATCAGGCT
TTACAACTTT CAGTAAGGCA GCAGCTTCCA CACTCGTTCA CTGCTAACGC GAGTAGCCCC
GGAAGCACAA ATAAGACTCC CCTCTCCAGC GCACTTGCTC GGCTTGCCAA AGCCCTGACC
AACGCCCTTG CCGCGACGTC CAATGTGGTA CCCGTCCTAG AGACCATGTG TGGTCATGGC
TCAACAATTG GTGGCTCTCT GTCCGAATTC CGGGACCTTC TTGCTCTCAT CCCAAAAGAA
TACCATTCAC GCATTGGCGT GTGTATAGAT ACCTGTCACA GCTTTGCAGC CGGGTATGAT
CTCGTGTCTC CGGCCGGATT CCAAGCATTT TTGAAAGAGT TTGAGGACTT AATTGGCATA
CAGCATCTAC GTGCCCTCCA CCTTAACGAC TCTAAAGCGC CCGGCGGCAG CAAACGCGAC
CTGCACGCGA ATATCGGAAC TGGCTTCCTT GGTCTAAGAG CGTTTCACAA TGTCATGAAC
GAGCCTCGCT TTGAGGGATT ACCAATGATT CTCGAAACGC CAATCGATCG AATACCAGCT
GCCGCTACAA ATAGGGCAAA ACAAGAAGCA GCGGAGGAAG ATGTTGAGTC AGGAGCCAGT
GATGACGAAA TCAAGCCAAA GACCAAGAAA AAACAACAGA AGAAACCCGC TGCGGCCAAG
GCAGTCCCAG ATTACTCTAT CTGGGCTCGC GAAATCGCCC TCCTTGAGTC TTTGATTGGC
ATGGATCCCG AAAGTGATGA GTTCCGTGCG CTTGAAGCCG AGCTCTCTGA AGAGGGCCGA
GAGACGCGGG AGAAGCACAT GGAGCAGTAT CTACGGAAAC AAGAAACTGA GGAAAAGAAG
AAGGCCAAGA GTGGGGGAAA GCAGAAAACA CTGATAGGAA TGATGAATGG AAGCAAGGGA
AGAGGTAAAT CGACCACGAA GAAGGGATAC GAGACAGAAA GTGAGGACGA AGGTTGCCAG
AGCTGCTGAC GCGACGCGTT TTGGCGAAGG CTAGCTAAGA GCGCTGGCGG TGACAATTAA
AGGTTAACTG AGAAAATACC AGCAGGGAAT AAATGCCGAG CGCAATGCGC TCTGAAAAAC
ACTCCAAAGC ATTGAACGTC TGCCCAGATA TATAGTCTCA GCGGGGATGA GACTTTTA
 
Protein sequence
MPPRGARKAA GIETNKRELG DDHELLSTRR QSKRIKSAEQ LRSTPSSPQP KSRRVQVKRE 
ARYLQEETTK AENPTLLNTE ELTEVKSEVS VKVNEEEIKE QTSATKATRK RRSTKKEEAE
MTPLRVRTQG LRMFVGAHVS AAGGVFNAVN NSKHVGGNAF ALFLKSQRKW DNPPLQDEHR
DNFVKLCKEH DYDAAKYVLP HGSYLVNLAQ EDEAKAKQAY DAFLDDLRRC EALGIRLYNF
HPGSTNKTPL SSALARLAKA LTNALAATSN VVPVLETMCG HGSTIGGSLS EFRDLLALIP
KEYHSRIGVC IDTCHSFAAG YDLVSPAGFQ AFLKEFEDLI GIQHLRALHL NDSKAPGGSK
RDLHANIGTG FLGLRAFHNV MNEPRFEGLP MILETPIDRI PAAATNRAKQ EAAEEDVESG
ASDDEIKPKT KKKQQKKPAA AKAVPDYSIW AREIALLESL IGMDPESDEF RALEAELSEE
GRETREKHME QYLRKQETEE KKKAKSGGKQ KTLIGMMNGS KGRGKSTTKK GYETESEDEG
CQSC