Gene ANIA_04372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_04372 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001303 
Strand
Start bp2231480 
End bp2232635 
Gene Length1156 bp 
Protein Length361 aa 
Translation table 
GC content56% 
IMG OID 
ProductEndo-polygalacturonase [Source:UniProtKB/TrEMBL;Acc:Q1HFT2] 
Protein accessionCBF77661 
Protein GI259482818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.217421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTCC TTCAAAACTC CCTTATTGCT GCGGCAATGG GCGCTGCGCT GGTCGCTGCC 
GCCCCTGCTG CTGATCTTGA TGCTCGAAGC TCGTGCACCT TCACCTCTGC CTCTGCGGCC
AAGTCTGGCG CATCCAAGTG CTCCACTGTC ACCCTCAAGA GCATCCAAGT TCCTGCCGGT
GAGACCCTTG ACCTGACCGG TCTCAAATCG GGTGCTACTG TACGTAGTTC CCCTTCCCCC
TCGCCGCCTC ATTGATCTCG GCTTTCTAAC GGAGGAGCAG GTTATCTTTG AAGGCGAGAC
AACCTTTGGC TACAAAGAAT GGAAAGGACC GCTGATCTCC ATGTCCGGTG ACAAAATCAC
GGTTAAGCAA GCCTCTGGCG CAAAGATCAA CTGCGACGGG GCCCGCTGGT GGGACACCAA
GGGCAGCAAC GGCGGCAAGA CCAAGCCCAA GTTCTTCAGC GCGCATAAGC TGAACAACTC
CAAGATTCAG GGGCTGAAGA TCTACAACAC CCCTGTCCAG GGATTCAGTA TCCAGTCCGA
CCACCTGACC ATTTCGGACG TGACCATCGA CAACTCCGCC GGCACCAGCA AGGGCCACAA
CACCGATGCC TTTGACATCG GCTCCTCGAC GTACATTACC ATCGACGGTG CGACTGTCTA
CAACCAGGAT GACTGTATTG CCATTAACTC CGGCGAGCAC ATCACCTTCA CCAACGGATA
CTGTTCCGGC GGCCATGGCT TGTCTATTGG CTCCGTCGGC GGCCGCAGCG ACAACACCGT
CAAGAGCGTC ACCATCTCCA ACAGCAAGGT CGTCGACTCC CAAAACGGCG TCCGCATCAA
GACCGTCTAC AAGGCTACCG GCTCCGTCAC CGATGTCACC TTCCAGGACA TCGAACTCTC
TGGAATCACC AAGTACGGCC TCATTGTTGA GCAGGACTAT GAGAATGGTA GCCCAACAGG
TACCCCTACC AACGGTGTCG AGGTTGAAGA TATCACCTTC AAGAAGATTA CCGGCTCTGT
GGATAGCAGT GCCACACGTG TCAATATCCT CTGCGGGTCA GGGAGCTGCA AAGACTGGAC
TTGGTCTGGG GTTGATATTA CCGGCGGAAA GAAGAGCTCT AAGTGCAAGA ATGTTCCGTC
TGGTGCTTCG TGCTAG
 
Protein sequence
MHFLQNSLIA AAMGAALVAA APAADLDARS SCTFTSASAA KSGASKCSTV TLKSIQVPAG 
ETLDLTGLKS GATVRETTFG YKEWKGPLIS MSGDKITVKQ ASGAKINCDG ARWWDTKGSN
GGKTKPKFFS AHKLNNSKIQ GLKIYNTPVQ GFSIQSDHLT ISDVTIDNSA GTSKGHNTDA
FDIGSSTYIT IDGATVYNQD DCIAINSGEH ITFTNGYCSG GHGLSIGSVG GRSDNTVKSV
TISNSKVVDS QNGVRIKTVY KATGSVTDVT FQDIELSGIT KYGLIVEQDY ENGSPTGTPT
NGVEVEDITF KKITGSVDSS ATRVNILCGS GSCKDWTWSG VDITGGKKSS KCKNVPSGAS
C