Gene ANIA_08341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_08341 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001305 
Strand
Start bp172275 
End bp174208 
Gene Length1934 bp 
Protein Length608 aa 
Translation table 
GC content53% 
IMG OID 
Productarylsulfatase, putative (AFU_orthologue; AFUA_8G02520) 
Protein accessionCBF80344 
Protein GI259484268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.333893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTG CAAGGACAAC ATTATGGCTC ACGTTGGCCC TGGCAGTCAA TGCCTTCCCT 
TCCCTTTTCG ATCCTCTTCA GCAGATCCTC GGTCCAACAC CGCAAACATT ACCGAATAAG
CCGAACTTCG TCTTCATCAT CACTGATGAT CAGGACCTTC AGCTCGACTC TATTGATTTT
ATGCCCCTGG TCTCAAAGCA CCTCAAGCAG AAAGGTACCT TCTTCAGCAA CCACTTTGTC
ACCACGGCGC TCTGCTGCCC GTCGCGCGTG AGCCTGTGGA CAGGGCGTCA GGCTCACAAC
ACGAATGTCA CTGACGTGAC TCCTCCATAT GGTAACCCTA ACCCTATTTT GTCAGGCGCA
ATGGGTTCTA ATAGCATTGT AGGCGGGTAC CCCAAGTTCG TTGACCGCGG CTTCAATGAC
AACTTCCTGC CAGTCTGGCT CCAAAGCGCC GGATACGACA CTTACTATAC GGGTAAATTG
TTCAACGCCC ACACCGTCGA TAACTACCAC AGCCCCTACG TCAACGGGTT CACCGGCTCT
GATTTCCTCT TGGACCCTTT CACCTATTCG TACCTAAATT CGACGTACCA ACGAAACGGC
GACGAGCCGG TGAGCTATGA AGGGCGACAT ACCGTCGACG TCATCACTGA GAAGGCTCTG
GGGTTTTTGG ACGACGGCCT AAATGGCGAC CGTCCGTTCT TTCTGACCGT TGCTCCGGTT
GCTCCTCATT CGAACGTGGA TGTCAGCGCT CTTGGGGCAG ATCGCAGAGC GCCTACGATT
ATGACAGAGC CTATCCCGCT CGATAGACAC AAGTCTCTGT TCCAAGACGT GAAGGTTCCT
CGTACCAAGC ATTTTAATCC AGACGAACCG AGCGGCGTCA GCTGGATCAG AGACCTGCCT
CAGCAGGATG AATCGACCAT CGAATACAAC GACCACTTTT ACCGGCAACG GCTGCGCGCT
CTGCAGGGTG TCGATGAGCT TGTCGACTTG ATTGTCACTA GACTCGAGGC CAGCGGCCAG
CTCGATAACA CGTACATCAT CTATACTTCC GACAATGGGT ACCACATCGG ACAGCACCGG
CTGCCGCCCG GTAAAGCATG CGGGTTCGAG GAGGATATTC GCGTGCCCTT GTTCATCCGT
GGGCCTGGTG TGCCTGAGAA CAAAGTGAAG GAAGCCGTTA CCACGCATAT TGACCTCGCG
CCCACGATCT TTGACCTAGC GGAGATACCA CTGCGCGAAG ACTTTGACGG CACGCCAATC
CCTCTTCCAA GCGCGGAAGA TAATGCCTCT ATACGCCATG AGCACGTCAC TGTCGAGTAT
TGGGGAAAGT CGTACCTTGA GGGGGAGAAG GGCCCTCTGA GTAAGTGTAC CAGTTTCGGT
TCCTGTTGTA GAGCTGACTC GCTGTCGTAG GCAACCCAGA GAACCTCCCA TTCTTCACCA
ACAACACTTA CAAGTCAGTA CGAATTATTG GGGAGGGCTA TAACCTCTAC TATTCCGTAT
GGTGTAATAA CGAACACGAG CTGTACGACT TAACTGTAAG TACTAACCCA CGGTATGGCC
CAGTCTAATA CGACACTAAC CTGCCATTCC AGGCCGATCC TTACCAGCTG AACAACCTCT
ACAACTCCAA CGACAAGTCA GTCATGGTCT TCGGCCACAG CCTCTCGCAG GTAATCAGCA
GGCTAGACTC AATTCTGCTC GTGCTCAAAT CCTGCAAAGG CGCTACTTGT ACCAAGCCGT
GGGAAGTCCT CCACCCGCGC GGCCATGTGA AGAACCTAAA GGACGCATTG AATCCGTTAT
ACAATGCATT CTACGCAGAC CAGGCCAGGG TTTCCTTTGA TCACTGTGAG CATGGGTATA
TCCCTGAAGT AGAGGGGCCA CAGGACGCGC TCCCGTTCAC GAGATATGGG TTGAATTGGG
ATATCTGGAC GTGA
 
Protein sequence
MKVARTTLWL TLALAVNAFP SLFDPLQQIL GPTPQTLPNK PNFVFIITDD QDLQLDSIDF 
MPLVSKHLKQ KGTFFSNHFV TTALCCPSRV SLWTGRQAHN TNVTDVTPPY GNPNPILSGA
MGSNSIVGGY PKFVDRGFND NFLPVWLQSA GYDTYYTGKL FNAHTVDNYH SPYVNGFTGS
DFLLDPFTYS YLNSTYQRNG DEPVSYEGRH TVDVITEKAL GFLDDGLNGD RPFFLTVAPV
APHSNVDVSA LGADRRAPTI MTEPIPLDRH KSLFQDVKVP RTKHFNPDEP SGVSWIRDLP
QQDESTIEYN DHFYRQRLRA LQGVDELVDL IVTRLEASGQ LDNTYIIYTS DNGYHIGQHR
LPPGKACGFE EDIRVPLFIR GPGVPENKVK EAVTTHIDLA PTIFDLAEIP LREDFDGTPI
PLPSAEDNAS IRHEHVTVEY WGKSYLEGEK GPLSNPENLP FFTNNTYKSV RIIGEGYNLY
YSVWCNNEHE LYDLTADPYQ LNNLYNSNDK SVMVFGHSLS QVISRLDSIL LVLKSCKGAT
CTKPWEVLHP RGHVKNLKDA LNPLYNAFYA DQARVSFDHC EHGYIPEVEG PQDALPFTRY
GLNWDIWT