Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_07229 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001304 |
Strand | - |
Start bp | 604013 |
End bp | 606865 |
Gene Length | 2853 bp |
Protein Length | 925 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | sulfatase domain protein (AFU_orthologue; AFUA_2G17610) |
Protein accession | CBF78807 |
Protein GI | 259483427 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0143959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.252406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACCG TACACCAACT CAAGCTCTAT CGCAGCTGCA TCTCCGTCTG GAACCGTCTG CGGTCGTCGC CTGCTGCCTT CTTCGATGTC TTCTGGGACT GGACCCGGCG TTTCTTCTTT ACGCTGAGCT TGCTCGCACT CTTCTCTGCC AAATCGCTGC ATCTCTACGC GCATCTGCAT TCCCTCCCCG CCGACAAGTT CCTGCTTTGG GGGGTCACCT TTTTTACGCA GGATGTCGCG TGCACTCTTC TGATCCGGAT CTGCACCCAA AAATTCCCGT GGCGATGGCT GGACGCTCTG GCGGCGCTGG TCGTGATTCC GTTTAGGTAC TTGGTTTCTG CTTCAGAGAT AACTGCGTGG CTAATTATGT TGTTTGTGTT AGTCTGACCA TGTCCGGAAT GGCCTCGGCC AATATCTCGT TCTTCGTCAC CGCCGGTGCT GAGATCAATT GGCGACAGGC GAAATCGTTC CACCGTGATG CGGCGGCGAT CCGCACTCTG CTCACGGGCT TGACCGGATT CTTGATCGTC GAGGCGATCA TGACGGTCAT CGCCTGGCTT GTTGCCCCTT TCCTGCACCG GCTCGTCGGC GGCGTATTGC ATATACTCGC CGAGCCGTTC AAGATCCTGT TTAGACCGCT GTTCACCCGC GCTGCCAGCC TGTCGTCGCG GATCTGGCGC AGGCGGCTCG GCAGCGAGAC GCTGCCCGAC CCGGACGTCT ACGAGCAGAT CGCCGTCGAG GATTATCACG ACTACAAGAG CGACGAGGAG GATGAATATT CGGATTATTC CCAGAGGCCG CCGCAGCGCA TATCGCTCAT CAAACGGCTG GTGGTCTGGT TGCCATTGCT TTGTCTGGGC CTCCTGCGTA GCGTGCGGCC ACCGTACCCG TCGTATATTT TCCTCTCCAG CGCCCTTCCG ATGACGCCGT TTGCTGGAAT GCACCGTCCA ACGTTGGGCG AGCAGGCGGG GAATATCCCC GACTACGCCT GGCTTGAGGG TAAGACGTCG CTTGGCAAGC CCCCCAACTG GGACTGGATG CCGGAGGAGA CACTGCCTGG GTTCAGGGAC TGGCACAACA AGCGCGAGCA TTATACTCCG TCGCAAGACC CGCTGCATCT GTCGAACCTC CAGGCACCGG TCCTGGACGA ACTAAAGGAC GTCCTGGCCA GCGGCGAGGT GAACATCAAG CATGTCATCC TCCTCAAACT CGAGAGCACG CGAGGCGATG TCTTTCCTCT GCGCAACGGC TCCTTCATGT GGAATAAGAT CGTGGACTCG TTTGACGGAA AGGAAATGCC AGAGAGTGCT GTACGAACAG TGGCCAACCT CACTCGCACG GCCGAGTATC TGACTGGGTT CGACTCTGGC TTTGACCAGT ACCGCGACGG CGAGCGTAAA TCGTACGGCG GGATCAGCGC CAGCAACGCC TTCACGACAG GGACATATAC CATCAAATCG GTGGCCGGAA CGGTGTGTGG GATCTCACCG CTTGTCGCCG ACTTTAACCG CGAGTACAAG TATCACTTGT ACAACCCGTG CATGCCGCAC GTCGTCAATG CACTCAGCCA CCAGGCCGAC ATCACCAACG GCTCGGATTA CCGCACCTGG CCATGGGAGT CGATCTGGAT GCAGTCGGTC ACAGACACCT ATGACCATCA GGATCTCCTG ACGCCACGAT TGGGCTTCCG AGATATCTAT ACCAAAGAGC GCATAGAGAA CCCGGGTGCA AAGCACTACC CGGTCAAGTC GAAAGAGGTC AACTACTATG GCTACCCGGA TACAGAGCTG AAGGAGTACA TCCGTGACGC CTTCGACGAC GCGGAGGAGA ACAATAAGCG TCTCTTCCTC GCCCATCTCA CCGGAACAAC GCACCACCCC TGGGGCATGC CGGACGACAA CTACGAGAAC ATCATGGGCC CGTCCTTCAA GGGCAAGAAC AACGATATGA ACAAGTATCT GAATACGATC GGGTTTGCAG ACCGCTGGAT CGCTCAGATC CTCGATATCC TGGAGGAAAA AGGCGTCCGT AATGAGACCC TCCTCGTGAT GGCGGGCGAC CACGGCCTCT CCCTCCCCAA CGACGGCGGG ATTACGCCCT ACAGCAACCC GCACATTGGC TCCTTCCACG TCCCCATCGT TTTGGCGCAC CCGAAACTAC CATCTATCGA AGTGAAGGAC CCTGTCATTT CCCTTCAGAT CGTCCCCACC ATCATCGACC TACTGATCGA ATCCTCTTCT CTTGGCCCCA ACTCCACACA AGCGGCACGG GACATCCGCG GGCTCTACGA GGGCCAATCT CTCATCCGCC CACTCTTCCA AGAGGCAAAC GGCATGCAGG ACTGGCAGTT CAGCGTCATG AATACTGGCG GGTCGTGGCT CGCCGTCCGG TCGGCCGCAC GGCCTCAGTG GCGCATAGTC GTCCCGCTCA TCGACGACGT CGAGTGGCGG TTCACCGACA TCGAAAAGGA TCCCCAGGAG ACGAAGCCGT TGACGAGCTT CAGCTTCTTC GATCTCATGG ATACGCTTTG GCGCGAGTAT CGGGATGACA AGAGCAATAC TCCCGAAGAG CATGACGAAG AGCCCGTCGA GCGCGATGGT CATGAATTCT TCAAACACCC TCCTCCTCCA CCTCCCCCCG GGCATCACCC TCACCACCCT CCCCCTCCAG GCGGCGGCCC TCCTCCGCCT CCCATCACGA TCACGCCGTC TCGTCCGCAT CCCGAAGATG CACCTTGGGA GCCAGAGGTC GTGCGTTGGG CACGCGACGC TGCACATATG GCCGAATGGT GGATTGCAGA CAACCACCGG CGGTATGGCT ATAAGAAGCC GTGACCGGTC AGGGAAGGTT GAT
|
Protein sequence | MRTVHQLKLY RSCISVWNRL RSSPAAFFDV FWDWTRRFFF TLSLLALFSA KSLHLYAHLH SLPADKFLLW GVTFFTQDVA CTLLIRICTQ KFPWRWLDAL AALVVIPFSL TMSGMASANI SFFVTAGAEI NWRQAKSFHR DAAAIRTLLT GLTGFLIVEA IMTVIAWLVA PFLHRLVGGV LHILAEPFKI LFRPLFTRAA SLSSRIWRRR LGSETLPDPD VYEQIAVEDY HDYKSDEEDE YSDYSQRPPQ RISLIKRLVV WLPLLCLGLL RSVRPPYPSY IFLSSALPMT PFAGMHRPTL GEQAGNIPDY AWLEGKTSLG KPPNWDWMPE ETLPGFRDWH NKREHYTPSQ DPLHLSNLQA PVLDELKDVL ASGEVNIKHV ILLKLESTRG DVFPLRNGSF MWNKIVDSFD GKEMPESAVR TVANLTRTAE YLTGFDSGFD QYRDGERKSY GGISASNAFT TGTYTIKSVA GTVCGISPLV ADFNREYKYH LYNPCMPHVV NALSHQADIT NGSDYRTWPW ESIWMQSVTD TYDHQDLLTP RLGFRDIYTK ERIENPGAKH YPVKSKEVNY YGYPDTELKE YIRDAFDDAE ENNKRLFLAH LTGTTHHPWG MPDDNYENIM GPSFKGKNND MNKYLNTIGF ADRWIAQILD ILEEKGVRNE TLLVMAGDHG LSLPNDGGIT PYSNPHIGSF HVPIVLAHPK LPSIEVKDPV ISLQIVPTII DLLIESSSLG PNSTQAARDI RGLYEGQSLI RPLFQEANGM QDWQFSVMNT GGSWLAVRSA ARPQWRIVVP LIDDVEWRFT DIEKDPQETK PLTSFSFFDL MDTLWREYRD DKSNTPEEHD EEPVERDGHE FFKHPPPPPP PGHHPHHPPP PGGGPPPPPI TITPSRPHPE DAPWEPEVVR WARDAAHMAE WWIADNHRRY GYKKP
|
| |