Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_10964 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001304 |
Strand | + |
Start bp | 2001193 |
End bp | 2004384 |
Gene Length | 3192 bp |
Protein Length | 979 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | conserved hypothetical protein |
Protein accession | CBF79686 |
Protein GI | 259483911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.296574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.628256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGC CGAGTCGACC TGCAACTGCC AGATTTCCTC CTTCCCGGCA GGTCCGGGGC ACGAGGCTTA ATACTATAAT CGAGGACGCG CGCGAGACGC AATATGCAGA CAATCGGGGG CCGAGTCCTA CTGACAATGG TCCGAAGAGT CCCACGCCGC GACTGAAGCT GAAGACGACC GGGCTGTCGT TGCCGCTGGT TAGACAACAA CGGAAGTTTC TGTCGCCGCT GTCGGCTGGT TCAGTCTCGT CGTGTTCAGA TGTGGATTGG CAGAATCAGA TGAGGACTTT TGATGAACTT TATGATGCTA CGGACGATGA GTCGGACTTC AGTGATGAGT GCATGTCATT CACAAGTACC CGGCCTACCA GCCTGACAAC ACCGACCACG AGGGCGGACT CGGTGACCTC TTCGAATTCT CGGAGGCGGT ATCCGGCTCT CTCTATCCCA ACGTCGAGTA TATGGCCTTC GCTCAATGGG GCTCCGAAGA GCTCGCCCGT TCCACCAACT CCTCCGCAGA GAATCCCCGT TTCTCCTGCC GCCCTCTCGC TTTTGTCACG TTCAGTACCG GCAATGCACG CGACTCCGTC GTTGGATGGC AGCGTGTCAT CGGACCAGGT GTCTAATCTG AGCACTCCAT CGACTCCGTC CTTGCGGTCT CTTCCAGACA CCGACTGGAA CTCGCGCGAG ATCCATGTTC TTCCAGACCT CGACGATTCG CAGCACCCGA ACATTGCGAA CCCTGAGACT GAGGAAGTAC CGAGCATCGA AATTCCTATC GAAGATGCGG ACGACGACTG GCGACGCTTC ATTGGGGAGT TTCCGCAGAT TCCTGGCCAA ACTACATCGC AAGCCGGCTA TGTTGGGGTT GAGCCAGCCC GCGAGGACAC CCCTTCTGAC CCAGGCGTTG CTCTTCCTGA AGGTGCTCTG GCAACGTTGC AGTTTATTCC TCTGGAAGGA ACGCCGGAAC CCTGGTCGGA GACTTCGGAA CCGAATGAGG AAATGTGGCA GGTTGCAGCT CCGCCAGAAC CACGCCGGCT TGACGATGAA ACGCCAGTCT CTGAGTTGTC AGAGTATAGC TTCACTGGAC TGAGCATACC ATCTCCTGGA GGATTCTTCA ACTCACTGGC CCCTCGGGCT CGCCATACCT GGTCACTTCC GAAGCTCAAC CAGCCGCCTA CCTCAGCCAC TGCTGAGAGA TTCTACAACC TTCCGTTCAA CAGGGAGGAA GGGGAAATTA TTGAGCAAGT CATTGATCTC CCAGAGAGAT TGAATGATGA GCAGCTGACA GCAATCTACG CACCACCGAC AGCAATTAAG ATCCCAGAGA GTCCCGCACA CCCACCAACG GAGGGTTCCA TTAGTCCTGT CAGCGAACGT GTGCACGAGA TTTCGAGACC TGCTACCGCC TACGACCCCG ACGAGCAAGA CGAGAATTAT GCGGAGGAGC TGCACAAAAA AGCGCTGTCC AGTTTGGACA GAACTAGCGT CTGGCTGGCG GCACAAGCTT CGTATCTTGC GGCGCTCAGA GAAACGAACC CCGTCAACAA CCTTCCAGAT GAAGACGAGC GGCCTCAAGA CGTCGACGAG AGCCCCCAAC ACGTTTCTCC GGCACTCGAG CGAAATGCTT CCGTTTGCTT CACTGGGATG TTTCCGGAAC CTCCCAGTTC GCTCCCTGCT GCTAATGCGA GCAAGGATTC AATCTATTGG CGCGGATTCC GTTTCTTGCT TGATCAGTCC AGAAGCCGGG ATACCTTTGT GCATCGTAGT ACTCGCTTTG ACGCTGTGCA ATCTTTCCGT CTTGGTCTGT CTGGTTTGCA CAACAAGTGC CTGCTTGGGA ACTATGAGTT GGTGCTTCCA GACCGACCCG CCTACAGTGG GCCGTTCGCG AAAGCGCCAC GCCATTCTGT TCTTCCCGGA ATTCTCCAAC AAAAGGCTGA GTTCTCCATG ATTGAGAAGG AGCAACTTGT CCTTTCCCAA ATCAGCCAGC CAATGTGGGC CATGGAAGCC TTGCGATACC TCCAAGGGGG CAATCTTGTT GTGAGTCCTG CTCGGAAGCG CTTCTCGAAA CGTGCGACAG CCGCAGCCCC CCACAAGACC CCCAAGCGCC GTCAAGTGAG GGTTCTTGAT CTGGGTGGTC ATGCCACAGC CGAGTGGGCC TGGCATCTCG CGCACGACTA TCCTCATGTC AAGGTCTACA CTGTGTACAC AGAGCACCAG CAAGTCAACA AAGCCATCAA GGGCCCCCCG AACCACCGTC ACATTCAAGT GCCCCAGCTA TGGAAGCTCC CCTTCCCTGA CAATAAGTTC GACGTGATCT CAGCCCGCTC CCTACCTGCA TTCCTGAAGA CGGAGCGTCC GGCTGGAGAT TGTCTAGACG AGTACGATCT TTGTCTGAAG GAATGCCGTC GCTGTCTCAA GCCAGGCGGG TATCTAGAGT ACCTCGTGAT GGACGCCGAG ATAGCTCGCG CAGGTCCATA CGCATCTGCA ACATCCATCG AGTTCTCGTT CAGTTTGAAA ATACGAGGTT ACGACCCAGT TCCAACGAAG CAATTCGTTG GTCGCCTGCG CAAGCAAGGC TTCGTGGGCA TCAAACGAGG CTGGATGTAT CTACCAATGG GAACAGAGCC ACCAAAGCCT CAAGTTCCAA GAGAGACTCC TGATCCCAGG GTCAAGAGTC TGATTGAGGA CTACGAGGCG GTTCAGGGGC CGTTGGGAAG CACGGCTGAT ATTGCGTCCA TAACAGGTTT ACTGGGTGGT TGGATCTGGG AACAGTGGCT ACTTAAGCTC CAGGTTGAGA TGGGTCGAGA TAAGAGCAAA TTGCTTGAGG GTATCGGGAG TATGTTTGAC GAGGGGCGGA AGAATGGGTC TGGCTGGACA TGCTTGTCTG GTTGGGCGAT GAAACCATTA AAGAAGCAGA GTGACGCTAC AGGTTTTTGA GGGCCTCTAT GAATCTTCGG TTGCGGCTTG CGCCCATTTC ACTCTCTAAA TTTTCATATT CAGTCCTGGT GGTTGTCTTT GTTCCCTCTA TCGTCCTGGT GGCGATCGAT CATTATCTGA TAACTAAACA TTTTTTGGCG TGGTTCTGTT GAGCGTTCTG TGATGGGTGA GATCTGAGAT GGCAAGAGTA CGACGGCATG AGACTTGATA CCAATGACCT AGTATAACAT GAAAAAAACC CTATCTGTTA GA
|
Protein sequence | MAEPSRPATA RFPPSRQVRG TRLNTIIEDA RETQYADNRG PSPTDNGPKS PTPRLKLKTT GLSLPLVRQQ RKFLSPLSAG SVSSCSDVDW QNQMRTFDEL YDATDDESDF SDECMSFTST RPTSLTTPTT RADSVTSSNS RRRYPALSIP TSSIWPSLNG APKSSPVPPT PPQRIPVSPA ALSLLSRSVP AMHATPSLDG SVSSDQVSNL STPSTPSLRS LPDTDWNSRE IHVLPDLDDS QHPNIANPET EEVPSIEIPI EDADDDWRRF IGEFPQIPGQ TTSQAGYVGV EPAREDTPSD PGVALPEGAL ATLQFIPLEG TPEPWSETSE PNEEMWQVAA PPEPRRLDDE TPVSELSEYS FTGLSIPSPG GFFNSLAPRA RHTWSLPKLN QPPTSATAER FYNLPFNREE GEIIEQVIDL PERLNDEQLT AIYAPPTAIK IPESPAHPPT EGSISPVSER VHEISRPATA YDPDEQDENY AEELHKKALS SLDRTSVWLA AQASYLAALR ETNPVNNLPD EDERPQDVDE SPQHVSPALE RNASVCFTGM FPEPPSSLPA ANASKDSIYW RGFRFLLDQS RSRDTFVHRS TRFDAVQSFR LGLSGLHNKC LLGNYELVLP DRPAYSGPFA KAPRHSVLPG ILQQKAEFSM IEKEQLVLSQ ISQPMWAMEA LRYLQGGNLV VSPARKRFSK RATAAAPHKT PKRRQVRVLD LGGHATAEWA WHLAHDYPHV KVYTVYTEHQ QVNKAIKGPP NHRHIQVPQL WKLPFPDNKF DVISARSLPA FLKTERPAGD CLDEYDLCLK ECRRCLKPGG YLEYLVMDAE IARAGPYASA TSIEFSFSLK IRGYDPVPTK QFVGRLRKQG FVGIKRGWMY LPMGTEPPKP QVPRETPDPR VKSLIEDYEA VQGPLGSTAD IASITGLLGG WIWEQWLLKL QVEMGRDKSK LLEGIGSMFD EGRKNGSGWT CLSGWAMKPL KKQSDATGF
|
| |