Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_02739 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001306 |
Strand | + |
Start bp | 2918374 |
End bp | 2921566 |
Gene Length | 3193 bp |
Protein Length | 901 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | DNA-directed DNA polymerase theta, putative (AFU_orthologue; AFUA_1G05260) |
Protein accession | CBF84114 |
Protein GI | 259486348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.539021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.18376 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCTC ACAGAGCTAT CCAGAACAGT GGCTTTCAGA CGTCAGTCGA TATCGCCCGC CAGCAGCCGT TTGCCGTAGC CCCGCTGGCG GGCCAGAAAC GGCCCCCCAG CGGCGCTCTA AACGAGGATA CAGATAATCC CAGTTCTCAT GCAATAAATG GGACCTCACG TCAGAATCTG CCTAATGGCC AAGGATTGGA TTTTACCCGG CCTCAAGTGC ATCTACCGAA GAGCCGTTTG ATTGCTAGTG AGATATGTAC TGTTGCAGGT TCTGAGAATG AACAGAAACC ACGACCAGAA GATCCGTCAA AGTCTCAAGG TTCCCTTCAG TCGCTGAATG ACCCTAAATT CGGACTGCCT CCAGCTCTGG TGGCCAACTT TGCTGCTGCC GGTGTCACCA GCATCTACCA ATGGCAGGCA TCATGCTTGC TTGGTGAAGG CCTCCTGAAA GGGAAGCGGC ATCTAATCTA TACGGCACCC ACGGGCGGTG GGAAGTCGCT TGTGGCTGAT GTTCTGATGC TGAAGCGGAT CATCGAGAAC CCTACGCGCA AAGCGATTCT GGTCCTCCCG TACGTGGCCT TGGTCCAAGA GAAACTTAAG TGGCTCCGGC GCATAGTCCA AGATGTTGAA AAATACACCG TTGACGATGA ACACCCCGAC GCGAGTCATC ACCGTTGGAG GAAAATGCAG AAATCTATTC GCATTTCGGG TTTCTTTGGA GGGAGCAAAA CTACCGCCTC TTGGGAGGAT ACAGACATTG CAGTTTGCAC CATTGAAAAG GTTTGCTATC GCAGAATACT GGCATGACCT CGATCTGACA GTGTTAGGCG AACTCATTGA TCAACACTGC TATTGAGGAA TGCAGTATTG GAGAACTGGG GGCAGTCGTG TTGGATGAAT TGCATATGCT TGACGACGAG AATCGAGGAT ACTTGTTGGA ACTGATGGTG ACCAAGCTGC TTCTACTGCA GCAGGATATT CAGATCATTG GAATGAGCGC TACCATCTCG GTAAGTGACA TCCTAATTAC AAGATGCAAT ATCTGACAGG TCAAGAATAC GGAGCTGTTG GCAGACTGGA TTAATGCTAG ATACTTTGTA TCAACCTATC GTCCAGTGCC CGTGGACGAA TATCTTATCT ATGATAATGC GATCTACCCA GCCGCGACTT CGAGACAGCT CTTTCAGACA ATCTCGAAGC TGACAGCCAC AGGAGGACCC TTCTTGAGCG AGGCCGTGCC TCCCCAGCGC ACAATCAAAC CTTCTGCCTT CAGGGAATTA TCTAACCCGA TGTCAAACGC AATGGTAGCT ATGGCAATCG ATACGGTCAC TGCAGGATAC GGTGCTCTGG TGTTCTGTGG CAGTAGAGTA GCCTGTCAAG TCCACGCTGC GCTCATAAGC GAGGCAATGC CTGATCCAGG TACGCTTGGC GCAGAGGATC TGGGTAAGCG ACTCGACCTG CTGGCAGAGC TTCGCAGTCT TCCCAGCGGA CTGGATCCGG CTCTGGAGAA AACCCTCATC AAAGGCGTGG GATTCCACAG TGAGCAAGTT TCTGACCAGA CACTGAGGGC TCGATACTGA TCATGCTTAG ACGCAGGGAT GACGACCGAA GAGCGTGAAG CCATTGCACA GGCATATGAT CAAGGTGTTC TAAAGGTGCT GGTTGCCACC TGCAGTCTCG CTGCTGGCGT CAACCTTCCG GCGAGAAGAG TAATCATAAA TGGAGCACGC ATGGGCCGTG AACTAGTTGG GCCAGCAATG TTGTAAGTCG CCCAGGTACC AGCAGGATTA GGGGTTGCTA ATGGACAGGC GCCAAATGTG CGGTCGAGCC GGCCGCAAAG GCAAAGATGA GGCGGGTGAG ACATACCTTA TTGTCGGAAA ATCTGATCTC CAGGCTGTTT GCGACCTTCT GGAGGCCGAT ATGCCAGCAA TTGAAAGTTG TTTGGCGCCG GAAAAGAGAG GACTGAAACG GTAAGTGTCC ACTGCTGCCA GGTAAGAAAG TTGTTCACAT TCTTAGAGCA CTCTTGGAGG CAATTGCAAC CGGTCTTGTC TCAGGCGTTG CCGCCATCAA AGAATATGTG AAATGTACCC TTTTATATCG GACTGTTGAT AAGAAGCTGT CGTACAGCAT CATGGACTCA GCCCTTCAAG AGCTGGCAGA AGAAAAGCTC ATTCAACTGA ATGAAGACGA GTCTTATGTA GCCACTCAGC TTGGACAAGC CGTGGTTGCT TCTGCCTTTG CGCCAGATGA CGGTCTTTTC ATGTATGAGG AGCTGAAGCG AGCGCTCCAG GCTTTCGTGA TGGACGGCGA CATGCATGTT TTCTACATGT TTACTCCGCT CCAAGCCGCG GCACAGACTC AGATTGATTG GCCAACATTC AGGGACTTAT TGGATACCCT GGATGACAGT GGTATACGCG CTTTGCAGTT TGTTGGAGTA AACCCTGGCT TTGTGAACTC AATGTACGGT TATCCATAGA TCCATCTTGA ACTTCCGATA CTGACTTCTG CAGGGTTCAA AGTGGTGCAT CACTGAAAGA GGACACCCCG GAACAAGTGA CTCAAGCAAG GATATATCGG CGCGCATATA CAGCCTTCCA GCTCCGTGAT CTCAGCAACG AGGTTCCATT ACCTGTGATT TCAAGTCGGT ACAAGATTCC CCGTGGAACA ATCCAGACTC TAGCGCAGCA GTGTCATGGA TTCGCCGCGG GAATAGTGAA GTTTTGTCAG CGCATGGGCT GGGGTATGTT AGCCGCAGTT CTCGATCATA TGCGGGATCG GTTGGAAGCA GGTGCGCGAG CTGACCTTCT CGAAATGGCC CAAGTGACCT ATGTCAAAGG CTGGACGGCA AGGTTACTTC GCGACAATGG ATTTCGGAAC CTGAGAGCAT TAGCTGAGGC CGATCCCAAG GATGTTGTAC CCGTATTGAA GATGGTAAGG CACTTAGACT GTAACTCTAT TATCAAACTA ACCCTATTTC TAAAGGTTAA TCCTCGTAAG ACCCAGCGAA ACCAGCTTCA CCCAACTGAA GCTGAGCGCT ACGCTGGGAA GTTACTCGCT AAAGCAGAGG TCATTGTCGC ATCGGCTAAT AGGATTTGGG GTAAGCATAC TGAAACGCTT GATTTCGGTG TGTGAGGCTA ATCACGACCT TGTCTATCCA GAACGAGAAA TGCAGGTTGA TCTGGATGAG TGA
|
Protein sequence | MSAHRAIQNS GFQTSVDIAR QQPFAVAPLA GQKRPPSGAL NEDTDNPSSH AINGTSRQNL PNGQGLDFTR PQVHLPKSRL IASEICTVAG SENEQKPRPE DPSKSQGSLQ SLNDPKFGLP PALVANFAAA GVTSIYQWQA SCLLGEGLLK GKRHLIYTAP TGGGKSLVAD VLMLKRIIEN PTRKAILVLP YVALVQEKLK WLRRIVQDVE KYTVDDEHPD ASHHRWRKMQ KSIRISGFFG GSKTTASWED TDIAVCTIEK ANSLINTAIE ECSIGELGAV VLDELHMLDD ENRGYLLELM VTKLLLLQQD IQIIGMSATI SVKNTELLAD WINARYFVST YRPVPVDEYL IYDNAIYPAA TSRQLFQTIS KLTATGGPFL SEAVPPQRTI KPSAFRELSN PMSNAMVAMA IDTVTAGYGA LVFCGSRVAC QVHAALISEA MPDPGTLGAE DLGKRLDLLA ELRSLPSGLD PALEKTLIKG VGFHNAGMTT EEREAIAQAY DQGVLKVLVA TCSLAAGVNL PARRVIINGA RMGRELVGPA MLRQMCGRAG RKGKDEAGET YLIVGKSDLQ AVCDLLEADM PAIESCLAPE KRGLKRALLE AIATGLVSGV AAIKEYVKCT LLYRTVDKKL SYSIMDSALQ ELAEEKLIQL NEDESYVATQ LGQAVVASAF APDDGLFMYE ELKRALQAFV MDGDMHVFYM FTPLQAAAQT QIDWPTFRDL LDTLDDSGIR ALQFVGVNPG FVNSMVQSGA SLKEDTPEQV TQARIYRRAY TAFQLRDLSN EVPLPVISSR YKIPRGTIQT LAQQCHGFAA GIVKFCQRMG WGWTARLLRD NGFRNLRALA EADPKDVVPV LKMVNPRKTQ RNQLHPTEAE RYAGKLLAKA EVIVASANRI WEREMQVDLD E
|
| |