Gene ANIA_02739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_02739 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001306 
Strand
Start bp2918374 
End bp2921566 
Gene Length3193 bp 
Protein Length901 aa 
Translation table 
GC content51% 
IMG OID 
ProductDNA-directed DNA polymerase theta, putative (AFU_orthologue; AFUA_1G05260) 
Protein accessionCBF84114 
Protein GI259486348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.539021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.18376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTC ACAGAGCTAT CCAGAACAGT GGCTTTCAGA CGTCAGTCGA TATCGCCCGC 
CAGCAGCCGT TTGCCGTAGC CCCGCTGGCG GGCCAGAAAC GGCCCCCCAG CGGCGCTCTA
AACGAGGATA CAGATAATCC CAGTTCTCAT GCAATAAATG GGACCTCACG TCAGAATCTG
CCTAATGGCC AAGGATTGGA TTTTACCCGG CCTCAAGTGC ATCTACCGAA GAGCCGTTTG
ATTGCTAGTG AGATATGTAC TGTTGCAGGT TCTGAGAATG AACAGAAACC ACGACCAGAA
GATCCGTCAA AGTCTCAAGG TTCCCTTCAG TCGCTGAATG ACCCTAAATT CGGACTGCCT
CCAGCTCTGG TGGCCAACTT TGCTGCTGCC GGTGTCACCA GCATCTACCA ATGGCAGGCA
TCATGCTTGC TTGGTGAAGG CCTCCTGAAA GGGAAGCGGC ATCTAATCTA TACGGCACCC
ACGGGCGGTG GGAAGTCGCT TGTGGCTGAT GTTCTGATGC TGAAGCGGAT CATCGAGAAC
CCTACGCGCA AAGCGATTCT GGTCCTCCCG TACGTGGCCT TGGTCCAAGA GAAACTTAAG
TGGCTCCGGC GCATAGTCCA AGATGTTGAA AAATACACCG TTGACGATGA ACACCCCGAC
GCGAGTCATC ACCGTTGGAG GAAAATGCAG AAATCTATTC GCATTTCGGG TTTCTTTGGA
GGGAGCAAAA CTACCGCCTC TTGGGAGGAT ACAGACATTG CAGTTTGCAC CATTGAAAAG
GTTTGCTATC GCAGAATACT GGCATGACCT CGATCTGACA GTGTTAGGCG AACTCATTGA
TCAACACTGC TATTGAGGAA TGCAGTATTG GAGAACTGGG GGCAGTCGTG TTGGATGAAT
TGCATATGCT TGACGACGAG AATCGAGGAT ACTTGTTGGA ACTGATGGTG ACCAAGCTGC
TTCTACTGCA GCAGGATATT CAGATCATTG GAATGAGCGC TACCATCTCG GTAAGTGACA
TCCTAATTAC AAGATGCAAT ATCTGACAGG TCAAGAATAC GGAGCTGTTG GCAGACTGGA
TTAATGCTAG ATACTTTGTA TCAACCTATC GTCCAGTGCC CGTGGACGAA TATCTTATCT
ATGATAATGC GATCTACCCA GCCGCGACTT CGAGACAGCT CTTTCAGACA ATCTCGAAGC
TGACAGCCAC AGGAGGACCC TTCTTGAGCG AGGCCGTGCC TCCCCAGCGC ACAATCAAAC
CTTCTGCCTT CAGGGAATTA TCTAACCCGA TGTCAAACGC AATGGTAGCT ATGGCAATCG
ATACGGTCAC TGCAGGATAC GGTGCTCTGG TGTTCTGTGG CAGTAGAGTA GCCTGTCAAG
TCCACGCTGC GCTCATAAGC GAGGCAATGC CTGATCCAGG TACGCTTGGC GCAGAGGATC
TGGGTAAGCG ACTCGACCTG CTGGCAGAGC TTCGCAGTCT TCCCAGCGGA CTGGATCCGG
CTCTGGAGAA AACCCTCATC AAAGGCGTGG GATTCCACAG TGAGCAAGTT TCTGACCAGA
CACTGAGGGC TCGATACTGA TCATGCTTAG ACGCAGGGAT GACGACCGAA GAGCGTGAAG
CCATTGCACA GGCATATGAT CAAGGTGTTC TAAAGGTGCT GGTTGCCACC TGCAGTCTCG
CTGCTGGCGT CAACCTTCCG GCGAGAAGAG TAATCATAAA TGGAGCACGC ATGGGCCGTG
AACTAGTTGG GCCAGCAATG TTGTAAGTCG CCCAGGTACC AGCAGGATTA GGGGTTGCTA
ATGGACAGGC GCCAAATGTG CGGTCGAGCC GGCCGCAAAG GCAAAGATGA GGCGGGTGAG
ACATACCTTA TTGTCGGAAA ATCTGATCTC CAGGCTGTTT GCGACCTTCT GGAGGCCGAT
ATGCCAGCAA TTGAAAGTTG TTTGGCGCCG GAAAAGAGAG GACTGAAACG GTAAGTGTCC
ACTGCTGCCA GGTAAGAAAG TTGTTCACAT TCTTAGAGCA CTCTTGGAGG CAATTGCAAC
CGGTCTTGTC TCAGGCGTTG CCGCCATCAA AGAATATGTG AAATGTACCC TTTTATATCG
GACTGTTGAT AAGAAGCTGT CGTACAGCAT CATGGACTCA GCCCTTCAAG AGCTGGCAGA
AGAAAAGCTC ATTCAACTGA ATGAAGACGA GTCTTATGTA GCCACTCAGC TTGGACAAGC
CGTGGTTGCT TCTGCCTTTG CGCCAGATGA CGGTCTTTTC ATGTATGAGG AGCTGAAGCG
AGCGCTCCAG GCTTTCGTGA TGGACGGCGA CATGCATGTT TTCTACATGT TTACTCCGCT
CCAAGCCGCG GCACAGACTC AGATTGATTG GCCAACATTC AGGGACTTAT TGGATACCCT
GGATGACAGT GGTATACGCG CTTTGCAGTT TGTTGGAGTA AACCCTGGCT TTGTGAACTC
AATGTACGGT TATCCATAGA TCCATCTTGA ACTTCCGATA CTGACTTCTG CAGGGTTCAA
AGTGGTGCAT CACTGAAAGA GGACACCCCG GAACAAGTGA CTCAAGCAAG GATATATCGG
CGCGCATATA CAGCCTTCCA GCTCCGTGAT CTCAGCAACG AGGTTCCATT ACCTGTGATT
TCAAGTCGGT ACAAGATTCC CCGTGGAACA ATCCAGACTC TAGCGCAGCA GTGTCATGGA
TTCGCCGCGG GAATAGTGAA GTTTTGTCAG CGCATGGGCT GGGGTATGTT AGCCGCAGTT
CTCGATCATA TGCGGGATCG GTTGGAAGCA GGTGCGCGAG CTGACCTTCT CGAAATGGCC
CAAGTGACCT ATGTCAAAGG CTGGACGGCA AGGTTACTTC GCGACAATGG ATTTCGGAAC
CTGAGAGCAT TAGCTGAGGC CGATCCCAAG GATGTTGTAC CCGTATTGAA GATGGTAAGG
CACTTAGACT GTAACTCTAT TATCAAACTA ACCCTATTTC TAAAGGTTAA TCCTCGTAAG
ACCCAGCGAA ACCAGCTTCA CCCAACTGAA GCTGAGCGCT ACGCTGGGAA GTTACTCGCT
AAAGCAGAGG TCATTGTCGC ATCGGCTAAT AGGATTTGGG GTAAGCATAC TGAAACGCTT
GATTTCGGTG TGTGAGGCTA ATCACGACCT TGTCTATCCA GAACGAGAAA TGCAGGTTGA
TCTGGATGAG TGA
 
Protein sequence
MSAHRAIQNS GFQTSVDIAR QQPFAVAPLA GQKRPPSGAL NEDTDNPSSH AINGTSRQNL 
PNGQGLDFTR PQVHLPKSRL IASEICTVAG SENEQKPRPE DPSKSQGSLQ SLNDPKFGLP
PALVANFAAA GVTSIYQWQA SCLLGEGLLK GKRHLIYTAP TGGGKSLVAD VLMLKRIIEN
PTRKAILVLP YVALVQEKLK WLRRIVQDVE KYTVDDEHPD ASHHRWRKMQ KSIRISGFFG
GSKTTASWED TDIAVCTIEK ANSLINTAIE ECSIGELGAV VLDELHMLDD ENRGYLLELM
VTKLLLLQQD IQIIGMSATI SVKNTELLAD WINARYFVST YRPVPVDEYL IYDNAIYPAA
TSRQLFQTIS KLTATGGPFL SEAVPPQRTI KPSAFRELSN PMSNAMVAMA IDTVTAGYGA
LVFCGSRVAC QVHAALISEA MPDPGTLGAE DLGKRLDLLA ELRSLPSGLD PALEKTLIKG
VGFHNAGMTT EEREAIAQAY DQGVLKVLVA TCSLAAGVNL PARRVIINGA RMGRELVGPA
MLRQMCGRAG RKGKDEAGET YLIVGKSDLQ AVCDLLEADM PAIESCLAPE KRGLKRALLE
AIATGLVSGV AAIKEYVKCT LLYRTVDKKL SYSIMDSALQ ELAEEKLIQL NEDESYVATQ
LGQAVVASAF APDDGLFMYE ELKRALQAFV MDGDMHVFYM FTPLQAAAQT QIDWPTFRDL
LDTLDDSGIR ALQFVGVNPG FVNSMVQSGA SLKEDTPEQV TQARIYRRAY TAFQLRDLSN
EVPLPVISSR YKIPRGTIQT LAQQCHGFAA GIVKFCQRMG WGWTARLLRD NGFRNLRALA
EADPKDVVPV LKMVNPRKTQ RNQLHPTEAE RYAGKLLAKA EVIVASANRI WEREMQVDLD
E