Gene ANIA_02946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_02946 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001306 
Strand
Start bp2247097 
End bp2250157 
Gene Length3061 bp 
Protein Length906 aa 
Translation table 
GC content50% 
IMG OID 
ProductDipeptidyl aminopeptidase [Source:UniProtKB/TrEMBL;Acc:Q7SI80] 
Protein accessionCBF83695 
Protein GI259486112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGCT CTGAAGATCG CGAGGACTCG GAGTTGCTTC CGGCAAATCG TCCTCGTTCC 
CCGTCCAGAA GCTCTTATGA CTCGGATGAC TCTGGATTAT CCGTCGATTC CATTCTCGAA
GAACAAAAGT ATAATGCCGC GACGAACGAG ACATTGGGGC TTCCCCAAGA AATGAGATAT
CACGATGAGG AGGGCGGAGA AGCTGGGTCT AATGAAGCCC TTCATACGAA GGCTTCCAGC
AGTCGTTCGC GACGGCTTCT CTGGTTGGTG GTCTTACTAT GCTGTGGGGG CTGGGTAGTG
GCATTTGTGT TGTTTATAAC ACAAGGGCGC GCGGATTATC GAACTGCGAC AGACGAGCTG
CAGTCGGATA ACTCTGGCTC GTTCTCAGAC GGTACAAGCT CGGGGAAACC TCTGACCTTA
CAACAAGTGC TTTCGGGAGT ATTCTTGCCT CGAGGTCATG CGATCTCCTG GGTTGCTGGT
CCTGATGGTG AAGATGGGCT GTTGATAGAA AGGGGAGAGG ATGATGAAGC GGGTTACCTG
CGTATTAATG ACATTCGCCA GGATGGCAAA GTCAACCGGG TACTAATGCA GAAACCTACC
GTGGGAGTTG ATGGGAGGAC GATTAAGCCA AGTGCCACGC GGCCTAGTCC TGATCTGAAG
AAAGTCTTGA TTATATCCAA TCAAGAGAAG AACTGGAGGC ACTCCTTCAC CGCGAGTTAT
TGGATTTTCG ATGTAGAGAC TCAGACTGCC GAACCTTTGG ACCCCAACAA CATTGATGGC
CGAGTACAGC TTGCTCTTTG GTCTCCCAAA TCTGACGCCA TTGCCTTTGT TCGAGACAAC
AATTTGTACT TAAGGAAGCT TTCTTCCGAA CGCGTTGTAC CCATTACCAA GGACGGTGGG
GAACAGCTAT TCTACGGTGT TCCCGACTGG GTATATGAAG AAGAAGTCTT CTCGGGAAAC
AGTGTCACCT GGTGGTCTGA AGATGGTTCT CAAATCGCCT TCATCAGGAC GAATGAGTCG
GCGGTACCCG AATTTCCCGT TCAATATTTC CTATCTCGAC CGTCAGGAAA GAAGCCACAG
CCAGGGTTAG AGAACTATCC GGAAGTAAGA GAAATCAAGT ATCCAAAGGC TGGCGCGCCC
AATCCGTTCG TTAATCTGCA ATTTTATGAC GTCGAGCAAG GTGAAGTCTT CTCCGTTGAT
ACGCCCGACG ACTTTGATGA TGACGATCGG CTTATCATTG AGGTGATATG GGCTGCTAAG
GGCAAGGTCC TTGTCCGGAC AACGAATAGA GAAAGCGACA TTTTGAAAGT GTTCTTGGTT
GACACTGAAT CAAGAGAAAG CAAGCTTATC AGGATCCAAG ATATTTCCGA GCTTGACGGT
GGTTGGGTTG AGCCTACACA GTCCGTGAGG TTCATCCCTG CCGACCCAGA TAAGGGCCGA
CCATTTGATG GATATCTTGA CACTGTGGTT CATGAGGGAT ACGACCATCT GGCTTACTTT
ACCCCTCTCG ATAATCCAGA GCCCATCATG CTCACTTCTG GCGAATGGGA AGTTGTGGAC
GCGCCTACTG CTGTAGACCT GACTCGTGGT CTGGTGTATT TCATCGCTAC TAAAGAAGCG
CCGACTGAAC GCCATCTATA CCGTGTCCGC TTAGATGGGT CTGATCTCAC GCCGTTGACT
GACACGTCTC AACCCGGCTA TTACAGTGTC TCGTTCTCGG ATGGTGCTGG ATATGCATTG
CTTAGTTACC AGGGTCCTTC CATTCCCTGG CAATCTATCA TCAGCACCGA AGGTGAAAAG
ACAACAACAC TCAGAATTAT CGAGGACAAC ACAGATTTGT CAAAGTTGGT GGCTCAATAC
GCTTTACCAA CAGAGAACTA CCAGAACATC ACCATTGATG GCTTCACGCT GCAAGTCGTC
GAGCGACGTC CACCCCACTT TAACCCAGCG CGAAAGTACC CTGTTCTATT CCACCTATAC
GGAGGCCCAG GGTCTCAAAC TGTGGACCGC CGTTTCAATG TAGATTTCCA GTCATATGTC
GCAGCAAGTC TTGGCTACAT TGTTGTCACC GTTGACGGGA GAGGCACCGG CTTCATCGGT
CGAGCAGCAC GCTGTATCAT CCGTGGCAAC ATCGGGCACT ACGAAGCCAT TGATCAAATT
GCCACGGCCA AAAATTGGGC ACAAAAGCCG TACGTCGACG AGTCCCGAAT GGCGATCTGG
GGCTGGAGCT ACGGCGGCTT TATGACGTTG AAGACTCTCG AACAAGACGC CGGCGAGACA
TTTCAGTACG GAATGGCTGT TGCGCCTGTC ACAGACTGGC GGTTCTATGG TAAGGCGCTC
AAACCCACCC ATTTCCTACT TGGACCATGT ATAGTCAACT AACCCAAAGC CCAATAGATT
CCGTCTACAC AGAACGCTAC ATGCACACCC CGCAGCACAA CCCAACAGGC TATGACAACA
CCAGCATATC GGACATGGCC GCCCTTCATA ACAATGTCCG CTTCCTTGTC ATCCACGGTG
CCTCAGACGA CAACGTCCAT ATCCAAAACA CGCTCACCCT AATTGATAAG CTGGATCTCG
CGAGCGTCCA GAACTATGAT GTGCATTTCT ACCCTGACTC CGACCATAGC ATCTTCTTTC
ACAATGCCCA TACTATGGTT TATGAACGTA TGTCACCAAT TTTCTATCCA TAACAACCCC
TGCCCTTAAC GGGTTCAAGA CTTCAGCTAA TGGTTGATTG TAAACTAGGC CTTGCAAGCT
GGCTTGTCAA CGCGTTCAAT GGCGAGTGGC ATCGGACAGC AAACCCCGTT CCAGACGAGT
CGATGTTGAG GCGGCTTGCG AAGAGGGTTT GGCCTGGTTT TGCGCATTGA TTTGCTAAAT
ATTGGAATAG GATATCTAAA CGTGTATATG AGTATGAAGC CGGACAACCT TTAAGCTTTG
CTTTAACAGT GATCGTTTTT TACGAAGGCG TTCTTTAGGA AGCGTTTCAT GTTTGTTCTT
AATCTTTTCA AAACAAATGC GTAACACACG CGGTCTATAT GATCGGCAAA GTCCCTAAGA
G
 
Protein sequence
MRSSEDREDS ELLPANRPRS PSRSSYDSDD SGLSVDSILE EQKYNAATNE TLGLPQEMRY 
HDEEGGEAGS NEALHTKASS SRSRRLLWLV VLLCCGGWVV AFVLFITQGR ADYRTATDEL
QSDNSGSFSD GTSSGKPLTL QQVLSGVFLP RGHAISWVAG PDGEDGLLIE RGEDDEAGYL
RINDIRQDGK VNRVLMQKPT VGVDGRTIKP SATRPSPDLK KVLIISNQEK NWRHSFTASY
WIFDVETQTA EPLDPNNIDG RVQLALWSPK SDAIAFVRDN NLYLRKLSSE RVVPITKDGG
EQLFYGVPDW VYEEEVFSGN SVTWWSEDGS QIAFIRTNES AVPEFPVQYF LSRPSGKKPQ
PGLENYPEVR EIKYPKAGAP NPFVNLQFYD VEQGEVFSVD TPDDFDDDDR LIIEVIWAAK
GKVLVRTTNR ESDILKVFLV DTESRESKLI RIQDISELDG GWVEPTQSVR FIPADPDKGR
PFDGYLDTVV HEGYDHLAYF TPLDNPEPIM LTSGEWEVVD APTAVDLTRG LVYFIATKEA
PTERHLYRVR LDGSDLTPLT DTSQPGYYSV SFSDGAGYAL LSYQGPSIPW QSIISTEGEK
TTTLRIIEDN TDLSKLVAQY ALPTENYQNI TIDGFTLQVV ERRPPHFNPA RKYPVLFHLY
GGPGSQTVDR RFNVDFQSYV AASLGYIVVT VDGRGTGFIG RAARCIIRGN IGHYEAIDQI
ATAKNWAQKP YVDESRMAIW GWSYGGFMTL KTLEQDAGET FQYGMAVAPV TDWRFYDSVY
TERYMHTPQH NPTGYDNTSI SDMAALHNNV RFLVIHGASD DNVHIQNTLT LIDKLDLASV
QNYDVHFYPD SDHSIFFHNA HTMVYERLAS WLVNAFNGEW HRTANPVPDE SMLRRLAKRV
WPGFAH