Gene ANIA_05101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_05101 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001305 
Strand
Start bp955253 
End bp956453 
Gene Length1201 bp 
Protein Length311 aa 
Translation table 
GC content49% 
IMG OID 
Productmetalloprotease MEP1 (AFU_orthologue; AFUA_1G07730) 
Protein accessionCBF80862 
Protein GI259484546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000175375 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTCAAC TTCGTCGACT CCAGGACCTG GTTCTTATGC TGGCCTTCCT TCAGCAAACC 
TGTCTTGCTG TTCCTCGTTG GGGCAGGGGA TATTGTGCTA CTGCAGGCCC AGATGAATCG
TTGAAGGCGG AATTTAGAAA ATTGAGCGCT CTCGAAAATG ATGGCATAGT CGAGCAAGGA
AGTCGTAAGG CGCTGGAGCC CATTGAGATA GAGGTATGGT TCCATGTCGT GAGCAGCAAA
GCGAGTGGCG ACGTGGTTTC GGACGGTATG ATTGCTACTC AGGTTAGTTC TGGATCTTTA
TTCGTGTGCC AGCCCACCTA TTTTTCCATC ACATAATGTT GATTAATGGA CTAACTATAA
TAAGTTATCT TACCTCCAAG ATGCATATCA AAACGCTTCA ATAAGCTACC GTCTCGAAGG
GGTAACGCGC CATATCAACG ATAAATGGGC GCGTAATGAA GACGAGCTTA GCATGAAAGA
TGCCCTCCGT AGAGGCAGCT ACCGAACCCT CAATGTCTAC TTCCAGTCCG ATCTCCAAGT
TCTCTCAGGC TCCGAATCTC AGGGTCGTCT GCTCGGTACT TCGGAACAGT TATCAGCAAG
CGTTCTCGGC TTCTGCACTT TACCCGACCC GAGTATTAAC AGTACTAGTC TGCGTTCCAG
CTATGTGAAG GACGGATGCA ACGTGCTTGC AAAAACTATG CCAGGGGGGT CTCTAACGCA
TTATAACCGA GGCGGAACCG CCATACACGA AATTGGTCAC TGGAACGGAC TCCTGCACAC
TTTCGAGGGG GAGTCTTGCT CCCTTGACAA CGAGGGTGAT TATATAGAAG ACACACCCCA
GGAGTCTATT CCGACCGATG GATGTCCTGC TCGCAAAGAC TCATGCCCAG GAAGCCCGGG
TGTGGACCCT GTACACAACT TTATGGATTA TTCTTCTGAT GAGTGTTACG AGCACTTCAC
GCCGGCCCAG GTTAAGAGGA TGCGTGACAT GTGGTTCACG ATGAGGGAAG GGAAATGATA
AAAATAGCCT CACTGGTTCA CCATGGTACC ACCGACATGT AACAAGTAAT GACGGGGACA
GGGAATTGGT GCAAATTGAT ACATAGCGGA CCCCATGGAC AGCGCTTTGA TGCCCTATGT
GGCCGTACTG CATGATTCAT TCATATCTGT TTGACGGTCA ATGAGAAAGC CCTTTTCCTG
G
 
Protein sequence
MLQLRRLQDL VLMLAFLQQT CLAVPRWGRG YCATAGPDES LKAEFRKLSA LENDGIVEQG 
SRKALEPIEI EVWFHVVSSK ASGDVVSDGM IATQLSYLQD AYQNASISYR LEGVTRHIND
KWARNEDELS MKDALRRGSY RTLNVYFQSD LQVLSGSESQ GRLLGTSEQL SASVLGFCTL
PDPSINSTSL RSSYVKDGCN VLAKTMPGGS LTHYNRGGTA IHEIGHWNGL LHTFEGESCS
LDNEGDYIED TPQESIPTDG CPARKDSCPG SPGVDPVHNF MDYSSDECYE HFTPAQVKRM
RDMWFTMREG K