Gene ANIA_02534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_02534 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001307 
Strand
Start bp4199836 
End bp4201366 
Gene Length1531 bp 
Protein Length380 aa 
Translation table 
GC content53% 
IMG OID 
Productendoarabinanase (Eurofung) 
Protein accessionCBF87045 
Protein GI259487963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCTGC CCACTCTTGC AGCTTCGGCG TCTCTCCTCG TAGGCGTGGC GCATGGCTAT 
GCGTCGCCCG GGGCGTGCTC GGGTGCGTGC AACATTCACG ACCCGGCTTT GATCCGCCGT
GAGTCTGATG GCAAGTATTT CCGCTTCTCA ACCGGTAACA AGATCTCTTA TGCGTCTGCT
TCCTCCATTG AGGGCCCATG GACAGCGATT GGGTCCGTCT TGCCGGGCGG TTCGTCGATC
GATCTGGATG GAAATGACGA TCTCTGGGTA AGTACCGGAG GATCGGCAGT TCGGCCTATT
TGGGCGAAAT AAGTGCTAAT GCATACTCTC GCAGGCTCCC GATGTCCAGC TCGTCAATGG
CGTATACTAT GTTCTCTATT CAGTTTCGAC CTTTGGGTCC CAGAATTCCG CGATTGGGCT
CGCGACTTCT GACACGATGG ACCTCAACAC CTGGACGGAC CACGGCTCGA CGGGCATCCG
GTCTGACTCC TCCAAGCCAT ACAATGCCAT TGATGGCAAC CTTTTCCAGG ATGATAGCGG
GACCTGGTAC ATGAACTTTG GGTCGTTCTG GAATGACATC TACCAAGCAC AGATGAAATC
TCCTCCCACA GCCGTCGCAT CGTCCTCGTA CCAGATCGCA TACCAGCCGG CTGGCGAGCA
CGCGGTTGAG GGCGCGTACT TGTACAAGTA CGGCAACTAC TACTACCTCT TCTTCTCCGA
GGGCAAATGC TGCGGCTATG ACTCTTCTAG GCCGGCTACT GGGGAAGAAT ACAAGATCAA
AGTGTGCCGT TCGACCACGG CCACTGGTAA CTTTGTAAGC TCTCCGCCTC GATAATGGAT
CGTGTTTTGG ACCCGCCTAA TTAGCCCAGG TTGATGCAAA TGGTGTTTCC TGCACTTCCG
GCGGTGGAAC AATCGTCTTG GAAAGCCACG ACAATGTCTA CGGACCTGGA GGACAGTATG
CCTCCCCAAT CCCACGAACT TTGGCAGAAA TGACTAATGT AAAACAGGGG TGTCTTCACC
GACCCGACGC TCGGCCCTGT GCTGTACTAC CACTATGTTG ATACCACTAT TGGCTACGCT
GATAGCCAGA AGCTCTTTGG ATGGAACGTT CTTGACTTCT CCAGCGGGTG GCCTGTTGTG
TAAGACTCGA TCGAGTATGC TCGAATCGCG GCGAAACTGT GTGTATTTAG TGGCTATGAA
GGTAACTGCA GGTGTCCTAT GATCCTGACT CAGCGTCCGC CAAGTAGACG ATCGTTCTCT
ATGTACGTGG TTGTAAGTGC TGCTCTGGCG TGTGTGATGA GACCACTGTA GACGGACACG
GTATAATGGG CACTGGGAGT CGTATAAAGT TTGTGTCTGC AAACCCTAAT ATAGCCCCTG
GTTACAGCGC GGCCAGAAGA AAAGAGAAAA TAGGTTTGCT CGCTGCTCTG TAGGTGATTA
TGGTAAGCGT GACCATTAGG GCAGGGGCAA CGGTGATGTC ACCGTTCTAA TGCTCTTTGC
CCCGGCCACT CCGGCTTTAT TTGTTGACTA G
 
Protein sequence
MYLPTLAASA SLLVGVAHGY ASPGACSGAC NIHDPALIRR ESDGKYFRFS TGNKISYASA 
SSIEGPWTAI GSVLPGGSSI DLDGNDDLWA PDVQLVNGVY YVLYSVSTFG SQNSAIGLAT
SDTMDLNTWT DHGSTGIRSD SSKPYNAIDG NLFQDDSGTW YMNFGSFWND IYQAQMKSPP
TAVASSSYQI AYQPAGEHAV EGAYLYKYGN YYYLFFSEGK CCGYDSSRPA TGEEYKIKVC
RSTTATGNFV DANGVSCTSG GGTIVLESHD NVYGPGGQGV FTDPTLGPVL YYHYVDTTIG
YADSQKLFGW NVLDFSSGWP VVCPMILTQR PPSRRSFSIP WLQRGQKKRE NRFARCSGRG
NGDVTVLMLF APATPALFVD