Gene ANIA_05157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_05157 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001305 
Strand
Start bp1112531 
End bp1115618 
Gene Length3088 bp 
Protein Length993 aa 
Translation table 
GC content52% 
IMG OID 
Productarmadillo repeat protein (AFU_orthologue; AFUA_1G07050) 
Protein accessionCBF80979 
Protein GI259484609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.304489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCG CCGAGGCGCC GCCTATCTTC CTCCAGCTAC AAAATGCGGA TTCCTTGTCA 
TCACAAGCTG CTGCTCTGAG AGCCCTGAAG AATGAGACAA TTGGCCATGA TCAAAGGAAA
GAGGCCTGGG TACGATTGGG GCTCATTCCC ATACTTTCCA ACGTGCTTGC GTCTCGGGCA
CTCGACAAGA GCGAGCTCAA TAACGGCACC AAGCAGCCCG AGTTGCCTGG CTCTAGAGAA
GAATCGGATG ATGTTTGCTT ACAGGCAATA ATTCTTGTTG GGAGCTTAGC GCAAGGTACT
AATTATTTTT TTTTGATCAC TCGGCATGCA ATCACTAACG TCCCAGCAGG AGGCACACCT
TTCCTATCGC CGATCTTATC GAGCAATATA CTTCCGATAC TCCTCTCCAT TCTATCATCC
AACTGCCCTT CTTCCTTCGT TCTTCCTATT CTCCGGGTTT TGAATAGTGT GGCGGATAGA
TTGCCTCTAC AGAGCCAGCA ACAATGGCCC AGGGATACTC GCTTGGCGGA CATTCTTTTC
TCAACGGAGC ACATTGGCTG CTTGACCCGC ATCCTTGGCC AGGATTACAG CAGCCACAGT
CGGCTGACTG CGATTGAACT GGCTGCAGGT CTTATTGGGA AGCTGTGCAC AGAGGAAAGC
CACAAGGCTG TTCTGGCTGA AAGTGGTGTT TTAGACGCTC TGGCGGTCAA AGTCGCATCG
TTTATAGTTG CGCAGGGATT CGTTTTCCCC GGCGCAGAGA GCCACCTAGA TGATGTAGGC
GCTCTGGGGT CACTGCCACC TCCTGCGCCC CGCGGGGCTA AGCTTGCGCC CATTTTACGT
GCTGTGACGG TCATCGTTGA GCATTCCAAG TGGCGAGCGG AGCATTTTCT CTCTTCTCCA
GGTATAGTTA CTGTGTTTCC ACGGCAAATA CCAGGCTTTT CCCCATCGGA TATCAAGAAG
GGCCCTTGGG GCTCCACTTA TTTTTCAGGG TCCGCGGTGC CACGGCACCT TGGAGGGACG
CCTCTAGAGT ATCTTCTTCC ATCTATTCCT TTGTCACAGT TGAAGCCCTC TGCTAGCTCA
TCCAACTTTC CACCGCTAGG TCAGTATGGG CAGCATCGCC GACAGAGCCA TTCATTTCCC
ACCCCGCTGT CCAGTTTCGA ACCGCCCACG GCTGAGGACG ATGAGAATCC GGTTGTCCCC
TGGCTGCTAT ACCTCGTCCG TGCTGAGAGC GGCATGGCTC GTCTGATGGC AGCCCGCTTT
GTGACGGTAT TATGCCGCCT GGGACTAACC AAAAAGCACA GGATCTCCAT GCTCTGCTAT
CTGTTAATCC CGATTCTGCT TCGCATGCTC GATAAGGACT ACGAGGCCTC TGACGACGGT
GTCCAATACG GTGGACTTAT TTCTTCCTCG CAACGCATTA AGGAGGAAGC TCCGGGTGTG
CTGGCCACCT TACTTGTTGA TGATCGAGAA CTGCAGAAAC ATGCGGTTGA GGGGGATGCG
ATCAAGCGAC TATCCCAGCT TCTCAAAGAA ACTTATAATC CAATCCATGA GCCAGCTCGA
ACAATGTGGC ATGCTGAAGG CCAACCGAAG GTTGAGGACC ATGACTCGCA GCCGGCGGAG
TGTCGATTAG GCCCTCCTGG ATACTCACCC CTCCGTTACC ATATCTTGAG ATATCGGGAA
AATATATTGA AAGCCTTGGC TGCACTGGTT CCTTTCAAGG ACGAGTATCG CAAGGCGGTA
TGCGAGCACG GTGTTGTGCC ATATATCATT GATTCTCTCA AACCCTTCCC AGACCAAATA
CCAGCAGAGT CCTCCGATCC AGGAAACACT GCTGCTGACG GCAACCCAAC ACCGACCCTT
CTGGCAGCCT GTGGTGCAGC CCGCATGCTC ACTCGCTCCG TTAGCGCTTT ACGAACGAGC
TTGATTGACG CCGGCGTCTC AACCCCGCTT TTCGCTTTGA TTCGACATCC TGATATTGAG
GTGCAAATTG CCGCGACCTC AGTAATCTGC AATCTTGCTC TAGATTTCAG TCCTATGAAA
GAGGTACAAT CTGCTCGGCC CTTGTGACCT GAAGCTGCTA ACGTATGTCC TATAGGCAAT
TATATCGGCC GAAATTCTTC CCATTCTGTG TGAGCATGCA CACTCATCGA ATACTAAACT
TCGGATTGAA TCATTATGGG CGCTAAAGCA CGTCGCCTAT AACTCGGCAA ATGACGTCAA
AATCAAGATC ATCGAGGGCT TGGGGCCGGA ATGGATTAAA CAAGTTATTA CTCAGGATCC
GACAAGTGTT CTCGCGAAGC GTGGGCTTGA GGACGATACA GACAGTAACA CTCCAAGCGG
GATGAGTCGG GCCAATTCAG CTGGCGAACG GGTAGACTTG CTGAATCCGA TGGATGACTT
CCGGGAGAGG GATGAGGACA TGAAAATGAC CGATCCTGTG CCATCATCCA AAGTCAGTCT
AGATATGTTC TTTCCAGACG CCACTAGACG ACGTAAGCTC GCTTTGCATG GCGATCTTGA
CCAAACCACA CAAGCCCGTC AGGATGACAT TGCGGCGCAA GAGCAAACCT TTGATCTTCT
AAGAAATGTC ATATGTGGGC CTGGTGCATC GGAAATGATT GACTATCTCT TCAAGGAACT
CGGCCAGGAT TTGCTGCTGG ATACCTTGGC CGATAAACTG CGCCCAAGGT CTATCCAGCT
GCCTCATCGG CGAGAGTCCC CAAACCATCG CGCGCTTCAG GTCCCCACTG AGATTTTGGT
CGCAGTAACG TTCGTTATCA TCCACCTCGC TGCAAGCCTT CCATGGTACC GGCAGCTCAT
AGTCTCACAC CGCGATCTCA TTCGTTATTT GATGGGTTAC TTTAACCACA GCCACCGAGA
CGTCCGTGCC AATTGCGTGT GGGTGGTAAT TAACCTCACA TATGAGGATG ATGTTCACGA
TCGAGAGGGT TGCCGGAAAC GCGCACTCGA GCTACGTTCA ATTGGGGTAC TAGATCGACT
GGCTAGCCTT GAACATGACC CGGACCTTGA CGTTCGCGAG CGAACGAAGA CGGCACTGCA
CTTGGTAAAC TCGTTGACAC ACTCTTAG
 
Protein sequence
MTRAEAPPIF LQLQNADSLS SQAAALRALK NETIGHDQRK EAWVRLGLIP ILSNVLASRA 
LDKSELNNGT KQPELPGSRE ESDDVCLQAI ILVGSLAQGG TPFLSPILSS NILPILLSIL
SSNCPSSFVL PILRVLNSVA DRLPLQSQQQ WPRDTRLADI LFSTEHIGCL TRILGQDYSS
HSRLTAIELA AGLIGKLCTE ESHKAVLAES GVLDALAVKV ASFIVAQGFV FPGAESHLDD
VGALGSLPPP APRGAKLAPI LRAVTVIVEH SKWRAEHFLS SPGIVTVFPR QIPGFSPSDI
KKGPWGSTYF SGSAVPRHLG GTPLEYLLPS IPLSQLKPSA SSSNFPPLGQ YGQHRRQSHS
FPTPLSSFEP PTAEDDENPV VPWLLYLVRA ESGMARLMAA RFVTVLCRLG LTKKHRISML
CYLLIPILLR MLDKDYEASD DGVQYGGLIS SSQRIKEEAP GVLATLLVDD RELQKHAVEG
DAIKRLSQLL KETYNPIHEP ARTMWHAEGQ PKVEDHDSQP AECRLGPPGY SPLRYHILRY
RENILKALAA LVPFKDEYRK AVCEHGVVPY IIDSLKPFPD QIPAESSDPG NTAADGNPTP
TLLAACGAAR MLTRSVSALR TSLIDAGVST PLFALIRHPD IEVQIAATSV ICNLALDFSP
MKEAIISAEI LPILCEHAHS SNTKLRIESL WALKHVAYNS ANDVKIKIIE GLGPEWIKQV
ITQDPTSVLA KRGLEDDTDS NTPSGMSRAN SAGERVDLLN PMDDFRERDE DMKMTDPVPS
SKVSLDMFFP DATRRRKLAL HGDLDQTTQA RQDDIAAQEQ TFDLLRNVIC GPGASEMIDY
LFKELGQDLL LDTLADKLRP RSIQLPHRRE SPNHRALQVP TEILVAVTFV IIHLAASLPW
YRQLIVSHRD LIRYLMGYFN HSHRDVRANC VWVVINLTYE DDVHDREGCR KRALELRSIG
VLDRLASLEH DPDLDVRERT KTALHLVNSL THS