Gene ANIA_00056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_00056 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001308 
Strand
Start bp4677347 
End bp4679234 
Gene Length1888 bp 
Protein Length521 aa 
Translation table 
GC content53% 
IMG OID 
Productarrestin (or S-antigen), N-terminal domain protein (AFU_orthologue; AFUA_5G12530) 
Protein accessionCBF90290 
Protein GI259489756 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.524817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACGATTGCAT ACTCCCATAG TCCGCCTGAC TCTATACACC CCGCGCCTGC CAGAGCATAT 
TACTACTTAC GGCCTCTACT ACAGCCTCTA CCCAAAGGAT TATGCGTGCA ACTTCCTACT
ACATCTCACA GCTCGATCAA CACTGCATTG AGTGCTGAAA CCCTTTTCAG ATAGCACCGC
ACTGCCTTGG GTGTCTAGTG CCCACCGAGA GAGCTTCTAT AGACCTCACT CAATTTAGCT
CATCGCCACA CATTGTCTCG ATCTTGTTTC CAGCCGCTAC CGCTGCAAGA CGTCGCAATG
CCCGGCCGTC TTCTTTCCAG CCTAGTTCGC CCCTCCCTTT CTCACCACTC GATCTTCCCT
CACTCCAACT CTTCTTCGTC CTCGTCTGTA AACGAAGTCT CCCAGGCCGC ATCTTCCAAG
CACTCGCACA CTGACAGATC ACATTCGCCG GAGCGCCGTC TATCCTTCTC AATGGACCAC
TTTATCCACA CTTACCGGGA CCATCACAAT AAGGAAAAGC ATCGAAAGCA CGGACGTTCC
TCTCGTTCAA AGGAGCGTGG CAGCCATGAG GAAACGGCAG CTTCAGCTAA ACTTGATGTC
ATCGTGGAAT CTCCACCTCT TGTTTGCTAT GGAACACCGG CGAACTCTAC GGGTGCTTTG
TTCTCTGGCC GCCTCAGAAT TACCGTACCC GAAGCAACGG GCATGGTCAT CCTTGACAAG
TTCGATATGC GCTTGATGAT TAGAAAGACG ACGAAGAAGC CCGTCTCGAG GGACTGCCCC
AATTGCGCCT CCAAGACCGA GGAACTGACA AACTGGAACT TCCTGACGGA ACCTCTCCAC
CTTAGGAGCG GTGACCACGA CTTCCCTTTC AGTTATCTGT TCCCAGGTAA CCTGCCGGCG
TCGTGTAATG GTTCCCTGGG ACAGATTGAA TATTTCCTTC AAGCACACGG ACACAATGTG
AACGGTGAAG AGTACAATTT TAGAATGCCG CTACACATGC GCCGCGCCAT TCTCCCAGGG
AACGACAAAT CCTCAATTCG TATCTTTCCA CCCACCAATC TAACTGGCCG CATCGTCCTC
CCATCTGTTG TCCACCCAAT TGGGACTTTT CCCGTTCAGA TGACCTTGAG CGGTGTCTTG
GATAAAGGCG AGGAGACCCA AACGCGTTGG CGGCTACGCA AGATGATGTG GCGGATTGAA
GAGCACCAGA AGATTGTCTC AACCGCTTGC CCCAAGCACG CGCACAAGAT TGGTGGCGAA
GGAAAGGGCG TTCTGCACCA GGAGACGCGG ATCATTGGAC ACAACGAGGA GAAAGATGGC
TGGAAGACAG ACTTTGATAC CGCTGGAGGC GAGATAAGCA TGGAATTTGA AGCCAACATT
AACCCAACCG CCAACCCGGT GTGTGATCTT GAGGCACCGG GTGGACTGGA GACGAAGCAC
AATTTGGTCA TTGAACTGAT TGTTGCGGAG GAATTCTGTC CAAACCGTAA CACCAGACTC
ATCACTCCAA CAGGAGCTGC CCGCGTCCTT CGCATGCAGT TTAACCTGCA CGTTACGGAG
AGAAGCGGGC TCGGTATCAG CTGGGATGAG GAGATGCCGC CAGTATACGA AGACGTTCCC
GCTAGCCCTC CCGGATATAC CATGCTTGAC GGAAGCAGCA TCATGGAGGA TTACCACGGA
TCTCCTCTAC CGACCCCGGA GTACGAAGAA CTAGAGCGAA TGGACTCACT TCGACTTGAT
AACTCATCTA CCAACTCTTC CTGCCGTGGA CGAAGCCGGC TGACGACCGA TGATTTGACG
GCCGAACCGG CGGAATTCGA AAGTCGAAAC CGAGCGCCAT CCGCGGACTC GCATTCGTCA
TGAACAATTG ATTCAATTGA TCCGTCAT
 
Protein sequence
MPGRLLSSLV RPSLSHHSIF PHSNSSSSSS VNEVSQAASS KHSHTDRSHS PERRLSFSMD 
HFIHTYRDHH NKEKHRKHGR SSRSKERGSH EETAASAKLD VIVESPPLVC YGTPANSTGA
LFSGRLRITV PEATGMVILD KFDMRLMIRK TTKKPVSRDC PNCASKTEEL TNWNFLTEPL
HLRSGDHDFP FSYLFPGNLP ASCNGSLGQI EYFLQAHGHN VNGEEYNFRM PLHMRRAILP
GNDKSSIRIF PPTNLTGRIV LPSVVHPIGT FPVQMTLSGV LDKGEETQTR WRLRKMMWRI
EEHQKIVSTA CPKHAHKIGG EGKGVLHQET RIIGHNEEKD GWKTDFDTAG GEISMEFEAN
INPTANPVCD LEAPGGLETK HNLVIELIVA EEFCPNRNTR LITPTGAARV LRMQFNLHVT
ERSGLGISWD EEMPPVYEDV PASPPGYTML DGSSIMEDYH GSPLPTPEYE ELERMDSLRL
DNSSTNSSCR GRSRLTTDDL TAEPAEFESR NRAPSADSHS S