Gene ANIA_08666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_08666 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001303 
Strand
Start bp3145172 
End bp3146402 
Gene Length1231 bp 
Protein Length389 aa 
Translation table 
GC content62% 
IMG OID 
ProductPutative Zn(II)2Cys6 transcription factor (Eurofung) 
Protein accessionCBF78223 
Protein GI259483115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00267711 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.115353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACGG CTGTTGAGTG CATGACATCG ATCACCGTCT CTCAGACCGC CGTCATGGAT 
CCCTCCGCCG ACAAGAAGCG CAACAAGCTG GGATACCATC GCACCTCTGT CGCATGCGGT
CAGTGGCCCT CCCTCCCCCA TCGTGCCCCC TCCATTGCCT GTTGCTTATC CTGCTCCAGT
CCACTGTCGA CGGCGCAAGA TCCGGTGTCT TGTTGCCGCA GACGACGCCC AGGGCCGATG
CGAAAACTGC ATCCGTCTGC GCAAAGAGTG CCAGTTCTTC CCGGTGGACC AACAACCGCC
GATCGAGAAG AAATCCCGTC CCAATTCCCG CATCGAAACG ATCTCGAATG ATCCGTCGAC
GGCTTCGTCG TCGCCTCCGA CTGTCTCGGG AGACCAGACA GACGCTTACT ACCACTACCA
GCATATGCCC TTGAGCGCCG GCCAAGACGT CGCGGCGTTC AACGCCGCGC CGTATGCGAA
TCCAATGGCT CAATTCGCGC CAGGTGGGTA TCATCAGATG GAATCAGGGT CAAGTGTCTC
GCCACTGACG AGGTACGGCA CAGGTGTAGA TGCGACATCG ACCCCCAATC CGATGGACCC
ATCCGTCCCC TGGGACGAGT TCACTACGTT GCCCTCAGAC CCGCAGCTGC TGGCGACGAT
GTCTGCCGCG GGGAAGCCCA TGGTCAGCGT CAACATGCCT CACCAAGCTC ATGTCTGGAG
CCAGCCTGCG ACCCCGATCG CCGCAATGCC ACCTAATACA CAGCTCCCAG GCGCGCCCAC
CGTGCCCTCA CAACCACAGC CTTTGAGCCC GTCATCGCCC TACACAGTGC AGCCCGACGG
CTCCGTCTCG GTCTGGCAAA TGGCGCAGAC TCCCACTCGA TCGATGACGT TCCCTGCCCA
GCCGAACATG CCCGCGCAGT ATCCCAGTCC CGGCGGCTTC GCACAACCCA TGCCGGCCGA
CCTCAAGCGG CGAGTGACCA GTCCCGGTCA AGGGTACCCT ATGCACCCGC AGAGCCCGCC
GGCCGACCTC CAGGGCACAT CGGTTCCGGT TACATATGCC GCGCAGCCGA CAGGTATAGG
ATACCCCGGG TGGCAGGATA TGAGTGGCGT AACACCGGTG AATATGGTCC CATACCCGGT
ATACACAGAC GCACAGCAAG CACAGGCCGT GTACGGCAGC CCTCTGATGG GGCCTGGCGC
TGGACATGGA CACCCGGGGC AACATCAATA A
 
Protein sequence
MTTAVECMTS ITVSQTAVMD PSADKKRNKL GYHRTSVACV HCRRRKIRCL VAADDAQGRC 
ENCIRLRKEC QFFPVDQQPP IEKKSRPNSR IETISNDPST ASSSPPTVSG DQTDAYYHYQ
HMPLSAGQDV AAFNAAPYAN PMAQFAPGGY HQMESGSSVS PLTRYGTGVD ATSTPNPMDP
SVPWDEFTTL PSDPQLLATM SAAGKPMVSV NMPHQAHVWS QPATPIAAMP PNTQLPGAPT
VPSQPQPLSP SSPYTVQPDG SVSVWQMAQT PTRSMTFPAQ PNMPAQYPSP GGFAQPMPAD
LKRRVTSPGQ GYPMHPQSPP ADLQGTSVPV TYAAQPTGIG YPGWQDMSGV TPVNMVPYPV
YTDAQQAQAV YGSPLMGPGA GHGHPGQHQ