Gene ANIA_01217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_01217 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001308 
Strand
Start bp1112064 
End bp1113936 
Gene Length1873 bp 
Protein Length566 aa 
Translation table 
GC content56% 
IMG OID 
Producthomeobox transcription factor, putative (AFU_orthologue; AFUA_1G10580) 
Protein accessionCBF87915 
Protein GI259488463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0446993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0116358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCATC CTACCATGAT GCATCACCCA ATGGACGGCT ACTACTACGC CCAGCCCCCT 
TTTGACATGG TCGACTACTA TCACCAGCCG ATGATGGATT ACGAAGAATA TGCCGAAAAC
CTCTCACGAC CGAGGTTGAC CAAGGAACAA GTTGAGACCT TGGAGGCCCA GTTTCAGGCG
CACCCCAAGC CCAGCAGCAA CGTCAAGCGT CAATTGGCTC AACAAACGCA CCTTAGTCTC
CCTCGAGTTG CAGTATGTGA TATTCAAGCT GGTTGAAGGC TAGTTTACTG ACTCTTGCAG
AACTGGTTCC AGAACCGACG GGCCAAGGCG AAGCAACAGA AACGACAGGA GGAATATGAG
CGCATGCAAA AGGCAAAAGC GGAGGCAGAG GAGGCTGCTA AGAGAAAGTC TGAGTCTTCA
GTGCCTGAGT CTTCAGATTC TCAACGTTCA GCGGAGGCCA AAGACGAGAA GAAGCAAGAT
GACAGCAAAG CCCCTACCCC TAAACCGTCA AAGCCAGCAT CTGACGACCA AAAACAATCC
GAAGCTCCTG CTGAGTCGAA TCACCAGCAG ACCCGCAGTG AATCTAATCG CGTAGCTAGC
CTTGCGTCTC TGCAGAGGGC CATGGACGCC GCCGCTCAGT ACCAAGGTGG TCAGGGCACG
ACAAGCATGG GCGGGTCCGG ATCAGTGTCC CCAACCACAT CGCTGCCGAA TGACGCAGAC
TCGGCCGTTT GGAGTTCAGT AAACTCAACA AACGGAGAGC TCTCTGTTCC CGGGTTAGAG
AACTCGCAGT CTTTCTCTGA TTACCGTTCA GCTAGCGACG CGGGTGCTTC GTACAACTCA
ATGCAATTTG CCCTGCAAGC AGATGCGGCT AACGCGCGTC GGGGTTCGTC TGATGTGCTT
GCCGACTCGT TCGATGGTAT CGGCATCAGC GCCTCGCCTA GCCTCTCGCA GCTTGGAAAC
CGGACAGACC GCCCGGCCTG GAAAGAAACC GGCAAGGAGC TTGATCTCGC TGCCCGCCGA
AACCGACCGA GGCCTGCCGC GAGTTGGCAC GTCGAGGTCC ACTTCAATGC TCTCTACTTC
GATCATGTCG CCAACGACAC GGGGCCAGAA CTACGGCACT GTGAAGCAAT CCAAATCTGC
CCAGAACCTC GGTTCGCGCT ACGCCGGGGT GCGAAAGCCT TCTGCGCAGC GCTCACCTTT
GAACCTGTCG ACATTCGCCG AGGCCGGTGT GCTGAGCTCC GCAAAGACGG AGTTGTCGAC
CATGCTGCAG CCAGTCACGA CAAACTCCCT AGCTCCGCCC ACTCCATTGA CCCCCGAGGA
TCTGCATCAC CTGCTTCCCA CCACCCCTTC CACCGATGGC TACTGCCTGT CCGCGCAGCC
AACCGCCCAC CTCTTCCCCA CGACGCAACC AATGCAGATC AACATCGCCT CGCCTCCCGC
TACACCTCTA GGGATGGATA TAATGTCCTC ATACCCATAT CACAGCGTCG CGCCGCCCAT
GTCCGCCCCG GCCAACTTCA CATCCTTTCC AGACTACAGC TGTGACGGCA GTTTCCAGGG
AAGAAACTGG GAAGCCACTT CGATGCCATC TCCGGAAGTC CCCTTCCAGA GCCAATGCCA
CCAAATGAAT TTCTCATCGA TCCCGTACGA TCACGCCCTA GACCAAAGCC AGTCTGAGAA
CGGGCCCTCT CAGTCTCCAT TTGGCGACGC AGATATACAA GCGCCTGGTG ATGCCAGCAA
GGCCACCGAA TTTCATCTGT ATGAGTTCCC AGACCAAGAA GAAGCGCACC GGTTCGTGGC
CCAGCAGCTA CCCAACCAGA AGCCCAAAGC GTACACTTTC GCCGACAACC GGACCCCCAC
CAATTTCGGC TAA
 
Protein sequence
MVHPTMMHHP MDGYYYAQPP FDMVDYYHQP MMDYEEYAEN LSRPRLTKEQ VETLEAQFQA 
HPKPSSNVKR QLAQQTHLSL PRVANWFQNR RAKAKQQKRQ EEYERMQKAK AEAEEAAKRK
SESSVPESSD SQRSAEAKDE KKQDDSKAPT PKPSKPASDD QKQSEAPAES NHQQTRSESN
RVASLASLQR AMDAAAQYQG GQGTTSMGGS GSVSPTTSLP NDADSAVWSS VNSTNGELSV
PGLENSQSFS DYRSASDAGA SYNSMQFALQ ADAANARRGS SDVLADSFDG IGISASPSLS
QLGNRTDRPA WKETGKELDL AARRNRPRPA ASWHVEVHFN ALYFDHVAND TGPELRHCEA
IQICPEPRFA LRRAPPTPLT PEDLHHLLPT TPSTDGYCLS AQPTAHLFPT TQPMQINIAS
PPATPLGMDI MSSYPYHSVA PPMSAPANFT SFPDYSCDGS FQGRNWEATS MPSPEVPFQS
QCHQMNFSSI PYDHALDQSQ SENGPSQSPF GDADIQAPGD ASKATEFHLY EFPDQEEAHR
FVAQQLPNQK PKAYTFADNR TPTNFG