Gene ANIA_09314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_09314 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001308 
Strand
Start bp251433 
End bp254441 
Gene Length3009 bp 
Protein Length928 aa 
Translation table 
GC content49% 
IMG OID 
Productterpene synthase family protein (AFU_orthologue; AFUA_5G15060) 
Protein accessionCBF87385 
Protein GI259488149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0823306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.708956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAAT CCATACATGA GTCGATTCAA ATCACCTTTC AGCCATCCAC TGATAGTTCA 
CACCCATTTT CACTAGGGTA TTTCCAGGCA AATATGGGAT CCAAGATGGA TCACAATTCA
GTGGCTATAC TTGTCGACGT GAAGCGGAGA GCCTTGTTCG AGCGCTTCGC GGAGTCGTAT
CATCCGACCT ACGGTTTAGG TACAATGTCT GGCAACATAT ACGACACCGC TTGGGTATCA
ATGGTCAGAA AGCCTACTGA AGAGGGCAAG TCTATCTGGG CCTTTCCGGC TACTTTTCAG
GCTCTTCTAC AGCACCAGCT CCCTTGCGGC AGTTGGGGCG GGACAAATTC AAATTTGGAT
TCTATTGCTA GCACTTTGAC AGCTCTTCTT GCATTACAGA AGCATGCAAG GGAATTGAGT
GCAACTGAAT CTCAGAATGA GCTCACCTCG AGGATCCTCA AAGCCAAGCG ATGGCTGGAT
GCAGCTTTGG TACGCCTGGA CGACTTGCTA GCAACTAGTA CCTTGACTGT TGGGCTAGAA
CTTAGACTAC CGACGCTTTT TGATCTTCTC GAAGCAGAGG GACACATCTT TGACTTTGAG
CGGACCCGCT TGACCAAGTT GAAGTCCAAG AAAGTATCCA AGATCAACTT CGATACCATC
TTCAGCGGGC CCCAATCATC TCTTTTGCAT TCCTTAGAGG CCATGGTTGG GAAAATTGAC
TTTAGAACGC TGGGTCATCA TAAAGTGCTT GGCAGCATGT TGGCATCACC CTCTGCAACA
GCAGCATATC TCATGTACAA CTCCGTGTGG GATGACGAGG CCGAGGAATA TATTCGACAT
GCAATCTCCA ACGGAGCCGG TCAAGGCTCA GGGCTCGTGG CGGCGGGATA CCCAACCACG
GTATTTGAGT GGGCCTGGGT AGGTTCTTCT TGCTTGGTGT GGGCTTGATC AAATGCTAAA
TCAGCAAGGT CACAACGAAT CTTTCACGGC ATGGCATCAA AATAAGCAGT AGCCTACGAC
AAATGGACAA GCAGATTGAA GACGAAATTA AGGTAAATGG ACTTGTGGGA TTCGGTACGA
TCACCTCCAG CAAGGCTTCC TTCTATTTTG GTGACCTGCT CTGAGTTTGC AGTTCCCAAG
GCATGCCCCG ATGCAGACGA CACGGCGAAG GCTTTGATAG CCTTTCAACT CCGGGGAAGA
CGCTATTCGC CGCAAGCGTT GATTGATCAA TTCGAACGAG AACATCATTT TACGACATAC
CTATACGAGA CGCACACGAG CGTAAGCACC AATGCCAATG TTCTGACTGC ACTGGTGTTG
CTCTCAGACG ATGGACGCTA CCAACCACAG ATCGAAAAAT GCATCCGCTA CCTCTGCGAG
GCATGGTTCC ACTGTGACCG AATGGTAAAA GACAAATGGG TATGTTGAGC TCTATCAACG
GAAAGGATTC TGCTTACTGA ATGCTAGAAT ATCTCTCCTT ATTATCCAAC GATGCTGTTG
TGCGAGGGCC TGATGTCTTA CATTCATCGC TGGAGCGAAG GGCACCTGGC TGCATTGCCT
GACGAATTGA TGAATTTCCA ACTTCCCATT ACGTTGTTTC AGGCCCTGAT CCGAACATTG
CGCACTCAAA ATTCAAATGG ATCCTGGGGG AGCTCGAACT CTGCTGAAGA GACTGCATAT
GCTATATTAA TCCTGAAAAA TGTGGCATCC TTTAACTTCA CTGACGAGAT ATCGGCCGAG
CTCGAGAGTG CCATCCGGAA AGGGATCCAG TTCATTCTCT CTAAAAGCCA ACGGTCTCAG
ACAGATGACC AGCTTTGGCT GGACAAAACC TTGTTCGCCA TACCAACTGT ATCGGACTCG
TACATCGTGG CTGCTGTGCA GGCTGAGGCT ACTGACTTTG TGTCTGGCGA TACACTAAAT
AAGCTCGTCG ACACGTCAAC ACCGACGGTG CAAAAATTGA CATCATACTT TGCTCGACTA
CCATCTCAGA CAGAAACACC GAAGTGGGTG ATCCAGGCTT CCGTTATAGA GGGAATCCTC
TTCAGCTGCA GGTTGAAGAC GTTGGACATC TTTTCGACTG GAAAAGCCCT CGGCGACAGA
TATATTAAAT ATGCCGCCAT CTTTTGGACA CTGGCCAACA ACGCACGTCC TGAGTATCTT
CTCAGTACGT CGGTGATCTA TAGCATGGTG GAACTTTCTG TTGGGATATT TCAGGAAGAC
GAAGAAATGG AGAAGTCCTT GGTCAACCTA CCGGATGCTG CTACCGACGC CGTAGCTGAC
TATATCGATA AATCGTGCCA CGAGACAGCT TTTTGCAACA ATGTCGCTTC CCACGCGACA
CCGCATGGAT CGGATATCTC TGGATATGAT GTAGAAACTC GAACTCAACT CATGACCATC
CAGCACAACA TAAGGCTTTG GCTGCGTTTT GCTTTGGTAG ACAACCTCCC GGCGAATGCC
AATTCCCATG ACATATACGA TCTCAAGCAA GAAGTCAAGA TGGCAATGAT TGCTGCTCTA
CAACAAGCTA AAGCACACAA ATTCCTCAAC AGCAGCCAAA CATTCTACGC TTGGCTCCAC
ACATGTGCCG TCCACGACGC CAAAAGCGCT GTCGTCTCCA AGTCCCTGAT TTGCAAGATC
GGGCACGGGT CAAATGTCTT CCGCACAGCC AAGGAAAAAT ATCTCGCCGA AAGGTTATGG
CGGCAGGTTT CCATAGAGGG CAGACTTTGG AACGATCTTG GGAGCATTGA GCGAGACAGA
CTCACAGCAA ACCTCAATTC GGCCGACTTT CTCGAGTCCG GGCCAGCTGG TGATGTGTGG
GAACAGCTAG TTCAGCTCGC GGACTTTGAA CACAAGTACG CTCTACTGTG CTTGGACAAC
TTGACGCAGC TCCTGGAAGC ATCCGGCCGC CATAGGATCT CGCTGTACCT GCAGATGTAT
TACCGTTGCT GCGAAATTTA TAATGAAACG TGTGTAAATT ATGAGTTTGG CTCCAAGATG
GCGAAATGA
 
Protein sequence
MVKSIHESIQ ITFQPSTDSS HPFSLGYFQA NMGSKMDHNS VAILVDVKRR ALFERFAESY 
HPTYGLGTMS GNIYDTAWVS MVRKPTEEGK SIWAFPATFQ ALLQHQLPCG SWGGTNSNLD
SIASTLTALL ALQKHARELS ATESQNELTS RILKAKRWLD AALVRLDDLL ATSTLTVGLE
LRLPTLFDLL EAEGHIFDFE RTRLTKLKSK KVSKINFDTI FSGPQSSLLH SLEAMVGKID
FRTLGHHKVL GSMLASPSAT AAYLMYNSVW DDEAEEYIRH AISNGAGQGS GLVAAGYPTT
VFEWAWIEDE IKVNGLVGFV PKACPDADDT AKALIAFQLR GRRYSPQALI DQFEREHHFT
TYLYETHTSV STNANVLTAL VLLSDDGRYQ PQIEKCIRYL CEAWFHCDRM VKDKWNISPY
YPTMLLCEGL MSYIHRWSEG HLAALPDELM NFQLPITLFQ ALIRTLRTQN SNGSWGSSNS
AEETAYAILI LKNVASFNFT DEISAELESA IRKGIQFILS KSQRSQTDDQ LWLDKTLFAI
PTVSDSYIVA AVQAEATDFV SGDTLNKLVD TSTPTVQKLT SYFARLPSQT ETPKWVIQAS
VIEGILFSCR LKTLDIFSTG KALGDRYIKY AAIFWTLANN ARPEYLLSTS VIYSMVELSV
GIFQEDEEME KSLVNLPDAA TDAVADYIDK SCHETAFCNN VASHATPHGS DISGYDVETR
TQLMTIQHNI RLWLRFALVD NLPANANSHD IYDLKQEVKM AMIAALQQAK AHKFLNSSQT
FYAWLHTCAV HDAKSAVVSK SLICKIGHGS NVFRTAKEKY LAERLWRQVS IEGRLWNDLG
SIERDRLTAN LNSADFLESG PAGDVWEQLV QLADFEHKYA LLCLDNLTQL LEASGRHRIS
LYLQMYYRCC EIYNETCVNY EFGSKMAK