Gene ANIA_06742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_06742 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001301 
Strand
Start bp2985174 
End bp2987472 
Gene Length2299 bp 
Protein Length697 aa 
Translation table 
GC content55% 
IMG OID 
Productsalicylate hydroxylase, putative (AFU_orthologue; AFUA_7G00590) 
Protein accessionCBF71369 
Protein GI259480334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00748094 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGTAGTG CCGTAGAAAC CGAGCAAGGC CTCCACGTCC TAATTGCAGG CGCTGGAATT 
GGCGGCCTTT CAGCTGCTAT AGCATTACGA CAGCAAGGAC ACCGCGTCGA GGTATGTCTC
GTGAGTTGTC AGCAGGAACA TCCAGCTTAG CACGCCAGAT GAACAGCTCT TCGAGCGCTC
CCGCTTCGCC AACGAGATCG GAGCCGCAAT CCACCTTACA CCCAATGCGA ATGCCGCGCT
TCTAAAGCTC GGAATTAATG CGACGACACT CGGTGCTGTG GAGTCTGAAA AGGTTCGTGA
AAGCGTTGAC TGAAAGGTTG CTAGTGTGCT GACGCACTGC GACAGCTCCG TGTCTTCCCT
CCAAACGGGC CGGAGATATT CTCGTTGGAC ATCAAGAAGA CTGCTGGTTT CTGGAGGCAT
GTAAGTTGCA CGTTCCATAA TTCTATGCAT CAATAATGAC CCTTCAGCGC TGGCTTCTCG
TCCACCGGGC GCATCTCCAT GAGGGCCTGA AAATTGCTGC TCAAGCTCCC GGCCCTGGAG
TGCCTGCCAA ACTACACACC TCCAACAAGG TAGTAGACAT TGATCCCCAC AGCGCAACTA
TCACCCTCGA AAACGGAGAA AAGGTAACTG GCGATATTGT TCTCGGCGCA GACGGCGTAC
ACTCGGTCGC AAAGACGAAA CTTTCTGGTG GAAAGAATAT AAAAACCTTC AGCTCAGGAA
AGAATGCCTT CCGGTTCTTG ATTTCTCGGA AGGATGCACT GGATGACCCC GAGACAAAAG
AACTCGTGAA TGAGCCGGGA ACATGGTACA TGTTTGACAG CCCCGACCTT CGCGTTGTTG
TGTATCCCTG TGCCAACAAT GATCTGCTGA ATTTCGTGTG CATCCATCCG GAGTCCTTGT
CCAAGATCCA CGATGGTTCT GAGTGGGATC AGGGTGCAAG CAAGGAATCA TTGCTGGAGG
TGTACAAGGA TTTCAGCCCA CAGGTCCGCA GATTGCTAGC GAAAGCTGAC CCAACCACGT
TGAAGGTGTG GCCGTTGCTT GACACCGATG ACCTGCCTAC GTGGGTGGAA GACCGACTTG
CGGTCATGGG CGATGCTGCA CATCCTTTCC TGCCGTACCG TGCGTCAGGC GGCGCAATGG
CTATTGAGGA CGCGGTCTCA TTGGGCGTTA TGCTTCACAA GGGTGTGTCT GTGGGAAGTA
TTTCTGAGCG GCTGAAGCTA TATGAAAAAG CCCGCCGTAC CCGCGCAACG ACAATTCAGC
AGTTGACACG AAAGAGCTCC CATGGGCCTC TCCCACCGTC GGAAGGTAGA TACCCACTCC
CCCTTTGGCT TGTACTGTCC GCGCTAACTG ACCAGAAAAA TCAATGACCG AGTATATCTA
CGGCCATGAC GAACACGACC ATAGCACCCA AATCCTGCGC AAACACCTCT GGGCACAAGA
ACCGCAAAAG TATTGGCGCC AGCCCATCGT CTTCGGGCCC ATGCCCGGCC CGCGTCAAGA
TTTTATAGGC CGTAGCCGTC TGGACCGGTC CCTAAAATCT ACGTTCCAAA CAACCTCCAT
TCGGTTCAAA ACCAGTCGAA CACTCCTGCA AAACTTGCTC CCGAATGACT CGTGGTCCTT
CTGCACGCCC AGCACGGTCG CCACTGCTAC CTTCTCACAG ACCCTGCTCA ATGGTATGGA
CTGGCTCGGT GGCGGCGGAT ACCGCCATCT GGGTCTGTAC ATTGAGGGCG TGCAGTACAC
AAAGGCCAAC GGTGAGGTCG TATCAGGGAC ATACCTGCCG ATTCTATTTG AGACTCTAGC
GGACCCGATT GTCAGCGGCC GTGAAGAGCT TGGGATGCCA AAGCTGTATT CTGCCCTTGA
GGCGAATGAG CGCGAGGGCT CATATTACCT TCAAGCGAGC TGGCAGGGCG CCGTCTGGGG
GCGGTTCCAG TGGGAAGGAC TCGAGGACCA AGATCCGGCA ACAACCGCCG GGCCAGGCGA
CAGTGGTGTC AACGGAGGCT TGCTTCTTCA CCGCTACATA CCCAAAGTTG GGAGAGACTG
CAAGGGTCAG GCGGAGGTTG AGTATCCGGT GTTTGTCCCG AATGCAGATG AGAGCAAGAT
GCTGGTCTCA AAGGTAAACC GGGTTCGTAC AGCTACAAGG GCTGCATTTG AGATTGATGG
CCTTGGGTGG GAGGCTTTGC CAACGTTGCA CCACATCATC GAGCGATTGG CCGAGATTCC
AGTCGATGAG ATTCTGTCGG CGAAAATTGT CGAGGGAGAG GGGGTGCCGG ATGTGTCGTC
TGCGCGCCGG ATAGAGTGA
 
Protein sequence
MGSAVETEQG LHVLIAGAGI GGLSAAIALR QQGHRVELFE RSRFANEIGA AIHLTPNANA 
ALLKLGINAT TLGAVESEKL RVFPPNGPEI FSLDIKKTAG FWRHRWLLVH RAHLHEGLKI
AAQAPGPGVP AKLHTSNKVV DIDPHSATIT LENGEKVTGD IVLGADGVHS VAKTKLSGGK
NIKTFSSGKN AFRFLISRKD ALDDPETKEL VNEPGTWYMF DSPDLRVVVY PCANNDLLNF
VCIHPESLSK IHDGSEWDQG ASKESLLEVY KDFSPQVRRL LAKADPTTLK VWPLLDTDDL
PTWVEDRLAV MGDAAHPFLP YRASGGAMAI EDAVSLGVML HKGVSVGSIS ERLKLYEKAR
RTRATTIQQL TRKSSHGPLP PSEEKSMTEY IYGHDEHDHS TQILRKHLWA QEPQKYWRQP
IVFGPMPGPR QDFIGRSRLD RSLKSTFQTT SIRFKTSRTL LQNLLPNDSW SFCTPSTVAT
ATFSQTLLNG MDWLGGGGYR HLGLYIEGVQ YTKANGEVVS GTYLPILFET LADPIVSGRE
ELGMPKLYSA LEANEREGSY YLQASWQGAV WGRFQWEGLE DQDPATTAGP GDSGVNGGLL
LHRYIPKVGR DCKGQAEVEY PVFVPNADES KMLVSKVNRV RTATRAAFEI DGLGWEALPT
LHHIIERLAE IPVDEILSAK IVEGEGVPDV SSARRIE