Gene ANIA_07447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_07447 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001304 
Strand
Start bp1511245 
End bp1514271 
Gene Length3027 bp 
Protein Length941 aa 
Translation table 
GC content52% 
IMG OID 
ProductmRNA splicing factor (Prp1/Zer1), putative (AFU_orthologue; AFUA_2G06070) 
Protein accessionCBF79397 
Protein GI259483750 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.143301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.632834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCTG GACGAAAAGA TTTCCTCAGT CAACCCGCCC CCGAGAATTA CGTTGCTGGT 
CTAGGTCGAG GAGCGACCGG CTTCACCACC CGCTCAGATC TAGGTCCTGC TCGAGAGGGT
CCAACGCCGG AGCAGATCCA AGCTGCGCTT GCGAAAAGAG CACAGTTACT TGGAGCAGCA
CCTCCCACGG CTTATGGGGC TACAAGAGAA AAGGGTAAAG GAGAGGAAAA GCCGGCCGAG
GAAGAAGATG ATGAACGGTT CCAGGACCCC GACAATGAAG TTGGGCTCTT TGCCTACGGC
CAGTTTGATC AAGAAGATGA TGAGGCGGAT CGCATCTACA GAGAGGTAGA TGAGAAGATG
GATAGACGGC GCAAAGCACG AAGGTTAGTG GACTTCGTAC ACTCTATTTT TTTATTTTCC
CTTTTTGCCT TAAGATGAGT TGTTTATCGG GTTCACCCCG TCTATTCTTG GACAGATGAC
CGACTTGTAC CTTTACCGCA GGGAAGCTCG AGAGCGTCAG GAGCGGGAAG AGTATGAACG
GAAGAATCCC AAAATTCAAC AGCAATTCGT CGATTTGAAG CGGTCTCTTG CGTCGGTCTC
GGAAGACGAA TGGGCAAACC TCCCCGAAGT CGGTGACCTT ACGGGTAGGA ATAGACGAAC
GAAGCAGAAC TTACGTATGC AACAACGTTT TTACGCGGTC CCCGATAGTG TGCTCGCGAG
TGCAAGAGAT TCATCTCAGT TCGATACAAC CGTTGCGGAC GATGGAACAG CAACAGATGC
TGGTGCTAAC GGGGCGGACG GAATGATAAC GAACTTTGCC AACATTAGTG CTGCTCGTGA
CAAAGTATTA CAGGTTAAGC TTGATCAGGC GGCAATGGGG TCCTCTGGGG ACGCGGCATC
TGGAAGTGCG ACTAGCATCG ATCCAAAGGG CTACCTCACA AGTCTTACGC AATCAGAGCT
GAAGGCAGGT GAAATCGAAG TGGGAGACGT CAAACGTGTG CGCGTCTTGC TGGAATCTGT
AACAAGGACG AATCCCAAGC ATGCTCCGGG GTGGATTGCG CTGGCGCGCC TGGAAGAGCT
GGCGGGCAGG ATAGTCACCG CTCGGAATGT GATTGCAAAA GGATGTGAGC TCTGCCCAAA
GAGTGAAGAT GCGTGGCTTG AGAACATTCG ACTTAACGAA GGTCACAATG CCAAAGTCAT
TGCTGCAAAC GCAATCAAAA ACAATGACCA CTCCACTCGG CTTTGGATCG AAGCTATGCG
ATTGGAAACA GAGCCACGTG CAAAAAAGAA CGTGTTGAGA CAAGCTATTC TGCATATTCC
GCAATCCGTC ACAATCTGGA AGGAGGCGGT TAACCTGGAA GAGGACCCCG CAGACGCACG
CCTTTTACTG GCTAAAGCAG TTGAACTGAT ACCGCTCTCG GTTGAGTTAT GGCTGGCGCT
CGCTCGTCTT GAGACACCTG AAAACGCCCA AAAAGTTTTG AACGCGGCGC GAAAGGCCGT
GCCTACCAGC CATGAGATCT GGATTGCTGC TTCTCGACTT CAGGAGCAAA TGGGAACCTT
CAACAAAGTG AATGTTATGA AGCGAGCTGT TCAATCGTTG GCGAGAGAAA ATGCTATGCT
TAAACGGGAG GAATGGATAG CGGAGGCAGA GAAGTGTGAG GAGGAAGGGG CTGTCCTCAC
TTGCGGTGCG ATCATTCGGG AGACGCTCGG ATGGGGGCTG GATGAAGATG ACGATCGGAA
AGACATCTGG ATGGATGACG CAAAGGCGAG TATTTCCAGA GGGAAATATG AGACGGCAAG
GGCTATCTAT GCGTATGCCT TGCGTGTCTT CGTCAATCGC CGATCCATAT GGGTTGCAGC
AGCGGACCTT GAACGCAACC ACGGCACCAA GGAAGCGTTA TGGCAGGTAC TTGAAAAAGC
AGTTGAGGCT TGCCCTCAAA GCGAAGAGCT ATGGCTACAG CTTGCGAAGG AGAAGTGGCA
GTCAGGAGAG ATTGACGATG CCAGACGAGT GCTCGGACGT GCATTTAACC AGAACCCTAA
TAACGAGGAT ATCTGGCTTG CTGCTGTCAA GCTGGAGGCG GATGCTCAGC AGACGGACCA
AGCCCGAGAG CTTCTTGCAA CAGCTCGACG CGAAGCAGGA ACAGATCGCG TATGGATAAA
GAGCGTCGCC TTCGAGCGGC AACTGGGTAA TGTTGACGAT GCGCTCGACC TTGTCAATCA
AGGTCTTCAG TTGTATCCCA AGGCCGATAA ACTCTGGATG ATGAAGGGCC AGATATACGA
GTCACAGAAT AAGCTCCCTC AGGCCCGCGA AGCATATGGC ACTGGTACTC GAGCATGTCC
GAAATCTGTC GCTCTATGGC TATTGGCGTC ACGACTGGAA GAAAAGGCCG GGGCAGTGGT
CAGAGCCCGA TCTGTTCTTG ATAGGGCTCG TCTAGCAGTA CCAAACAGCC CCGAACTGTG
GACAGAGAGT GTCCGAGTTG AACGGCGGGC AAATAACATC CCTCAGGCGA AGGTTCTGAT
GGCCAGAGCA TTACAGGAGG TCCCATCATC CGGCCTTCTG TGGAGCGAAA GCATTTGGCA
CCTCGAACCG CGCTCGCAGC GGAAGGCTCG CAGTCTGGAA GCTATCAAGA AGGTTGACAA
TGATCCAATC CTCTTCATCA CAGTAGCGCG AATCTTCTGG GGCGAACGTC GACTTGAGAA
GGCGATGACA TGGTTTGAAA AGGCGATCAT ATCAAACAGT GATTTCGGCG ACGCGTGGGC
CTGGTACTAC AAGTTCCTGC TGCAGCATGG TACAGATGTA AGTTTTCTCC TCTCCTGCAT
ATAATCTCTT TTTTGCCACT CAGTGGAAAG AACCCCTGTT CTAATATTCA TTCTTCATAG
GAAAAACGAG CCGACGTCAT TTCGAAATGT GTACTTTCTG AGCCTAAGCA CGGTGAAGTC
TGGCAGTCCA TAGCGAAAAA TCCCGCTAAT GCCTATAAAT CAACCGAGGA TATCCTAAAG
TTAGTTGCGG ACAGTCTTGT CCAATAA
 
Protein sequence
MASGRKDFLS QPAPENYVAG LGRGATGFTT RSDLGPAREG PTPEQIQAAL AKRAQLLGAA 
PPTAYGATRE KGKGEEKPAE EEDDERFQDP DNEVGLFAYG QFDQEDDEAD RIYREVDEKM
DRRRKARREA RERQEREEYE RKNPKIQQQF VDLKRSLASV SEDEWANLPE VGDLTGRNRR
TKQNLRMQQR FYAVPDSVLA SARDSSQFDT TVADDGTATD AGANGADGMI TNFANISAAR
DKVLQVKLDQ AAMGSSGDAA SGSATSIDPK GYLTSLTQSE LKAGEIEVGD VKRVRVLLES
VTRTNPKHAP GWIALARLEE LAGRIVTARN VIAKGCELCP KSEDAWLENI RLNEGHNAKV
IAANAIKNND HSTRLWIEAM RLETEPRAKK NVLRQAILHI PQSVTIWKEA VNLEEDPADA
RLLLAKAVEL IPLSVELWLA LARLETPENA QKVLNAARKA VPTSHEIWIA ASRLQEQMGT
FNKVNVMKRA VQSLARENAM LKREEWIAEA EKCEEEGAVL TCGAIIRETL GWGLDEDDDR
KDIWMDDAKA SISRGKYETA RAIYAYALRV FVNRRSIWVA AADLERNHGT KEALWQVLEK
AVEACPQSEE LWLQLAKEKW QSGEIDDARR VLGRAFNQNP NNEDIWLAAV KLEADAQQTD
QARELLATAR REAGTDRVWI KSVAFERQLG NVDDALDLVN QGLQLYPKAD KLWMMKGQIY
ESQNKLPQAR EAYGTGTRAC PKSVALWLLA SRLEEKAGAV VRARSVLDRA RLAVPNSPEL
WTESVRVERR ANNIPQAKVL MARALQEVPS SGLLWSESIW HLEPRSQRKA RSLEAIKKVD
NDPILFITVA RIFWGERRLE KAMTWFEKAI ISNSDFGDAW AWYYKFLLQH GTDEKRADVI
SKCVLSEPKH GEVWQSIAKN PANAYKSTED ILKLVADSLV Q