Gene ANIA_02998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_02998 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001306 
Strand
Start bp2088970 
End bp2092312 
Gene Length3343 bp 
Protein Length1031 aa 
Translation table 
GC content55% 
IMG OID 
ProductC1 tetrahydrofolate synthase C1-THFS Fragment [Source:UniProtKB/TrEMBL;Acc:Q96UN8] 
Protein accessionCBF83595 
Protein GI259486056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATACT CCCCCGCCTA TTTGGCAACA GTACCCCGCT GTGCTGCTCC AGCAAACAGC 
CTAGGTTTAC AACCGTCGGC TCTGCGTCAG CCTCCTTCAC GCCAATTTTT CTCCTGCAAA
TCTTCCTGTC CCTCTTTATA CACTCGACTG TCCTCTGTTC GACGCGGGTT CCTTATTGAG
TCTTGCCGAA GAGTTCCCCG ACGTTCCCCG GAGAGCCTCA GAAACTTTCC CGCTTTACCG
TGTCAACGTC GTACATTCTC GAGCACAGGC GTCGCCATGG TGGCCGAAAA GATCGATGGC
ACGCAAATCG CCAAGGACAT CCGGGCGGGG TTGAAGGATG AAATCCAGAA GATCCAGGAA
ATCAACCCCA GGTTCAAGCC CAGCCTGGTG ATCTTCCAAG GTACGCGCTA TACTTCAAAT
AAACGTCTTC CGCGGGAAAG TACTGACATT GTTTTGTCAA GTCGGAGATA GGTCTGATTC
AAGTGAGTCT TCTTGCGACG ATATTTGGTG GGAGGAGAGC GCTGACATGT CATACCTTCA
GGCACATATG TGCGCATGAA GTTGAAGGCG GCTGAGGAAG TATGTTGCCC TGACCACAGA
CCACCCCTTC CTATACTTTT TAAGCCTCAG TTAACAAATT CCCTCTGTTA GGCAAACATC
CTCTGCAAGA TCGTCAACTT CCCCGAGTCC ATTACTCAGC CTGAAATTCT CCAAGAGATC
AGCCAGGCCA ACAATGACCC CTCAGTACAC GGCATCCTCG TCCAGTTACC CCTTCCCCAG
CACCTTTCCG AGCATGCGGT CACCTCCGCT GTAGCCGACG AGAAGGATGT CGATGGTTTC
GGAGCGATTA ATATTGGAGA GCTCGCCAAG CGTGGTGGTC GCCCGCTTTT TGTTCCTTGC
ACACCGAAGG CCGTAATGGT CCTTCTCAAG GCCAGTGGTG TCGACCCAGC GGGCAAAGAG
GCAGTTGTCC TTGGCCGCAG CGACATTGTT GGAAGCCCTG TTAGCTACCT TCTCAAGAAT
GCAGACGCGA CCGTCACTGT GTGCCATTCG AAGACCCCCG ATATTGCTAG CGCTGTAAAA
AAGGCGGATA TTGTTGTCGC GGCGATTGGT AAGACAGAGT TCGTCAAGGG CGACTGGATC
AAGCCAGGCG CCGTCGTTAT CGACGTTGGT ATCAACTACA AGCCTGATTC CACGAAGAAG
TCAGGACAGC GTTTGGTCGG TGACGTCGAG TACGAGTCGG CCTCCCAAGT GGCTTCAAAG
ATCACGCCTG TCCCCGGTGG TGTTGGGCCC ATGACAGTAG CTATGCTTCT GGAGAATGTT
GTTGCTTCGG CCAAAGCATA CTTTGAGAAA CAGAAGGAGC GACATATCAC CCCGCTCCCG
CTCAAGCTGG CCACCCCGGT TCCCTCAGAC ATCGCCATCT CCCGCTCGCA GTACCCTAAG
CCTATTACTC AAGTCGCGTC CGAGATCGGT ATCGCATCTC ACGAACTTGA GCCGTACGGT
CATACTAAGG CCAAAGTGAG CCTTGAAGTA CTTAATCGTT TGAGCCACCG CCGTAATGGC
CGCTACATCC TGGTCTGTGG TATCACTCCC ACTCCTCTAG GAGAGGGCAA GTCGACAACT
ACGTTGGGTC TCAGCCAGGC CCTAGGTGCA CACTTGAACC GTGTCGCTTT TGCCAACGTC
CGCCAGCCGA GCCAGGGTCC TACGTTCGGT ATCAAAGGTG GAGCCGCCGG TGGAGGCTAC
AGTCAGGTCA TTCCCATGGA TGAGTTCAAT CTGCATTTGA CTGGTGATAT TCACGCCATC
ACTGCCGCTA ACAACCTCCT TGCCGCTGCA ATCGAGACAC GTATGTTCCA CGAGGCTACC
CAGAAGGACG CCGCGCTGTA CAAGCGTCTC GTCCCAGAGA AGAAGGGCAA GCGCGAGTTC
AAGCCTATCA TGTTCAAGCG GCTAAAGAAG CTGGGAATCA ACAAGACCGA CCCCAACGAG
CTTACTGAAG AAGAAATCAA TCGGTTTGCC CGCCTTGACA TTGACCCTTC GACCATCACT
TGGCGCCGTG TTCTGGACGT CAACGATCGA CACCTTCGCG GAATCACCGT TGGACAGGCG
CCAACGGAGA AGGGACTAAC ACGTGAAACT GGGTTTGACA TCTCGGTTGC CAGTGAATGT
ATGGCAATTC TGGCCCTGAG CAGTGATCTC GCAGATATGC GGGAGAGACT TGGTCGTATG
GTTGTTGCTA CCTCGAAACG GGGAGAGCCG GTCACTTGCG ACGATATCGG TGCTGGGGGA
GCGCTTGCGG CGCTGATGAA GGACGCGATC AAGCCCAACT TGATGCAGAG TTTGGAAGGT
ACGCCTGTTC TAGTTCACGC CGGTCCCTTC GCCAACATCA GTATCGGAGC CAGTTCGGTC
CTTGCGGACC GGGTAGCACT GAAGCTGGCG GGTACCGAGC CCGAGGAGGA CCATGAAGCC
AAGACTGGTT TCGTTGTTAC AGAGGCTGGT TTCGACTTCA CCATGGGCGG AGAGCGCTTC
TTCAACATTA AGTGTCGGTC GTCTGGTCTT TCTCCTGACA CTGTAGTCAT TGTTGCTACT
GTGCGTGCCC TGAAAGTTCA CGGTGGTGGT CCTGAGATCA GCCCTGGAGC TCCACTACAC
GAGGTCTACC GCACAGAGAA CACCGAGATT CTCCGCAAGG GCTGTGTTAA CCTTAAGAAA
CACATTGAAA ATGCCCGGCA GTACGGAGTC CCCGTCGTGG TAGCTATCAA CCGCTTCGAG
ACCGACACCG AGGCGGAGAT CGCTATCATT CGCGAGGAGG CCATCTCGGC GGGTGCGGAG
GACGCAGTCT CCGCCAACCA CTGGGCCGAG GGCGGAGCCG GCGCCGTCGA CCTGGCCAAG
GCTGTCATCA TTGCTAGCTC CAAGCCAAAG GACTTTAAGC TGCTCTACGA TCTCAACGGC
AGTATCCAGG AGCGCATTGA GCGGATCGGT AAGGCCATGT ACGGTGCGGA GAAGGTGGAG
TTCAGCGAAC TCGCTCAGAA GAAGGTCGAC ACATACACTG CCCAAGGCTT CTCTAACCTC
CCGATCTGTA TCGCCAAAAC ACAGTACTCT CTCAGTCACG ACCCCGCGCT GAAGGGCGCT
CCGACTGGGT TTACCGTTCC CATCCGCGAT GTACGATTGG CTGTGGGCGG TGGATACCTG
TAAGTCCTGT CCCTAAGTTT TGTCACATAG TTTCGACTCA CCGATCATTA CTAGGTACGC
GCTCGCAGCG GACATCCAGA CGATCCCCGG GCTGCCGACC GCTCCTGGGT ACCTGAACGT
GGACATTGAC CCCGAGACCG GGGAGATCGA CGGGCTCTTC TAG
 
Protein sequence
MPYSPAYLAT VPRCAAPANS LGLQPSALRQ PPSRQFFSCK SSCPSLYTRL SSVRRGFLIE 
SCRRVPRRSP ESLRNFPALP CQRRTFSSTG VAMVAEKIDG TQIAKDIRAG LKDEIQKIQE
INPRFKPSLV IFQVGDRSDS STYVRMKLKA AEEANILCKI VNFPESITQP EILQEISQAN
NDPSVHGILV QLPLPQHLSE HAVTSAVADE KDVDGFGAIN IGELAKRGGR PLFVPCTPKA
VMVLLKASGV DPAGKEAVVL GRSDIVGSPV SYLLKNADAT VTVCHSKTPD IASAVKKADI
VVAAIGKTEF VKGDWIKPGA VVIDVGINYK PDSTKKSGQR LVGDVEYESA SQVASKITPV
PGGVGPMTVA MLLENVVASA KAYFEKQKER HITPLPLKLA TPVPSDIAIS RSQYPKPITQ
VASEIGIASH ELEPYGHTKA KVSLEVLNRL SHRRNGRYIL VCGITPTPLG EGKSTTTLGL
SQALGAHLNR VAFANVRQPS QGPTFGIKGG AAGGGYSQVI PMDEFNLHLT GDIHAITAAN
NLLAAAIETR MFHEATQKDA ALYKRLVPEK KGKREFKPIM FKRLKKLGIN KTDPNELTEE
EINRFARLDI DPSTITWRRV LDVNDRHLRG ITVGQAPTEK GLTRETGFDI SVASECMAIL
ALSSDLADMR ERLGRMVVAT SKRGEPVTCD DIGAGGALAA LMKDAIKPNL MQSLEGTPVL
VHAGPFANIS IGASSVLADR VALKLAGTEP EEDHEAKTGF VVTEAGFDFT MGGERFFNIK
CRSSGLSPDT VVIVATVRAL KVHGGGPEIS PGAPLHEVYR TENTEILRKG CVNLKKHIEN
ARQYGVPVVV AINRFETDTE AEIAIIREEA ISAGAEDAVS ANHWAEGGAG AVDLAKAVII
ASSKPKDFKL LYDLNGSIQE RIERIGKAMY GAEKVEFSEL AQKKVDTYTA QGFSNLPICI
AKTQYSLSHD PALKGAPTGF TVPIRDVRLA VGGGYLYALA ADIQTIPGLP TAPGYLNVDI
DPETGEIDGL F