Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_00797 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001308 |
Strand | + |
Start bp | 2398134 |
End bp | 2400943 |
Gene Length | 2810 bp |
Protein Length | 867 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | hypothetical histidine biosynthesis trifunctional protein (Eurofung) |
Protein accession | CBF88763 |
Protein GI | 259488921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.342936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTTGCCCTT GTTAAGCTCT CTCCAGTTGG GCACTGGACT CTTCTCTATC TCACCTCTAT TTTCTCTCTT TTCTCTTGAC CCCATTGACC TGATTGACCT CCGAAACATG GCCACTCCCT TCCTCGTCTC CTACGACCCC GCCTCCGCGT CCGGCGGCCT CTCCCTCCAG CAGATCGCCT ACTTCGGCCG CGTTCTGATC AAGGCTACCG ACCTCGCCCA GGCTGAGACC TTTATCCGAC AAAATTTCCG CCTACTCGAT ATCTACGTCG ACGCAACTGG CATCTCCGCA ACGGGCGATC TTGTCGACAT CCTCAACGCC GGCGCGGCCA AGATCTTCAT CTCCCTTGAC CAATTGAATG CCCTCTCCGA AGAACAATCC GTCCCCTCGT CACGGCTCGT TGTCTACACT TCTTCCAACG ACCAAGTGGA AGCGTTTCAG AAATGGGTGG TTAAGCACAT TGAGCGCGAA GAGGCCGGCC TGTGCACGGA CTCGGCCGTT GTCCACTCTA TCTCTGTGAA GCTCGGACTG AACCCGGAAG CCCAGCTTCT CTACCGTACA TATTCTGGAG ACGTGACCGA GGATGCGGTC AAGGATACAA TGAAGCAGGG AGGTGTCAGT ATTGTTCCTG CGGCCGCTCT GACTATCAGC CGCGAGGAGT CCAGTGGGAA GATCCAGGCG GGTTCTTTGA TTGCTGCGCG GGGTGTCAAG GACCAGGGTA ATGGTCTGTA TGCCACAACA GTAACGGACG AGAGGGGTAC TTGCTTGGGG TTTGTGTGGA GTAGCGACGA GAGTATCGCG GAGGCTCTGC GTACAGGCAC CGGTGTCTAC CAGAGCCGGA AGCGTGGTCT GTGGTACAAG GGTCAATCCA GCGGTGACGT ACAGGAGTTG ATCCGCATCG GATTTGACTG CGACAGCGAC TGCTTGGTTT TCATCGTGAA GCAGATCGGA AGAGGTAGGA CGTCTCGTCA CTTACGTACA CCTAGTGCTG ACTTTCAGGT TTCTGCCACC TCGGCACCGC CAGCTGTTTC GGACCTTACA CCGGTTTATC ACGCCTCCAA AAGACGCTAC AAGCCCGCAA GGCCGATGCC CCGGCCGGCT CGTACACCGC GCGACTGTTC AACGAGCCTA AGCTTACACA AGCCAAGATC ATGGAAGAGG CTGACGAGTT GTGTCGTGCG GAAACAAAGG AGGATATCGC TTTCGAAGCA GCCGATCTTT TGTACTTTGC GCTCACCCGC TGTGTTGCCG CCGGTGTCAG CCTTGAGGAT GTCGAGAGAA ACCTTGACTT GAAGAGCCTA AAGGTGAAGC GGAGAAAGGG TGACGCCAAG GGCCCTTGGG CAGAGAAGGC TGGTCTTGCC GAGAAGCCTG CTGAAGCGAA GCCTGCTCCG AAGCCAGAGG AGCCAAAGGA AGACACGTCT CGGATCGAGA TGACCCGTGT CGCTACCGCT TCGACGCCGG CGGAGAAGGT CCAGGAGTAC CTCAAGCGGC CATCGCAAAA GTCAAACGAC GCCATTGTCG GCCTTGTCAA GCCCATCATT CAGGATGTCC GTGAGCAGGG TGATGCTGGT GTTCTTAAGT ACACACATAA GTTCGAAAAG GCCACATCTT TGACCTCACC TGTCCTCAAA GCACCGTTCC CGGCCGAGCT GATGAAGCTG TCACCAGAAG TTCAGGAGGC GATTGATGTC AGTATTTCCA ACATTGCCAG ATTCCACAGC GCCCAGAAAG GCAGCAATGA TGCATTGTCG ATGGAGACCA TGCCTGGCGT AGTCTGCTCC CGCTTCTCGC GGCCCATTGA GCGTGTTGGT TGCTACATTC CCGGTGGAAC GGCCGTGCTG CCATCTACTG CAATGATGCT TGGTGTTCCC GCCATGGTTG CCGGCTGCAA GAAGATCGTC TTTGCCTCTC CACCTCGTGC CGACGGCAGC ATCACCCCCG AGATTGTCTA TGTCGCACAC AAGGTTGGAG CGGAGAGCAT CGTCCTTGCT GGAGGCGCGC AGGCTGTAGC AGCCATGGCC TACGGTACTG AGAGCGTCAG CAAGGTCGAC AAGATTTTGG GACCTGGTAA TCAGTTCGTG ACCGCCGCCA AGATGTTGGT TTCCAACGAT ACCTCCGCTG GTGTCAGCAT CGACATGCCT GCCGGACCTA GTGAGGTTCT CGTCATTGCC GACAAGGCCG CCAACCCCGC CTTCGTTGCT TCAGACCTTC TCAGCCAAGC AGAACACGGT GTCGATTCCC AGGTCATTCT CATCGCGATT GACCTGAACG AGCAGGAACT GAAAGCCATT GAAGATGAGG TAGATCGCCA CGCCCGTGCT CTTCCTCGCA TGGACATCGT CCGTGGATCC CTCGCACACT CCGTCACCTT TGTTGTTAGG GACCTTGATG AGGCAATGGC TCTGAGCAAC GATTACGCTC CTGAGCATCT CATCCTGCAA ATCCAGAATG CAGAGGCCGC TGTCGAGAAG GTCCAAAATG CGGGATCTGT TTTCATCGGA CAGTGGACGC CTGAGAGTGT GGGTGACTAC TCTGCTGGTG TCAACCACTC ATTGCGTACG TTCCCAACTC CTATGATTCT TTTACCACCG ATTCTAACTT CTCTTCCTAG CAACATATGG CTACGCCAAG CAGTACTCCG GAGTCAACCT TGGCTCCTTC CTCAAGCACA TCACCTCCTC AAACCTAACG GCGGATGGTC TTCTGCGTCT GTCCAAGACT GTCGAGACGC TCGCGGCTGT GGAGGGATTA GATGCCCACA AGCGGGCAGT GAGCATCCGT GTTGCGGCTA TGAAGCAGGA GCAGTTGTAG
|
Protein sequence | MATPFLVSYD PASASGGLSL QQIAYFGRVL IKATDLAQAE TFIRQNFRLL DIYVDATGIS ATGDLVDILN AGAAKIFISL DQLNALSEEQ SVPSSRLVVY TSSNDQVEAF QKWVVKHIER EEAGLCTDSA VVHSISVKLG LNPEAQLLYR TYSGDVTEDA VKDTMKQGGV SIVPAAALTI SREESSGKIQ AGSLIAARGV KDQGNGLYAT TVTDERGTCL GFVWSSDESI AEALRTGTGV YQSRKRGLWY KGQSSGDVQE LIRIGFDCDS DCLVFIVKQI GRGFCHLGTA SCFGPYTGLS RLQKTLQARK ADAPAGSYTA RLFNEPKLTQ AKIMEEADEL CRAETKEDIA FEAADLLYFA LTRCVAAGVS LEDVERNLDL KSLKVKRRKG DAKGPWAEKA GLAEKPAEAK PAPKPEEPKE DTSRIEMTRV ATASTPAEKV QEYLKRPSQK SNDAIVGLVK PIIQDVREQG DAGVLKYTHK FEKATSLTSP VLKAPFPAEL MKLSPEVQEA IDVSISNIAR FHSAQKGSND ALSMETMPGV VCSRFSRPIE RVGCYIPGGT AVLPSTAMML GVPAMVAGCK KIVFASPPRA DGSITPEIVY VAHKVGAESI VLAGGAQAVA AMAYGTESVS KVDKILGPGN QFVTAAKMLV SNDTSAGVSI DMPAGPSEVL VIADKAANPA FVASDLLSQA EHGVDSQVIL IAIDLNEQEL KAIEDEVDRH ARALPRMDIV RGSLAHSVTF VVRDLDEAMA LSNDYAPEHL ILQIQNAEAA VEKVQNAGSV FIGQWTPESV GDYSAGVNHS LPTYGYAKQY SGVNLGSFLK HITSSNLTAD GLLRLSKTVE TLAAVEGLDA HKRAVSIRVA AMKQEQL
|
| |