Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_00648 |
Symbol | trpC |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001308 |
Strand | + |
Start bp | 2846579 |
End bp | 2848629 |
Gene Length | 2051 bp |
Protein Length | 664 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | Anthranilate synthase component 2 (EC 4.1.3.27)(Anthranilate synthase component II) [Includes Glutamine amidotransferase;Indole-3-glycerol phosphate synthase(IGPS)(EC 4.1.1.48);N-(5'-phosphoribosyl)anthranilate isomerase(PRAI)(EC 5.3.1.24)] [Source:UniProtKB/Swiss-Prot;Acc:P06531] |
Protein accession | CBF89063 |
Protein GI | 259489085 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCAGC AATGCATCAT CCACTCGTTC GGCGGTAAAG TCGATGTTAC CGGAGAGATT CTGCATGGTA AGACGTCAGT CCTGAAACAC GACGGAAGAG GTGCCTACGA AGGTCTTCCC CCTTCGGTTA TCATTACCAG ATATCACTCT CTGGCAGGAA CGCACTCTAC CATACCCGAG TGCCTGGAAG TGAGCTCTTT TGCTCAATTG GGTGAGGATG CCGACAAGAC TGTCATTATG GGCGTGCGAC ACAAGCAGTT TGCGGTGGAA GGCGTTCAGT TTCACCCTGA GAGTATTCTG ACAGAACATG GTCAAACGAT GTTTAGAAAC TTCCTTAAGC TTACTGCCGG CACCTGGGAA GGCAATGGAA AGGATGTTGC TCAGGGAGGC AATTTCACCG CCGCCGCTCC CAACCCTCCG AAGGCTACTA AGAAGGTGTC AATCTTGGAG AAAATTTATG ATCACCGGAG AGCGGCTGTT GCTAAGCAGA AAACTATTCC GTCTCAGCGC CCATCTGATC TTCAGGCTGC CTATGAACTC AGCGTTGCTC CGCCTCAAAT ATCGTTTCCT GATCGCCTTA GACAATCAGC GTATCCTTTG TCTTTAATGG CCGAAATAAA GCGCGCGTCT CCATCAAAGG GCCTGATAGC AGAGCATGCA TGTGCTCCTG CACAAGCCCG GCAATATGCT AAGGCTGGTG CCAGTGTTAT CTCCGTGCTT ACTGAGCCCG AATGGTTCAA GGGCAGTATC GACGACCTAC GTGCGGTGCG TGCAAGTCTA GAGGGACTGA CTAACAGACC TGCTATCCTG CGAAAGGAAT TCATTTTCGA TGAATATCAG ATTTTGGAAG CTCGTCTGGC GGGCGCAGAT ACCGTCCTAC TAATTGTGAA AATGCTTGAC ACTGAGCTTC TCACAAGACT TTATCTTTAT TCCCAGAGCC TTGGGATGGA ACCCCTTGTT GAGGTCAATA CACCGGATGA GATGAAGATC GCAGTGGACC TTGGCGCTCA AGTGATTGGC GTCAACAACA GAGACCTAAC AAGCTTTGAA GTTGATCTTG GCACTACCAG CCGGCTCATG GACCAGGTAC CCGAGAGCAC TATTGTTTGT GCACTTAGCG GTATTTCTGG ACCCAAGGAT GTAGAAGCCT ACAAGAAGGA TGGTGTGAAA GCTATTCTCG TCGGAGAGGC CCTTATGCGT GCGCCAGACA CTGCAGCATT CGTCGCTGAG CTTCTGGGAG GACAATCCAA GAAGCTTCCT CTGCAGTCAC GAAATTCACC TCTGGTCAAG ATCTGTGGTA CAAGGACTGA AGAAGGGGCG CGCGCGGCTA TTGAAGCAGG CGCGGATTTA ATTGGCATTA TTCTCGTCGA AGGCCGAAAG CGCACTGTTC CCGATGATGT TGCGTTGCAA ATCTCGAAAG TTGTAAAGTC CACCCCGAGG CCCACTCCTT ATCCAACTGA GGTGCCACAA GGAGATACGG ACGCTACCTC TGTTGACTAT TTCGACCATT CCGCCACGAC TTTGCGGCAC CCCACTCGTG CTTTGTTGGT GGGTGTATTC CTCAATCAGC CCCTATCGTA TGTCCTAGCC CAGCAGCAAA AGCTGGGCCT TGACGTGGTT CAATTGCACG GTTCCGAACC GCTTGAATGG TCTAGGTTAA TACCGGTTCC CGTTATCCGC AAATTTGGGC TTGATGAGTT CGGCATCGCC CGAAGGGCAT ATCACACGCT GCCGCTGCTG GATTCCGGAG CTGGCGGCTC TGGAGAACTC TTGGATCAGA TGCGCGTTAA GCAGATTCTA AAGTCTGATG ACGGATTGCG GGTCATTTTA GCTGGTGGCC TGGATCCACT TAACGTTACT GAAATCATCA AACAGCTTGA CGAATCTGGA TATAAGATCG TTGGTGTCGA TGTCAGCTCC GGAGTTGAGA CAAATGGTGT TCAGGATCTC GATAAGATAC GTTCATTTGT CCAAGCAGCA AAGAGTGCCT TCTAGTGATT TAATAGCTCC ATGTCAACAA GAATAAAACG CGTTTCGGGT TTACCTCTTC C
|
Protein sequence | MGQQCIIHSF GGKVDVTGEI LHGKTSVLKH DGRGAYEGLP PSVIITRYHS LAGTHSTIPE CLEVSSFAQL GEDADKTVIM GVRHKQFAVE GVQFHPESIL TEHGQTMFRN FLKLTAGTWE GNGKDVAQGG NFTAAAPNPP KATKKVSILE KIYDHRRAAV AKQKTIPSQR PSDLQAAYEL SVAPPQISFP DRLRQSAYPL SLMAEIKRAS PSKGLIAEHA CAPAQARQYA KAGASVISVL TEPEWFKGSI DDLRAVRASL EGLTNRPAIL RKEFIFDEYQ ILEARLAGAD TVLLIVKMLD TELLTRLYLY SQSLGMEPLV EVNTPDEMKI AVDLGAQVIG VNNRDLTSFE VDLGTTSRLM DQVPESTIVC ALSGISGPKD VEAYKKDGVK AILVGEALMR APDTAAFVAE LLGGQSKKLP LQSRNSPLVK ICGTRTEEGA RAAIEAGADL IGIILVEGRK RTVPDDVALQ ISKVVKSTPR PTPYPTEVPQ GDTDATSVDY FDHSATTLRH PTRALLVGVF LNQPLSYVLA QQQKLGLDVV QLHGSEPLEW SRLIPVPVIR KFGLDEFGIA RRAYHTLPLL DSGAGGSGEL LDQMRVKQIL KSDDGLRVIL AGGLDPLNVT EIIKQLDESG YKIVGVDVSS GVETNGVQDL DKIRSFVQAA KSAF
|
| |