Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_50410 |
Symbol | CPA2 |
ID | 4841148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 742848 |
End bp | 746294 |
Gene Length | 3447 bp |
Protein Length | 1148 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392463 |
Product | Multifunctional pyrimidine synthesis protein CAD (includes carbamoyl-phophate synthetase, aspartate transcarbamylase, and glutamine amidotransferase) |
Protein accession | XP_001386549 |
Protein GI | 126140054 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.280294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.615916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACATT TGAAATCAGT GTTAACTCGT CAGTTAAGAG CAGGCTCGGT CAAGCCATTG AAAAACTATG GCTACAGCAG ATTTTCCACA TACAACTTCT TGAGATCTCA AGCAGAACCA AAGTATGAAG GTGCTGAATT ACTAAAGAAA TTCACAGATG AACACGCCCA CAAGTTGGTC GACGTATCCA AGGTTTTGGT TATTGGTTCT GGTGGTTTGT CTATTGGTCA AGCCGGTGAG TTCGACTACT CTGGTTCACA AGCCATCAAA GCATTGAAAG AAGCCAACAA GAAGTCGATT TTGATCAATC CTAATATCGC TACCAACCAG ACTTCTCATT CTTTGGCCGA CGAAATCTAC TACTTGCCAG TTACTGCTGA ATACATTACT TACATTATAG AAAGAGAAAG ACCAGATGGT ATCTTGTTAA CTTTCGGTGG TCAAACAGGT TTGAACGTCG GTGTCAAGTT GGACAAAATG GGTGTCTTTG AAAGATATGG TGTCAAGGTG TTGGGTACTC CAATCAAGAC ATTGGAAACT TCTGAAGATC GTGATTTGTT TGCTCAAGCC TTGAAGGAAA TCAACATTCC TATCGCTGAG TCTATTGCTG TTGAAACTGT CGACGATGCC TTGGACGCTG CCAAGAGTGT CGGTTACCCT ATTATTGTTA GATCTGCTTA TTCCCTTGGT GGTTTAGGTT CTGGTTTCGC TGCTAACGAA ACCGAATTGA GAAACTTGGC CGCTCAATCT TTGTCTTTGG CTCCACAAAT CTTGGTCGAA AAGTCCTTGA AGGGTTGGAA GGAAGTCGAA TACGAAGTAG TCCGTGACCG TGTTGGTAAC TGTATCACCG TTTGTAACAT GGAAAACTTC GATCCATTGG GTATCCATAC TGGTGACTCT ATCGTCGTTG CTCCATCTCA AACTTTGTCT GATGAAGAAT ATCATATGTT AAGATCTGCT GCTATCAAGA TTATCAGACA TTTGGGTGTT GTTGGTGAAT GTAATGTTCA GTATGCTTTG CAGCCAGATG GATTGGACTA CAGAGTCATT GAAGTCAATG CTCGTTTGTC TCGTTCTTCT GCTTTGGCTT CCAAGGCTAC TGGTTATCCA TTGGCATACA CAGCTGCCAA GATTGCTTTG GGCCACACCT TGCCTGAATT GCCAAACCCT GTTACTAAGA CTACTTCTGC TAACTTTGAA CCATCTTTGG ATTACATGGT CACCAAGATC CCAAGATGGG ATTTGGCTAA GTTCCAACAT GTCAAGAGAG ATATTGGTTC TGCCATGAAG TCTGTTGGAG AAGTTATGGC TATCGGTAGA AACTTTGAAG AATCATTCCA GAAGGCTATC AGACAAATCG ACCCATCCTA CATCGGTTTC CAAGGTGACC ATTTTGAAGA CTTGGACTTT GTCTTGGCCA ACCCTACTGA CAGAAGATGG TTAGCTGTTG GACAAGCTTT GCTTCACGAA AACTACTCGG TAGATAAGGT CCATGACTTA ACCAAGATTG ACAAATGGTT CTTATATAAG TTGATGAACA TTGTCAACAT GTACAGAGAA TTGGAAGCTG CTGGATCCTT GAGCCAAATT AACAGTGACT TGATGTCTCG TGCTAAGAAG TTAGGATTTT CTGATAAACA AATTGGTCTT TGTGTTGGAT CCAAGGAATT GGACGTTAGA GCTGTTAGAA AGGCTTTTGG TATTATTCCA TATGTTAAGA AGATTGACAC TTTAGCTGCT GAATTCCCTG CCAATACCAA CTATTTGTAT ACTACATACA ACGCTACCTC TTCTGATGTG GAGTTCAACG AAAACGGTAC TTTGGTCTTG GGTTCTGGTG TTTACCGTAT TGGTTCCTCT GTCGAATTCG ACTGGTGTGC TGTTTCCACT GCTCGTGCTT TGAGAGACTC TGGTCGCAAG ACCATTATGA TCAACTACAA CCCGGAAACT GTATCTACTG ATTTCGATGA AGTTGACAGA TTGTACTTTG AAGAATTATC CTTAGAAAGA GTTTTGGATA TCTACGAACT CGAACACTCC GAAGGTGTTG TCGTCTCTGT TGGTGGTCAA TTACCACAAA ACATTGCCCT TAGCTTACAA AAGGAAGGTT GTAATGTATT GGGTACTAAC CCAGAAGACA TTGACAAGGC TGAAGATCGT CACAAGTTCT CTCAAATCTT AGATTCTATT GGGGTTGATC AACCACAATG GAAGGAATTG ACATCCCTCG CTGAAGCTGA AATTTTTGCT AACGAGGTTG GCTACCCAGT TTTGGTCCGT CCATCTTACG TCTTATCAGG TGCTGCTATG TCTGTTATCA ACAACCAGGC AGAGTTGGAC TCTAAATTGT CTAACGCTGC AAAGGTTTCC CAGGACCATC CAGTTGTCAT CTCCAAGTTC ATTGAAGGTG CTCAAGAAAT TGATATTGAT GGTGTTGCCA GCGAAGGTCA AGTTTTGGTA CATGCTGTTT CTGAACACGT CGAAAATGCC GGTGTCCACT CTGGTGATGC CACTTTAGTT TTGCCACCAC AAGATTTGTC TCCAGTTATC ATGGACAGAT TGAAGGTTAT TGCCGACAAG GTTGCTGAAG CCTGGAAGAT CACTGGTCCA TTCAACATGC AAATCATCAA GAACGACCAA AACGGAACCT TGGACGACGC AAACTGTGAA TTGAAGGTTA TTGAATGTAA TATCAGAGCC TCTAGATCTT TTCCATTTGT TTCCAAGGTT TTGGGTGTCA ACTTCATTGA CGTTGCTGTT AAGGCTTTGA TTAAGGAAGG TGTTCCAACT CCTGTTAATT TGATGAACAA AAAGTATGAT AGGGTTGCTA CCAAGGTTCC ACAATTCTCT TTCACCAGGT TGGCTGGTGC CGACCCATTC TTGGGTGTTG AGATGGCCTC TACTGGTGAA GTTGCCTGTT TCGGAAAGGA CAAGGTGGAA GCTTACTGGA CTTCTATGCA ATCTACGATG AACTTTAACG TTCCTCAAGC CGGACAAGGT ATCTTGTTTG GTGGTGACTT GACCAACGAC AAGTTGGGCA AGGTTGCTGA AACACTCTCT GGTTTGGGTT ACAACTTCTT CAGTTGTAGT GAGGAAGTCG CTAAGTACTT GAAGAACTTC GTTGAAGAAC AAGTTACTGT CATTGAATTC CCAAAGACAG ACAAGAGAGC TTTGCGTGAA ATCTTCCAAA AGCACAAGAT CGGTGGTGTT TTCAACTTGG CCAGAGCAAG AGCTGAAGAT TTGTTGGATG AAGACTACGT TATGAGAAGA AATGCCATCG ACTTTGCCAT TCCATTATTT AACGAGCCAA ACACCTCATT ATTATTTGCT CAATGTTTGA AGAGCAACAT CGCTAACAAG CAACCTTTTG ACGTTATTCC TGAAAACGTT GTCATTCCAT CTGAAGTCAG AAGATGGAGT GAGTTCATTG GTGGTAAGCC AGTATAA
|
Protein sequence | MIHLKSVLTR QLRAGSVKPL KNYGYSRFST YNFLRSQAEP KYEGAELLKK FTDEHAHKLV DVSKVLVIGS GGLSIGQAGE FDYSGSQAIK ALKEANKKSI LINPNIATNQ TSHSLADEIY YLPVTAEYIT YIIERERPDG ILLTFGGQTG LNVGVKLDKM GVFERYGVKV LGTPIKTLET SEDRDLFAQA LKEINIPIAE SIAVETVDDA LDAAKSVGYP IIVRSAYSLG GLGSGFAANE TELRNLAAQS LSLAPQILVE KSLKGWKEVE YEVVRDRVGN CITVCNMENF DPLGIHTGDS IVVAPSQTLS DEEYHMLRSA AIKIIRHLGV VGECNVQYAL QPDGLDYRVI EVNARLSRSS ALASKATGYP LAYTAAKIAL GHTLPELPNP VTKTTSANFE PSLDYMVTKI PRWDLAKFQH VKRDIGSAMK SVGEVMAIGR NFEESFQKAI RQIDPSYIGF QGDHFEDLDF VLANPTDRRW LAVGQALLHE NYSVDKVHDL TKIDKWFLYK LMNIVNMYRE LEAAGSLSQI NSDLMSRAKK LGFSDKQIGL CVGSKELDVR AVRKAFGIIP YVKKIDTLAA EFPANTNYLY TTYNATSSDV EFNENGTLVL GSGVYRIGSS VEFDWCAVST ARALRDSGRK TIMINYNPET VSTDFDEVDR LYFEELSLER VLDIYELEHS EGVVVSVGGQ LPQNIALSLQ KEGCNVLGTN PEDIDKAEDR HKFSQILDSI GVDQPQWKEL TSLAEAEIFA NEVGYPVLVR PSYVLSGAAM SVINNQAELD SKLSNAAKVS QDHPVVISKF IEGAQEIDID GVASEGQVLV HAVSEHVENA GVHSGDATLV LPPQDLSPVI MDRLKVIADK VAEAWKITGP FNMQIIKNDQ NGTLDDANCE LKVIECNIRA SRSFPFVSKV LGVNFIDVAV KALIKEGVPT PVNLMNKKYD RVATKVPQFS FTRLAGADPF LGVEMASTGE VACFGKDKVE AYWTSMQSTM NFNVPQAGQG ILFGGDLTND KLGKVAETLS GLGYNFFSCS EEVAKYLKNF VEEQVTVIEF PKTDKRALRE IFQKHKIGGV FNLARARAED LLDEDYVMRR NAIDFAIPLF NEPNTSLLFA QCLKSNIANK QPFDVIPENV VIPSEVRRWS EFIGGKPV
|
| |