Gene PICST_50410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50410 
SymbolCPA2 
ID4841148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp742848 
End bp746294 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table12 
GC content42% 
IMG OID640392463 
ProductMultifunctional pyrimidine synthesis protein CAD (includes carbamoyl-phophate synthetase, aspartate transcarbamylase, and glutamine amidotransferase) 
Protein accessionXP_001386549 
Protein GI126140054 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.280294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.615916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACATT TGAAATCAGT GTTAACTCGT CAGTTAAGAG CAGGCTCGGT CAAGCCATTG 
AAAAACTATG GCTACAGCAG ATTTTCCACA TACAACTTCT TGAGATCTCA AGCAGAACCA
AAGTATGAAG GTGCTGAATT ACTAAAGAAA TTCACAGATG AACACGCCCA CAAGTTGGTC
GACGTATCCA AGGTTTTGGT TATTGGTTCT GGTGGTTTGT CTATTGGTCA AGCCGGTGAG
TTCGACTACT CTGGTTCACA AGCCATCAAA GCATTGAAAG AAGCCAACAA GAAGTCGATT
TTGATCAATC CTAATATCGC TACCAACCAG ACTTCTCATT CTTTGGCCGA CGAAATCTAC
TACTTGCCAG TTACTGCTGA ATACATTACT TACATTATAG AAAGAGAAAG ACCAGATGGT
ATCTTGTTAA CTTTCGGTGG TCAAACAGGT TTGAACGTCG GTGTCAAGTT GGACAAAATG
GGTGTCTTTG AAAGATATGG TGTCAAGGTG TTGGGTACTC CAATCAAGAC ATTGGAAACT
TCTGAAGATC GTGATTTGTT TGCTCAAGCC TTGAAGGAAA TCAACATTCC TATCGCTGAG
TCTATTGCTG TTGAAACTGT CGACGATGCC TTGGACGCTG CCAAGAGTGT CGGTTACCCT
ATTATTGTTA GATCTGCTTA TTCCCTTGGT GGTTTAGGTT CTGGTTTCGC TGCTAACGAA
ACCGAATTGA GAAACTTGGC CGCTCAATCT TTGTCTTTGG CTCCACAAAT CTTGGTCGAA
AAGTCCTTGA AGGGTTGGAA GGAAGTCGAA TACGAAGTAG TCCGTGACCG TGTTGGTAAC
TGTATCACCG TTTGTAACAT GGAAAACTTC GATCCATTGG GTATCCATAC TGGTGACTCT
ATCGTCGTTG CTCCATCTCA AACTTTGTCT GATGAAGAAT ATCATATGTT AAGATCTGCT
GCTATCAAGA TTATCAGACA TTTGGGTGTT GTTGGTGAAT GTAATGTTCA GTATGCTTTG
CAGCCAGATG GATTGGACTA CAGAGTCATT GAAGTCAATG CTCGTTTGTC TCGTTCTTCT
GCTTTGGCTT CCAAGGCTAC TGGTTATCCA TTGGCATACA CAGCTGCCAA GATTGCTTTG
GGCCACACCT TGCCTGAATT GCCAAACCCT GTTACTAAGA CTACTTCTGC TAACTTTGAA
CCATCTTTGG ATTACATGGT CACCAAGATC CCAAGATGGG ATTTGGCTAA GTTCCAACAT
GTCAAGAGAG ATATTGGTTC TGCCATGAAG TCTGTTGGAG AAGTTATGGC TATCGGTAGA
AACTTTGAAG AATCATTCCA GAAGGCTATC AGACAAATCG ACCCATCCTA CATCGGTTTC
CAAGGTGACC ATTTTGAAGA CTTGGACTTT GTCTTGGCCA ACCCTACTGA CAGAAGATGG
TTAGCTGTTG GACAAGCTTT GCTTCACGAA AACTACTCGG TAGATAAGGT CCATGACTTA
ACCAAGATTG ACAAATGGTT CTTATATAAG TTGATGAACA TTGTCAACAT GTACAGAGAA
TTGGAAGCTG CTGGATCCTT GAGCCAAATT AACAGTGACT TGATGTCTCG TGCTAAGAAG
TTAGGATTTT CTGATAAACA AATTGGTCTT TGTGTTGGAT CCAAGGAATT GGACGTTAGA
GCTGTTAGAA AGGCTTTTGG TATTATTCCA TATGTTAAGA AGATTGACAC TTTAGCTGCT
GAATTCCCTG CCAATACCAA CTATTTGTAT ACTACATACA ACGCTACCTC TTCTGATGTG
GAGTTCAACG AAAACGGTAC TTTGGTCTTG GGTTCTGGTG TTTACCGTAT TGGTTCCTCT
GTCGAATTCG ACTGGTGTGC TGTTTCCACT GCTCGTGCTT TGAGAGACTC TGGTCGCAAG
ACCATTATGA TCAACTACAA CCCGGAAACT GTATCTACTG ATTTCGATGA AGTTGACAGA
TTGTACTTTG AAGAATTATC CTTAGAAAGA GTTTTGGATA TCTACGAACT CGAACACTCC
GAAGGTGTTG TCGTCTCTGT TGGTGGTCAA TTACCACAAA ACATTGCCCT TAGCTTACAA
AAGGAAGGTT GTAATGTATT GGGTACTAAC CCAGAAGACA TTGACAAGGC TGAAGATCGT
CACAAGTTCT CTCAAATCTT AGATTCTATT GGGGTTGATC AACCACAATG GAAGGAATTG
ACATCCCTCG CTGAAGCTGA AATTTTTGCT AACGAGGTTG GCTACCCAGT TTTGGTCCGT
CCATCTTACG TCTTATCAGG TGCTGCTATG TCTGTTATCA ACAACCAGGC AGAGTTGGAC
TCTAAATTGT CTAACGCTGC AAAGGTTTCC CAGGACCATC CAGTTGTCAT CTCCAAGTTC
ATTGAAGGTG CTCAAGAAAT TGATATTGAT GGTGTTGCCA GCGAAGGTCA AGTTTTGGTA
CATGCTGTTT CTGAACACGT CGAAAATGCC GGTGTCCACT CTGGTGATGC CACTTTAGTT
TTGCCACCAC AAGATTTGTC TCCAGTTATC ATGGACAGAT TGAAGGTTAT TGCCGACAAG
GTTGCTGAAG CCTGGAAGAT CACTGGTCCA TTCAACATGC AAATCATCAA GAACGACCAA
AACGGAACCT TGGACGACGC AAACTGTGAA TTGAAGGTTA TTGAATGTAA TATCAGAGCC
TCTAGATCTT TTCCATTTGT TTCCAAGGTT TTGGGTGTCA ACTTCATTGA CGTTGCTGTT
AAGGCTTTGA TTAAGGAAGG TGTTCCAACT CCTGTTAATT TGATGAACAA AAAGTATGAT
AGGGTTGCTA CCAAGGTTCC ACAATTCTCT TTCACCAGGT TGGCTGGTGC CGACCCATTC
TTGGGTGTTG AGATGGCCTC TACTGGTGAA GTTGCCTGTT TCGGAAAGGA CAAGGTGGAA
GCTTACTGGA CTTCTATGCA ATCTACGATG AACTTTAACG TTCCTCAAGC CGGACAAGGT
ATCTTGTTTG GTGGTGACTT GACCAACGAC AAGTTGGGCA AGGTTGCTGA AACACTCTCT
GGTTTGGGTT ACAACTTCTT CAGTTGTAGT GAGGAAGTCG CTAAGTACTT GAAGAACTTC
GTTGAAGAAC AAGTTACTGT CATTGAATTC CCAAAGACAG ACAAGAGAGC TTTGCGTGAA
ATCTTCCAAA AGCACAAGAT CGGTGGTGTT TTCAACTTGG CCAGAGCAAG AGCTGAAGAT
TTGTTGGATG AAGACTACGT TATGAGAAGA AATGCCATCG ACTTTGCCAT TCCATTATTT
AACGAGCCAA ACACCTCATT ATTATTTGCT CAATGTTTGA AGAGCAACAT CGCTAACAAG
CAACCTTTTG ACGTTATTCC TGAAAACGTT GTCATTCCAT CTGAAGTCAG AAGATGGAGT
GAGTTCATTG GTGGTAAGCC AGTATAA
 
Protein sequence
MIHLKSVLTR QLRAGSVKPL KNYGYSRFST YNFLRSQAEP KYEGAELLKK FTDEHAHKLV 
DVSKVLVIGS GGLSIGQAGE FDYSGSQAIK ALKEANKKSI LINPNIATNQ TSHSLADEIY
YLPVTAEYIT YIIERERPDG ILLTFGGQTG LNVGVKLDKM GVFERYGVKV LGTPIKTLET
SEDRDLFAQA LKEINIPIAE SIAVETVDDA LDAAKSVGYP IIVRSAYSLG GLGSGFAANE
TELRNLAAQS LSLAPQILVE KSLKGWKEVE YEVVRDRVGN CITVCNMENF DPLGIHTGDS
IVVAPSQTLS DEEYHMLRSA AIKIIRHLGV VGECNVQYAL QPDGLDYRVI EVNARLSRSS
ALASKATGYP LAYTAAKIAL GHTLPELPNP VTKTTSANFE PSLDYMVTKI PRWDLAKFQH
VKRDIGSAMK SVGEVMAIGR NFEESFQKAI RQIDPSYIGF QGDHFEDLDF VLANPTDRRW
LAVGQALLHE NYSVDKVHDL TKIDKWFLYK LMNIVNMYRE LEAAGSLSQI NSDLMSRAKK
LGFSDKQIGL CVGSKELDVR AVRKAFGIIP YVKKIDTLAA EFPANTNYLY TTYNATSSDV
EFNENGTLVL GSGVYRIGSS VEFDWCAVST ARALRDSGRK TIMINYNPET VSTDFDEVDR
LYFEELSLER VLDIYELEHS EGVVVSVGGQ LPQNIALSLQ KEGCNVLGTN PEDIDKAEDR
HKFSQILDSI GVDQPQWKEL TSLAEAEIFA NEVGYPVLVR PSYVLSGAAM SVINNQAELD
SKLSNAAKVS QDHPVVISKF IEGAQEIDID GVASEGQVLV HAVSEHVENA GVHSGDATLV
LPPQDLSPVI MDRLKVIADK VAEAWKITGP FNMQIIKNDQ NGTLDDANCE LKVIECNIRA
SRSFPFVSKV LGVNFIDVAV KALIKEGVPT PVNLMNKKYD RVATKVPQFS FTRLAGADPF
LGVEMASTGE VACFGKDKVE AYWTSMQSTM NFNVPQAGQG ILFGGDLTND KLGKVAETLS
GLGYNFFSCS EEVAKYLKNF VEEQVTVIEF PKTDKRALRE IFQKHKIGGV FNLARARAED
LLDEDYVMRR NAIDFAIPLF NEPNTSLLFA QCLKSNIANK QPFDVIPENV VIPSEVRRWS
EFIGGKPV