Gene PICST_34371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34371 
SymbolCPS1 
ID4851388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1680573 
End bp1682321 
Gene Length1749 bp 
Protein Length582 aa 
Translation table 
GC content44% 
IMG OID640393096 
ProductGly-X carboxypeptidase 
Protein accessionXP_001387964 
Protein GI126274458 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.531597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCTT TGCCTTTGGA CAATTACAAC ACAACAACGC CCTTGTGGAA GAGAAAATCG 
GTAGGTGCAA TCGGAGCGAT TGTGGCTCTC TTATTGGTTC TTTTAACTAC TGACCTCTAC
AGCCACCTCA AGGTTGCGCT TCTTCCTTTA GATACCTCGG ACTCGTTGTG TCCCTTGTAT
GAGCCTATTT TCCCGAAGTC GTTTTCTGTG GACAATTCGA CTGTTCTCGA CATTTTGTAT
GGAGAAGACT TTAGAAATGC ATCTATTGCC AAGTTGGCAG GAGCCATTCA AGTAGATACA
CAAATCTTTG ACAACCAGCC AGATGTGCCA GACTCGCCAG AGACATGGGC AAAATTCAAG
AAGTTCCACA AATACTTGGA AAAGACGTTT CCTATCGTCT ACAAGAACTT GCAAGTTGAA
AAAGTCAACA CCTACGGTTT GGTCTACTTC TGGAAAGGTT CTGACGATAG CTTGAAACCT
TTGATGTTAA CTGCTCACCA AGACGTGGTT CCAGTTCAGC AGGATACACT TAAAGATTGG
ACTTATCCTC CATTTGAGGG CCATTACGAC GGTGAATTCA TTTATGGAAG AGGTGCTGCG
GACTGTAAGA ATGTGTTGAT CTCCATCTTG GAGACGATAG AATTGTTGTT GAAGAAGGGC
TACCAGCCAC AGAGGTCGGT TATCGCTGCA TTTGGCTTTG ATGAAGAAGC ATCAGGTGTG
GTTGGTGCTG CTAAGATTGG TCAGTACTTG GAAAAGACTT ATGGTAACGA CTCTGTGTAT
GCCATTATCG ATGAGGGTCC AGGCTTGTTG CTTGACCCTC TCACCAAGAC CATATTGGCT
GCTCCTGGTA CTGGAGAAAA GGGCTATATC GACATTGACG TTGAATTGAC TACCCCTGGT
GGCCATTCTT CCGTTCCTCC TGACCATACC TCCATTGGTA TCATTGGAGA GTTAACCTAT
TTGATAGAAA AGGATCCTTA CTCTCCACTC TTGACTTCCA AGAATCCAAT CTTGAGCTAC
ATGCAATGTG CAGCTCTTCA CGACCCTTAT GACAACATCC CTAGGTACTT GAAGGGTGCG
ATCTTGAGAG CTGCTCACGA TAGATTTGCA AATTCTCGTG TGGTCAAGAC AATGCAACAA
AGCAAGCTCA TGAAGTACTT AGTACAAACG TCTCAGGCTA TAGATATTGT GAAAGGCGGG
GAAAAGGCAA ATGCCTTGCC TGAGCACGTC AAGTTGTTGG TCAATCACAG AGTCAATATC
GACTCTTCTC TTGAAGAAGT CAAACATCGT TTCGTTTCCA GAGTCGTAGA AGTCGCAAAA
AGACATAACT TGGCTGTAGA AGCTTTTGGT GAGTTGGTTC TTGACACCAA GAATAACTCT
GGTGTGTTTG TACTCAATTC TCAAACTCCG TTGGATATTG CTCCAGTAAC TCCTTTGAAT
GACAACGTGT GGAAATATCT TGCAGGTGTT ACCAGACATG TCTTTGAAGA CTTGGTGTTC
CCTGATATCG AGTATCCAAT TGTCACTTCT CCTTCTATAA TGACTGGAAA CACTGATACC
AGGCACTACT GGAACTTGAC GAGAAACATC TTTAGATACT CACCTCTTTT TACCAGCGAT
ATGATCCATG ACACTAACAT TCATAGTGTG GATGAGAAGT TGAAGTTTGA CACTCACTTG
CAGTTGACGG CATTTTTTTA CGAGTACATC CAAGCCGTGG ATACTGCCGA GGCTGATAAT
GAGAACTAG
 
Protein sequence
MVALPLDNYN TTTPLWKRKS VGAIGAIVAL LLVLLTTDLY SHLKVALLPL DTSDSLCPLY 
EPIFPKSFSV DNSTVLDILY GEDFRNASIA KLAGAIQVDT QIFDNQPDVP DSPETWAKFK
KFHKYLEKTF PIVYKNLQVE KVNTYGLVYF WKGSDDSLKP LMLTAHQDVV PVQQDTLKDW
TYPPFEGHYD GEFIYGRGAA DCKNVLISIL ETIELLLKKG YQPQRSVIAA FGFDEEASGV
VGAAKIGQYL EKTYGNDSVY AIIDEGPGLL LDPLTKTILA APGTGEKGYI DIDVELTTPG
GHSSVPPDHT SIGIIGELTY LIEKDPYSPL LTSKNPILSY MQCAALHDPY DNIPRYLKGA
ILRAAHDRFA NSRVVKTMQQ SKLMKYLVQT SQAIDIVKGG EKANALPEHV KLLVNHRVNI
DSSLEEVKHR FVSRVVEVAK RHNLAVEAFG ELVLDTKNNS GVFVLNSQTP LDIAPVTPLN
DNVWKYLAGV TRHVFEDLVF PDIEYPIVTS PSIMTGNTDT RHYWNLTRNI FRYSPLFTSD
MIHDTNIHSV DEKLKFDTHL QLTAFFYEYI QAVDTAEADN EN