Gene PICST_62767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_62767 
Symbol 
ID4839886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp56712 
End bp58139 
Gene Length1428 bp 
Protein Length475 aa 
Translation table12 
GC content46% 
IMG OID640391201 
ProductX-Pro dipeptidase 
Protein accessionXP_001385698 
Protein GI150866190 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.106037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGTC CTCCTTCTTT AGTCGGAAAG AAATACCCAG CCAAACAGCA TGCCCGCACG 
GTCTACTCCC ACTTGGTTCA CAAGAACGAA GTTTCCGCCA AAGGCAGTGC TTTCTTCGTT
TCCGGAGAAG ACTTGGAATT GTACCTCTAC TGCGACCAGA CCAAGCCGGT CAGACAAAAT
CGGTACTTCT TTTACTTGAC TGGATGTGAC ATTCCAGGAT CTCACGTGTT GTATAACACT
GTCAAGGATC ACTTGACCCT CTACTTGCCC GACATAGATT ACGAAGATGT CATGTGGCTG
GGGCTCCCTT TGTCGCTTGA AGCTGCAGCC GAAAAGTTTG ATGCAGACGA AATCAAATAT
GCATCGGCGT TGCATGCTGA TTTGGAAGAA TTCCACAACG ACAAGGTGAC AATTTTCACC
ACGGACATCA ACAAGTTCAA CACAAAGTAT GAGGGCTTTT TGCAGCCCGG AAACAAGGAT
TTCTTCTATG CTTTGGACGA ATCACGTTTG ATCAAGGATT GGTACGAAAT CGAATTGATG
AAGCACGCAG CCAAGATCAC CGACAACTGC CATTTCGCCG TGATGTCTGC TACTCCTATT
GAAACCAACG AAACCCACAT CCATGCTGAG TTCTTGTATC ATGCATTGAG ACAAGGTTCA
AAGTACCAGA GTTATGATCC TATTTGCTGC GCTGGCGAAA CTTGTTCGAC TTTGCACTGG
GTCAAGAACG ATGAAGAAAT CACTCCAGAC AAAAAGTCGG TATTAATAGA TGCCGGCGCC
GAATGGAGCT GTTATGCCTC GGATGTCACC AGATGTTTCC CCATTAATGG TGATTGGACC
AAAGAGCATC TTGAGATCTA CAACGCTGTA TTGAAGATGC AATCGGTGAC CAAGGAAATG
ATCAAACCTG GAGCCAGCTG GGATGTACTC CACTTAACAG CCCACAGAAT TATGATTGAA
GAGTTCTTGA AGTTGGGAAT TTTCAAAAAG GAGTATACCG TAGATGAACT CTTTGAGTCT
AAAGTCAGCG CCCGTTTCTT TCCACACGGA TTGGGCCATT TACTTGGAAT GGATACTCAC
GACGTAGGAG GATACCCCAA TTACTCCGAC CCAGATCCCT TGTTGCAGTA TTTGAGATTG
AGAAGAGATT TGCAGGCCGG TATGGTGTTG ACCGACGAGC CAGGAATTTA CTTCTCGCCT
TTCTTGTTGG AAGACACCTT GAAGGACCCA ACCAAGGTCA AGTACATCAA TAAAGATGTC
TTGGACAAGT ACTGGTACAT TGGAGGTGTT AGAATTGAAG ATGATATCTT GGTCACCGAA
GATGGATATG AAAACTTCAC TGGCATTACC TCTGATCCAG AGGAAATCTC AAAGATTGTA
AGGGCTGGGC TTGCTAAGGG CAAGGAAGGC TTCCACAATG TTGTATAG
 
Protein sequence
MSGPPSLVGK KYPAKQHART VYSHLVHKNE VSAKGSAFFV SGEDLELYLY CDQTKPVRQN 
RYFFYLTGCD IPGSHVLYNT VKDHLTLYLP DIDYEDVMWS GLPLSLEAAA EKFDADEIKY
ASALHADLEE FHNDKVTIFT TDINKFNTKY EGFLQPGNKD FFYALDESRL IKDWYEIELM
KHAAKITDNC HFAVMSATPI ETNETHIHAE FLYHALRQGS KYQSYDPICC AGETCSTLHW
VKNDEEITPD KKSVLIDAGA EWSCYASDVT RCFPINGDWT KEHLEIYNAV LKMQSVTKEM
IKPGASWDVL HLTAHRIMIE EFLKLGIFKK EYTVDELFES KVSARFFPHG LGHLLGMDTH
DVGGYPNYSD PDPLLQYLRL RRDLQAGMVL TDEPGIYFSP FLLEDTLKDP TKVKYINKDV
LDKYWYIGGV RIEDDILVTE DGYENFTGIT SDPEEISKIV RAGLAKGKEG FHNVV