Gene PICST_39982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39982 
Symbol 
ID4851975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3338287 
End bp3339483 
Gene Length1197 bp 
Protein Length398 aa 
Translation table 
GC content43% 
IMG OID640393683 
Productpredicted protein 
Protein accessionXP_001387208 
Protein GI126276187 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00520] L-asparaginases, type II 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.551295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATTT CCTCGTCGTC AAGCTTCTTC CCATTGCATA CTCTTAATGA GGACTTCTCT 
GTGGAGATTC AAGACGAGTC CCGTGATCTC GAACACAGTC AGTTCAAGAT CAGATACAGA
TCGAACTCGG TAGTGTCTGG AACTTCTGAA AGCTCATTGA ACTTTGATAC TTTGCCTACT
ATCAAGGTGA TTGGTACAGG AGGAACCATT GCTTCAAAAG GGTCCTCTGC ACATCAGACT
GCCGGCTATG AAGTTGATCT TACCATTGAG GACTTGATCA AGTCCATTCC CGATATTCTG
ACCACCTGCT TGTTGGAATA CGAACAGTTG TTGAACATAG ACTCTAAGGA ATTTGGCACC
AAGGAATTGA TTCAGTTGTA CTCTAAAATA ATGCTGGAAC TCCCCAAATA CGACGGCTTT
GTCATCACCC ACGGAACCGA TACAATGGAG GAAACTGCTT TCTTCTTGCA GCTCACCATC
AACACTTACA AGCCTATAGT CATGTGTGGA TCTATGAGAC CTTCCACTGC CATTTCCTCA
GATGGGCCCA TGAACTTGTA CCAGGCTTGT GTTATCGCTG CTAGTAGAGA ATCTCGAGGC
AGAGGTGTCA TGGTAGCCCT TAACGATAGA ATTGGTTCTG GCTACTATAT CACCAAGTCC
AATGCAAACT CATTGGATAC CTTCAAATCC ATAGGTCAGG GCTACGTAGG AAACTTCGTA
GATAATGAGA TTCATTACTA TTTCCCTCCA GCAAAACCAC TCGGTATGAC ATATTTTAAC
TTGAGGCTTC CGTTGGCTGA TAACGATCTT CCACTGGTTC CAATTCTTTA TGCTCACCAG
GGCTTCAACA ACAAGATAAT AGACGTGACT GTTAAAGAAC TTGAAGCTAA GGGTCTTGTT
ATTGCTACCA TGGGAGCAGG CTCTTTGGCA GATGAAACCA ATCAGTACTT ATCCGATTTG
GTGGAGAGCA TGGAAATACC ATTTCCAATT ATCTACACCA AGCGTTCTAT GGATGGAAGG
GTACCTCTTG GATCAATCCC CAAGGTCAGA ACTGAAGATC ATACTTCACT TTCTACTTTC
GAAAGTGCTA TTCCTGGAGG CTACTTGAAT CCACAGAAGG CTAGAATCCT TTTGCAGTTG
TGTCTTTATG AGGACTACAA TATGCAAGAG ATCAAAAAGG TGTTCAAGGG TGTATGA
 
Protein sequence
MSISSSSSFF PLHTLNEDFS VEIQDESRDL EHSQFKIRYR SNSVVSGTSE SSLNFDTLPT 
IKVIGTGGTI ASKGSSAHQT AGYEVDLTIE DLIKSIPDIL TTCLLEYEQL LNIDSKEFGT
KELIQLYSKI MLELPKYDGF VITHGTDTME ETAFFLQLTI NTYKPIVMCG SMRPSTAISS
DGPMNLYQAC VIAASRESRG RGVMVALNDR IGSGYYITKS NANSLDTFKS IGQGYVGNFV
DNEIHYYFPP AKPLGMTYFN LRLPLADNDL PLVPILYAHQ GFNNKIIDVT VKELEAKGLV
IATMGAGSLA DETNQYLSDL VESMEIPFPI IYTKRSMDGR VPLGSIPKVR TEDHTSLSTF
ESAIPGGYLN PQKARILLQL CLYEDYNMQE IKKVFKGV