Gene PICST_47423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47423 
SymbolMET22 
ID4839154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp183945 
End bp185042 
Gene Length1098 bp 
Protein Length365 aa 
Translation table12 
GC content44% 
IMG OID640390469 
Product3'(2')5'-bisphosphate nucleotidase 
Protein accessionXP_001384691 
Protein GI126136335 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1218] 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase 
TIGRFAM ID[TIGR01330] 3'(2'),5'-bisphosphate nucleotidase, HAL2 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCGA TTTCTGCCCA GCACCCTTAT TACAAGGAAT TGGAGATTGC TACATTGGCA 
GTCAAGCGTG CCTCGTTGCT CACTAAGAAA TTGAGCGATT CTATAGGCGT TACACAGAAA
TCTGGAACCC AGACAAAGGA CGATAAATCG CCTGTAACTG TAGGAGATTA TGCAGCTCAG
GCTATTATCA ACTATGCTAT CCAAAAAAAC TTTCCTGGTG ACGAAATTGT CGGAGAGGAA
GACTCAGACA CTTTGAGAGA AGACACAGAT GAATCTCGGA AGTTGTCGGG TCGCATTCTC
GAGATCATCG AAGATGTCCA GGACAATACT TCTACCTATA GTGACAAGAT TGGCACACTT
GAGAACTTGC AAGATATTTA TGAGAGCATA GACCTCGGTA TTTCCCAAGG TGGAGATAAA
GGTAGAATTT GGGCCCTTGA TCCAATTGAC GGCACCAAAG GATTCCTTAG AGGCGACCAG
TTTGCAGTGT GTTTAGCTCT TATTGTAGAT GGTGAGGTAG TATTGGGCGT TATTGGCTGT
CCCAACTTGC CTGAGATTAT CCTTTCCAAC GAAGATATGA CGGGTACTGT TGGAGGTTTG
TACTCGGCCG TAAAGGGCGT TGGTTCGTTT TATACAGCCT TGTTTGACTC TGACAAGTTT
GTTCCTTTGT CGAAGCAAGA GAGAATCAAA ATGACCACTA ACACTTCGCC AGCCAGTATT
AAAGTAGTGG AAGGTGTAGA AAAGGGCCAT TCTTCTCATT CAACGCAGTC AAAGATCAAA
GACATCTTGG GTTTCAACCG TGAAATCGTT CATAGACAGA CCATAAACTT GGATTCCCAA
GTCAAATATT GTGTATTGGC TAAAGGACAG GCTGACATCT ACTTGCGTTT ACCAGTCAGT
GATACCTATC GTGAAAAGAT CTGGGACCAT GCTGCTGGTA ACATCTTGGT GTATGAAAGT
GGTGGTCAAG TCGGTGATAT CAGCGGTGCC CCTCTTGACT TTGGTAAGGG CAGATTCTTG
CAATCCAAGG GTGTCATTGC TGGTAATACC CACATCTTCC CTGCTGTTAT CAAAGCAGTA
GAACAAGCCT TGAATTAA
 
Protein sequence
MSSISAQHPY YKELEIATLA VKRASLLTKK LSDSIGVTQK SGTQTKDDKS PVTVGDYAAQ 
AIINYAIQKN FPGDEIVGEE DSDTLREDTD ESRKLSGRIL EIIEDVQDNT STYSDKIGTL
ENLQDIYESI DLGISQGGDK GRIWALDPID GTKGFLRGDQ FAVCLALIVD GEVVLGVIGC
PNLPEIILSN EDMTGTVGGL YSAVKGVGSF YTALFDSDKF VPLSKQERIK MTTNTSPASI
KVVEGVEKGH SSHSTQSKIK DILGFNREIV HRQTINLDSQ VKYCVLAKGQ ADIYLRLPVS
DTYREKIWDH AAGNILVYES GGQVGDISGA PLDFGKGRFL QSKGVIAGNT HIFPAVIKAV
EQALN