Gene PICST_90976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_90976 
SymbolKAP1 
ID4840662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp852416 
End bp853486 
Gene Length1071 bp 
Protein Length199 aa 
Translation table12 
GC content42% 
IMG OID640391977 
ProductAdenylyl-sulfate kinase (APS kinase) (Adenosine-5'phosphosulfate kinase) (ATP adenosine-5'-phosphosulfate 3'-phosphotransferase) 
Protein accessionXP_001386353 
Protein GI126139661 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases 
TIGRFAM ID[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.342518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTACATTTTT CTTAGCATTT TGTTCCACAT TATCAAAATC ATAGCCTTTT CTCGGTACCA 
GCCACAACTA CTCCAAGCCG TGTATTCGGA TTTGCAAACG CTTTACGACT CTCATTTGCG
AGTCTCTTGT TCACATCGAA AGCCTGTCTT TCTCTCACTT CATGCTTATA ACCTTCACTG
TAGCTTTTGT AAGTTCGGCT CTTCTCAGTT CAACTTCTTA CATCAATTTT CTGCACCATA
CGCAGAAGTG CTGCAACGCT CTTATCTTCA CGTGAACTGT TCCCCATATC ACGTGGTATG
GTGAACCACA CAAATTTTTC AACCATATAG AGCTTGACAA TTGCAAATTA CCATCAATTT
TGACTATGAG TCAGGTCTTC TTTCTACTCG TTTTATTGAC TCAAAATCCC AAAAATGGCC
TCCAACATCA CATGGCATCC AAACTTGACC CACGCTGAGC GTGCCAGCTT GAGAAAGCAA
AAGGGAGTCA CTGTTTGGTT AACTGGCTTA TCAGCTAGTG GAAAATCCAC TATTGCCTGC
GCCTTGGAAC AGTCCATTCT TGCCAGAGGC TTGAATGCCT ACAGATTAGA CGGTGACAAC
GTGAGGTTCG GCTTGAACAA GGACTTGGGC TTCAGCGAAG CTGACAGAAA CGAAAACATC
AGAAGAATCT CAGAAGTAGC TAAATTGTTC ACAGACTCTT GTTGTGTTAC TTTGACCAGT
TTCATCTCTC CTTACAAACA AGATAGAGAC TTGGCAAGGC AATTGCACGA AAAGGACAAC
TTGCCATTTG TCGAAGTCTA TGTTGATGTT CCAGTTGAAG TTGCTGAGCA AAGAGACCCA
AAGGGGTTGT ACAAGAAAGC TAGAGAAGGT ATTATCAAGG AATTCACCGG TATTTCTGCT
CCATACGAAG CACCAGAAAA GCCTGAAATC CACTTGAAGA ACTATGAAGG TGTTTCGATC
GAAGAATCGG CGGAAAAGAT CATCGATTAT TTGATCGAAA ACAAGTATAT TTAAGATAGA
ACCATACATA GACGTATATA GCGAGAAGTA CATGGACTTC GTTAAAGTGG T
 
Protein sequence
MASNITWHPN LTHAERASLR KQKGVTVWLT GLSASGKSTI ACALEQSILA RGLNAYRLDG 
DNVRFGLNKD LGFSEADRNE NIRRISEVAK LFTDSCCVTL TSFISPYKQD RDLARQLHEK
DNLPFVEVYV DVPVEVAEQR DPKGLYKKAR EGIIKEFTGI SAPYEAPEKP EIHLKNYEGV
SIEESAEKII DYLIENKYI