Gene PICST_61206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61206 
SymbolUAP1 
ID4839291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp942173 
End bp943633 
Gene Length1461 bp 
Protein Length486 aa 
Translation table12 
GC content42% 
IMG OID640390606 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionXP_001385188 
Protein GI150865818 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4284] UDP-glucose pyrophosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.561051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAA CGTCTGCTTC TTCCATTATT GAGGCGTTTT CCAAGGCCGG TCAAAGCGCA 
CTTTTCCAGT TCTTTGATTC TCTCTCTAAG GACCAGCAAA ATGAGTTCAT TGAGCAGTTG
GCCAAAATCG AAGACCCTAT CAAATTAGTC AACACTGTTC AAGAAGCCTT GAAGTTTTCC
TCTAACAATG CTAGTTCTCG AAACTTCACC CAGTTGCCTT CAGAACAGAC AGCTTCCACG
TTAGACTTGG ATCCAGAATT GTCTGAAAAA TGGACCAGAC TCGGTTTAGA AGCCATTTCA
AAAGGTCAAG TTGGTGTAAT TTTGATGGCT GGAGGTCAGG GGACCAGATT AGGATCGAGT
GACCCAAAGG GTTGTTTCGA TATCGAATTG CCTTCTTCTA AATCGTTGTT CCAAGTCCAG
GCTGAAAAGA TCTTGAAGAT CCAACAGTTA ACCGCTCAAA AGCTCAACTT GGCTCAACAA
CCAAAGATCT ATTGGTACAT AATGACCAGT GGTCCTACAA GACTGCCAAC GGAGTCCTTT
TTCCAAAAGA ATCACTACTT TGGCTTGCAG CCAGACCAAA TCGCCTTCTT TGACCAAGGT
ACTTTACCGT GTTTCAACTT GGACGGATCG CAGATTCTCT TGGAATCGCA AAACAAATAC
TGCGAATCCC CAGATGGGAA CGGAGGTTTG TACAAGGCCA TTCAGACTAA TGGTATAATT
GACGACTTTG TGGCTAAGGG CATTGAACAT ATTCACATGT ACTGTGTAGA TAACGTGTTG
GTTAAAGTTG CTGATCCAGT ATTTTTGGGT TTTGCTATCG ACAAGAAATT CGATTTAGCT
ACCAAAGCCG TGAGAAAGAG AGATGCTAGT GAATCTGTTG GTTTGATTGT ATTAGATGAC
GACATCAAAA AGCCATGTGT TATTGAATAC AGTGAAATCA CCCAAGAATT AGCCAACAAA
ACCGAGCAAA ACGACTCGTC CAAGTTATTC CTCAGAGCTG CTAACATTGT TAACCACTAC
TATTCTGTAG ATTTGTTGAG AAGGGAAGTG CCTAATTGGA CTTCTTCTCA GAAATTCTTG
CCCTTCCACA TTGCAAAAAA GAAGATTGCA TCTATAAATC CGAAGACTGG TGAATTCTAC
AAGCCAACTG AACCTAACGG TATCAAATTG GAGCAGTTCA TTTTCGACGT TTTTCCCTCG
GTTGATTTAA ACAAGTTTGG TCTTTTGGAA GTAGAAAGAT CAGACGAATT TTCTCCATTG
AAGAACGCCG TGGGTGCCAA AAACGATACA CCAACCACAT GTAGAAGTCA TTTCCTTGCC
TTGGGTACAA GATGGGTAAA AGAAAATGGC GGTATAATTG AAGACGACGG TTATGTCGAG
GTCAGCTCAT TGACCAGTTA CGGTGGTGAA GGATTGGAGT TCGTCAAGGG TAAGCATTTC
AAGAACGGGG AACAAATCTA A
 
Protein sequence
MTTTSASSII EAFSKAGQSA LFQFFDSLSK DQQNEFIEQL AKIEDPIKLV NTVQEALKFS 
SNNASSRNFT QLPSEQTAST LDLDPELSEK WTRLGLEAIS KGQVGVILMA GGQGTRLGSS
DPKGCFDIEL PSSKSLFQVQ AEKILKIQQL TAQKLNLAQQ PKIYWYIMTS GPTRSPTESF
FQKNHYFGLQ PDQIAFFDQG TLPCFNLDGS QILLESQNKY CESPDGNGGL YKAIQTNGII
DDFVAKGIEH IHMYCVDNVL VKVADPVFLG FAIDKKFDLA TKAVRKRDAS ESVGLIVLDD
DIKKPCVIEY SEITQELANK TEQNDSSKLF LRAANIVNHY YSVDLLRREV PNWTSSQKFL
PFHIAKKKIA SINPKTGEFY KPTEPNGIKL EQFIFDVFPS VDLNKFGLLE VERSDEFSPL
KNAVGAKNDT PTTCRSHFLA LGTRWVKENG GIIEDDGYVE VSSLTSYGGE GLEFVKGKHF
KNGEQI