Gene PICST_84845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84845 
Symbol 
ID4840319 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1639778 
End bp1641187 
Gene Length1410 bp 
Protein Length402 aa 
Translation table12 
GC content44% 
IMG OID640391634 
Productpredicted protein 
Protein accessionXP_001385675 
Protein GI150866173 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCATTATCA AAATATATAT AAGTTCTTGA ATCATCAGAA CACTATCGAC GCATTTTTCC 
AGTAGATGAC CGGGCTTTTG TCTTATCTTC ATTATTCAGC AATTTGAATT TTTACAATCA
AATTATTCCT AAATCAAGTA CCATATTCCA CCAAATAAAA TGGACGCTTC TTCTTGTGGA
CCTTCCAGCG CTTTGGCAAA CCTTTCCAAA CACACTCAGC GTGACAACAC TCTTCAGAAC
GAGTATGCAG CCAAAAATGC CCAACTCCGG ACTGGCCCAG CCGGTTTTCG CCAGAATGGA
AATGCTGTAG ATGCCAGATT GAATGCTGAG TTCCACAACT TTCAAGGAAA TGGATTAGGA
GCCGAATTTC CAGCCTTTGC TGGAACCCCT TCTTTTCAGC AACAAGCTCA ACACTTAAAC
CAGCAGCAGT TTCAGCCAAA CAATGCTGGC TGGGTCCAGG ACTTCTCGGG CTTGTCTATC
AGTAACCAGC CACAGCAAGT AGGTCATCCT CAAAGTGACT GGCACCAGCA GTTTCTTCAA
CAACAGCAAC ATCACCATCA ACAGCAGCAA TTTGAGCAGC AGAATATTCA GCAGGGTCAG
CAGTTTGCAC CAAATTATGC CCTGAGTGCC TTTTCTATGA ATATGAGAAC CAATTTGTCT
ACGCCTTTAT ATGCCCAGCA GCAAGTTCAA ACTGGCGCAC TGCTTACGGA GCACCAGGAG
ATTCATAAAA TGGAACAGGA GAAACAGCTT TTTGATTCCC ATTTTGATCA ACTTGAGAAG
GAATTGAACC AACAGCTGCA GGAAAAGCCA GAAGTAGAGG TACAAGTAGA CAAAGTTGAA
AACGAGCAGT TTGCTGAGAC AGCTAGACAA ATCGAAAATT CCTTGCGACA ATTCGACACT
GCTGATGCTG CAACTAAAGC AAAGATAGAG AACTCAGACT TCTTGAAGTT GATGAGCTCC
ATTTCCAACA AACAAGTAGT ATTGGATGGC GACAAATTGG TAGACTCTAC AGGCCAGGAT
ATTCGCGAAA ATGTGAACGA ACCTTTGCAA CAAATCAGTA GACCTGACTA TCATGATCCT
ATTCACGACA TACCCGTTCC TGTGCGGCCG ATCACGAGAA ATCCGGCTCA GGCTGAAATA
CAGCAAGAAG CCAGACCAGA ACAGATTAAC AAATTACCGG ATCCTTTATC CCATATGCAA
GATGGACTGT TAGGCGACGT CTATGATGCC TTATCCGCAG CTAAAGTAGT CTCAGGTGGA
CAAGTCAAGA CAGGAGACTG GGTGGATGAA GATGACGAGT GGCTTGATAT GACTACTCCA
TCTATAAGCA GGCCAAAGAA GGCAAGCATC ATGGCAGACC ATTGGCAAGA AGTGTATGAC
GACTATAGAA ATGACGATGA TTTTCATTAG
 
Protein sequence
MDASSCGPSS ALANLSKHTQ RDNTLQNEYA AKNAQLRTGP AGFRQNGNAV DARLNAEFHN 
FQGNGLGAEF PAFAGTPSFQ QQAQHLNQQQ FQPNNAGWVQ DFSGLSISNQ PQQVGHPQSD
WHQQFLQQQQ HHHQQQQFEQ QNIQQGQQFA PNYASSAFSM NMRTNLSTPL YAQQQVQTGA
SLTEHQEIHK MEQEKQLFDS HFDQLEKELN QQSQEKPEVE VQVDKVENEQ FAETARQIEN
SLRQFDTADA ATKAKIENSD FLKLMSSISN KQVVLDGDKL VDSTGQDIRE NVNEPLQQIS
RPDYHDPIHD IPVPQEARPE QINKLPDPLS HMQDGSLGDV YDALSAAKVV SGGQVKTGDW
VDEDDEWLDM TTPSISRPKK ASIMADHWQE VYDDYRNDDD FH