Gene PICST_46734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46734 
Symbol 
ID4839367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp141366 
End bp142682 
Gene Length1317 bp 
Protein Length438 aa 
Translation table12 
GC content45% 
IMG OID640390682 
Productpredicted protein 
Protein accessionXP_001385047 
Protein GI150865715 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTAG CTACTCATAG TGCCAGGGAC ATTTCTTCGA TTGAGCTGAT ATTGCGAAAT 
ATCGGTCCTT ATAACGAGGT TCCAGATGAA TACAATGCTG ATGAGGCTGC CGAGGCTCTC
AGAACGACCA CTGTATTGGT TATAGGTGCC GGAGGTTTGG GCTGCGAAAT CCTCAAGAAT
TTGGCCCTTA CTGGCTTCAA AAAGATCCAT GTGATAGACA TGGACACAAT CGACGTGTCC
AACTTGAACA GACAGTTTCT CTTTCGTCCC AAAGACGTGG GTCACTCAAA GGCCGAGGTG
GCTGCACGGT TTATACAAGA ACGAATCGGT GACGAAGAGT TGAAGATCAC GCCATACTTT
GGAAAGATCC AGGATAAACC TTTGGAATAC TATCGCCAGT TTGGAGTTAT TGTCTGTGGA
TTGGATAGTA TAGAAGCCCG AAGATGGATA AATGCCACTG TAGTCAGCCT TGTAGATTCC
GAGTTGAACA ACTTGATACC CATGGTTGAC GGTGGAACCG AGGGATTTCG TGGGCAGTCC
CGTGTGATTC TCCCGACATT GACATCTTGC TACGAATGTA CTCTCGATTT GCTATCGCCG
AAAACGACGT ATCCCGTTTG TACCATTGCG AATACGCCTA GATTGCCTGA ACATTGTATA
GAATTTGCTT CAGTAATCGA GTGGCCAAAA CACTTTCCTG GTCGCAAGTT CGATGCTGAC
GACCCAGAAC TGGTTCAATG GATGTATGAG ACAGCTTTGG CTAGAGCTAA GCTATTCAAC
ATCCAAGGCG TTACCAAACA ATTGACCTTA GGGGTTGTTA AGAATATAAT ACCTGCAATA
GCCTCTACTA ACGCTATCAT AGCTGCATCG TGTTGCAACG AAGCCTTCAA AATCGTCACC
AACACCAATC CCATCCTAAA CAACTATATG ATGTATGCTG GAGACGAATC CATATTCACA
TACACATATG CCCATAGCCG CAGGCCTAAC TGTCCCGTGT GTGGCAACAT GTCCAAGAAA
GTTATAGCGA AAAACTGGTG GACACTAGAT AGATTCATCG AAGAAATCTC TGGCAAACAA
GAGATCCAAA TGCTGCTGCC TTCATTAACA ACAGCTGAGA AATCTCTCTA CTTGCGCAAC
CCTCCTAATT TGGAACAAGC CACAAGACCA AACTTGGCCA AAAAGTTCAA CACCTTGGTA
AGAGCTGGAG ACGAAGTAGT GATCACAGAT CCCAACCTTC CTATCTCATT AAGGTTAACT
GTGGAGTTTA CTGGTCCTGA AGTAGAACCC GACGATGTCA ACTCCAGTCT AATGTAA
 
Protein sequence
MSVATHSARD ISSIESILRN IGPYNEVPDE YNADEAAEAL RTTTVLVIGA GGLGCEILKN 
LALTGFKKIH VIDMDTIDVS NLNRQFLFRP KDVGHSKAEV AARFIQERIG DEELKITPYF
GKIQDKPLEY YRQFGVIVCG LDSIEARRWI NATVVSLVDS ELNNLIPMVD GGTEGFRGQS
RVILPTLTSC YECTLDLLSP KTTYPVCTIA NTPRLPEHCI EFASVIEWPK HFPGRKFDAD
DPESVQWMYE TALARAKLFN IQGVTKQLTL GVVKNIIPAI ASTNAIIAAS CCNEAFKIVT
NTNPILNNYM MYAGDESIFT YTYAHSRRPN CPVCGNMSKK VIAKNWWTLD RFIEEISGKQ
EIQMSSPSLT TAEKSLYLRN PPNLEQATRP NLAKKFNTLV RAGDEVVITD PNLPISLRLT
VEFTGPEVEP DDVNSSLM