Gene PICST_44458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_44458 
Symbol 
ID4839007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp976398 
End bp977663 
Gene Length1266 bp 
Protein Length421 aa 
Translation table12 
GC content42% 
IMG OID640390322 
Productpredicted protein 
Protein accessionXP_001384141 
Protein GI150865076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0219256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.240847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGT TCCAGGTTAG TGGCTGGGAC TTAAAGAACG AAACCGTGGC TGTCGGTGGC 
ACAGGCGCAA AGAAGAAGTC CAACAGAGAA AAAAAGAGAG CCAGGCAACA AATTAAGGAA
CTTGAAAAGT CTCAGCAATC TGAAGATGTA GCCGAACAAG AAGATGAAAT CATAAAAGAA
ATAGACGAAC CAGAGAAAGA AGAGAAGAAG ATCAAAAAAG AAAAGAAAAT AAAGAAAAGA
AAACACGAAG AATCTGAAAA AAGCTCTTCA ACAACTTCTC CGGCTGCTGC TATAGTAAAT
CCTACTGTAG ATGCACCTAT ACCTATTACT ACACAGAAAC TCACTCCATT GCAACAGAAG
ATGATGGCTA AATTGTCTGG ATCCAGATTC AGATGGATAA ACGAACAATT ATATACAATC
TCGTCGGAAG AGGCTCTCAG TTTGTTAAAG CTGCAACCTT CCTTGTTCGA CGAGTACCAT
CAAGGATTCA GATCGCAAGT CCAAGCGTGG CCAGAAAACC CTGTAGATGT GTTTGTCGAC
CAGATCAAGA CTCGTGCCTC TCAGAGACCT ATTAATGCTC CCGGTGGTTT GCCTGGTTTT
CCCGACAAGA AAGTTGTTGT TGCCGATATG GGTTGTGGGG AAGCCCAGCT AGCCTTAGAT
GTGAACAACT TTGTTAAACA ATACAACGCT CAAGGGGCTA AAAAGAAATT CTCGAAAGGT
AACAATAACA AGAGATTACA AACTGGACCC AAAACATTGG AAATCGAAGT ACATAGTTTT
GACTTGAAGA AGCACAACGA CAGAATAACC GTGGCCGATA TTAAGAATGT GCCGTTGCCA
GATGGGTCAT GTACGGTGGT GATTTTCTGT TTGGCATTGA TGGGAACCAA CTTTTTAGAT
TTCATAAAAG AAGCCTACAG ATTGTTGGCT CCTCGAGGCG AGTTGTGGAT TGCCGAAATC
AAATCGAGAT TCACTGAGTC GTCCGAAAAG AAAACAGTCA AACCAGAGGA CGTCGGACAG
GAATTCGTGG ACGCCTTGAA GTTGTGTGGT TTCTTCCACA AGAAGACAGA CAACGACAAT
AAGATGTTCA CTCGTTTTGA GTTTTTCAAG CCACCTCAAG ACATTATCGC TGAGAGAAAC
GCGAAGTTGG AAAGAAGAAA GAAATTCATT GAACAGGAGT CGGAAAAGGA AGACTTGGAG
ACTAAAAGAG CACAAACTCC AGAAGGTAAA TGGCTCTTGA AGCCATGTAT TTACAAGAGA
AGATAG
 
Protein sequence
MALFQVSGWD LKNETVAVGG TGAKKKSNRE KKRARQQIKE LEKSQQSEDV AEQEDEIIKE 
IDEPEKEEKK IKKEKKIKKR KHEESEKSSS TTSPAAAIVN PTVDAPIPIT TQKLTPLQQK
MMAKLSGSRF RWINEQLYTI SSEEALSLLK SQPSLFDEYH QGFRSQVQAW PENPVDVFVD
QIKTRASQRP INAPGGLPGF PDKKVVVADM GCGEAQLALD VNNFVKQYNA QGAKKKFSKG
NNNKRLQTGP KTLEIEVHSF DLKKHNDRIT VADIKNVPLP DGSCTVVIFC LALMGTNFLD
FIKEAYRLLA PRGELWIAEI KSRFTESSEK KTVKPEDVGQ EFVDALKLCG FFHKKTDNDN
KMFTRFEFFK PPQDIIAERN AKLERRKKFI EQESEKEDLE TKRAQTPEGK WLLKPCIYKR
R