Gene PICST_58842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58842 
Symbol 
ID4838705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1596969 
End bp1598033 
Gene Length1065 bp 
Protein Length354 aa 
Translation table12 
GC content43% 
IMG OID640390020 
Productpredicted protein 
Protein accessionXP_001384610 
Protein GI150865407 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.040839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00802664 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTCAGAT TAGCCAGAAG AAAACTTGCC ACTTCAGCTG ATACCATAGC GAAACAGCAG 
CAGTTCAACG TCTTTGACCG TTCAGCAAAG TTGCTCCAGA GATCCAGAAC ACCTTTGCTC
AACCCTGAAT TGCTGAGAAA GAAGGAATAT TTACGTGACG AGGTTGCTCT AAAAACCATT
GAACGTTTGG CATTCATCAC AAGAGACTTT ACAAATGTTC TAGATTTTGG TTCTCATCTG
GGGAACTTGT TGAAGAATCT CTGTGTTGAG ACAGAGATTC CACCAGACGC AGACTATGCC
GAAACTGAGA TCACCAAACA GTTGAATAAT GACAAGAAAA TCATCTGTAG TAAGATCAAA
GAGTTGACCA TGGTAGACTC GTCAAGGGAA TTGCTCTATA GAGATGCTGA GGAAAGTTTC
AATAGCGTAT TTCCGGGTAA GGTTATACGA AGTGTAGCGG ACGAAGAAAT TTTTTCACAC
GAAAGTCTTT CGAAGCCAGA ACATTATGAT GCTGTGATTT CTAACTTGTC GCTCCATTGG
ATCAATGATC TTCCCTCGAC GTTAGCTAAC ATCAACAGAA TCTTAAAACC TGACGGCTTA
TTCATGGGAA CGTTATTTGG AGGGGACACT TTGTACGAAT TGCGTACTTC GCTACAACTT
GCTGAGATGG AGAGAATGGG TGGAATGTCG CCCAGAGTAT CCCCTTTAGT TAATTTAAAC
GATATCGGCT CCTTGTTGAA CCGTGCTGGC TTCAGTATGT TAACCATTGA TGCAGAGGAT
ATCATTGTAG GAGGATTCCC AGATATTGTT TCTGTGATGG ACGATTTGCA GGCCATGGGA
GAACAGAATT CGGTCTTATC CAGATCGGGA TATTTGCCTC GTGATGTCTT ACTAGCAGCC
AACGAGATAT ACAAAACTAT GCATGGAGAA AAAGACGATA ACGGTGTCGT TACGTTACCT
GCTACGTTCA ATATTATCTT TATGATAGGT TGGAAGAAGA GTGAGAATCA GCCCAAGCCA
TTGGCCAGAG GCTCTGGGCA GGTCAACTTG AAGGACGTAT TGTAG
 
Protein sequence
MFRLARRKLA TSADTIAKQQ QFNVFDRSAK LLQRSRTPLL NPELSRKKEY LRDEVALKTI 
ERLAFITRDF TNVLDFGSHS GNLLKNLCVE TEIPPDADYA ETEITKQLNN DKKIICSKIK
ELTMVDSSRE LLYRDAEESF NSVFPGKVIR SVADEEIFSH ESLSKPEHYD AVISNLSLHW
INDLPSTLAN INRILKPDGL FMGTLFGGDT LYELRTSLQL AEMERMGGMS PRVSPLVNLN
DIGSLLNRAG FSMLTIDAED IIVGGFPDIV SVMDDLQAMG EQNSVLSRSG YLPRDVLLAA
NEIYKTMHGE KDDNGVVTLP ATFNIIFMIG WKKSENQPKP LARGSGQVNL KDVL