Gene PICST_89074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89074 
Symbol 
ID4838503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1640710 
End bp1642521 
Gene Length1812 bp 
Protein Length445 aa 
Translation table12 
GC content41% 
IMG OID640389818 
Productpredicted protein 
Protein accessionXP_001384268 
Protein GI150865164 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCACCTTCTA CCAGTCAGCA GCTGTCGCCG ACACCGCCTC GCTATCCACC TGTGCCCGCA 
TCGGCCCACT GCATTGTCTT AAGACGCCAA TCTCATCTCA TTGAATCCCT CGTACTCATT
TGTACAGTAG CACTGAACAT CACGTACATT CACTGAGATC ATACAACATT ACAAACACTG
AATGGCCAAA GAACATTCAA CACCAATAAC AAACATCAGC AACTCATTCA ACACAATTAG
TCATCAGTAG TCAAAGAATA TTTAATTTTC CACGGCTTTA GAGCTTCCAT TTCATACAAC
AATCATACTA CTTCACATTA TTCCAGAGAC TCTACTCCGG TCTCATTCTC ATTAATAGCC
ACTTCACTTC TTCCTTCTAT TACTACTCAC AATGGGCTGT GGTGCTTCGA AAGAACCTAT
TCCAGAAGCC TCAGCTGGAA ACACCAACGC CAGGCGCAAT TCCCAGAGAC AGGCTCTAGC
AAACGGTACT AAACAAACGT CTAATGCCAA CAACAATACC AACAATAACA GTTCGTCCAC
AGAATCTTCA AATGTCGTAG AGAAACGTGC CAATGGCTCG GCTTCAACGG CTGCTTCAAC
TGCCATTAAT CCCGTAGACG CGGCTACGCA ATCCGATCCC AATAAGGAAG TGAAGATCTT
ATTGCTCGGC TCAGGTGAGA GTGGAAAATC TACCATCGTC AAGCAGATGA AGATTTTGCA
CCAGAATGGC TATACCAAAG AAGAACTGTA TGAATTCAAG CCTTTTGTTT ACAAGAATAT
CTTGGACTGT ATCAAGAACG TCATCAACGC CATCATCGAC TTGCAACCAG ATTTGATCAA
CAAAAGTGGC CATTTAGAAA GCGCAAATGG CACAATCGAA GAGGATAAAG ACAGCTTAGA
AGAATTTAAA GGTCAAGACT TGGAAAAAGA TAAGGACAGA AAACATATTC TAGATTACGA
TGATTTGAAT GAAGTGTTGG ACTACGAGTT CTCCATCGAT TCAGAGCAAG TTTTCAACCC
GGTCATTGCG CAAAAGATTA AAACCATATA TAACACTCCA GAAGTCAAGA ATTTCATGAA
ATTCCAACAA GCCAACTTCT ACCTCATCGA TTCTACCCAC TACTTCTTGT CAGATGTCGA
CAGAATCACA TCACCAGATT ACGTGCCCTC AGTAAGTGAC ATCTTGAGAA CTCGTAAAAA
GACTTCTGGT ATCTTTGATT TTACGTTCCA GATGAGTGGA GGCTTAAATA TCCACATGTA
CGATGTAGGT GGACAGAGAT CTGAAAGAAA AAAGTGGATC CATTGCTTCG ACAATGTTAC
CTTGATCATT TTCTGTGTAG CATTATCGGA ATACGATCAA GTACTACTTG AAGAAAATTC
TCAGAACCGC TTAGAAGAAT CTTTAGCGTT GTTTGATTCA GTCGTTAATT CTAGATGGTT
CTCCAGAACT TCCATCGTAT TGTTCTTGAA CAAAATCGAT GTCTTTGCCG AAAAATTGGC
ATACTCTCCT TTAGAGAAAC AATTCCCGGA CTACACCGGA GGTAACAATA TCAACAAAGC
TGCCAAGTAC ATTTTGTGGA GATTTACCCA GGTGAATAGA TCCGGTTTAA ACATTTACCC
CCACGTTACA CAAGCTACAG ACACTTCAAA CATCCAGTTA GTTATGGCTG CTGTCAAGGA
AACGCTCTTG GAGAACTCCT TGAGAGATAC TGGTATCTTG AACAATTGAA GCTCCTATTT
ATTAAACTGT CATATTTCCA TGACATTACT TTTAATATTT GTATGTATTC TTGGATAAAA
GTGATTCATG AT
 
Protein sequence
MGCGASKEPI PEASAGNTNA RRNSQRQALA NGTKQTSNAN NNTNNNSSST ESSNVVEKRA 
NGSASTAAST AINPVDAATQ SDPNKEVKIL LLGSGESGKS TIVKQMKILH QNGYTKEESY
EFKPFVYKNI LDCIKNVINA IIDLQPDLIN KSGHLESANG TIEEDKDSLE EFKGQDLEKD
KDRKHILDYD DLNEVLDYEF SIDSEQVFNP VIAQKIKTIY NTPEVKNFMK FQQANFYLID
STHYFLSDVD RITSPDYVPS VSDILRTRKK TSGIFDFTFQ MSGGLNIHMY DVGGQRSERK
KWIHCFDNVT LIIFCVALSE YDQVLLEENS QNRLEESLAL FDSVVNSRWF SRTSIVLFLN
KIDVFAEKLA YSPLEKQFPD YTGGNNINKA AKYILWRFTQ VNRSGLNIYP HVTQATDTSN
IQLVMAAVKE TLLENSLRDT GILNN