Gene PICST_81056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81056 
SymbolSIP2 
ID4851915 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3171846 
End bp3173982 
Gene Length2137 bp 
Protein Length623 aa 
Translation table 
GC content43% 
IMG OID640393623 
ProductSip1p-Gal83p family protein 
Protein accessionXP_001386934 
Protein GI126276009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.896245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAACA ACACTTCAGT ATCAGCGAAT ACTTCCACGC GGAAGTCTGC TGCCGCCATT 
CTCCACCAAC AGAATCTCGA TCTGGAGTCA CAAGTGTCGG TAGCCGGTTC ACGAAAGTCG
TCTTCAGGAA GAAATGCTAC AGCTAAAGGA GACACTTCGC TAGATGAAGA TTTCAGTGAT
TTGATTCTCC ACCAAGTAAA ACGCCAGGAT CTGACGAGCT CGTCGGGATA CCTTCCCACG
ACTAATATCC TGGCCGGTGG GATCATTCCC GTCTCCTCTA ACATCGGAAG CACGCCTGCT
GTAACGAATA CTAATGCTAA CGCTAACCTT GCTCATAACA CTAATGCTGA TCCATATTAT
TCAGCTACGT ATAATCAATT TTCAAATGAC CTTATCGAAG ACGAGTTCAA TGAGTTGAAT
GATGGATTAT TGGATCAGGA ACCAGAAGTC TCTCGTAATA TCTTGCCCGA TGATGCTAGT
GAAATCACTA CGGAATCTAC AAACAGCGGA GCAAACGACG AAACTCAAGA CAATATGGAT
GTGGATGAGG ACCATTTCAA AGCTGTAACA GACCAATCCA ATACTGATTC TGACTTGGAC
ATGAATCATG CATCTGGCTT GTCGAAAGTA GACTTCACCA AAGTGACACC AGCCAACCAG
CAGCCTCATG AAGTTTCTGT AAAAGTAGAT AACTCATATG TTCACCAGAG TAGAAAACGA
CACAATCGTT CGGGGAATGC TAGTGCCAGC AACGTCACCA GCAACCTCAT TCCAGTGGAA
ATTAAATGGG TGAACTCGTC TCGAGAGGTC ATAAACAAGA TCTCTATCAT TGGCTCGTTC
ACCAACTGGA GGGACAGCAT TCCTTTGTCG CTTTCACCTT TTCATTCGAA CGAATACGTG
ACCACCTTAA ACTTACCTCT TGGTGTCCAC AAGTTGTTGT ATATCATCAA TAACGAATAC
CGAGTCAGTG ACCAGTTGCC TACTGCAACA GATCTGGAGG GAATTTTCTT CAACTGGTTC
GAAGTCATAG ACGAAGCCCA TCTCTTCAAT CATTCATTAA ATCAACCAAA TCATATCGGT
GCTTCTACAG ACTACGATGC CAACATAATC TCTCCGCCAT ACTACGACTA CAAGACAACG
TCGTCTTTCT CAGTAAACCA CCAGGTACAG CCTCAAACTG CTGGCAAGTT TGAAGTGGAC
CAAATCAACA GAAAATCTAA CAGCTTCTTG GCCAAGATCT CAAAAGAGAA CTCGTCCAAC
TTCGAACATG TAGAATACGC GGAAGACAAA AACGACGACA TGAAGGACAT ACGGATGCAC
GAAGAAGAAC AAAAGAGTTC CGAGAACTAT CCATATGGAA GTAACAAAGG CTCAGTGCCA
GGCTCAACTC AATATGTGCC ATATAGCATG TCGTCTTCTA CATCTTTGCG AACGCCCACA
ACTGAGAATG TACCAAAATT AGAATATTCC AGTGACATTC CAGAAATGTT TCAAAACTAT
GACTACTTCA AGAATAAGAG TCCAAACTAC GAACTTCCTG AACCACCACA GCTCCCAGCA
CACTTAAACA ACGTGTTATT AAACAAAATG TCGCAGACAT CGTCTCAAAG CTCCCAGAGC
CACATTAGCA ATTCACAGAC AGCTCACAGC TCTTCTTACG GTTCCACGCA TCATCAGAGC
CTCAAACCTC CCAATGCTGC ATTTGTTTCA GAATCTAGCC CAACTAACCA GTCTCATAAC
AAAAGACCCA CCTTAAGAAG AGCCGACAGC TCATACTACG CTTCAAACAA AGAATCCTAC
CACCAGTCAA TTCCCAACCA CGTGATCTTG AACCATTTGA TGACGACCTC CATTAGAAAC
GACGTCTTAA CAGTTGCTTG TATAACAAGG TACTCCGGTA AGTTCGTTAC CCAAATCATG
CATTCTCCAG CAGATAAATG AGATGAATGT TATGAATGCA AATGCTGCGA ATGCTAAGGG
AAAGTAGAAT AGTGGAATTA CTGGCTAGTG TTTTTGTATG TATGCTTATT ATTGCTTTAT
TTAACTTGTA TTTTTCTCTC GTTCTCGTTG ACTAGCTTAC GCGGTTAATT CATTCAATTC
ATAAATAAAA CATGAAATAA ATAACAATAC AAACTAG
 
Protein sequence
MGNNTSVSAN TSTRKSAAAI LHQQNLDLES QVSVAGSRKS SSGRNATAKG DTSLDEDFSD 
LILHQVKRQD LTSSTPAVTN TNANANLAHN TNADPYYSAT YNQFSNDLIE DEFNELNDGL
LDQEPEVSRN ILPDDASEIT TESTNSGAND ETQDNMDVDE DHFKAVTDQS NTDSDLDMNH
ASGLSKVDFT KVTPANQQPH EVSVKVDNSY VHQSRKRHNR SGNASASNVT SNLIPVEIKW
VNSSREVINK ISIIGSFTNW RDSIPLSLSP FHSNEYVTTL NLPLGVHKLL YIINNEYRVS
DQLPTATDLE GIFFNWFEVI DEAHLFNHSL NQPNHIGAST DYDANIISPP YYDYKTTSSF
SVNHQVQPQT AGKFEVDQIN RKSNSFLAKI SKENSSNFEH VEYAEDKNDD MKDIRMHEEE
QKSSENYPYG SNKGSVPGST QYVPYSMSSS TSLRTPTTEN VPKLEYSSDI PEMFQNYDYF
KNKSPNYELP EPPQLPAHLN NVLLNKMSQT SSQSSQSHIS NSQTAHSSSY GSTHHQSLKP
PNAAFVSESS PTNQSHNKRP TLRRADSSYY ASNKESYHQS IPNHVILNHL MTTSIRNDVL
TVACITRYSG KFVTQIMHSP ADK