Gene PICST_36589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36589 
SymbolHOL12 
ID4840307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp389444 
End bp391156 
Gene Length1713 bp 
Protein Length570 aa 
Translation table12 
GC content41% 
IMG OID640391622 
Producttransport protein 
Protein accessionXP_001385425 
Protein GI150865983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.209561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCG AAGCTCCAAT TTTGTCGAAC GTTCAAATCA ACGATAAAGC CAACGACGAC 
TTTATTCCTG GTACAAAGAA TATTTATACG GATGCCATAG GTCCGGAAGA TGAGGTTACC
AAGAAGAAAG TGAAAACTCA GGGTAACATC ATTTTGATTC CACAACCTTG TAACTCTCCC
AATGATCCAC TTAATTGGTC CAAGTATAGA AAGTTTGTGA ACTTTGCTAT TCTTGCCGTT
ATCACTGGTT TTACCGCCGC TACTTCTAAC GATGCTGGTG CTATTCAAGA TTCCCTTAAC
GAGTTGTACG ATATATCGTA TGACAAAATG AACATCGGTG CGGGGGTGCT TTTCTTGGGA
ATTGGTTGGG GTACTTTTTT CATAACCCCA CTTGCATCTT TGTATGGTCG TAAATTGTCC
TACTTCATCT GTATTTTCTT GGGTTTACTT GGTGCCGTTT GGTTTGCTCT TTCCAGGCGT
ACTGCCGATG CCATCTGGTC CCAATTATTT GTAGGTATTT CCGAAAGTTG TGCTGAGGCG
CTGGTTCAGT TGAGTTTGAG TGAGTTATAT TTCCAACATC AATTAGGTTC CGTCTTAACT
GTATATATTT TGGCTACTTC CGTTGGTACT TTCTTGGGTC CTTTAATTGC TGGTTTTATT
GTTCAATATG TAGGTTTCAG ATGGGTGGGT TGGATTGCAG TCTTCATTTC CATTGGTTTG
CTCTTTATCA TTTTCTTCGT TCTCGAAGAA ACCATGTTCG ACAGAAAGAT ATTCTCCAGA
GGCGTTATAA ATGGAGTTTC TCCATCTTCG GACAAGCAAG TGCCTGAATT CGACGATATC
AAGAAGGAAC ACTCTAAGGA ATTTACTACC GTGGATGAAT CCAACTCCTC TGAGGATGAA
GACACTGATT TGATCAATAT GGGCGAAAAT GACCCACCAA AGTCCTACTG GAAAAACGTT
CAGTTGATTA CCTTAGCTCC AAACTTGCAA GGTACTGGTT TCTTACAATA TTTAACCCAA
TTGAGAATGT TGTTGAAGGT TTTCTTGTAT CCTCCAGTTA TCTTCTCTGG ATTAGTGTGG
GGTATGCAAG ATGCCCTTTT AACATTCTAC TTGACAGTTG AAGACGACGA ATACTATGAC
CCTCCTTACA GCTACGGTAA CACAGGTGTT GCCTTGATGA ATGTTCCAAC CTTAATTGGA
GCCATTATTG GGTGTTTCTT TGCTGGTCCT CTAAGCGATA AGTTCTCCAT CTGGATGGCA
AAGAGAAATA ACGGTATTCA AGAAGCTGAA TACCGTCTTT GGTTCCTTTT TGCCCCTGCA
GTAATTGCTC CAGTCGGTTT GATATTGTTT GCAGTGGGTA CTGATCAAGT TTGGGACTGG
CCACCTACTT ACGTAGGCTT AGGATTTATT GGGTTCGGTT TCGGTTGTTC TGGAGATGTT
TCCATGTCTT ATCTTATGGA TGCTTACCCA GATATGGTTA TTGAAATGAT GTGTGGTGTT
TCTGTTATCA ACAACATGAT TGGTTGCATT TTTACTTTCG CTTGTTCACC TTGGTTGGAT
GCCATGGGAA ACACACACAC CTTCATTATT TTGGCAGTCA TTGAAGCCGT TATTATGTTC
AGTGCTGCGC CAATGATCTG GTATGGCAAG AGAATCAGAA ACTGGACTAA GGATTGGTAT
ATTGAATTCT GTCAATTGCG TGACGGTATG TAA
 
Protein sequence
MNLEAPILSN VQINDKANDD FIPGTKNIYT DAIGPEDEVT KKKVKTQGNI ILIPQPCNSP 
NDPLNWSKYR KFVNFAILAV ITGFTAATSN DAGAIQDSLN ELYDISYDKM NIGAGVLFLG
IGWGTFFITP LASLYGRKLS YFICIFLGLL GAVWFALSRR TADAIWSQLF VGISESCAEA
SVQLSLSELY FQHQLGSVLT VYILATSVGT FLGPLIAGFI VQYVGFRWVG WIAVFISIGL
LFIIFFVLEE TMFDRKIFSR GVINGVSPSS DKQVPEFDDI KKEHSKEFTT VDESNSSEDE
DTDLINMGEN DPPKSYWKNV QLITLAPNLQ GTGFLQYLTQ LRMLLKVFLY PPVIFSGLVW
GMQDALLTFY LTVEDDEYYD PPYSYGNTGV ALMNVPTLIG AIIGCFFAGP LSDKFSIWMA
KRNNGIQEAE YRLWFLFAPA VIAPVGLILF AVGTDQVWDW PPTYVGLGFI GFGFGCSGDV
SMSYLMDAYP DMVIEMMCGV SVINNMIGCI FTFACSPWLD AMGNTHTFII LAVIEAVIMF
SAAPMIWYGK RIRNWTKDWY IEFCQLRDGM