Gene PICST_82351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82351 
Symbol 
ID4837305 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2347872 
End bp2350318 
Gene Length2447 bp 
Protein Length667 aa 
Translation table12 
GC content45% 
IMG OID640388620 
Productconserved hypothetical protein 
Protein accessionXP_001383205 
Protein GI150864411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.18371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.103561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAATATAACG AGACATAAAT CTCGTGTGTC TGGCTGAATA TATTGTGAAG CATATATAGC 
TAAGAGGTTG TTTGTGATTT TCCCTACTGC TGAGACTCTC AACATATCTA GTTTTGCATT
TTATGCTGGA TACTCATCTG ATTTCATCTG GCAAATTTTG TTGAACCACA ATTCATTCTT
ACGTTATCAT TTGCGACACT CTTTTAGGAA CCCTCAGACC CAAAGATATT ACTATACGAA
TAGGTGTTTC ATTAGAAGAC ATTTCATATA AGAGACTAAA AGATACCTAA AGCATGGCTC
TTCTTTCGAG GGTACTTCAT CACCACTCCG ACCGACCGGA CAGGCGTGAG AGATCGTCTT
CGTTCCGATC TAATAGCAGC AGCAGCAGTT TTTCCAACTT GACACCAACT AGATCCGGAC
CACCATCTAT AAGTCATTAC TCGACATCTC CCTCTTTTTC CATATCGGTA ACGTTGGAAT
CTCCTCCGAT TGTGTTGTAC GGACAGCCTT CGGAGTCCAC TGGTTCCATT ATATCCGGAC
TACTTAAGTT ACACATCAGA CCTCCAAATT CCAAACCTTC TCGTAGTGAC AGTTCTAGTC
TGTCGTCGGA AGCAGAGATC GATGTGGAGT CGGTTACGTT ATCGTTGGTT CAAACCATCA
GATACACCAA ACCTTTCTTG CTAGCCTCAT CTACGATCTC AAATTGTAAG GATTGTACTA
CCAAGACGAC CACTTTGGCC AGATGGGATG TGTTGACCAC ACGGTGCTCG TTTGCTGTAG
GGAGTCACGC CTATCCATTC TCGCATTTGT TGCCTGGATC TTTACCAGCT TCATGTAAAT
TGGGATCACC CAATTCTCAG TCATACATCA AGTACGACTT GATTGCCGTA GCCAAAAAAT
CTAATTCAAA TAGCGAAACC ATAGTCAAAT TGCCCATCAA CATCTCGCGT CTGATTCTCC
GAGGTCCAGA TAGGAACTCT CTTAGAGTGT TCCCACCTAC GGAGGTCACT GCCGCAGCAG
TATTGCCCAA TGTAGTATAC CCGAAGTCAA CTTTTCCTGT TGAGTTGAAG TTGAATTATG
TAGTTAATGC TGCCGGGGAT CGGAGATGGA GAATGCGGAA ATTAACATGG AAGATCGAAG
AAACGATTAA GACTAGAGCA TCGTCATGTG CCAAACACGA GCAAAAAAGA AGGGCGCTTG
AAGCCCATCA GAAGAAGATA CCTCCTCCAA AGGACAATCC GGTAAACAGG GCAAGTAACC
ACCATAGCAA TATTCACACG AGTATGTCTT TTCTAGCGCA TCCCAGAAAC AACGGTCACC
ACAATTTAAA CGGTCCTGAC AATCTGGCTC CCGACGACAG CGAGCCTGTG ACAGACGATG
TAGTGAACGA TGTCGAAGAA ATGATTCGGA GTGGTCCGTC GCATGCACAC GAGAACTTCA
TAGAAGATTT TGGTTCGAAC GGTAATGTCG AAAGTTCGAG ATTGAGATCG AATCAATTGC
CAACAAGTAA CAATATTACA CCTGAGCCTA CGCAGCAGGA CTTAGGAGAG TATTTGTATA
TCGAGGAAAC TCGAGCTGTA TCCCATGGTG AGATCAAGTC TGGTTGGAAA TCAGATTTCT
CTGGCCGTGG AACCATTGAG TTGGTTGCTG ATATTAGCGC TTACAATTGT TCTTCTGGGT
TCAGTAAGAA TGTCACAAAG ATGAGCAGTG ATGATGACAG AGTCGACGAC CTTCAAATAG
GATTACGTAA CGGTGCCAAT ATCTGCTGTG ATATGGACGA CCCGTGCCTT GGAGTATTTG
TGTTCCACAC TTTGATCTTA GAAGTGATTG TAGCTGAAGA AATAGTTCAC AATGTACTTG
GCAAGTTGCA GCACAAGAGA ACAGAAAGAT CCGAGGGAAT TCTAAACTTG ACACCAGTAG
ACTCGAGTTC TTCAGCTCTT GTAGACGAAA ATGGAGCTCC TACCCCAACA ACTGAATTCA
TACCCTCCAC TACACCGTCA CAGATGGGTG TTCCTACAGG TGCTGCCAGA GTGTTGAGAA
TGCAATTCAA GTTGTGTGTA ACTGAACGAT CTGGTTTGGG TATTGCCTGG GATGATGAAG
TTCCACCAAC ATACGAAGAT GTGAGAACCT TGAGTCCTCC AACATACACT CCAAAGGGTA
CACCGGTGAC AGGTCCAGTA ACTGGATCGG TTGTATCCGC ATCCCAAACG CCAGGACTGG
GACGTGGTAC ACCCAACGTC CTTTACGGTG TTGGGGAGAC TCCACTAATG GGAAGCTTAG
GACTCAACCG TGAGGTGCGC CCCTTGAACC TCGACGAGCT CGCGGATCTT GATGAGAGGA
TCCAGGAGTT GTCATTATAG ACTGAGGGGT TCATTATGAC TATTTATGTA TATATTAACA
TTACACCAGC ATATGACACA TTTCAGTAAT ATATGGTATA TGAGTGG
 
Protein sequence
MALLSRVLHH HSDRPDRRER SSSFRSNSSS SSFSNLTPTR SGPPSISHYS TSPSFSISVT 
LESPPIVLYG QPSESTGSII SGLLNSSSEA EIDVESVTLS LVQTIRYTKP FLLASSTISN
CKDCTTKTTT LARWDVLTTR CSFAVGSHAY PFSHLLPGSL PASCKLGSPN SQSYIKYDLI
AVAKKSNSNS ETIVKLPINI SRSILRGPDR NSLRVFPPTE VTAAAVLPNV VYPKSTFPVE
LKLNYVVNAA GDRRWRMRKL TWKIEETIKT RASSCAKHEQ KRRALEAHQK KIPPPKDNPV
NRASNHHSNI HTSMSFLAHP RNNGHHNLNG PDNSAPDDSE PVTDDVVNDV EEMIRSGPSH
AHENFIEDFG SNGNVESSRL RSNQLPTSNN ITPEPTQQDL GEYLYIEETR AVSHGEIKSG
WKSDFSGRGT IELVADISAY NCSSGFSKNV TKMSSDDDRV DDLQIGLRNG ANICCDMDDP
CLGVFVFHTL ILEVIVAEEI VHNLQHKRTE RSEGILNLTP VDSSSSALVD ENGAPTPTTE
FIPSTTPSQM GVPTGAARVL RMQFKLCVTE RSGLGIAWDD EVPPTYEDVR TLSPPTYTPK
GTPVTGPVTG SVVSASQTPG SGRGTPNVLY GVGETPLMGS LGLNREVRPL NLDELADLDE
RIQELSL