Gene PICST_30676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30676 
Symbol 
ID4837933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp654706 
End bp656874 
Gene Length2169 bp 
Protein Length722 aa 
Translation table12 
GC content44% 
IMG OID640389248 
Productpredicted protein 
Protein accessionXP_001383760 
Protein GI150864787 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.105769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0372054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACATT CGCCGGGAAA TGCTTCACAA ACAACGCTTG AGCCAGTGAG CCACACCTAT 
GGCGAGAAAC TAGCCCTCCC CCAGGTCACT CTACGTGCTA CTCTTGTAGG CTTGGTTATA
GGGTCCATAG TGCTCGTGCT GAACTTCCAG TTTGGACTCC AGACAGGCTG GGTGCTGATG
ATGTCACTTC CAGCTGCCTT GTTGGGTTTC ACGATCTTCA AATTGTCACC ATATTCTGAA
GATTTTACTG ACGTAGAGAA TGTCTATGTC CAAAGTGTAG CTGTAGCTGT AGGGACAGGT
CCTCTATGCT ACGGACTTAT AGGGATTGTA CCGGCTATCG AAAAGTTCCT CACAGCAGAA
GAGTCTGGCT TGGGCGACAA ATTTTCGTTT TCGTTGTTTG ACTTGATGGT GTGGTCGCTT
GGCTTGGGTC TCTTTGGAGT CTTTTTTGCT GTTCCACTCC GAAAGCAAGT GATTATCAAG
GAGAAGCTTC CCTTTCCCTC AGGAAGTGCT ACAGCCACTT TGATCTCAGT AATGCATGGA
ACCGAAATCT ACGACGACGA ACAGGAAGCA AAAGCCATCA AAAAACATGA AAATGCCTTG
TTAAACAAAA CGTCCATGTC TCAGGATATG GAAAACTATC TTGCCAACGA CGAAACAGAA
CAAGAAGAAG ACTCACCAGT AAAGCAAAAT AGTCAAGAAC CTTCAGTCTA CTCTAAAAAT
ATCCATTCGT TATTGATTAC ATTTCTGGTG TCTGCCACTT ATACATTGGT ATCGTATTTT
GTTCCTGTCT TGAAAAACTT GCCCATATTT GGCACAAATC TTTCCAACAA GTACAAGTGG
AATTTCCAGC CTTCTCCAGC GTATATTGGG CAGGGTATAA TCATGGGTTT ACCAACTGTA
TCGTACATGT TGTTTGGTGC GATTTTGGGA TGGGGCATCC TTGCACCCTT AGCCAAGAGT
AAGAACTGGG CCCCAGGAGC TATTGACGAT TGGATTCATG GAGGCCAAGG ATGGATTCTA
TGGATAAGTT TGAGTGTGAT GATCAGCGAC TCACTTGTTT CTTTTCTAAT AGTGACGTTC
AAATCGATCA GAAGCGGAAT CAAGATCATG AAAGATTTGA ACACCCAAAG ACATTATACA
GAAGCAACTC TGCAAACTTT AGAATTGCAA CAGTCGTTAC TCAACGAAGA CAGCGACAGA
AACTCGAATG AGGACCCCTC TAACAGCACA TACTCTGGTC AACGAGCTAA AACAGCCGAG
ACGTCTTCTA TTAGTCCTCT TTCCAGAGCC GAGATTCAGA AGGACTACTT GGTGAGCAAC
ACGATAACGT TTTCTGGAAT CATCCTTTCA TCCATTCTCT GTCTTATTTC TATCAGAATC
GTGTTTGGTT CCATTGTTCC GTACTATGCT GCCGTTGTAG CAATTGTTCT TGCCTTGTTC
TTCTCTGTTC TTGGAGTTAG AGCTCTTGGA GAAACAGACT TGAACCCAGT TTCTGGAATT
GGAAAGTTGT CGCAGTTGAT CTTTGCAGTG GTCATTCCAG CCAACCACCC ATCCAAAATC
TTGATCAACT TGGTAGCAGG AGGTGTTGCG GAAGCTGGGG CGCAACAGGC CGGAGACTTG
ATGCAAGACT TGAAGACAGG TCATCTTATA GGAGCCTCGC CCAAGGCACA ATTCATAGCA
CAAATCATCG GTACATTATA TTCGGTTTTC CTCAGCAGTA TAATGTACAA GGTCTACAAT
GCAGTATACA CCATTCCTGG AGATTTGTTC AGAATACCGA CAGCCGTGAT CTGGATCGAC
TGTTCTCGGT TGGTAACTGG ACAAGGCTTA CCACCTATGG CATTTGAATT TTCGATGATC
TTTGGAGCAA TCTTTGGGTT TATTGCATTG CTCAAAAATA CCATTCCTTC AAGTTCTCGT
TTCCACAAGT ATCTTGTATA CTTGCCCAGT GGTATAGCCG TAGGTATCGG TATCTACAAT
ACGCCGAACT TCACACTTGC ACGGTTTGTT GGCGGTGTTA TTGCTTACTG GTGGATAAAC
TTTGGAAATA AAACAGCTGG AAACAACAGA ATCGCCATGA TCATTTTCAG CAGTGGTCTT
GTTCTTGGAG AAGGGTTGTT GAGCGTGGTT ACGATGTTAT TGACGAGTTT GGGAGTAAAA
CATTTTTAG
 
Protein sequence
MSHSPGNASQ TTLEPVSHTY GEKLALPQVT LRATLVGLVI GSIVLVSNFQ FGLQTGWVSM 
MSLPAALLGF TIFKLSPYSE DFTDVENVYV QSVAVAVGTG PLCYGLIGIV PAIEKFLTAE
ESGLGDKFSF SLFDLMVWSL GLGLFGVFFA VPLRKQVIIK EKLPFPSGSA TATLISVMHG
TEIYDDEQEA KAIKKHENAL LNKTSMSQDM ENYLANDETE QEEDSPVKQN SQEPSVYSKN
IHSLLITFSV SATYTLVSYF VPVLKNLPIF GTNLSNKYKW NFQPSPAYIG QGIIMGLPTV
SYMLFGAILG WGILAPLAKS KNWAPGAIDD WIHGGQGWIL WISLSVMISD SLVSFLIVTF
KSIRSGIKIM KDLNTQRHYT EATSQTLELQ QSLLNEDSDR NSNEDPSNST YSGQRAKTAE
TSSISPLSRA EIQKDYLVSN TITFSGIILS SILCLISIRI VFGSIVPYYA AVVAIVLALF
FSVLGVRALG ETDLNPVSGI GKLSQLIFAV VIPANHPSKI LINLVAGGVA EAGAQQAGDL
MQDLKTGHLI GASPKAQFIA QIIGTLYSVF LSSIMYKVYN AVYTIPGDLF RIPTAVIWID
CSRLVTGQGL PPMAFEFSMI FGAIFGFIAL LKNTIPSSSR FHKYLVYLPS GIAVGIGIYN
TPNFTLARFV GGVIAYWWIN FGNKTAGNNR IAMIIFSSGL VLGEGLLSVV TMLLTSLGVK
HF