Gene PICST_31980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31980 
Symbol 
ID4839142 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp388400 
End bp390511 
Gene Length2112 bp 
Protein Length690 aa 
Translation table12 
GC content36% 
IMG OID640390457 
Productpredicted protein 
Protein accessionXP_001384732 
Protein GI150865493 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.25184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCG TTGTTCATTT TGATTTGTAT CCAGAGCATT TCTATCAAAA AATAATAGAT 
GAGTTACCAT TTCTTATTGT TCTAGTACTT GCCTGCAACT CCAGTTCACC CTACCAGAGG
TACCTTCTCA ATTCCATCTA CAAAGAAATC GAAATTCAAT ATATTTCTCC TCTGCCAGGA
CTAAGACTCG AATTTTTAAA GCAATTTTAT CTTCTAAGAT ATAATGTTAT AAATCCTAGT
GATGAGAAAG ATTATGTTCC CTGTAAGGTA GTTGGAAAAA ATGTAGATCT GTTGGTAACT
TTTCTTAACG AGAATCCCCT GGTTGCTGTT CGTCATCTAC GATTGGTTGC GATACCAAGT
GAAGAAGTAA TTGCAAAAGT TAGAAGTCTT TCTGACAGAA TTGGTAAGAT TACTTTTATT
TCTGCTCGTG TAATTGATGA ACAACAAGAT GCCAGCATGA ATTGGCCAAC AGAAAGAGAC
GCAGCATACT TGTGTGGACT TCGCAGTAAG TTTGTTGAAT CAATGTATTA TCCGGAATCC
TTAGTGCATC TAGATTTAAA CTTTTCAGAT CCAACAAATT ACAGTGATAC CCTTCTGGAA
CTTCTTTTCA AGCTTCGGTA TCCACCCAGA CTTGAATTTC TAAGTTTGTC CGGTGGCAAG
GAAGCGATTG ACTCTAAAGT ATTTTCTCGC TTTCCACGAA CTATTAAATC CCTCATCTTG
GACCTCTATG ATATTCAATG TGATGGATTT TTGAAATTGA ACTTGCCTCC TTTCTTGAAA
TTTTTCTCGT GTACAGTAAT TGTTGATAAG AATAACAGAT GTTTTGATAT TTCCCATCTT
TCCCATCTTA CAGAGGTGAA ATTGTTTTTC TACTATCATC TAATTCCATT GTCTATCTTC
AGATTTCCTC GTCTGTTGCA GACATTAAGT GTGGATAGTG GCCTTTCATC TTCGGGTATG
GAACAACTTG CAGAATTAGA CCAATTAAAA CAAGTAACCA TACGTGTTTA TCGTAGTCCG
AACAATCCGG TACCAATCTT AAGGGAAGCA GTCCATTTAC CAAATTCTAT TGAAGATCTC
CTGATACAAG ATTACACTAT TGACGGCGAT GTTGATGGTG CATATTTTAT TCCAAAAAGC
ACAAAGAAGC TCCAACTTCA AAATAGCTAC GGACTTTACT TTATTCTGGA ATTGAGCGTC
TTAGAATCAC TACACATTAG CTATACTTCG TGTAGGATTC CTCATCTACA AAATTTGAAT
ACTTTGGTAA TAAAGAGTGT GGAAATTGAT TCTGTATCCA TGTGGAAAGA TGTACATCGC
CTTACAAACT TGAAACATAT GAGCATAAAT GATTGCGAAC TTGATTGCTT GAATTGCACT
CTTCCTAGTT TCCTTGAAAC TCTCGATGTT TCACGAAACA ATATCGAAGA AGCTGATATC
ATACTTCCTG CAAATTTCAA GAGTTTGGAT ATCTCTCATA ATGAAATATG CAAATTCAGT
GCTAAAGGCA GATTGTTGAC ATTGAATCTT GATACTAATC GCATGTCCGA ATTATCGAAT
TCAACTCTCT GTATCCCCTG TACCGTTTGT GAATTGAACA TGAGTAATAA CGATACGATA
TCAATTTCAA GTGACTTTTC TTTTCCAGAA TCTGTGAAGG TGTTACGCTT AGATTACAAC
TTCTTTTCTG ATTATACGGT ATTATTCAAG ATGCCTTCCC AGATCTTGTT GCTACTGTTG
GACAGTTCTT TTTTTTTATA TCCAGAAACT AAAGAACCAA CTCCAGTAAT AATGAATTAT
CCGAAGCTCT GGCATTTCAG TATGACATCC TCCATAGGTA CTGAGTATCT TGACTTCAAT
TGGAATGGTT GTCCGAATCT AGAAAGTATC TTGATGAATG GCTGCAAGTT TGAAATAATT
AAACTTGAAA ATCTTCCGCC TTCAGTCAAG ATTGTTGATT TCAGTGATTG TAAAGTTCGA
AAAGTTGAAG GAAGATTTGA GAGATTACCT CATTTGATCG AGTTCAATCT CGAAGACAAC
CGATTGGCTC CAGGAGTAGA AACTTTCGGA AAAATGGGAA TGGGTTACGT CTCCCAAGCA
TTGCGTTGGT GA
 
Protein sequence
MDSVVHFDLY PEHFYQKIID ELPFLIVLVL ACNSSSPYQR YLLNSIYKEI EIQYISPSPG 
LRLEFLKQFY LLRYNVINPS DEKDYVPCKV VGKNVDSLVT FLNENPSVAV RHLRLVAIPS
EEVIAKVRSL SDRIGKITFI SARVIDEQQD ASMNWPTERD AAYLCGLRMH LDLNFSDPTN
YSDTLSELLF KLRYPPRLEF LSLSGGKEAI DSKVFSRFPR TIKSLILDLY DIQCDGFLKL
NLPPFLKFFS CTVIVDKNNR CFDISHLSHL TEVKLFFYYH LIPLSIFRFP RSLQTLSVDS
GLSSSGMEQL AELDQLKQVT IRVYRSPNNP VPILREAVHL PNSIEDLSIQ DYTIDGDVDG
AYFIPKSTKK LQLQNSYGLY FISELSVLES LHISYTSCRI PHLQNLNTLV IKSVEIDSVS
MWKDVHRLTN LKHMSINDCE LDCLNCTLPS FLETLDVSRN NIEEADIILP ANFKSLDISH
NEICKFSAKG RLLTLNLDTN RMSELSNSTL CIPCTVCELN MSNNDTISIS SDFSFPESVK
VLRLDYNFFS DYTVLFKMPS QILLLSLDSS FFLYPETKEP TPVIMNYPKL WHFSMTSSIG
TEYLDFNWNG CPNLESILMN GCKFEIIKLE NLPPSVKIVD FSDCKVRKVE GRFERLPHLI
EFNLEDNRLA PGVETFGKMG MGYVSQALRW