Gene PICST_31835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31835 
Symbol 
ID4838765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1797158 
End bp1798561 
Gene Length1404 bp 
Protein Length467 aa 
Translation table12 
GC content40% 
IMG OID640390080 
Productpredicted protein 
Protein accessionXP_001384656 
Protein GI126136264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA TAGTAGAAAT TTGGAGAAGA CCCATATCCC AGGTGATTTT GGTTGGTTGT 
GTGCTATTCA CGCAACCAGG AATGTTCGAT GCCATAACAG CAATAGGTGC CGGTGGCCAG
AAGGCTACTC TTTCGTGGCT AACAAATCAG GCCCTTGCTA CCTTGTACGG ATGCTTTGCT
GTAGTCGGTT TCATGGGAGG ATCATTTGTC AACACATTAG GTACAAGGAT TACCTTCTTC
CTTGGGACCA TAGGTTACAC ATTGTATATC GGCTCCTTGT GGTGTCTTGA TGAAACGGGA
AACACTGGAT TTGTTGTTGC TGGAGGAGCA TTGTGTGGAA TATCAGCTGG TCTATTATGG
TCTGTGCATG GTATGGTTAT TATGTCTTAC CCAGAAGAAA AAGACAAGGC AAAATGTTTT
GCGTTGACTT GGAGTTTACT ATCTGTAGGG GCTACTCTTG GTGGGTTGAT CAGTTTATTA
CAAAATGCTC AGCATGCAGA TACTTCAGGT GTTGCTACAG GAACATATGT TGCATTCATG
TGTATTATGC TTGTGGGATT GCTCATTTCT CTTTTGTTAT TGAATCCAAA AGACATTCGC
AGAAGTGATG GATCTAAATT GGAAAATTTC AAACAGACTT CATTCAAAAG AGAAATTGTA
GATACTTGCA AGCTTTTAGG GGATTCACGT TTGGTGATGT TGTTCCCAGC ATTCTTTGCT
AGTAATTTCT TCTATTCGTA TCAATTTGGT ATCAATGCTT TCTACTTCTC TCTTAGGACT
AGATCTTTGA ATTCTATGGT ATATTGGCTA ACTCAGATTA TCGGCACATT TGGACTTGGT
CTAATTCTCG ACAATACTAA GCTTGAAAGA AAGCAAAGAG GAATAATTGG CCTTGCTGTT
ACTTGTGTTG TTGTTATCGC AACTTGGATT GGTGGTGCTG TTTTCCAAAC TCAATTTACA
AGGTCTTCTT CGCCTCCAAA TGTTGACTGG ACTGATTCAA ATTTCGGTGG TCCATTTGTA
CTATATTTCA TGTATGGAAT TTCAGATGCT ATGTGGCAAT GCTGGTGTTA TTGGATAATG
GGCTCCTTAT CGAACGAATC GTATAAACTT GCTCGTTATG CTGGGTTTTA CAAAGGTGTT
CAATCGGCAG GTTCTGCTAT CTCTTTTGGA TTAGATTCCT TGCAAATACC CTTTAATAGA
GAATTGGGAG CAAATTTTGG TATGACTCTC TTTAGTATGC CATTCATGCT TTATGTTGCC
ACAAAGTTGA CTAAGACCAA CTATGATCAA GAGCAAGAAG TTATACCACC AGCACATGTT
CAAGAGGAGC TTGGTGTTGA AGGTGCTCGT GATTCACAAA GTAGTGTAAT TCTTATTGAA
GAAATTAATA CACTAAAGGT ATAG
 
Protein sequence
MKFIVEIWRR PISQVILVGC VLFTQPGMFD AITAIGAGGQ KATLSWLTNQ ALATLYGCFA 
VVGFMGGSFV NTLGTRITFF LGTIGYTLYI GSLWCLDETG NTGFVVAGGA LCGISAGLLW
SVHGMVIMSY PEEKDKAKCF ALTWSLLSVG ATLGGLISLL QNAQHADTSG VATGTYVAFM
CIMLVGLLIS LLLLNPKDIR RSDGSKLENF KQTSFKREIV DTCKLLGDSR LVMLFPAFFA
SNFFYSYQFG INAFYFSLRT RSLNSMVYWL TQIIGTFGLG LILDNTKLER KQRGIIGLAV
TCVVVIATWI GGAVFQTQFT RSSSPPNVDW TDSNFGGPFV LYFMYGISDA MWQCWCYWIM
GSLSNESYKL ARYAGFYKGV QSAGSAISFG LDSLQIPFNR ELGANFGMTL FSMPFMLYVA
TKLTKTNYDQ EQEVIPPAHV QEELGVEGAR DSQSSVILIE EINTLKV