Gene PICST_83452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83452 
Symbol 
ID4838642 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp226615 
End bp227853 
Gene Length1239 bp 
Protein Length403 aa 
Translation table12 
GC content48% 
IMG OID640389957 
Productpredicted protein 
Protein accessionXP_001384004 
Protein GI126134960 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCAG GCTCCGGTCA CAACACCTAC GGAGGATACC CTCCTCCACA AGGTCCTCCT 
CCTAACAATA ATGGCTACAA CTCTGGCCCC AATAATAGCT ACAGGCAACA GGGCTATTCT
CGTCCACAGG GACCTCCTCC AGGTCAGTAT GATCAACAAT CCCAGTACTC TCAGCAATCT
CAGTACTCTC AACAGCCTCA ATACTCTAGA CCATCTGCTC CTCCTCAAGG AGGCACTGGC
TATGGCGACC AGAGTCAATG GGGACGTCCT ACAGGGCCCC CTCCATCTGG ATCTCAGTCC
TTCGGTCAGA ATTCTGGCTA CACGTTCCAA TATTCGAACT GTAGTGGGCG TAAAAAGGCG
CTTTTGGTAG GAGTAAACTA CTTTGGCTCA CCAAACGAAT TGCGGGGCTG CATCAACGAC
GTCAAGAACA TGAGTTCGTT TCTTGTTGAC CATTGGGGCT ACCAGTGGAA CGACATTGTC
ATTTTGACAG ATGACCAGAA CGATATATCT CGAGTTCCAA CCAAGAACAA CATCATCAGG
GCGATGCAAT GGCTTGTTAA GGATGCACGT CCTAATGACT CGTTGGTATT CCACTATTCT
GGTCACGGGG GTACAACAGC GGACACGGAT GGAGACGAAG AATCTGGTTA CGATGACGTT
ATCTACCCTG TTGATTTCCA GCAAGCTGGT CATATAGTGG ATGATGACAT GCATGCAATT
ATGGTAAGAC CTCTTCCTCC TGGTTGTCGT TTGACGGCTT TGTACGACTC TTGCCATTCT
GGAACCGCTC TCGACTTACC CTATGTGTAC TCCACTAAGG GAGTAGTCAA GGAACCTAAT
TTGTTGAAGG ATGCAGGCTC GGATGCACTT AATGCATTCA TTAGTTATGA GCGAGGCAAC
ATTGGAGGTG CCATTTCGTC GCTTACTGGA TTGGTTAAGA AAGTAGCCCG CCAAGGCTCT
ACCAACCAGG ACCAGGTAAG ACAAGCTAAG TTCTCTGCAG CTGATGTGAT CTCGATTTCT
GGGTGTAAGG ATGACCAGAC TTCTGCTGAT GCAAAGGAAA ACGGCCGAGC CACCGGTGCT
ATGTCGTGGT CGTTCATCAA AGTGTTGAAC GAGCTCCCCA ACCAGTCGTA CTTGTCTCTT
TTGAACAATA TGAGAACGAT CTTGGCGGCC AAGTACTCGC AAAAGCCGCA ATTGAGTTGT
TCTCATCCTC AGGATATGAA CATTCAATTC ATCATGTAA
 
Protein sequence
MFPGSGHNTY GGYPPPQGPP PNNNGYNSGP NNSYRQQGYS RPQGPPPGQY DQQSQYSQQS 
QPSAPPQGGT GYGDQSQWGR PTGPPPSGSQ SFGQNSGYTF QYSNCSGRKK ALLVGVNYFG
SPNELRGCIN DVKNMSSFLV DHWGYQWNDI VILTDDQNDI SRVPTKNNII RAMQWLVKDA
RPNDSLVFHY SGHGGTTADT DGDEESGYDD VIYPVDFQQA GHIVDDDMHA IMVRPLPPGC
RLTALYDSCH SGTALDLPYV YSTKGVVKEP NLLKDAGSDA LNAFISYERG NIGGAISSLT
GLVKKVARQG STNQDQVRQA KFSAADVISI SGCKDDQTSA DAKENGRATG AMSWSFIKVL
NELPNQSYLS LLNNMRTILA AKYSQKPQLS CSHPQDMNIQ FIM