Gene PICST_83067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83067 
Symbol 
ID4838900 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1444140 
End bp1445430 
Gene Length1291 bp 
Protein Length363 aa 
Translation table12 
GC content44% 
IMG OID640390215 
Productpredicted protein 
Protein accessionXP_001384575 
Protein GI126136102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00421755 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAA AAGAAAGACT TGCCGAATTG GAGCGGGATG CCCAAGAGTC GTATTCGTTT 
TTGGTATCTG ATGAGAAGTC ATCGACATTC CGCAGTCAGG AGGAGTTCAA AGAATTTGTA
GTAGACAAGA AGCAATCTCC AGTAGAAAAG ACACCAGTGA AGCCATGGGT CCATTTTGTA
GCTGGTGGTA TCGGGGGAAT GGTCGGTGCC ATAGTAACGT GCCCTTTAGA TGTGGTGAAA
ACGAGATTGC AATCAGACGT CTACCATGCC ATGTACAACA AGACACCTAA GTCTGCGAAC
CCTGTAATCA AGATGTTTCA GCATTTGAAG GAAACAGGCT CCGTTATTAG GGAATTGTAT
GTGAGCGAAG GTTCTAGGGC CTTGTTCAAA GGTTTGGGAC CAAATTTGGT CGGTGTGATA
CCTGCTCGTT CTATCAACTT CTTCACATAC GGCTCTACCA AAGAGTTCTT GACCAGCAAC
TTCAACCAGG GCCAGGAAGC CACCTGGATT CATTTGGCAG CCGGTATAAA CGCCGGTTTT
GTCACCTCGA CAGCTACCAA TCCAATCTGG TTGATCAAGA CCAGATTACA GTTGGACAAA
ACTAAGGGCA AACACTATAA AAGCTCTTGG GATTGCCTCA CTCATGTGAT CAAGCACGAA
GGATTCAGTG GCCTTTACAA GGGTTTGAGT GCTTCATATT TGGGAGGTGT AGAATCGACG
TTGCAATGGG TGTTGTACGA ACAGATGCGG ATGTTTATCC ACAGAAGATC GTTGGCTCTA
CATGGAGATG ATCCTAGTAG TAAAACTACT AGAGACCACA TCATAGAATG GTCTGCCCGA
TCTGGTGCTG CCGGTGCTGC CAAGTTCATA GCATCTTTAA TTACGTATCC TCATGAAGTG
GTCAGAACTC GTTTGAGACA AGCTCCGTTG GAGTCCACAG GTAAGCCGAA GTACACGGGC
TTGATCCAAT GCTTCAAATT GGTGTTGAAG GAAGAGGGTC TTGCCAGCAT GTATGGAGGT
TTGACTCCAC ACTTGTTGAG AACAGTGCCC AACTCCATCA TCATGTTTGG CACCTGGGAG
CTTGTAGTTC GTTTATTGTC ATGAGCATTT GAGAGACATG TTGCTCTTTC TAACGTAATA
TCTTGTACAT ATTTGCTTGG GTTCGTTCTT CATGAAGGTT GGTTTTGTTA TATATATCTA
TATATATCTA TCGTTTCTTG TTCATAAGAG ATTATCATCT CTTTGTATAT AATTCATACA
AGTTTCGGTT TCTTCGACTG TTTTTCTGTT A
 
Protein sequence
MTQKERLAEL ERDAQESYSF LVSDEKSSTF RSQEEFKEFK QSPVEKTPVK PWVHFVAGGI 
GGMVGAIVTC PLDVVKTRLQ SDVYHAMYNK TPKSANPVIK MFQHLKETGS VIRELYVSEG
SRALFKGLGP NLVGVIPARS INFFTYGSTK EFLTSNFNQG QEATWIHLAA GINAGFVTST
ATNPIWLIKT RLQLDKTKGK HYKSSWDCLT HVIKHEGFSG LYKGLSASYL GGVESTLQWV
LYEQMRMFIH RRSLALHGDD PSSKTTRDHI IEWSARSGAA GAAKFIASLI TYPHEVVRTR
LRQAPLESTG KPKYTGLIQC FKLVLKEEGL ASMYGGLTPH LLRTVPNSII MFGTWELVVR
LLS