Gene PICST_43969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_43969 
SymbolECM4 
ID4838219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp306695 
End bp307663 
Gene Length969 bp 
Protein Length322 aa 
Translation table12 
GC content46% 
IMG OID640389534 
ProductExtra Cellular Matrix protein 
Protein accessionXP_001383688 
Protein GI150864731 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0435] Predicted glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGTTTG CTGCTAAAGA TGGAGCTTTC CACAGAAGAC CTTCGCTGTT CAGAGATTTT 
ATCAGCAACA AGCCTGGTTC CAAGTTCTTA GCCGAGGCTA ACCGTTACCA TTTATATGTT
TCGTTTGCGT GTCCTTGGGC TCACAGGACG TTGATCACCA GAGTGTTGAA GGGCTTGAGT
TCCGTGATCT CTGTGTCAGT AGTGCATTGG CATATGGATG ACAAGGGATG GAGATTCATC
AACGATGAGG AATTGAAGAC AATTGACCCC AAGACAGATG TTTCTTTGGG AACAGTAGAC
CATTTGTACA ACTTCAAGCG TATCAGGGAG TTGTACTTCA AGGCGGAGCC TGATTATGTT
GGCAGATTCA CCGTTCCAGT CTTGTGGGAC AAAAAGTTGG AGACCATTGT GAACAACGAA
TCGAGTGAAA TCATCCGGAT GTTAAACCTG GAGTTCAACG AGCTTGCCAC AAAGGAAGGA
GCTGCCATTG ATATCTACCC TAAGGAGTTG CAAACAGAGA TTGACGACAT TAACTCGTGG
ATCTACGACA ACATTAACAA TGGGGTGTAC AAGTCTGGGT TCTCCACCAA ACAGGAAGTG
TATGACAAAG AAGTCAAAAA CGTGTTCACT CATTTGGACA AGGTTGAGGA GATCTTGAAG
AAAAACCATG CGGCCGACAA GCCGTACTTG CTAGGTAACA CCTTGACCGA AGCAGACGTG
CGTTTGTTCA CCACCATAAT CAGATTTGAC CCTGTGTATG TTCAGCACTT CAAGTGCAAC
ATTGGTATGA TCAGACACGA TTATCCTCAC ATCCACCAGT GGGTCAGGGA ATTGTATTGG
AAGGTGCCTG GCTTCAAGGA GACCACCGAC TTCGACCACA TCAAATACCA CTACACGAAG
TCGCATATTG CCATCAATCC TCATTCAATC ACTCCAGCTG GTCCTATCCC CAACATCTTG
CCATTGTAG
 
Protein sequence
LKFAAKDGAF HRRPSSFRDF ISNKPGSKFL AEANRYHLYV SFACPWAHRT LITRVLKGLS 
SVISVSVVHW HMDDKGWRFI NDEELKTIDP KTDVSLGTVD HLYNFKRIRE LYFKAEPDYV
GRFTVPVLWD KKLETIVNNE SSEIIRMLNS EFNELATKEG AAIDIYPKEL QTEIDDINSW
IYDNINNGVY KSGFSTKQEV YDKEVKNVFT HLDKVEEILK KNHAADKPYL LGNTLTEADV
RLFTTIIRFD PVYVQHFKCN IGMIRHDYPH IHQWVRELYW KVPGFKETTD FDHIKYHYTK
SHIAINPHSI TPAGPIPNIL PL