Gene PICST_74925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74925 
SymbolSEC21 
ID4851221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1233929 
End bp1236928 
Gene Length3000 bp 
Protein Length935 aa 
Translation table 
GC content42% 
IMG OID640392929 
Productcoatomer gamma non-clathrin coat protein involved in transport between ER and Golgi 
Protein accessionXP_001387882 
Protein GI126274207 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5240] Vesicle coat complex COPI, gamma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0538338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0907114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACTG TCTCTTACAA GAACAAGGAT GCGTACCTGT CGCTGTCGGG CTTGCCTGAC 
AAGATGGCCG TGTTCCAAGA ATGTCTCCAA CAGTTCAATG CAACACCAGT GAAGACGAAA
AAATGTCGTC AACTCCTAGC CAAATTGTTA AGGCTCATTT ATCATGGTGA AGAGTTCCCA
CCACTGGAGT CTACCACTTT GTTTTTCTCC ATCTCAAAGT TATTCCAGCA CAAGGACTCG
TCGTTGAGAC AATTGGTTTA CTTGACTATC AAGGAGTTGC TGTCGCTGTC AGACGACATC
TTGATGGTCA CGTCTTCCAT TATGAAGGAT ATTCAAGAGG GAGATGTTGT CTACAAGCCC
AATGCCATCA GAACATTGGC CAAAGTGTTG GATGCTACAA CAGTGTTTTC GGCCGAAAGA
TTGTTCAAGA ACGCCATTGT GGACAAGAAC CCTATTGTGT CAACAGCTGC CCTTATTTCA
TCGTACAACA TGTTGCCCAA TGCTAAAGAG GTGGTCAAGA GATTCACCAA CGAGACGTTG
GAGACAATCC AGAGCTACAA ACAGTTTCCC AAAGACCAGT TCCAGTTGCA TGAGTACTAT
GGTAGCTCTA CCTCCAACTT GCCAGCGACT TCTTACATGT ACCAATACCA TGCCTTGGGC
TTGTTGTACC ATTTGAAAAA CCACGACAAG ATGGCTCTTA TGAAGTTAAT CACAACGTTG
TCTGAGGGTT CGTCTTTAAA GAACTCGTTG TCGATCATCC AATTGATCAG ATATATCAAC
AAGATTTTGA TTGATGACGA ATCTCTCATT ACCCACTTGT ACCCCATCTT GTCTGGCTTG
TTGAAGCATA AGTCAGACAT GGTAGAGTTG GAAGCCTGTA AGACATTGAT CAACTTACAA
CACTTGATCA AGGACGACCA ATTCATGTCA ATTGTCACCA CATTGCAGAA GTTGTTGGGT
GTACCTAGAA CGGCTACTAG GTTCGCTGCC ATCAGATTGA TCAACAAGAT CTCTGCTAAA
CATCCAGAAA AGATCATTGT CGTCAACATC GAGTTGGAAG GCTTGATCAA CGACTCCAAC
AGATCAATTT CCACCTTGGC CATCACCACA TTGTTGAAAA CTATGGGAGC AGGTACCATT
GACTCTGGTG CCGGAGGTGA AAACGTAGAC AGATTGATCT CCAAGATGAC CTCGTTGATG
GACGAGATTA CGGAAGACTT CAAGATCGTG ATCATTGAAG CAATTGAAAA CTTAGCATTA
AAGTTCCCCT CGAAGCACAA GAAGTTGGTT GCATTTTTGA CCGATTTGTT GAGAGACGAC
GGTTCGCTTC AGTTGAAGAC AAGTATTGTA GATGCCTTGT TCGACTTGAT CAAGTTCTTG
CCTGAAGCCA GTGCCAAACA GTTGATATTG ATGAACTTGT GTGAATTCAT TGAAGATTGT
GAGTTCACCG AGTTGTCGGT TCGTATTTTG CACTTGTTAG GAGACGAAGG TCCAAACACA
TCCAATCCTT CTTACTACAT TAGACACATT TACAACAGAT TGGTTTTGGA AAACTCCATT
GTGAGATCGT CTGCTGTCAT TTCTTTGGCC AAGTTCGCTG CTGTTTGTGG TGGTGACGTT
TCTAAGAACA TCAAGATCTT GTTGAGCAGA TGTTTGAACG ATGTAGACGA TGAAGTTAGA
GACAGAACAG CCTTGTCATT GAAGTTCATC AACAGTGACC ATAAGAAGTT GATTGTTTCC
GGATCCAAGT ACGATTTAGC TGCTTTGGAA AGCAAATTAA CTCATTACTT GAACGAGACT
GATTTCGCTT CTTCATTTGA CATCAATGAA GTTCCACTTC TCAGCAGTGA AGAGTTGAAG
TCTATCGAGT ACAACAAGAA GATCAATAAG TTGGAGTCTT CCAATGCTGA CGCCAGTGAA
TCTAACGACA ACGTCAAGGG TTCCAAGACC GAAGACGACA GATCTGGTTC AGACAATTTG
GCCAACGACT TGTTGAAGCA ACAAGAATAC GCACAGGAAT TGTCTCAGGT TCCAGAGTTC
GCCGACTACG GCAAATTGTC CAAGTCGACC CTTACTCCAA AGTACTTGAC CGACAAGGAA
AACGAAGTTG TAGTCACTGT AGTCAAGCAC TTCTTCATCG AATCGCAAAA GTTGGTGTTG
CAATACGACG TCACCAACAC TTTACCTCGA TCCCTTATAC AAGACTTTTC TGTTATTGCC
GTTCCCGATA ACGAGTTATA CGAAGAAGAC TTCATTATTC CATTGGCTGA ATTGAAGCCA
GAACAAACCG GTACAGTTTA CATCTCGTTT AGTACCCCAA GTATAGAAGA CGAAGATTTG
CTTGCGGCCT TTGGCAACAC CATAAACTTT ATAAACAGAG AAATCATTGA CGATGAAGGC
AATGTCGATG AAGCCGATGA AGGATACACG GAAGAATTCG GCATCGAAGA CTTGGAAGTA
TTGCCAGGAG ACTTCCTTGC ACCTTTGTAC AACTCAAATT TCAGTGCAGC CTACGATCAG
TTGCCACACC ACGAGAGCTC GGTTGTTACG ATCTCTGGAG TCAACTCTTT AGAGAATGCT
GTCAGCAGCT TGAGAAGCAG CTTGAATTTG TTGCCATTAG ATGGATCTGA CTATGTTCCA
AGTGACACCA ATTCTCATGT GTTGAAGTTG TTTGGTAAAG ACGTTTGGGG CGGAAAAGTT
GGTGTGTTGA TCAGATTGGC TTTGACTGGC GGTAAGGTTG TTGCTAAGCT TGAAGTGAGA
GCAGAAACAG ACAATTTCAG CACTGCTGTA GCCAACGGAG CATACTGAAC TTGTAAATTT
AGTTTCTTCA GTTTTACATT TTTCTTTTCT TTGTGTACGA ATCAAAAGCC ATGTATAGAG
AAATTGAAAC TGTAGACACT CAAGTAGTTG ATACTACTAA AGGAAATTGG AATATTAGGC
TGGTATAGTT GATATTGAAA TGCGGTTGTG CTATTTTACG AGAATAGCCT ATAATTATAG
 
Protein sequence
MSTVSYKNKD AYLSLSGLPD KMAVFQECLQ QFNATPVKTK KCRQLLAKLL RLIYHGEEFP 
PLESTTLFFS ISKLFQHKDS SLRQLVYLTI KELLSLSDDI LMVTSSIMKD IQEGDVVYKP
NAIRTLAKVL DATTVFSAER LFKNAIVDKN PIVSTAALIS SYNMLPNAKE VVKRFTNETL
ETIQSYKQFP KDQFQLHEYY GSSTSNLPAT SYMYQYHALG LLYHLKNHDK MALMKLITTL
SEGSSLKNSL SIIQLIRYIN KILIDDESLI THLYPILSGL LKHKSDMVEL EACKTLINLQ
HLIKDDQFMS IVTTLQKLLG VPRTATRFAA IRLINKISAK HPEKIIVVNI ELEGLINDSN
RSISTLAITT LLKTMGAGTI DSGAGGENVD RLISKMTSLM DEITEDFKIV IIEAIENLAL
KFPSKHKKLV AFLTDLLRDD GSLQLKTSIV DALFDLIKFL PEASAKQLIL MNLCEFIEDC
EFTELSVRIL HLLGDEGPNT SNPSYYIRHI YNRLVLENSI VRSSAVISLA KFAAVCGGDV
SKNIKILLSR CLNDVDDEVR DRTALSLKFI NSDHKKLIVS GSKYDLAALE SKLTHYLNET
DFASSFDINE VPLLSSEELK SIEYNKKINK LESSNADASE SNDNVKGSKT EDDRSGSDNL
ANDLLKQQEY AQELSQVPEF ADYGKLSKST LTPKYLTDKE NEVVVTVVKH FFIESQKLVL
QYDVTNTLPR SLIQDFSVIA VPDNELYEED FIIPLAELKP EQTGTVYISF STPSIEDEDL
LAAFGNTINF INREIIDDEG NVDEADEGYT EEFGIEDLEV LPGDFLAPLY NSNFSAAYDQ
LPHHESSVVT ISGVNSLENA VSSLRSSLNL LPLDGSDYVP SDTNSHVLKL FGKDVWGGKV
GVLIRLALTG GKVVAKLEVR AETDNFSTAV ANGAY