Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_74925 |
Symbol | SEC21 |
ID | 4851221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1233929 |
End bp | 1236928 |
Gene Length | 3000 bp |
Protein Length | 935 aa |
Translation table | |
GC content | 42% |
IMG OID | 640392929 |
Product | coatomer gamma non-clathrin coat protein involved in transport between ER and Golgi |
Protein accession | XP_001387882 |
Protein GI | 126274207 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5240] Vesicle coat complex COPI, gamma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0538338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0907114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACTG TCTCTTACAA GAACAAGGAT GCGTACCTGT CGCTGTCGGG CTTGCCTGAC AAGATGGCCG TGTTCCAAGA ATGTCTCCAA CAGTTCAATG CAACACCAGT GAAGACGAAA AAATGTCGTC AACTCCTAGC CAAATTGTTA AGGCTCATTT ATCATGGTGA AGAGTTCCCA CCACTGGAGT CTACCACTTT GTTTTTCTCC ATCTCAAAGT TATTCCAGCA CAAGGACTCG TCGTTGAGAC AATTGGTTTA CTTGACTATC AAGGAGTTGC TGTCGCTGTC AGACGACATC TTGATGGTCA CGTCTTCCAT TATGAAGGAT ATTCAAGAGG GAGATGTTGT CTACAAGCCC AATGCCATCA GAACATTGGC CAAAGTGTTG GATGCTACAA CAGTGTTTTC GGCCGAAAGA TTGTTCAAGA ACGCCATTGT GGACAAGAAC CCTATTGTGT CAACAGCTGC CCTTATTTCA TCGTACAACA TGTTGCCCAA TGCTAAAGAG GTGGTCAAGA GATTCACCAA CGAGACGTTG GAGACAATCC AGAGCTACAA ACAGTTTCCC AAAGACCAGT TCCAGTTGCA TGAGTACTAT GGTAGCTCTA CCTCCAACTT GCCAGCGACT TCTTACATGT ACCAATACCA TGCCTTGGGC TTGTTGTACC ATTTGAAAAA CCACGACAAG ATGGCTCTTA TGAAGTTAAT CACAACGTTG TCTGAGGGTT CGTCTTTAAA GAACTCGTTG TCGATCATCC AATTGATCAG ATATATCAAC AAGATTTTGA TTGATGACGA ATCTCTCATT ACCCACTTGT ACCCCATCTT GTCTGGCTTG TTGAAGCATA AGTCAGACAT GGTAGAGTTG GAAGCCTGTA AGACATTGAT CAACTTACAA CACTTGATCA AGGACGACCA ATTCATGTCA ATTGTCACCA CATTGCAGAA GTTGTTGGGT GTACCTAGAA CGGCTACTAG GTTCGCTGCC ATCAGATTGA TCAACAAGAT CTCTGCTAAA CATCCAGAAA AGATCATTGT CGTCAACATC GAGTTGGAAG GCTTGATCAA CGACTCCAAC AGATCAATTT CCACCTTGGC CATCACCACA TTGTTGAAAA CTATGGGAGC AGGTACCATT GACTCTGGTG CCGGAGGTGA AAACGTAGAC AGATTGATCT CCAAGATGAC CTCGTTGATG GACGAGATTA CGGAAGACTT CAAGATCGTG ATCATTGAAG CAATTGAAAA CTTAGCATTA AAGTTCCCCT CGAAGCACAA GAAGTTGGTT GCATTTTTGA CCGATTTGTT GAGAGACGAC GGTTCGCTTC AGTTGAAGAC AAGTATTGTA GATGCCTTGT TCGACTTGAT CAAGTTCTTG CCTGAAGCCA GTGCCAAACA GTTGATATTG ATGAACTTGT GTGAATTCAT TGAAGATTGT GAGTTCACCG AGTTGTCGGT TCGTATTTTG CACTTGTTAG GAGACGAAGG TCCAAACACA TCCAATCCTT CTTACTACAT TAGACACATT TACAACAGAT TGGTTTTGGA AAACTCCATT GTGAGATCGT CTGCTGTCAT TTCTTTGGCC AAGTTCGCTG CTGTTTGTGG TGGTGACGTT TCTAAGAACA TCAAGATCTT GTTGAGCAGA TGTTTGAACG ATGTAGACGA TGAAGTTAGA GACAGAACAG CCTTGTCATT GAAGTTCATC AACAGTGACC ATAAGAAGTT GATTGTTTCC GGATCCAAGT ACGATTTAGC TGCTTTGGAA AGCAAATTAA CTCATTACTT GAACGAGACT GATTTCGCTT CTTCATTTGA CATCAATGAA GTTCCACTTC TCAGCAGTGA AGAGTTGAAG TCTATCGAGT ACAACAAGAA GATCAATAAG TTGGAGTCTT CCAATGCTGA CGCCAGTGAA TCTAACGACA ACGTCAAGGG TTCCAAGACC GAAGACGACA GATCTGGTTC AGACAATTTG GCCAACGACT TGTTGAAGCA ACAAGAATAC GCACAGGAAT TGTCTCAGGT TCCAGAGTTC GCCGACTACG GCAAATTGTC CAAGTCGACC CTTACTCCAA AGTACTTGAC CGACAAGGAA AACGAAGTTG TAGTCACTGT AGTCAAGCAC TTCTTCATCG AATCGCAAAA GTTGGTGTTG CAATACGACG TCACCAACAC TTTACCTCGA TCCCTTATAC AAGACTTTTC TGTTATTGCC GTTCCCGATA ACGAGTTATA CGAAGAAGAC TTCATTATTC CATTGGCTGA ATTGAAGCCA GAACAAACCG GTACAGTTTA CATCTCGTTT AGTACCCCAA GTATAGAAGA CGAAGATTTG CTTGCGGCCT TTGGCAACAC CATAAACTTT ATAAACAGAG AAATCATTGA CGATGAAGGC AATGTCGATG AAGCCGATGA AGGATACACG GAAGAATTCG GCATCGAAGA CTTGGAAGTA TTGCCAGGAG ACTTCCTTGC ACCTTTGTAC AACTCAAATT TCAGTGCAGC CTACGATCAG TTGCCACACC ACGAGAGCTC GGTTGTTACG ATCTCTGGAG TCAACTCTTT AGAGAATGCT GTCAGCAGCT TGAGAAGCAG CTTGAATTTG TTGCCATTAG ATGGATCTGA CTATGTTCCA AGTGACACCA ATTCTCATGT GTTGAAGTTG TTTGGTAAAG ACGTTTGGGG CGGAAAAGTT GGTGTGTTGA TCAGATTGGC TTTGACTGGC GGTAAGGTTG TTGCTAAGCT TGAAGTGAGA GCAGAAACAG ACAATTTCAG CACTGCTGTA GCCAACGGAG CATACTGAAC TTGTAAATTT AGTTTCTTCA GTTTTACATT TTTCTTTTCT TTGTGTACGA ATCAAAAGCC ATGTATAGAG AAATTGAAAC TGTAGACACT CAAGTAGTTG ATACTACTAA AGGAAATTGG AATATTAGGC TGGTATAGTT GATATTGAAA TGCGGTTGTG CTATTTTACG AGAATAGCCT ATAATTATAG
|
Protein sequence | MSTVSYKNKD AYLSLSGLPD KMAVFQECLQ QFNATPVKTK KCRQLLAKLL RLIYHGEEFP PLESTTLFFS ISKLFQHKDS SLRQLVYLTI KELLSLSDDI LMVTSSIMKD IQEGDVVYKP NAIRTLAKVL DATTVFSAER LFKNAIVDKN PIVSTAALIS SYNMLPNAKE VVKRFTNETL ETIQSYKQFP KDQFQLHEYY GSSTSNLPAT SYMYQYHALG LLYHLKNHDK MALMKLITTL SEGSSLKNSL SIIQLIRYIN KILIDDESLI THLYPILSGL LKHKSDMVEL EACKTLINLQ HLIKDDQFMS IVTTLQKLLG VPRTATRFAA IRLINKISAK HPEKIIVVNI ELEGLINDSN RSISTLAITT LLKTMGAGTI DSGAGGENVD RLISKMTSLM DEITEDFKIV IIEAIENLAL KFPSKHKKLV AFLTDLLRDD GSLQLKTSIV DALFDLIKFL PEASAKQLIL MNLCEFIEDC EFTELSVRIL HLLGDEGPNT SNPSYYIRHI YNRLVLENSI VRSSAVISLA KFAAVCGGDV SKNIKILLSR CLNDVDDEVR DRTALSLKFI NSDHKKLIVS GSKYDLAALE SKLTHYLNET DFASSFDINE VPLLSSEELK SIEYNKKINK LESSNADASE SNDNVKGSKT EDDRSGSDNL ANDLLKQQEY AQELSQVPEF ADYGKLSKST LTPKYLTDKE NEVVVTVVKH FFIESQKLVL QYDVTNTLPR SLIQDFSVIA VPDNELYEED FIIPLAELKP EQTGTVYISF STPSIEDEDL LAAFGNTINF INREIIDDEG NVDEADEGYT EEFGIEDLEV LPGDFLAPLY NSNFSAAYDQ LPHHESSVVT ISGVNSLENA VSSLRSSLNL LPLDGSDYVP SDTNSHVLKL FGKDVWGGKV GVLIRLALTG GKVVAKLEVR AETDNFSTAV ANGAY
|
| |