Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_62474 |
Symbol | ATG21 |
ID | 4840049 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 764828 |
End bp | 766609 |
Gene Length | 1782 bp |
Protein Length | 552 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391364 |
Product | Autophagy-related protein 21 |
Protein accession | XP_001385850 |
Protein GI | 150866303 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.949866 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA TCAATGACTT GACCTTCAAC CAGGACTACT CGTGCATTTC TATTTCTACC TCCAACTACC ATCGCATCTT CAACTGCGAG CCATTTGGTC AGTTCTACTC TTCGTCTCAC GGAAACATTA AAAAGACCCT CTCCAACTCC ATAGAAACCA ATACTGTCAA CGCCAATGTT GATAATAATA CTAGGAATAG TAGAGCAAGT CTTGGTGACA ACACTATTCT GTTAGATGGA GCTCCTGAAG AAACTAAATG TCCGACTGCT TACTTGAAGA TGCTATTCTC TACTTCGCTA ACTATCATAG TTCCTCAGTC TCAGAACAAG CTCGGAAACA GGCTTCTCAA GATCTACAAC CTCAAACAAA ACTTGAAGAT CTGCGAGCTT TCGTTCCCGT CAAATATTAT CAACATCAAG CTTAACCGCA AACGTTTACT TGTCTTTTTG GAGATCGGCC ATATCTATAT CTACGACTTG AGTTGTGTCA GACTCATCAA AATCTTGGAA GTCAACTCAT ACTTGAATGA AACGGTATCT ACTGCAAATA ATTTAACCGA TTCTGGTCAA ACAGCAAGAG TATCGACCAA TATGAACAAC TCCTTCCACC AGCTTATTGG AGACTTGAGT GCTGACGACA ACTCGTTCTT GGTGTTGCCA CTTTCAGCCA TCAATGACCA GACTGATCTC TTCAACCATG AACATTCGTC GGCTTCTCCA CTGAGAAAGT CGTCCCAACC TTCAACACCA TTGTTGAAGC CCAGTGACTC GACCATAATA GCGAACTCTC TTGACTCCTT GATTGAGTAC ACACACAAAG ATACCCATCA CTTGCACAAA AAGGGCAGTA TCACCCTCGA CGATCTCAAA AAGGACAGCA ATGGATGGGT TATCGTGTAT GACACCATTG AATTGAGACC TCGGTTGATA TTCAAAGCTC ATAATTCAAG TATAGGCAAG ATTACTGTCT CGAATGATAA CTCCAAGATA GCTACTGCAT CAGTAAAGGG CACTATCATA AGAGTATTTC TGATTGATAG CAACAGTTTC TCCAGTGACA AGCTTAAAAT TTCGCAAGTG ACGAACTTGA GAAGAGGCCA TAATCTCGCG AGGATCAACA CGTTAAGCTT CAATGCGGAC AATCTGATTT TGGGCTGTGG TTCTGAAAGT AATACAATCC ATTTCTTTAG ATTGAATGAA AAAGCAGAAG CTACGTCTCC TGGCAATTCG GATGAGGGAA ATACTGAAGA CTACGAGGCA AACGACCACG ATAGCGAAGT CGAAGGCGAG AGTAGCAAGT CTTCAGAAGA CTTGAATGAG AACTTGGCCA ATTTGCTAGT ATCCAATCCA GCACCTCCCG TGGATGCAGA GGAGAACCAT AAACAGAGCA AGTCTTATTT CAGTTTCTCT AATCTAAAGA GTACTACAAA ATTGATTAAC AACCAATACA CCAAGTCTAT CATAAAGAAG TTACCTTACA AGGATTACTT TGAAAACTTG ATATGGGAGC CTCCGAGAAG GTCGTTTGCC TACATTAAGC TTCCAGAATA TACTCCACCC CATCACTATG GAGGACAACA TTTCACCTCT GAATCCACCA GTCCAGAGAA TAGAGTGGAG ATTGGCTTCA GCAATTCGTT GATCTTGTTG GCATCGTACC AAACAGGAAT CTTCTACCAC TATCAGTTGC CCAAGCCCGT GGGAAGCACC AGAGTTGGGC TGCCATCGGA AGAGGAAAAG AGAGAGGAAT GCTATCTTAT CAACCAGTAT AGTTTGGTCT GA
|
Protein sequence | MTTINDLTFN QDYSCISIST SNYHRIFNCE PFGQFYSSSH GNIKKTLSNS IETNTVNANV DNNTRNSRAS LETKCPTAYL KMLFSTSLTI IVPQSQNKLG NRLLKIYNLK QNLKICELSF PSNIINIKLN RKRLLVFLEI GHIYIYDLSC VRLIKILEVN SYLNETVSTA NNLTDSGQTA RLIGDLSADD NSFLVLPLSA INDQTDLFNH EHSSASPSRK SDSTIIANSL DSLIEYTHKD THHLHKKGSI TLDDLKKDSN GWVIVYDTIE LRPRLIFKAH NSSIGKITVS NDNSKIATAS VKGTIIRVFS IDSNSFSSDK LKISQVTNLR RGHNLARINT LSFNADNSIL GCGSESNTIH FFRLNEKAEA TSPGNSDEGN TEDYEANDHD SESSEDLNEN LANLLVSNPA PPVDAEENHK QSKSYFSFSN LKSTTKLINN QYTKSIIKKL PYKDYFENLI WEPPRRSFAY IKLPEYTPPH HYGGQHFTSE STSPENRVEI GFSNSLILLA SYQTGIFYHY QLPKPVGSTR VGSPSEEEKR EECYLINQYS LV
|
| |