Gene PICST_42095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42095 
SymbolATG7 
ID4836974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1395895 
End bp1397853 
Gene Length1959 bp 
Protein Length652 aa 
Translation table12 
GC content45% 
IMG OID640388289 
ProductAutophagy-related protein 7 (Autophagy-related E1-like activating enzyme ATG7) 
Protein accessionXP_001383035 
Protein GI150864282 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR01381] E1-like protein-activating enzyme Gsa7p/Apg7p 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0582833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA GCGATAAGAC GGCCGCAAGA GTCGCACCAA AATATGTGCC CATTCTGTCG 
TTTGTAGAAC TGTCTTTCTT TACCAAATTA TCTGAACTCA AGCTTAACGA GTTCAAGCTT
GATTCGTCCA AAAGAGACAT CCACGGGTTT ATCACCTCCC CCAGACGGCT CAATAAGTTC
AATGACCAGC CCACTTTGAA TTTGGACTTG CAAAGCTTTG ATATTGCAGA AAAGGAAGCT
AACAACCTTC ATATAAGCGG AGAGCTCTAC AATGTGAACA CCATAGAAGA GTTCAAAAAC
ATCAACAAGC TGGACTTGCT CAACGACTGG GGCAAAGAGG TGTACACACG CCTTATACAG
ACAGAGTCGT TGGATTACAA GGCGTTCAAT TGGTTTTTCA TCTTAACTTT TTCTGACTTG
AAGAAGTACA AGTTCTATTA CTGGGTTGCC TTCCCCACAT TGAATGCTCC GTGGTTTGTG
ACTTCGACCA GAGATGATTC CTTGGTAGAA AAACATACTA AAAACATCAC TAGACTCCTA
GAGAATGATG GAGATTCCGA AAATCTTGCG TTCTCCCAAT TGTACCAAGT AGTAGGAGAG
TCTTACTTAG ATTTGAATTC TATTCGTAGT AGTCGTAACG GTGTGTTTGT GTTTCTCGAT
GGCTGTTTGA ACAAAGAAAC CAAACCCTCG GTCCAGTTGA AGAACTACCT CTACTTCTTG
GCCTACAAAG GCTTTGAAGA CGTCGATGTG ATCGTATATA GAAACGACGG GTCCAGTTTT
CAGGTTCATT ACGAGCTAGA CACTGATTCC TTCAACAAGA ACGTCCAGCC AAAGATAACC
GGCTGGGAGA GAACGAGTCA GGGTAAGCTT GGGCCAAAAT TGGCAGATCT TGGCTCGTTG
ATCAACCCGC ACCAATTAGC TGATCAGGCT GTGGATCTCA ATTTGAAGTT GATGAAATGG
CGTATTGCTC CGGAACTCAA CCTAGACATC GTCAAGGAAC AGCGGGTACT TCTTCTTGGC
GCAGGTACTT TGGGAAGTTA TGTGGCAAGG GCGTTAATGG GCTGGGGCGT GAGAAAGATT
ACGTTTGTCG ATAATGGGCG TATCTCATAT TCTAACCCTG TACGGCAACC TTTGTTCAGT
TTTAAGGATT GTTTCAGCGA TAATGGACAA GGTGAAATGA AGGCTGCACG AGCTGCTGAA
GCCCTTAAGG AGATATTTCC TGGTGTAAGT TCTGAGGGTA TTAGTTTGGA AGTACCCATG
ATTGGGCATC CGGTGAGCGA CGAAGCCAAA CTGAAGAGTA ATTTCGGAAC GCTTTCACAA
TTGTTTGACG ACCATGACAT CATCTACTTA TTGATGGACT CGCGTGAATC GCGCTGGCTT
CCCACGGTTC TTGGCTATGC TAAAAACAAG ATTGTCATCA ACGCTGCATT GGGGTTTGAT
AGTTATTTGG TGATGAGACA CGGGAATTTA AGCCAACCAG AAGAGTCCAG GCTTGGTTGT
TATTATTGTA ATGATGTTGT TGCACCCAAC GACAGTCTTA CCGACAGAAC GTTGGACCAG
ATGTGTACCG TAACCAGACC CGGAGTTGCC CTTATGGCTT CTGCTTTGGC TGTAGAGTTG
CTTGTTTCCA TCCTACAACA TCCTGATGGA AGCAAAGCTG CTCAGGATGA GAGCACCAAG
TTTGGTGGTG TTCCTCACCA AATTAGAGGC TTCTTGCACA ACTTCCAGCA GACAAAGCTT
TATGCTCCTA ACTACAAGCA CTGTTCAGCT TGTTCACACA CGGTGATCAG TAAGTTCGAA
GAAGAGGGCT GGGAGTTTGT CAAGAAGTGT TTGAACGACT CGGGATACTT GGAGGAAATT
TGTGGGTTGA AACAGGTCCA GGAAGAGGCC GAGAAGGCTA CGGAGGATTT GATGAAGGAT
ATGGACTTAG ACGATGAAGA TTCTGAGTGG TTGGACTAG
 
Protein sequence
MSDSDKTAAR VAPKYVPISS FVESSFFTKL SELKLNEFKL DSSKRDIHGF ITSPRRLNKF 
NDQPTLNLDL QSFDIAEKEA NNLHISGELY NVNTIEEFKN INKSDLLNDW GKEVYTRLIQ
TESLDYKAFN WFFILTFSDL KKYKFYYWVA FPTLNAPWFV TSTRDDSLVE KHTKNITRLL
ENDGDSENLA FSQLYQVVGE SYLDLNSIRS SRNGVFVFLD GCLNKETKPS VQLKNYLYFL
AYKGFEDVDV IVYRNDGSSF QVHYELDTDS FNKNVQPKIT GWERTSQGKL GPKLADLGSL
INPHQLADQA VDLNLKLMKW RIAPELNLDI VKEQRVLLLG AGTLGSYVAR ALMGWGVRKI
TFVDNGRISY SNPVRQPLFS FKDCFSDNGQ GEMKAARAAE ALKEIFPGVS SEGISLEVPM
IGHPVSDEAK SKSNFGTLSQ LFDDHDIIYL LMDSRESRWL PTVLGYAKNK IVINAALGFD
SYLVMRHGNL SQPEESRLGC YYCNDVVAPN DSLTDRTLDQ MCTVTRPGVA LMASALAVEL
LVSILQHPDG SKAAQDESTK FGGVPHQIRG FLHNFQQTKL YAPNYKHCSA CSHTVISKFE
EEGWEFVKKC LNDSGYLEEI CGLKQVQEEA EKATEDLMKD MDLDDEDSEW LD