Gene PICST_62474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_62474 
SymbolATG21 
ID4840049 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp764828 
End bp766609 
Gene Length1782 bp 
Protein Length552 aa 
Translation table12 
GC content42% 
IMG OID640391364 
ProductAutophagy-related protein 21 
Protein accessionXP_001385850 
Protein GI150866303 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.949866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA TCAATGACTT GACCTTCAAC CAGGACTACT CGTGCATTTC TATTTCTACC 
TCCAACTACC ATCGCATCTT CAACTGCGAG CCATTTGGTC AGTTCTACTC TTCGTCTCAC
GGAAACATTA AAAAGACCCT CTCCAACTCC ATAGAAACCA ATACTGTCAA CGCCAATGTT
GATAATAATA CTAGGAATAG TAGAGCAAGT CTTGGTGACA ACACTATTCT GTTAGATGGA
GCTCCTGAAG AAACTAAATG TCCGACTGCT TACTTGAAGA TGCTATTCTC TACTTCGCTA
ACTATCATAG TTCCTCAGTC TCAGAACAAG CTCGGAAACA GGCTTCTCAA GATCTACAAC
CTCAAACAAA ACTTGAAGAT CTGCGAGCTT TCGTTCCCGT CAAATATTAT CAACATCAAG
CTTAACCGCA AACGTTTACT TGTCTTTTTG GAGATCGGCC ATATCTATAT CTACGACTTG
AGTTGTGTCA GACTCATCAA AATCTTGGAA GTCAACTCAT ACTTGAATGA AACGGTATCT
ACTGCAAATA ATTTAACCGA TTCTGGTCAA ACAGCAAGAG TATCGACCAA TATGAACAAC
TCCTTCCACC AGCTTATTGG AGACTTGAGT GCTGACGACA ACTCGTTCTT GGTGTTGCCA
CTTTCAGCCA TCAATGACCA GACTGATCTC TTCAACCATG AACATTCGTC GGCTTCTCCA
CTGAGAAAGT CGTCCCAACC TTCAACACCA TTGTTGAAGC CCAGTGACTC GACCATAATA
GCGAACTCTC TTGACTCCTT GATTGAGTAC ACACACAAAG ATACCCATCA CTTGCACAAA
AAGGGCAGTA TCACCCTCGA CGATCTCAAA AAGGACAGCA ATGGATGGGT TATCGTGTAT
GACACCATTG AATTGAGACC TCGGTTGATA TTCAAAGCTC ATAATTCAAG TATAGGCAAG
ATTACTGTCT CGAATGATAA CTCCAAGATA GCTACTGCAT CAGTAAAGGG CACTATCATA
AGAGTATTTC TGATTGATAG CAACAGTTTC TCCAGTGACA AGCTTAAAAT TTCGCAAGTG
ACGAACTTGA GAAGAGGCCA TAATCTCGCG AGGATCAACA CGTTAAGCTT CAATGCGGAC
AATCTGATTT TGGGCTGTGG TTCTGAAAGT AATACAATCC ATTTCTTTAG ATTGAATGAA
AAAGCAGAAG CTACGTCTCC TGGCAATTCG GATGAGGGAA ATACTGAAGA CTACGAGGCA
AACGACCACG ATAGCGAAGT CGAAGGCGAG AGTAGCAAGT CTTCAGAAGA CTTGAATGAG
AACTTGGCCA ATTTGCTAGT ATCCAATCCA GCACCTCCCG TGGATGCAGA GGAGAACCAT
AAACAGAGCA AGTCTTATTT CAGTTTCTCT AATCTAAAGA GTACTACAAA ATTGATTAAC
AACCAATACA CCAAGTCTAT CATAAAGAAG TTACCTTACA AGGATTACTT TGAAAACTTG
ATATGGGAGC CTCCGAGAAG GTCGTTTGCC TACATTAAGC TTCCAGAATA TACTCCACCC
CATCACTATG GAGGACAACA TTTCACCTCT GAATCCACCA GTCCAGAGAA TAGAGTGGAG
ATTGGCTTCA GCAATTCGTT GATCTTGTTG GCATCGTACC AAACAGGAAT CTTCTACCAC
TATCAGTTGC CCAAGCCCGT GGGAAGCACC AGAGTTGGGC TGCCATCGGA AGAGGAAAAG
AGAGAGGAAT GCTATCTTAT CAACCAGTAT AGTTTGGTCT GA
 
Protein sequence
MTTINDLTFN QDYSCISIST SNYHRIFNCE PFGQFYSSSH GNIKKTLSNS IETNTVNANV 
DNNTRNSRAS LETKCPTAYL KMLFSTSLTI IVPQSQNKLG NRLLKIYNLK QNLKICELSF
PSNIINIKLN RKRLLVFLEI GHIYIYDLSC VRLIKILEVN SYLNETVSTA NNLTDSGQTA
RLIGDLSADD NSFLVLPLSA INDQTDLFNH EHSSASPSRK SDSTIIANSL DSLIEYTHKD
THHLHKKGSI TLDDLKKDSN GWVIVYDTIE LRPRLIFKAH NSSIGKITVS NDNSKIATAS
VKGTIIRVFS IDSNSFSSDK LKISQVTNLR RGHNLARINT LSFNADNSIL GCGSESNTIH
FFRLNEKAEA TSPGNSDEGN TEDYEANDHD SESSEDLNEN LANLLVSNPA PPVDAEENHK
QSKSYFSFSN LKSTTKLINN QYTKSIIKKL PYKDYFENLI WEPPRRSFAY IKLPEYTPPH
HYGGQHFTSE STSPENRVEI GFSNSLILLA SYQTGIFYHY QLPKPVGSTR VGSPSEEEKR
EECYLINQYS LV