Gene PICST_58795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58795 
Symbol 
ID4838910 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp627998 
End bp632281 
Gene Length4284 bp 
Protein Length1408 aa 
Translation table12 
GC content40% 
IMG OID640390225 
Productpredicted protein 
Protein accessionXP_001384087 
Protein GI150865037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.658129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAATA TTCTGAGCCA ACCGTTCAAA GTAAGCCATC AGAGAGTTAA CATTGATGTA 
GATATGCTGC GGAATCGTAT TGATGGTTTC ACGGAATTAA CTCTAGTTCC ATTTACCAAC
ACTTTAAAAG TAGTGAGATT AGATTGTAGA GAAATGAAGA TCACCAGAGT CACCATCAAC
AACATGAAGC CATGCAATTA CATACATAAC GACATTTTGT ACATCAATGA TGGCAAGTAC
TTCGACGAAG AAATCGTACT GGCCTATGAT GTCAACTTGT TTGACTTGTA TTCAGATGAA
GTGTCTATTC ATCAACATCA TATGATCAAG CACAAGCTAG GGTATATCTT TGGGGAGAGC
AATTACGATC CCAGAGATCC CCATGCCGAT GTATTCTCAA CTATTGTCAA TACTGAGGAG
CTTTCAGTGA TGCTTCCTGA TAATTTGCGA CTTGAATTGA CGGACATCAA CTCGATCCAT
ACTCCTGGGA GCCAACCGGG AACACTTACT CCTTTGCATT TGAAACTGAA AGCCACAAAC
AGCGACATTT ATACACCTAT ACAGCTTCGG ATCGAGTACG AGCTTGTGAA TCCCAAGAAT
GGAGTCAACT TTGTCTCTGA TAGTATAGAG AAGCGTAACT GGCACTCGTA CACAACTAAT
AACACGTACA ATCTTTCGAC TTCGTCATGG GTTCCTTGCA TAGACAATTT ATGGGACAGA
AGTACGTGGT CTCTTGAAGT AATCATCCCC AGAACAGTGC GAGATATAGG GAATCCTCGT
ATCATAGGCT CAGAAGAAGC TATGCGAGGG TCGCGCAACC AGAAGAAGAA ACGTAGACTA
AATAGAAACG ATGATTCAGA CATCGAGGAC GATGAAGACA ACGAAGATGA TGATAACGAA
AACCATGATC TAGTGGTTTG CTCAGGAGAT TTCAACAATG TTAAGGAAAC ACCGCATTCG
ATCGATTTGT CAAAGAAAGT AGTCTCATGG TCCATCTTTA ATCCTGTTTG TGCACATCAT
GTAGGCTGGG CTCTCGGATG TTTTGAGAAC TTTGTCATTT CCGAGTCGAC AGAGTCAAGA
GAAGTAGACG ACGAGATCAA AGAGAACTTT GAAGATATAG ACAAGGATGG AACCAGTTCT
CCTATAACCA TTTACTGCCT TCCCGGTCAG ATTGAACAGG CTAGAAACAC TTGTATTTTC
ACTATGAGAG CTATGGACTT TTTTCTGAAG GAATTTGGTT CTTTCCCGTT TAGTTCCTAC
GGAATAGTTT TTGTTCAGGA CAGTGTAGTC GACACCAACA ACTTTGCTGG TTTATCGATA
TTTTCAGATT CCATTTTGTA TCCTTCTGAT ATCATAGAGC CCATGTTTAC CAGCACGGAG
GTGATTCTTG AAGCCATTTC CAGTCAGTGG TCGGGAATCA GCATCACTCC TCAAACAGTT
AACGACATCT GGTGCACCAT AGGTATAGCA AAGTTCATGG TCTTTCAGTA CTTGAAGGAT
TTGATGGGAA CCAACGAATA CAGATTCAAG ATCAAGAAGA TGATGAATAG AATTGTTGAA
GAAGATAGAG GAAAGAAACC ATTGGGATAC CATTATTTCA GATTTCCTGT CTCTGACTCA
GATTTAGACT TCATCAAGTT AAAGGCTCCG ATTGTGTTGT TCATTTTGGA TAAGCGTATG
ACCAAGACCG ACAAATCATT TGGTTTATCA AGAGTGTTGC CCAAGTTGTT TCTTCAGGCG
ATGTCTGGAG ATCTTCAGAA CGGTACCCTT TCTACGCAGC ATTTTCAGTA TGTTTGTGAG
AAAGTCAATA GAAACAAACT AGAAAACTTC TTTAAGCAAT GGGTCTATGG AGTCGGAGCT
CCAATTTTTA ATATCACGCA AAGATTTAAC AAGAAGCGTG GAGTAATCGA AATGAGCATT
CGTCAAATTC AACATCAAGT TACTAGGAAA AGTGGAACAA ACGCTGAATC GTTTATAAAC
GATTCCATAG CATATTTGGA AGACGAGCCT ACTTTTCCAG TTCAGTCTAT TTTCACTGGA
CCTATGACTA TCAGAGTGCA TGAAGCTGAT GGGACTCCTT ATGAGCATAT AGTAGATTTA
AAAGAGGGAA ACACAAAGCT TGATGTTCAA TACAATTCAA AATTTAGACG TATGAAGAAG
AATCGTGATG AAACAAGTGA AAATGCAGTG ACTTTCAGCA GACTTGGAGA TGTTTTAGAA
TCAGAAAAGG AGATGGAAGA ATGGAACTTG GCTGACTGGG CAAAAGTGGA TGATGATCCT
ATGAATATAG AAGCGTTTGA GTGGATCAGA GTGGATGTAG ATTTCGAGTG GATTGCGAGA
TTTGATGTCA AACAACCTGA CTACATGTTT GGTTCACAAT TACAACATGA CAGAGACGTT
GAAGCCCAAT TTGACGCCGT CCGCTACTTG GGAAACATAG AAAAACCTTC TACGATCCAC
TGCACTGCAC TAACTCGTAC GGTAGTTGAT GAAAGGTATT ACTATGGGGT AAGGATTGCT
GCTGCTGAGG CATTGGCAAA CTTCTCCAAT TCTGTGACAA ACTTCATTGG TGTTCCTTAC
TTGGTCAAAA TTTATAGAGA GCTCTACTGT TTTCCAGGCA GTTCCATTCC TTTGAGTAAT
GATTTCAACG ACTTTGGTAG ATTCTTTTTA CAGAAAGAAA TCCCTAAGCA ATTGTGTAAA
ATTAGAGATA GTGATGATGA GGTTCCCGTT GTCATCAGAA ATTTGATACT CAATTTGATC
AAATTCAATG ACAATACCAA CAACAATTTC CAGGACAGTT TCTACATATC AGAATTGGTT
CAATCATTGA CAACTTGTGC TGTTAATTCC AGTTTTCCCA ATTCGCCGAA GGATATATTT
CCAAAGTCGC ATCCTCACGG GAGCCTGGAG AAGAAGAAGT TTGTTGCCAA TGTGATTACT
GAAATCAATA GACTTCAGAA GTTAGACGAA TGGATACCTT CGTATCATAA TGTAGTTTCG
GTGACTTGTT TGACTCAGAA AATTCGTTTG GCATTACATG GGCATTTAGA TCTTTCGTTC
GAAGACTTGT TATACTTCAC TGTTGAAAAG TTTCCAATTC AGATAAGAGT GGAAGCTTTC
CGTGGTCTTT TTGTGCTTGG TGGTTTGAAG AATAGACATA TCTTGAACTA TTTCTTGAAG
GCATGCCTCT TGGATGTTCG ATCAGCTGCT TACAGAGAGC TACTCATTTC GGCTCTAATT
GATTCGATAT GTGTCGCTGC TGTCAGTGGC ACTCCTTCTA CGTTAGACGA TCCCGAGTTC
AAGCCATTTG AAAAATTAAG CGAATCTAAA ACTGGGGCCA GTACAGCATT GGCCAATATG
ATTATTGTTG AAGATGGATC ACACAATGAG ATGGACGAAA AAAGAGATGT GTTTGCGAGA
GCCACTGTAA GTGGAGCTAT TGATATATTG AGAAGAGATT ATTCCATTGG AAAAGGCTTA
AAGCGTACTA TATGGGAGCT ACTTCATACG TCCTTATTGA GCATCCGTGA GAAGCGTAAC
TTGTTTTTGA TTTGTCAGAT CTTGTACAAA GAGATAGATC TGTTCGTAGT TAAGATTCCT
ATTCCGAATG TTCCTTTGAA TGAATTGACG AAAAAGATCG TTACGAAGAA CTTGGGCAAC
GGAAAGGTTG TATTTAAGAG ACAGGGTAGA TTCAAGATTC AGTTGGCATC ACAGAAACTT
CTTATTGAAA AGCCTAAGGC CAAGGAACCT AAAGTTTCTC ACAAGAAGCC TGTCGACAGA
ACAGTGAACG AGGTAGCCCT TAATCCTGAT GAGTCAAAGG AACCTAAATT GAAGTTGAAC
CTTAGTATCA AGCCAAGCGC TCCTCCGGCA GCTCCACCTC CAACGGAATC TCCTACTAAG
AAACAAAAGA AGCAAGCTCC AGCACCTAAA AAGCATATTC CGGTTGTATC TATTGGAACG
CACAATTCGA GAAAATTCGA ATTGTCGTTC AAGTTCCAAG ATCACAGTTT GCCAGAGTTG
AAGCCAGTGT CTGGAATTTC AGTGAAGGTA GGTAGAAGTC AAGTGTCTGT AGACGGAGGC
AAAGTAAAGT TCAACTTTAA GGGTGCTTTC AAATCGAGGT TCAGAGATTT GATGGCCAAG
CCAGTGGAAC CAGCACCACC ACAAAAGCCC AAACCCACAT TACCAGCGGT GACTACCGTA
GAGCCTAATG CACGGTATGT TAAGATTTTA ACCAAAGAGA AGAAAGTTTT GATATCGTCT
ACGCCTTTTG AAGTTTCAAA GGAG
 
Protein sequence
MRNISSQPFK VSHQRVNIDV DMSRNRIDGF TELTLVPFTN TLKVVRLDCR EMKITRVTIN 
NMKPCNYIHN DILYINDGKY FDEEIVSAYD VNLFDLYSDE VSIHQHHMIK HKLGYIFGES
NYDPRDPHAD VFSTIVNTEE LSVMLPDNLR LELTDINSIH TPGSQPGTLT PLHLKSKATN
SDIYTPIQLR IEYELVNPKN GVNFVSDSIE KRNWHSYTTN NTYNLSTSSW VPCIDNLWDR
STWSLEVIIP RTVRDIGNPR IIGSEEAMRG SRNQKKKRRL NRNDDSDIED DEDNEDDDNE
NHDLVVCSGD FNNVKETPHS IDLSKKVVSW SIFNPVCAHH VGWALGCFEN FVISESTESR
EVDDEIKENF EDIDKDGTSS PITIYCLPGQ IEQARNTCIF TMRAMDFFSK EFGSFPFSSY
GIVFVQDSVV DTNNFAGLSI FSDSILYPSD IIEPMFTSTE VILEAISSQW SGISITPQTV
NDIWCTIGIA KFMVFQYLKD LMGTNEYRFK IKKMMNRIVE EDRGKKPLGY HYFRFPVSDS
DLDFIKLKAP IVLFILDKRM TKTDKSFGLS RVLPKLFLQA MSGDLQNGTL STQHFQYVCE
KVNRNKLENF FKQWVYGVGA PIFNITQRFN KKRGVIEMSI RQIQHQVTRK SGTNAESFIN
DSIAYLEDEP TFPVQSIFTG PMTIRVHEAD GTPYEHIVDL KEGNTKLDVQ YNSKFRRMKK
NRDETSENAV TFSRLGDVLE SEKEMEEWNL ADWAKVDDDP MNIEAFEWIR VDVDFEWIAR
FDVKQPDYMF GSQLQHDRDV EAQFDAVRYL GNIEKPSTIH CTALTRTVVD ERYYYGVRIA
AAEALANFSN SVTNFIGVPY LVKIYRELYC FPGSSIPLSN DFNDFGRFFL QKEIPKQLCK
IRDSDDEVPV VIRNLILNLI KFNDNTNNNF QDSFYISELV QSLTTCAVNS SFPNSPKDIF
PKSHPHGSSE KKKFVANVIT EINRLQKLDE WIPSYHNVVS VTCLTQKIRL ALHGHLDLSF
EDLLYFTVEK FPIQIRVEAF RGLFVLGGLK NRHILNYFLK ACLLDVRSAA YRELLISALI
DSICVAAVSG TPSTLDDPEF KPFEKLSESK TGASTALANM IIVEDGSHNE MDEKRDVFAR
ATVSGAIDIL RRDYSIGKGL KRTIWELLHT SLLSIREKRN LFLICQILYK EIDSFVVKIP
IPNVPLNELT KKIVTKNLGN GKVVFKRQGR FKIQLASQKL LIEKPKAKEP KVSHKKPVDR
TVNEVALNPD ESKEPKLKLN LSIKPSAPPA APPPTESPTK KQKKQAPAPK KHIPVVSIGT
HNSRKFELSF KFQDHSLPEL KPVSGISVKV GRSQVSVDGG KVKFNFKGAF KSRFRDLMAK
PPNARYVKIL TKEKKVLISS TPFEVSKE