Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_43257 |
Symbol | |
ID | 4837980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 795603 |
End bp | 798449 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640389295 |
Product | predicted protein |
Protein accession | XP_001383787 |
Protein GI | 150864810 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.393184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.929698 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGAAC AATTACATAC GTTACAATCA ATAGCTATAG AGGACGAGTC CGCTCCATCT TTGGCGGTTT CCAACCGTTT TGTTCCGGCA TTATACGAAC TCAAATTGGA CATTAACCAT ACAAAACCAA ACTTCCAAGG TCAATTAGAC ATTCTTTTGA AAGAGAACGA CGTTTATAAT GTTGAAAGAC TGCAAACATC CACTAAATTC TCATTATCTT TACATGCATC CAAATTAGTC ATTACCAAGG CAGTTTTGAA TACCACTTCT GAAGTCAAAC TTACTGTTAA ATACGACAGA ATCAACAGTC AGGTGACTCT TTCTTCGTCA GAAGATGTCG AAATTGTAGA TGTCGCTAAT CTGAAAGTCT CCATTACGTA CATGGGGCAG ATCAATAGCA TCAAAACGTA CCAAGATAAA ACCCATGGTT TGTTCAAGAC TAATTATTTG GACAGTGTCT CAGGAAAGTC TAACAATTAT ATTCTTTCTA CTCACTTCCA GCCTCATTCT GCCAAGTTGG TCTTTCCCTT GATCGAAGAA TTGCACGTCA AGACTCCGAT CAAGTTGACC ATCACTACTC TATCCAAATT TAAAGTAATT TCAGTTGGTA AACTTCTACT GAATCAGCCA TTGGAGATGA GTGAAAACTC TACTTTCAGC TTTGAAACTT CTCCTCCAAT TGCGCCTTCT GTATTTGGAT TTGTAATCGG TGATTTTGAA TACTTGGAAG ACAAATATGG CGATCTTCCT CTTCGTGTAT ACACAAGTAT TGGTGAAAGC AGATATGCTA TTCGTGCTCT TAAACTGATG AAGAAATTGC TTCCTATTCT TGAAAGTTTG CTTGATGTCA AGTATCCTTT GGAAAAGCTT GATTTCGTTT CTATTCCATT CTTAAACGAT GGAGCCATGG AAAATTGGGG CTTGGTAACT GTTCTTTCTA ACCAATTGCT TGTGGATGAA AGCACAGCCA GCCCATCCAC TTTAAGACAA ATTGACCAAA TAGTTGCACA TGAACTTGTC CACCAATGGA TAGGAAACTT GGTTACATTT GATGACTGGA AATACTTGTG GTTGAACGAA TCCTTCGCTA CATGGCTCGG TAACTATATT CTTGACGTTG CTGATAACTC TCACCCAAAA GACAAAGAAT TTGAAAAATT CCTTGACCAA GACTGCTTCT ATGCTGAAGA CAAGTTCTCA ATCCCAAGCA TCAATACCTA TATGTCCAAA ATAGATACCG GATTAAACTC TTTGACATCG ACAATTTTTG ATACACATGC TTACGAGAAG GGTATAATTT TATTAAGAAT GATTGGAAAT ATCATACATG TAGATGGGGA TCCAACATCA GCTAGTGAGG GTGATGACTA CACGAGAATG TTGCGTGGAA TCGGCGCACT TATCAAAAAG TATCAGTACA AGTCAATCAA GGCATTTGAA ATTTGGAATA CATTGAACGA GTTGACATCC ATTGACTTGC AGAGTTTCGT CCATTCTTGG TTAAGATACC CTGGATTTCC ATTGGTTCAA GTTACTACCA ATGACGACAA CTCCAAGTTG TTATTTGAAC AACATCAATG TTTGTACAAT TTGAGGGCAG ACCAAGTTAA TTTAGAAGAC CATCCTTTTC ATGTCCCATT GTTTATCAAG GTGATAGATG ACAAGGGCTC ATCGAAAGTA TTGAATATCA TAATGACAGA CCGTACTTTG GAATTGGATA TTTCCTTGGC TCAATTAGTC AATATTAACC ACAACCATAG TGGTTACTAC CGTGTCAAGT ATTCCCCAAA ACTAATTGCC AATATCATTG AGAACATTGA TAAGGTGTCC CTGACTGATT TGATTACCAT CATCAATGAC TATGGAAAGC TTTTAGGAAG TGTTGGCACT ACAAAAGAAG ATCTCATTTC TTTAGTACAG ATCATTGAAG CTGTTTGCAA GAGATCACAC ATTGACTACG ATTTATTACA AGTTGCGATG ACCTATTTAG AAACAATCAA CTCAAACTTG ATGCATTTCA GTAAATATAC TGAGTTCCAG GTATGGGTTG ACAACTTAGT TAGCACTTTG TTCCAAAGAA TTGGTGGATG GGACAAATTG CAGTCTTTCA AAGATAGTCA TTGCTACGAT CCTGTAGAAA TGGAAGTTAG AAACGCCATT CTTCAAATTG GAGTATACCG TTCAGACTTC CAAGAAGTTG GCAAAAAGTT ATTCAAGAAT TTCGTTAATT CGGGAATAAA CAAATCGTTT ACTCCCAAAC AATTGGCAAC ATCCATGTTC AATACAATGA TCTACAATGC CCCACAGAAG GACTACAAGC GAGTCTTAGA GTTTGTCAAG AATTCCAACA ATTCTCTTTT GGAGCATACC GACCTTACGA ACCTGGATTT GCAAACCACA GCAGTATCGT CGTTGTCGTT TGTACACAAG GACGATTTGC TTCATAAGAC CTTGAACTTT GTCATGACCA ACATCGATGC TAAAATGATA GAGTTAGGCT TGATTGGATT CCAGTACAAG AGCTCCAAAC AAGACAAGCT TAAATTGTTC CAATGGTATA AGCTCCATTA CGACCAGTGG GTATTACGTT CGTTAAGAAA GGGTTCCGAT TGGTCAAAGC AAATCGGTAT TACGGTTTCA AACATTTCGA AGATGGTCTT AGGAACCATA ATGCAATTCG ATCCCGAACT TGTCGAATTG AGAGAAAAAT TTGTTAAGGA CAAACTTGCG ACGTTACCTC CTCACGGTTT ACAAGAATTG CTTGAAGCGG TACAGGATGA AAACGAAGAG AAGGTGTTGA TCGGCGGTTA TTATGACGAC TTGGTCCTGC AAGTGCTTAG AGCTTGA
|
Protein sequence | MVEQLHTLQS IAIEDESAPS LAVSNRFVPA LYELKLDINH TKPNFQGQLD ILLKENDVYN VERSQTSTKF SLSLHASKLV ITKAVLNTTS EVKLTVKYDR INSQVTLSSS EDVEIVDVAN SKVSITYMGQ INSIKTYQDK THGLFKTNYL DSVSGKSNNY ILSTHFQPHS AKLVFPLIEE LHVKTPIKLT ITTLSKFKVI SVGKLLSNQP LEMSENSTFS FETSPPIAPS VFGFVIGDFE YLEDKYGDLP LRVYTSIGES RYAIRALKSM KKLLPILESL LDVKYPLEKL DFVSIPFLND GAMENWGLVT VLSNQLLVDE STASPSTLRQ IDQIVAHELV HQWIGNLVTF DDWKYLWLNE SFATWLGNYI LDVADNSHPK DKEFEKFLDQ DCFYAEDKFS IPSINTYMSK IDTGLNSLTS TIFDTHAYEK GIILLRMIGN IIHVDGDPTS ASEGDDYTRM LRGIGALIKK YQYKSIKAFE IWNTLNELTS IDLQSFVHSW LRYPGFPLVQ VTTNDDNSKL LFEQHQCLYN LRADQVNLED HPFHVPLFIK VIDDKGSSKV LNIIMTDRTL ELDISLAQLV NINHNHSGYY RVKYSPKLIA NIIENIDKVS STDLITIIND YGKLLGSVGT TKEDLISLVQ IIEAVCKRSH IDYDLLQVAM TYLETINSNL MHFSKYTEFQ VWVDNLVSTL FQRIGGWDKL QSFKDSHCYD PVEMEVRNAI LQIGVYRSDF QEVGKKLFKN FVNSGINKSF TPKQLATSMF NTMIYNAPQK DYKRVLEFVK NSNNSLLEHT DLTNSDLQTT AVSSLSFVHK DDLLHKTLNF VMTNIDAKMI ELGLIGFQYK SSKQDKLKLF QWYKLHYDQW VLRSLRKGSD WSKQIGITVS NISKMVLGTI MQFDPELVEL REKFVKDKLA TLPPHGLQEL LEAVQDENEE KVLIGGYYDD LVSQVLRA
|
| |