Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_43205 |
Symbol | |
ID | 4838007 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 13815 |
End bp | 16973 |
Gene Length | 3159 bp |
Protein Length | 411 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640389322 |
Product | hypothetical protein |
Protein accession | XP_001383276 |
Protein GI | 150864453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCTTA GTAAATACAC TCTTGCGGCG TTCAGCTTTA TATCCGCAGT GACGGCTGTC ACCATCACAC AAGACACTAT CGATCGTGGC GCTATCTCCC TTGGGCTTGG TGATACCATC ATTGAAGATG GGGTATATTG GTCTATCATT GATAACGTCA TTACAGCTTT TGCAGGTAAT GTTGATGTGG GTACAGGGTC TGGGTTGTAC ATCACAGGCC TCAATCCCTT GCTTGCCTTA CAGGTGACTC TTTTGCTGGG GTCCCTTACT AATGACGGTA TCATTTCCTT CAATGCTGTT CAATCTTTGC TTGCTCCAAC ATACAATCTT GTGGGAATCT CATTTACCAA CAATGGTGAA ATGTATTTAG GTGCCGATGG TTCTGTTGGT ATCCCCAGCA TCCTGATTAC CACCCCAGTT TGGAACAATA ACGGTTTGTT GGTTTTTTAC CAAAATACTA GAACCACGGG TCCTGTCAAT TTGGGTACAA TTGGTCTTAC TATTAACAAT AACGGACAAA TTTGCTTTTA CAACGAATTG TATACCCAGA CAACAAACAT TGCTGGTACT GGATGTATGA CTTTAAATGA CGATTCAAGT ATTTTTTTCT CTAATACCCT TTTGAATATC GATAGCAACC AGGTCTTTTA CTTGGCAGAT TCTGCCTCCT CGATCAGAGC AACTGCCATA AGTGCCCCCA AGACCTACAA TGTTGCTGGG TTCGGTAATG GCAACAAGAT TGGTTTAGAT ATTCCACTTG TTAACATTCC CCCTTTACTC AATGCTTACT CATACGATAC CACCACTGGT ATTTTGACAC TTCGTGGTGC TGGATTGTTA TCAATGAATT TCAACATTGG TTTAGGCTAT AACCCATCTC TTTTCTCCAT TGTAACCGAT GACAACGTTG GTTTGCTTTC GGTTCCGTTC GGTGCGCTTA CCTATTCTGG TCCTCCTCCA AACGTAGTGC CGTCGGTATG TCAACCATGT AAGCTACTTC CACCCGCACC AGGCACTAGT GCTACTGAGT TCACTACAAC TGCAACTTCC ACCAATTCTG CTGGGTTTAC TTGTACTGAG GTTGATGATA TCATCATTTC CACTGACACC AGTTACTCCT GGTTTACCAG TACTTCCACC ATTACTGCAG GATGTGCTTC CAACCCTACT ACCACTGTTA CTTCTACTTG GACTGGTACT GATACTACCA CCCTCACAGT TACTGATACT ATCGGCGGTA CTGACACCGT CATCGTCGAA GTTCCATCCA ACGATCAAAC CACCATCACT TCTACTTGGA CTGGTACCGA GACAACTACT GTGACTTTAA CAGATACCCA AGGTGGTACC GACACCGTCA TCGTCGAAGT TCCATCTAAC GATCAAACCA CCATCACTTC TACTTGGACT GGTACCGAGA CAACTACTGT GACTTTAACA GATACCCAAG GTGGTACCGA CACCGTCATC GTCGAAGTTC CATCCAACGA TCAAACCACC ATCACTTCTA CTTGGACTGG TACCGAGACA ACTACTGTGA CTTTAACAGA TACCCAAGGC GGTACTGACA CCGTCATCGT CGAAGTTCCA TCCAACGATG AAACCACCAT CACTTCTACT TGGACCGGTA CTGAGACAAC TACTGTGACT TTAACAGATA CCCAAGGTGG TACTGACACC GTCATCGTCG AAGTTCCCTC TACAGCCAAC AGTCAAACCA CCATCACTTC TACTTGGACC GGTACTGAGA CAACTACTGT GACTTTAACA GACACTGTTG GTGGTACTGA CACCGTCATT GTCGAAGTTC CCTCTACAGC CAACAGTCAA ACCACCATCA CTTCTACTTG GACCGGTACC GAGACAACTA CTGTGACTTT AACAGACACC CAAGGCGGTA CTGACACCGT CATCGTCGAA GTTCCCTCTA CAGCCAACAG TCAAACCACC ATCACTTCTA CTTGGACCGG TACTGAGACA ACTACTGTGA CTTTAACAGA CACTGTTGGT GGTACTGACA CCGTCATCGT CGAAGTTCCC TCTACAGCCA ACAGTCAAAC CACCATCACT TCTACTTGGA CTGGTACCGA GACAACTACT GTGACTTTAA CAGACACTGT TGGTGGTACT GACACCGTCA TCGTCGAAGT TCCTTCTACA GCCAACAGCC AAACCACCAT CACTTCTACT TGGACTGGTA CCGAGACAAC TACTGTGACT TTAACAGACA CCCAAGGCGG TACTGACACC GTCATCGTCG AAGTTCCCTC TACAGCCAAC AGTCAAACCA CCATCACTTC TACTTGGACC GGTACTGAGA CAACTACTGT GACTTTAACA GACACTGTTG GTGGTACTGA CACCGTCATC GTCGAAGTTC CCTCTACAGC CAACAGTCAA ACCACCATCA CTTCTACTTG GACCGGTACT GAGACAACTA CTGTGACTTT AACAGACACT GTTGGTGGTA CTGACACCGT CATTGTCGAA GTTCCCTCTA CAGCCAACAG TCAAACCACC ATCACTTCTA CTTGGACCGG TACCGAGACA ACTACTGTGA CTTTAACAGA CACCCAAGGC GGTACTGACA CCGTCATCGT CGAAGTTCCC TCTACAGCCA ACAGTCAAAC CACCATCACT TCTACTTGGA CCGGTACTGA GACAACTACT GTGACTTTAA CAGACACTGT TGGTGGTACT GACACCGTCA TCGTCGAAGT TCCCTCTACA GCCAACAGTC AAACCACCAT CACTTCTACT TGGACCGGTA CCGAGACAAC TACTGTGACT TTAACAGACA CTGTTGGTGG TACTGACACC GTCATCGTCG AAGTTCCTTC TACAGCCAAC AGCCAAACCA CCATCACTTC TACTTGGACT GGTACCGAGA CAACTACTGT GACTTTAACA GACACTGTTG GTGGTACTGA CACCGTCATC GTTGAAGTTC CAACCAGCTA CCCATCTTCT GAAAGTTCAT CTTCTGAAAG TTCATCTTCT GAAAGTTCCT CTTCGAGTGA GACTTCATCG ACTGAAAGTT CGTCTTCTGA AAGTTCCTCT TCGAGTGAGA CTTCATCGAC TGAAAGTTCG TCTTCTGAAA GTTCCTCATC GAGTGAGACT TCATCGACTG AAAGTTCGTC TTCTGAAACT TCATCTTCC
|
Protein sequence | MLLSKYTLAA FSFISAVTAV TITQDTIDRG AISLGLGDTI IEDGVYWSII DNVITAFAGN VDVGTGSGLY ITGLNPLLAL QVTLLSGSLT NDGIISFNAV QSLLAPTYNL VGISFTNNGE MYLGADGSVG IPSISITTPV WNNNGLLVFY QNTRTTGPVN LGTIGLTINN NGQICFYNEL YTQTTNIAGT GCMTLNDDSS IFFSNTLLNI DSNQVFYLAD SASSIRATAI SAPKTYNVAG FGNGNKIGLD IPLVNIPPLL NAYSYDTTTG ILTLRGAGLL SMNFNIGLGY NPSLFSIVTD DNVGLLSVPF GALTYSGPPP NVVPSVCQPC KLLPPAPGTS ATEFTTTATS TNSAGSSSES SSSSETSSTE SSSSESSSSS ETSSTESSSS ESSSSSETSS TESSSSETSS S
|
| |