Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31744 |
Symbol | |
ID | 4838703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1578458 |
End bp | 1580476 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 12 |
GC content | 36% |
IMG OID | 640390018 |
Product | predicted protein |
Protein accession | XP_001384257 |
Protein GI | 150865155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.743293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.568196 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGT TATCAATCGA GGATCTACTT CGAACTCTTC CCGAATACTA TGTAGAAGAG ATAGTGAATG GTATTCCTTT TTCCACTGTA TGTGCTCTCG TTTCCAACGG AACATCGCCG TACCAGAAGT TCTTTCTCAA TAGAGTCTTC CGAGATGTTA TGGTTTTAGA GGCGATGCCT GACGCTTCTC CAGTTTGTGA TATAGAATCT CTATATTTCG ACTTTTTCTT CAAAGACTTT GACGATAGAA GACAAAATAC AGTTTCAATT GGTGGGTACG ATACCTTGGT TCAATTTGTT GCCAGTTACC CAAATATTCG CCTTGGAAAG GTTGAAATTC AAACTGACCA CAACACGGAA GACATTTTAA ATCTATTGGA AGCAAATAAT ATAACCATAA ATAAATCACC TCGACACGCT GGAATCGATA TTAGTTTCCC GGAAACCCTA TATGGACTAC ATCATTTTGA AATAATACAA AACATTCCAG AGAACCTACA GGAGCTTGTA CTCCAACATG TTAACTTTCA AAATATTGAT ATATTGGAAT ATATTAAATC ATTACCTTCA AGACTAAAAA TTCTAGAAAT TGACTGTGTG TCCACATCGA TAAATCCAAG GCACCTAGAA TTGTTACCAA AGTCTTTACG AAAGCTTTGT TTGAGAAGTA CTAGTTTAGA AGGGGATGGA ATAATCAAGA CACACATGCC ACCGTTGTTA GAATCTATTG AACTACATTT GCAGAATGTT GGTGATGTAT GTCTAGATAT ATCTCATTTG AAGAAATTAA TAGAAGTCAA ACTCCTGACT TGCCTTTTAT TCAAGTTACC GGAGCAGTTA CAGAGGCTCT TCTTGGAGAA TTTAGACGGT TTCAAAGATT TAAATCGGCT ATGTGAACTA AAGAAATTAC GCTATTTATC AGTCACTGCG TTGTCATCCA ATATCCTTGA TGAAATTGTT CTTCCCAAAT CCTTGCAAGA ATTAGAAGTC CAGAATCCTT TTCAAGAATA CGAATTACCA ATGAGTACTC TAAATCTAAA ATTTGACAAC CTCCATGAAT CTGTAAGTTT CAAAGACCTT TCGCATATCA CAATGTCTGC AGCTGACTAT TCTAAATTTG TACTGGGATC ATTGGCCGAA AAATTGACCT CTTTAACTAT ACTCAATCAG CATAGTTTAC CGAAAAAGTT TTGGACAGAT ATTGAAAAAT TAAAAGAGTT GAGGAAATTA TCAATAACTA AGTGTGAAGT TGAATCAACT CCAAAGTACT TGCCACCCAA ATTAATAATT TTAGACCTTT CTCAAAATAG GATTTCCAGT ATTGCCATTT CTGGAACGTT GAAGAAACTA GTTCTAGACA GAAACGAGTT CACCACTATA TCCAATGCTA CCCTAGCGTT ACCTTCTACT ATTTGCGAGT TGTCGCTACA GTCCAATCGT ATATCATCAC TAGAGGAAGG GTATGCATTT CCTAAGTGTC TTCAATTATT TGATTTACTT GATAATGAAC ATTGTCCAAT TGAAGACATT CTTACAAACT TACCTCCCCG AATAGTGCAA TTGAGGTTAT CCACTAATAA GAAGAAAAGC TTTCCAAATA AAACAAGCCC CAGTACTGCT GAATTACAAT CTGCCACTAA AAACAAATAC TATAGAAGAG AAAAGCCTTT ACTTAATGTA ACAAGTAGTA CTTTATGGAA AGTCTATCTA GGTGGAGTAA GAGATCATTA TTTGGACTCA GAGTTGGTGT GGACTGGTTG CCCGAATTTG CAGTGCCTTG AAATCAGAAG TATTGATTTG GGAAGTATTC TGCTTAAGAA TTATCCATCT TCTTTGAAAA AATTAGTGAT GCTCAACACT AACATAAGTC AAGTAGAAGG GGATTTCCTC ACTTTACCGA GCTTGATTGT TGCCAGCTTA GTGGATAACC CATTGGAGGA ATGGTTAGAG AAAAACAAAG ACCACATTCC CCTGAATGTG AAGTTTAGTT ATTTCCAAGA TATATCATCA TATTGGTAA
|
Protein sequence | MNLLSIEDLL RTLPEYYVEE IVNGIPFSTV CALVSNGTSP YQKFFLNRVF RDVMVLEAMP DASPVCDIES LYFDFFFKDF DDRRQNTVSI GGYDTLVQFV ASYPNIRLGK VEIQTDHNTE DILNLLEANN ITINKSPRHA GIDISFPETL YGLHHFEIIQ NIPENLQELV LQHVNFQNID ILEYIKSLPS RLKILEIDCV STSINPRHLE LLPKSLRKLC LRSTSLEGDG IIKTHMPPLL ESIELHLQNV GDVCLDISHL KKLIEVKLST CLLFKLPEQL QRLFLENLDG FKDLNRLCEL KKLRYLSVTA LSSNILDEIV LPKSLQELEV QNPFQEYELP MSTLNLKFDN LHESVSFKDL SHITMSAADY SKFVSGSLAE KLTSLTILNQ HSLPKKFWTD IEKLKELRKL SITKCEVEST PKYLPPKLII LDLSQNRISS IAISGTLKKL VLDRNEFTTI SNATLALPST ICELSLQSNR ISSLEEGYAF PKCLQLFDLL DNEHCPIEDI LTNLPPRIVQ LRLSTNKKKS FPNKTSPSTA ELQSATKNKY YRREKPLLNV TSSTLWKVYL GGVRDHYLDS ELVWTGCPNL QCLEIRSIDL GSISLKNYPS SLKKLVMLNT NISQVEGDFL TLPSLIVASL VDNPLEEWLE KNKDHIPSNV KFSYFQDISS YW
|
| |