Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_63214 |
Symbol | |
ID | 4840529 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 412966 |
End bp | 414609 |
Gene Length | 1644 bp |
Protein Length | 533 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391844 |
Product | predicted protein |
Protein accession | XP_001386281 |
Protein GI | 150866622 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.220985 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0880707 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAAG CTCCATATTC CAACGACTCC GATGATGCCG AATCTCTTCT TGAGTTTGAA AGATTGGAGT CGACCCTTCC CCCAGCCTTG AAAAGGTCAG CCTCTTCACT CTTCGAAGCC ATTCAGCTGC AGAATCCTTC AGGACATTCG CTGGTTTCGT CACCTCAGCC AAACCGGGAA AATCACTTCA ATTCTTCCTG GATCAACACA GGAAATCCAA CTCCGAGAAC CCCTTCTCAG CATGCCGTGT TTCAACGGAA CAATCCCCAG TTGAGAAATG CTCTTCTTTC CAAGCTTTCC CACTCTGAAA ATGGTCGTGA ATCTTCTCCA ATTCCAGTTC CAAGACCCAA ATTCAACGAA GATATTGTAA AGCCTGCCAT GAATACTAAG CAGTCTTCTA CCGGTGTCTT ATGGGTAACC GAAAGAGCCA CTGAATATGG ATTCGATGCC GAAAACCAAA CTGATTGGGC CAACTTGGGC CAAGGTGCGC CAGAACACGG AGATACCGTT CCAGGCTCGT TCATTAGACC GAAAACGATT CCAGTGCCGG ATTATTCTAA GGAATATGCT CCTACTGCTG GCATCAAGAA ATTGAGAGAA GCTGTTGCCA ACTACTATAA CGAAACGTAT CGTCAGAACA AACTCAGCAA ATACACATAC AAGAACGTTT GTATTGTGCC TGGAGGAAGG GCCGGACTTA CGAGAATTGC ATCCATCATC AGTGATTGTT ACTTGAGTTT CTTCTTGCCA GACTATACTG CATATGCTGA AATGTTGTCG TTGTTCAAGA ACTTCTCCCC TATACCGGTG CCATTGGATG AAGCAGACAA CTACGAAATG CACTTGGAAA TGATCAAGAA CGAATTGACA CGTGGTGTCA GTGCTTTATT GACTTCCAAC CCAAGAAACC CTACAGGAAG GTGTATGACG CCACTACACT TGAAGCAATT GCACGATTTG TGTCGTGAGA AATGTTTACT TATCATGGAT GAATTCTACT CGCACTACTA CTACGACGAT GGTTGCACTG GATCATCAAT TTCTTCAGCC CAGTTCGTGG ACGATATCAA CCAGGACCCA GTGCTCATTC TCAATGGGTT GACCAAGGCA TTCAGATTGC CAGGATGGAG AATCTGTTGG ATATTGGGAC CGGAGGAGTA CATCAGTGCC TTGAGTAGTG CTGGTTCGTT CTTGGATGGA GGATCCAATG CCCCATTCCA ATTCACTGCT GTTGATTTCT TGGAACCATT GAAGGTGAGA GCCGAAATGA AGGCACTTCA AATCCACTTC AAGATGAAAC GTGACTACAT TATTGGCAGA CTTTCAAAGA TGGGATTCAC GTTCACTGAA AAGAATATCC CCAACTCCAC CTTCTATTTG TGGTTGAATT TGTCTCATCT TCCAGGGAAG TTGAGTAACT GTCTTGGATT TTTCCACGAA TGTCTTCATG AGAAGGTAAT TGTTGTTCCT GGATTTTTCT TCTTGATCAA TCCACAGAAC TTGTCACGCT TGGAAGATGT TATCTGGTAT AACTACGTAA GATTGAGTTA CGGCCCCGAA TTCAACCTGT TGGTGTTGGG CATGGACGGA ATAGAAAGGA TATTGCATCG TTTCGGCTGT CTCCCATACG ATCCAAACCA GTAG
|
Protein sequence | MSQAPYSNDS DDAESLLEFE RLESTLPPAL KRSASSLFEA IQSQNPSGHS SPNRENHFNS SWINTGNPTP RTPSQHAVNA LLSKLSHSEN GRESSPIPVP RPKFNEDIVK PAMNTKQSST GVLWVTERAT EYGFDAENQT DWANLGQGAP EHGDTVPGSF IRPKTIPVPD YSKEYAPTAG IKKLREAVAN YYNETYRQNK LSKYTYKNVC IVPGGRAGLT RIASIISDCY LSFFLPDYTA YAEMLSLFKN FSPIPVPLDE ADNYEMHLEM IKNELTRGVS ALLTSNPRNP TGRCMTPLHL KQLHDLCREK CLLIMDEFYS HYYYDDGCTG SSISSAQFVD DINQDPVLIL NGLTKAFRLP GWRICWILGP EEYISALSSA GSFLDGGSNA PFQFTAVDFL EPLKVRAEMK ALQIHFKMKR DYIIGRLSKM GFTFTEKNIP NSTFYLWLNL SHLPGKLSNC LGFFHECLHE KVIVVPGFFF LINPQNLSRL EDVIWYNYVR LSYGPEFNSL VLGMDGIERI LHRFGCLPYD PNQ
|
| |