Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31331 |
Symbol | |
ID | 4838844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 533066 |
End bp | 535024 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390159 |
Product | predicted protein |
Protein accession | XP_001384399 |
Protein GI | 150865257 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGA ACTTCAATCA AACTGCTTCC AGTGATCGAG ACCAGATTTC TGCATTTTAC GACCAAACGC ATGTAGAAGC TTACAGCCAT GATCCAGAAT TCACACAGAT ATTCTCGACT GATGTACTTC CAAACTCTGA AGATGTGGAA TTAGCTAATT ATCTATTATA CCTTGACAAC GACTACAACT CGTTCATTCA GATCGCAAAC GCAATTCTGA TAAACGACTC CAAGTTTCAG ATAGAAGACT CCAACTCGCA ATTAGCTGAC TCGACACCCC AGACAGAAGC ATTGTCAATG ATGAGCTCTA CAGAAATCGA ACAACTTGTG AACTACAAAG TTAGCAATAA CAATACCAAT AATAATTTCG ATACTAATGG TACTTATGGA ACTAACTTCC AGGATCAAAA TCAAAGCGAA AATGGATTTC CTCACTTAGT CAGTCACCAA ACTCAAGATA CTCCAAGGAT TACCGTCACG GATGAGACAG GAAGAGAGAT TTCCCAATGG CAACAACCTA CTTACGAATC AAGTACCTCT TTGCAGGACT TCTACTCGTA TGATGTGGCA TTGCAAGACC CTATCCCATC TTCAAGTAGT AGTCTTATTC TTGGGAACTT GGATGAAACT GCAAGGCAAA ATCATGCTGC AAACTACCCG TTTTCTGAGA GTGATCAGCC CGAATTTCAT ATTCCAAAAA TAGAAGAACT GATTCCAGAT CCAGACGACT TTGCTCAATA TTCTCTCCTC ATATACCCAC AAGCCACAAC AGTAGCAAAA AACATACCAA TCTTCAATGT CCCCACTAGC AGTCAAGGAT CTTCAAGAGC AGTTGCTCCT ACGGTTCATT TCCAGCTCGA AACAAAAGAT CATACTGATG CAAACTTACA ACCCCGTACG AATAACTCGC TGAGCAGCTC CTTCAGCCTT TCTACTTCCC CTGCAATGGT ATTTGACGAA AGCAACCCTC TAAGGAGTTC CAGAAGTTCA AGAAGCTCCA GAAGTACACT AAATTCGATG AATTCTTTGG ATAGCAAAGG TCCCAGTTTT ACTGAGGTAG CTAGCCAAGG CGAAGTTGCC AGCTACTTCG AAGTAGTCAG ACATAGTGAC CAAGTCAGAG AATCTGAGGT AGACAGAGAA ACTATGGAAG TCGGAAATTC CAGATTCAGG GTTCTGCCCT TGGCTAAGAT CAGCATTTCC ATTTCGCCCG AGAGTGTTTC ATGTCTCAAC TGTAATATTG ACTACGGAAA CTACACTGGA AAAATCACCA ACAAAATCGA AGGTCACTTA GCATTGGCGA ACGTGTATGT TGCCCCTGGT GCTCATGAAA GTCAGATAAC CGACGCAACA CTTATTAAGC TATACACAGA GACAAAAGAA TTGTCAGCCT CATCATGTAT CCATCAGAGA ACTAAGTACG ACAGGGAAAT CGATGAGTTA CACAATTCCA TTAGCAATAT TGTCTACAAA ACTAGTAGCA AATATTCGTT AGATATGCCC TACGAGCCAC AGTACCTAAG GTTTGAAGTT GGTCCCGATA GTAGGTTGGT TATGGCGAGC AAGAGTGGTT TGTGTCCCTA TTGCGAAGAA GTGAGATTCC TTCCGTTCAA GAACTCGAGT TACTTGTCTC ATTTGACATT GGAACACGGA GTGTTCTCCA ACGGTTATCT CACGCCTGAT GGGCTCTATT TTGGATCTTA CAAGTTGAAG AAGAATAGCA GTAGGAATAA CCAGGAGCAC ACTCCATCTG GCAGAGAACG ACAAGTAGAA GCCTTGATGT GCCCCTTGTG TTTCGATATG GTGGAATTTG GATGTTGGGA AGGAAAGAAA AACAAGCTCT TGTCTTACTT CAGACACTTT AAAAATATCC ATGGTCAACA TACGATCAAG GCGAGAAGCT CACAGATTCC ACCCATTCAA GACCGGGGCC GTACGCTCCA TATTCTACCA GATCCTTAG
|
Protein sequence | MNMNFNQTAS SDRDQISAFY DQTHVEAYSH DPEFTQIFST DVLPNSEDVE LANYLLYLDN DYNSFIQIAN AISINDSKFQ IEDSNSQLAD STPQTEALSM MSSTEIEQLV NYKVSNNNTN NNFDTNGTYG TNFQDQNQSE NGFPHLVSHQ TQDTPRITVT DETGREISQW QQPTYESSTS LQDFYSYDVA LQDPIPSSSS SLILGNLDET ARQNHAANYP FSESDQPEFH IPKIEESIPD PDDFAQYSLL IYPQATTVAK NIPIFNVPTS SQGSSRAVAP TVHFQLETKD HTDANLQPRT NNSSSSSFSL STSPAMVFDE SNPLRSSRSS RSSRSTLNSM NSLDSKGPSF TEVASQGEVA SYFEVVRHSD QVRESEVDRE TMEVGNSRFR VSPLAKISIS ISPESVSCLN CNIDYGNYTG KITNKIEGHL ALANVYVAPG AHESQITDAT LIKLYTETKE LSASSCIHQR TKYDREIDEL HNSISNIVYK TSSKYSLDMP YEPQYLRFEV GPDSRLVMAS KSGLCPYCEE VRFLPFKNSS YLSHLTLEHG VFSNGYLTPD GLYFGSYKLK KNSSRNNQEH TPSGRERQVE ALMCPLCFDM VEFGCWEGKK NKLLSYFRHF KNIHGQHTIK ARSSQIPPIQ DRGRTLHILP DP
|
| |