Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46734 |
Symbol | |
ID | 4839367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 141366 |
End bp | 142682 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390682 |
Product | predicted protein |
Protein accession | XP_001385047 |
Protein GI | 150865715 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTAG CTACTCATAG TGCCAGGGAC ATTTCTTCGA TTGAGCTGAT ATTGCGAAAT ATCGGTCCTT ATAACGAGGT TCCAGATGAA TACAATGCTG ATGAGGCTGC CGAGGCTCTC AGAACGACCA CTGTATTGGT TATAGGTGCC GGAGGTTTGG GCTGCGAAAT CCTCAAGAAT TTGGCCCTTA CTGGCTTCAA AAAGATCCAT GTGATAGACA TGGACACAAT CGACGTGTCC AACTTGAACA GACAGTTTCT CTTTCGTCCC AAAGACGTGG GTCACTCAAA GGCCGAGGTG GCTGCACGGT TTATACAAGA ACGAATCGGT GACGAAGAGT TGAAGATCAC GCCATACTTT GGAAAGATCC AGGATAAACC TTTGGAATAC TATCGCCAGT TTGGAGTTAT TGTCTGTGGA TTGGATAGTA TAGAAGCCCG AAGATGGATA AATGCCACTG TAGTCAGCCT TGTAGATTCC GAGTTGAACA ACTTGATACC CATGGTTGAC GGTGGAACCG AGGGATTTCG TGGGCAGTCC CGTGTGATTC TCCCGACATT GACATCTTGC TACGAATGTA CTCTCGATTT GCTATCGCCG AAAACGACGT ATCCCGTTTG TACCATTGCG AATACGCCTA GATTGCCTGA ACATTGTATA GAATTTGCTT CAGTAATCGA GTGGCCAAAA CACTTTCCTG GTCGCAAGTT CGATGCTGAC GACCCAGAAC TGGTTCAATG GATGTATGAG ACAGCTTTGG CTAGAGCTAA GCTATTCAAC ATCCAAGGCG TTACCAAACA ATTGACCTTA GGGGTTGTTA AGAATATAAT ACCTGCAATA GCCTCTACTA ACGCTATCAT AGCTGCATCG TGTTGCAACG AAGCCTTCAA AATCGTCACC AACACCAATC CCATCCTAAA CAACTATATG ATGTATGCTG GAGACGAATC CATATTCACA TACACATATG CCCATAGCCG CAGGCCTAAC TGTCCCGTGT GTGGCAACAT GTCCAAGAAA GTTATAGCGA AAAACTGGTG GACACTAGAT AGATTCATCG AAGAAATCTC TGGCAAACAA GAGATCCAAA TGCTGCTGCC TTCATTAACA ACAGCTGAGA AATCTCTCTA CTTGCGCAAC CCTCCTAATT TGGAACAAGC CACAAGACCA AACTTGGCCA AAAAGTTCAA CACCTTGGTA AGAGCTGGAG ACGAAGTAGT GATCACAGAT CCCAACCTTC CTATCTCATT AAGGTTAACT GTGGAGTTTA CTGGTCCTGA AGTAGAACCC GACGATGTCA ACTCCAGTCT AATGTAA
|
Protein sequence | MSVATHSARD ISSIESILRN IGPYNEVPDE YNADEAAEAL RTTTVLVIGA GGLGCEILKN LALTGFKKIH VIDMDTIDVS NLNRQFLFRP KDVGHSKAEV AARFIQERIG DEELKITPYF GKIQDKPLEY YRQFGVIVCG LDSIEARRWI NATVVSLVDS ELNNLIPMVD GGTEGFRGQS RVILPTLTSC YECTLDLLSP KTTYPVCTIA NTPRLPEHCI EFASVIEWPK HFPGRKFDAD DPESVQWMYE TALARAKLFN IQGVTKQLTL GVVKNIIPAI ASTNAIIAAS CCNEAFKIVT NTNPILNNYM MYAGDESIFT YTYAHSRRPN CPVCGNMSKK VIAKNWWTLD RFIEEISGKQ EIQMSSPSLT TAEKSLYLRN PPNLEQATRP NLAKKFNTLV RAGDEVVITD PNLPISLRLT VEFTGPEVEP DDVNSSLM
|
| |