Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30036 |
Symbol | LAC4 |
ID | 4837088 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1783793 |
End bp | 1786915 |
Gene Length | 3123 bp |
Protein Length | 1021 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388403 |
Product | Beta-galactosidase (Lactase) |
Protein accession | XP_001382569 |
Protein GI | 150863923 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0855272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGACT ATACAAAGAA TTCGCTAGTA AAAGTACTTT CAGATCCACA GACTGTTCAT ACCAACAGAT TACCAACAAG GGCATACTAC CTCCCGTCTG AATCGACTTT GTCTTTGAAT GGAGACTGGG ATTTTAGTTA TTTCGAAACT CCTCAGGAAG CCCCAATACC AGGAGACAAT TTCGAAGACT TTAAAAAGAT TCGAGTTCCT GGCCATTGGC AATTACAAGG ATACGGAAGA CCTCATTACA CCAACGTCGT CTATCCATTC CCTGTAACAC CCCCAAATCC ACCTTCGAAA AATCCAACTG GGGTTTATCG CCATTCATTT GAAGTTCCAG AAGATTGGTC AAAAAAAGAT TATGAATACA GACTCCGGTT CGAAGGAGTC GACAACTCAT ATCACTTATT TCTCAATGGT AAACTCATTG GATACAACGA AGGAAGTAGA AATGCTGCTG AATTCGATGT ATCCGACTGC ATCCACAAAA CAGGCAAGAA TGACTTGGTC ATTAGAGTTT ATCAATGGTC TAGCTCATCT TACATTGAAG ATCAGGACCA ATGGTGGTTA AGTGGAATAT TTAGAGATGT CTACTTACTA GGATTCAATA AGAAGGGCTA TATCAAGAAC TTTCAAGTTG CTACTGATTT GGACAAAGAG TACAAGAATG CCGAATTGAG AATTAACTTG CAATTGAACA CTACTTCAGA TGTAAAGATA TCTTTGCATG ACCCCACAAA GAATTTGATA TTTGAACAGA AGTTTGACAA ATTGGTTCCA TCTTCTGAAC TCAAATTTCC AGTTTCAGAG CCTTTGAAAT GGACAGCGGA GTCGCCTTAC TTGTACCTTC TTCGAATTGA GATTGTCGAC GAATTAGAGG CAAAGATTTC TTGTGTTGAA CAACAGATCG GGTTCAGAAC AGTAGAAATG AAGAAAGGCT TGATCTGCGT CAATGGAGTT CCAATTTTGA TAAGAGGAGT TAATAGACAT GAACACCATC CAAAGTTTGG CAGATCAGTT CCTTTTGACT TTGTAGAAAG AGACTTAAAA CTTATGAAAG CTCACAATAT CAACGCCATT AGAACTGCTC ATTACCCAAA TCATCCCAAG TTTTATGAAT TGGCAAATCA GCTAGGATTT TGGGTTTTAG ACGAAGCCGA CTTAGAATGT CATGGATTCG TCGAAGCTGT ACGTATTCCA CAGAATAAGG AGACACAAAT TCTGTACGAC GAAACCACTC GTCAACTATT CAAAGAAGCA GCGGAATTCA CATCAAATAA CCCGTTATGG GAGAATGCTT ATATAGACCG TGCAAATCAA TTGGTGCATA GAGATTGTAA TCAACCCTGC ATCATCATAT GGTCTCTAGG AAATGAAGCA TTTTTCGGAC GGAATCATGC TAAAATGGCG AAAGAAATCA GGAGAAATGA TATTCAGAAC AGACCAATTC ATTACGAAGG AGATTTGAAC GCTGAAGTTG CCGACATGTT TAGTAGAATG TACATCACTC CTGATGAAGT TCTTGAGTAT ACTAAGCAGA AAGCAAAACC CTTGATTCTA TGTGAATATG CTCATGCAAT GGGAAATGGA CCCGGACTTT TAAGACAATA CCAGGACTTA TTCTATGAGC ATGAAATTCT TCAAGGAGGA TTTGTCTGGG AATGGGCAAA CCACGGATTG GAAGATGTTG ATTCCAAAGG AAATGTAGTA TATAAATACG GCGGTGACTT CGGCGAGTCT CCTCACGACG GTGTTTTTAT TTTGGACGGA CTTACGAATT CTGTCCATGA CCCAACTCCT GGATTGGTGG AATATAAGAA GGTGATCGAG CCTGTAGTTA TTTTAATTGG AGAAGAAGAA GTTTCCATCA AGAATACTTT TGATTTCATC GACTTGAATG AGTACACGGC TGAATACACA TTTCTCGAAA TCATTGGATT AGATAGACAT GTTTTGCAAT CCGGAGATTT AGACATCTCT AATTTGCAGC CCAAACAAAC AAGAAAACTA GCACTCCCAA CTTTGGAATC CAAACCTGAG CCTGGGTCCA CTGTTATATT TCATATCATT ATCAAAACAA AGAAAGAAAC TCGCGGTTTA CGTCGAGACC ATATTATCTC TTGGGCACAG AGAAAGATAC AACAGGGAAG CTCCAAAATT CTCAAGCAAC CTGGTGCAAC TTTAAAATGC AAACAAGAAG GGAACAGTTT GCAGATCGAT TCCGAAGGAT CCAAGTTGGT GTTTGATTTG GTTAAAGGGA GAATTAATTA TTGGGGATCA AGCAAAGAAC TTTTCTTGAG CGATGAAATG GACCAAGGAA GTTTGACATT CTGGAGACCA AGCATCAATA ATGATGCTAC CAAAGATGCA CCATACTGGA AATCATTTGG CTTGGACAAG ATGCAAAACC ATGTTCGTGA TGTTAGAGTT CAAAAACAAA ACCAATTTCA AGTAACCATC GAAGTAGATT CCTTTGTGGC TCCTCCTATA TTAGCCTGGG GATTTGAAGT GAAACAGGTT TACGAAGTGC TTGACAAAAA GATCAAATTA ACCACTTCTT TGAAGCCAAT TGGCCACAAG GATGAGTTCA TTCCCAAGAC AATTCCTCGT CTAGGTTATC AGTTCATTAT TTCTGACAAA CTAGGATCTA ACGTGAGATG GTTTGGGCGA GGTCCTGGAG AAAGCTATAG TGACAAAAAG GAAGGACAAT GGTTCGATGT TCATAGACTT CCGCTTGACA AATTGGATTA CAGCTACGAT TACCCACAAG AAAACGGAAA CCATGAAGAT ACCGATTGGG TTCTTCTTGA ATCAAAAGAA GAAGTCAAAG GCACACAAGC AGAAAATGGA GCTAAGAGTG AGTGTGGTGC AGCCCCAAAT GTGTCTAATG CAGTGTTAAT TAGCTCATCT AGAGCTTTCG GATTCAAGGC GTCGGATAGC TGGCGAGTAG ACGAGGCACA GCATCCATCT GACATAGTTC ATGATAGACG GTTTATTCGG TTAGACTACA AGCAACATGG AGTTGGAACC GAGGCTTGTG GTCCTGGTCC ATTAGCCGAA TATCAATTTA GACTTAACGG TCCAATAGAG TTCGAGTTCA CACTAGACAT GATAACGAAC TAG
|
Protein sequence | MIDYTKNSLV KVLSDPQTVH TNRLPTRAYY LPSESTLSLN GDWDFSYFET PQEAPIPGDN FEDFKKIRVP GHWQLQGYGR PHYTNVVYPF PVTPPNPPSK NPTGVYRHSF EVPEDWSKKD YEYRLRFEGV DNSYHLFLNG KLIGYNEGSR NAAEFDVSDC IHKTGKNDLV IRVYQWSSSS YIEDQDQWWL SGIFRDVYLL GFNKKGYIKN FQVATDLDKE YKNAELRINL QLNTTSDVKI SLHDPTKNLI FEQKFDKLVP SSELKFPVSE PLKWTAESPY LYLLRIEIVD ELEAKISCVE QQIGFRTVEM KKGLICVNGV PILIRGVNRH EHHPKFGRSV PFDFVERDLK LMKAHNINAI RTAHYPNHPK FYELANQLGF WVLDEADLEC HGFVEAVRIP QNKETQISYD ETTRQLFKEA AEFTSNNPLW ENAYIDRANQ LVHRDCNQPC IIIWSLGNEA FFGRNHAKMA KEIRRNDIQN RPIHYEGDLN AEVADMFSRM YITPDEVLEY TKQKAKPLIL CEYAHAMGNG PGLLRQYQDL FYEHEILQGG FVWEWANHGL EDVDSKGNVV YKYGGDFGES PHDGVFILDG LTNSVHDPTP GLVEYKKVIE PVVILIGEEE VSIKNTFDFI DLNEYTAEYT FLEIIGLDRH VLQSGDLDIS NLQPKQTRKL ALPTLESKPE PGSTVIFHII IKTKKETRGL RRDHIISWAQ RKIQQGSSKI LKQPGATLKC KQEGNSLQID SEGSKLVFDL VKGRINYWGS SKELFLSDEM DQGSLTFWRP SINNDATKDA PYWKSFGLDK MQNHVRDVRV QKQNQFQVTI EVDSFVAPPI LAWGFEVKQV YEVLDKKIKL TTSLKPIGHK DEFIPKTIPR LGYQFIISDK LGSNVRWFGR GPGESYSDKK EGQWFDVHRL PLDKLDYSYD YPQENGNHED TDWVLLESKE EVKGTQAENG AKTFGFKASD SWRVDEAQHP SDIVHDRRFI RLDYKQHGVG TEACGPGPLA EYQFRLNGPI EFEFTLDMIT N
|
| |