Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30411 |
Symbol | ALS4 |
ID | 4836722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2731415 |
End bp | 2733235 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388037 |
Product | Aglutinin-Like Serine rich hypothetical protein |
Protein accession | XP_001382757 |
Protein GI | 150864068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGTAA ACAAGTACTT GGCTGCATCC ACCTCGCTTT TATCTATGGT AGTGGCCCTC GCCATCGACC AAGATATCCC TCAAACCACT GTCGTCTCTA CCTGGACTGG TACAAGAACA GCTACAGACA CCGAAGATGA TTATGTTGGC GGTGGAACTA TAACTGTAAT CGATGAAATA AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TCACTACTTC TGAAACATTG ACTGACACCC CAGGTGGAAC TGATACTGTT GTAGTTGAAG TTCCAACGCT GTATGGACCA AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TCACTACTTC TGAAACATTG ACTGACACCC CAGGTGGAAC TGATACTGTT GTAGTTGAAG TTCCAACGCT GTATGGACCA AATCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC ACCGATACTG TTGGTGGAAC TGACACTGTC GTAGTTGAAG TTCCAACTAG CTACCCAACC TCCAAAAATA CGACTAGCAT AGAACCCTCT TACATTACTT CTATTATCAC AAAAACTGAA GCAAATGGAT ATTTAACAAC CTACACCACA ACTGACACCA CCACAGAATT TGTTACTGCG CCATGCATTA CGACCACAAT TGTAACTACT GGTCCGCAAG GTGAGGTAAC CACGTACATA ACTACTACCA CTGGTTCCAC TGTTACTGAA ATCATTAGTG AACCATGTGT AACCACTACA ATTGTTACAA CTGGTCTACA GGGAGAAGTT ACCACCTATG TTACCACTAC AGCTCCTGGT TACACTACAA CTGAATCCGG TCCTACAGCT ACTGTTACCA TTACTGGTCC AAATGGTTCA ATTAGTACAT ATGTAACTAC CTCTCCATCT ACTGTTACAG CACCAGGTCA TAACTCAACT GTCACAATTA CTGGACCAAA CGGCAATAAC ACTACTGTCG TTCCAACTGT TCAAGCACCA GAGCCTACCA CGTTATTCAC CACAGGTCCA AATGGAACCA CGTCCACGAT TGTTACATTT GCCTCAGTAG TGACTGATTC TCCTTCGGCT GAAGTCTACA CCATTGCCCA CACAGTAACT GGAACTAATG ATATTCTTTC GACAATCCTC ACTGCTATCA CCATTTCACC TGCCTCTCCA GCCCCATCCT CCCCAGCTCC AGATTTTGCC TCCAACTACA CCATCCCCTC TGTCCCAGTT TATGAGGGTT CTGCTAACTT CATGAGATCC ACATTCTCGG TTGCCACTCT TTTATGTTTA GCTGTGACAT TCATTGTTTA G
|
Protein sequence | MLVNKYLAAS TSLLSMVVAL AIDQDIPQTT VVSTWTGTRT ATDTEDDYVG GGTITVIDEI NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTSETL TDTPGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTSETL TDTPGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYPT SKNTTSIEPS YITSIITKTE ANGYLTTYTT TDTTTEFVTA PCITTTIVTT GPQGEVTTYI TTTTGSTVTE IISEPCVTTT IVTTGLQGEV TTYVTTTAPG YTTTESGPTA TVTITGPNGS ISTYVTTSPS TVTAPGHNST VTITGPNGNN TTVVPTVQAP EPTTLFTTGP NGTTSTIVTF ASVVTDSPSA EVYTIAHTVT GTNDILSTIL TAITISPASP APSSPAPDFA SNYTIPSVPV YEGSANFMRS TFSVATLLCL AVTFIV
|
| |