Gene PICST_30411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30411 
SymbolALS4 
ID4836722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2731415 
End bp2733235 
Gene Length1821 bp 
Protein Length606 aa 
Translation table12 
GC content44% 
IMG OID640388037 
ProductAglutinin-Like Serine rich hypothetical protein 
Protein accessionXP_001382757 
Protein GI150864068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGTAA ACAAGTACTT GGCTGCATCC ACCTCGCTTT TATCTATGGT AGTGGCCCTC 
GCCATCGACC AAGATATCCC TCAAACCACT GTCGTCTCTA CCTGGACTGG TACAAGAACA
GCTACAGACA CCGAAGATGA TTATGTTGGC GGTGGAACTA TAACTGTAAT CGATGAAATA
AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC
ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA
AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TCACTACTTC TGAAACATTG
ACTGACACCC CAGGTGGAAC TGATACTGTT GTAGTTGAAG TTCCAACGCT GTATGGACCA
AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TCACTACTTC TGAAACATTG
ACTGACACCC CAGGTGGAAC TGATACTGTT GTAGTTGAAG TTCCAACGCT GTATGGACCA
AATCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC
ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA
AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC
ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA
AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC
ACCGATACTG TTGGTGGAAC TGATACTGTC GTAGTTGAAG TTCCAACGCT GTATGGACCA
AACCCTACTA CTACTATCAC TTCAACTTGG ACCGGTACTT TTACCACCAC TGAAACTATC
ACCGATACTG TTGGTGGAAC TGACACTGTC GTAGTTGAAG TTCCAACTAG CTACCCAACC
TCCAAAAATA CGACTAGCAT AGAACCCTCT TACATTACTT CTATTATCAC AAAAACTGAA
GCAAATGGAT ATTTAACAAC CTACACCACA ACTGACACCA CCACAGAATT TGTTACTGCG
CCATGCATTA CGACCACAAT TGTAACTACT GGTCCGCAAG GTGAGGTAAC CACGTACATA
ACTACTACCA CTGGTTCCAC TGTTACTGAA ATCATTAGTG AACCATGTGT AACCACTACA
ATTGTTACAA CTGGTCTACA GGGAGAAGTT ACCACCTATG TTACCACTAC AGCTCCTGGT
TACACTACAA CTGAATCCGG TCCTACAGCT ACTGTTACCA TTACTGGTCC AAATGGTTCA
ATTAGTACAT ATGTAACTAC CTCTCCATCT ACTGTTACAG CACCAGGTCA TAACTCAACT
GTCACAATTA CTGGACCAAA CGGCAATAAC ACTACTGTCG TTCCAACTGT TCAAGCACCA
GAGCCTACCA CGTTATTCAC CACAGGTCCA AATGGAACCA CGTCCACGAT TGTTACATTT
GCCTCAGTAG TGACTGATTC TCCTTCGGCT GAAGTCTACA CCATTGCCCA CACAGTAACT
GGAACTAATG ATATTCTTTC GACAATCCTC ACTGCTATCA CCATTTCACC TGCCTCTCCA
GCCCCATCCT CCCCAGCTCC AGATTTTGCC TCCAACTACA CCATCCCCTC TGTCCCAGTT
TATGAGGGTT CTGCTAACTT CATGAGATCC ACATTCTCGG TTGCCACTCT TTTATGTTTA
GCTGTGACAT TCATTGTTTA G
 
Protein sequence
MLVNKYLAAS TSLLSMVVAL AIDQDIPQTT VVSTWTGTRT ATDTEDDYVG GGTITVIDEI 
NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTSETL
TDTPGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTSETL TDTPGGTDTV VVEVPTSYGP
NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTTETI
TDTVGGTDTV VVEVPTSYGP NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYGP
NPTTTITSTW TGTFTTTETI TDTVGGTDTV VVEVPTSYPT SKNTTSIEPS YITSIITKTE
ANGYLTTYTT TDTTTEFVTA PCITTTIVTT GPQGEVTTYI TTTTGSTVTE IISEPCVTTT
IVTTGLQGEV TTYVTTTAPG YTTTESGPTA TVTITGPNGS ISTYVTTSPS TVTAPGHNST
VTITGPNGNN TTVVPTVQAP EPTTLFTTGP NGTTSTIVTF ASVVTDSPSA EVYTIAHTVT
GTNDILSTIL TAITISPASP APSSPAPDFA SNYTIPSVPV YEGSANFMRS TFSVATLLCL
AVTFIV