Gene PICST_28367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28367 
SymbolMLP2 
ID4851144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1038085 
End bp1039371 
Gene Length1287 bp 
Protein Length428 aa 
Translation table 
GC content40% 
IMG OID640392852 
Productmyosin-like protein 
Protein accessionXP_001387843 
Protein GI126274130 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.525338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0889168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGTC TCATAGACGA CTTGGTGAAT ATCTGTTGGA GTCGAATCTC ACGCAGCTCT 
CTGTCTGATT CAATATTTGT CTCTCAGGTA TTAGGACTTC TTTCCGAGAT AGAATCTACG
CTTGGGGTCA ATTCTCTTTT GAAAAATGAA GAGCTCAAAT TATTGAAGCA GATGATTCAG
GCGACGCCTC TGATGCGCTT ACACAAGAAA GAGTTCCAAG AGTTTATCAT GCGGTTGGTG
AAATATCCCA ACTTTGAAGT CTTTCTCTAC GAGCGATGTA GAATCTCGAT GGACGATTTG
CGTAGAATCA TGAATGTTCC GTTTAAAGGC CCAAATCCAC CAACTCTTTC TCCGCTTGCT
CCACGAGAAA TCAAAAATAC AGCTAATGAG GCAAGAATAT CACATTCTCG TTACTTTGAT
CACAAAGAAA ATGTATCTCC CAACCACACC AACAAACAAT TGAAATCTCC TCCTGAATCA
CCCAGCCTTG ACTACAGATA CACTAAATTG CAATCAGAAC TTAATTTCAA AGATGAACAG
CTCAGAACGA AAGAAAGCGA GTACACAAGG GCGAACCTAG AGTATAGAAA GCTAGTGGAT
ACCAACTCAA CACAACTTAA AAGGATACGA GATCTCGAAA GTGAAGTCAG TTCCATCAAC
AAATATGTCC AGTCTTTGGA AGAACAGCTT TCAAGACAGC TGGGAGATAG AAACTCGAAT
TCTTTGGCTC TGAAAATTAA AGATAGGGAC AGAACTATAC GTAGTCTTGA GCAATTAAGC
AACGAATACA GAAACGAACT CAAGAACTTA GAAGAAGATA AATTGAAATC CGAGAATTCA
CTTGCGGAAT TGGTCACTAG TCTACGGGAG CAGGATAACT TGATCAAAAA TCTACAATTG
AAGCTTTCGC TAACCGGAGA ATCATTGAAA ATACAGTCTC AGAAAGCAGA CCCGGTTCGA
GTTAACTCAC AACTTCAAGA CTTTCTACTA AACTTACCAT TCCTCAAACA GTATTACTAC
TTCTACAAGT ATAAAAACAA CACACGCAGA TTGTTTATTG TGAACATGTT TGCGATGATA
CTAGCGACCA TCATAGTGTT GCATGTTGCG GAATGTGTAC TATATTTCTC CATCTGGTTT
TTCACTTCGA AACCAAACTC TTCCATGTAC TTATATAACA ACTTTGACAA CGAGTGGTAC
AGCACCGAAT CCACCTTTGT TTGGTGGAAA GAAATAGAAA CCTTAGAATA CTTCGTGTCC
ACGATCAGTG AATGGTTCAC TACATAG
 
Protein sequence
MSSLIDDLVN ICWSRISRSS LSDSIFVSQV LGLLSEIEST LGVNSLLKNE ELKLLKQMIQ 
ATPLMRLHKK EFQEFIMRLV KYPNFEVFLY ERCRISMDDL RRIMNVPFKG PNPPTLSPLA
PREIKNTANE ARISHSRYFD HKENVSPNHT NKQLKSPPES PSLDYRYTKL QSELNFKDEQ
LRTKESEYTR ANLEYRKLVD TNSTQLKRIR DLESEVSSIN KYVQSLEEQL SRQLGDRNSN
SLALKIKDRD RTIRSLEQLS NEYRNELKNL EEDKLKSENS LAELVTSLRE QDNLIKNLQL
KLSLTGESLK IQSQKADPVR VNSQLQDFLL NLPFLKQYYY FYKYKNNTRR LFIVNMFAMI
LATIIVLHVA ECVLYFSIWF FTSKPNSSMY LYNNFDNEWY STESTFVWWK EIETLEYFVS
TISEWFTT