Gene PICST_61009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61009 
SymbolMID1 
ID4839272 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp761697 
End bp763334 
Gene Length1638 bp 
Protein Length532 aa 
Translation table12 
GC content43% 
IMG OID640390587 
Productintegral plasma membrane protein 
Protein accessionXP_001385156 
Protein GI150865796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0502869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0930816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATC TTGTGGTGGC ATTCGTGCTG TTCTTTCTCG CCACTGGCGC GGTGGCCATA 
AACTTCGAAT TCGATATCGA CGCTGGACTT GCACTCCCTG TTGAGAATCC ATACGAAGAA
ATTGCACAAG AATATGAAAT GAACCAAATA GAATTCGGAA GCAGCCTCAA CAAAAAGCCA
AGTATAGATG GCAATGTCAG TAAACGATTT GACATAGGAG CCGTTAAGGT GTCCAACGTC
AATTCCCACG GACTTCATGA GTTTACACCT ATTAGCGATA CTATTGTCCA GAGTGATACA
AAGTACTACT CTTTCAGTGT GAACACCACC TCTGGGTTAG GAGAGTTCTA CGAATTGTTG
ATCTTTATCA CGGGTAATAT TTGTACGCAG CCATCAAACG TAGGAGCCAA CCAGACCAGT
TTAGCTGTGT ATTATTCATT TAATTCGTCT ATATTCACTA ATATAGAATA CTCAACTATG
GTGTTATTTG AAAATGGCTA TGTCCAGTCA TTGGCCGATG TGGCTGTGAA CTCTAACAAT
AACGAGTCGG TGCTATATAT TGTAGTGCGG GCGCCTGAAA ACGTCAACAA AACGGCTACA
TGGACATATC AGATCGGAGT TTCCCAGAAC GATTTAGTAT TCCAGTGGGA TGATCAGACC
TGGGCACAAT TCATAGATTC AGACGATGAT TCGGCTCTTG TAGTTACCGG TAATTTGACC
AATGTTCAGG GACTTAACAT CACGGAGTTG AATGCAACAA GATCGCAGTT TTCGTTGTAT
GTCTATTCAA ATGACTATAG ACACTATTTC GATACCCTCA ACAGCAGTTG GTGTGCAGTT
AGAAACGGCC CAGCTCTTAT GAATCCTGCT ACTATAGAAA GTAGCTATAC CAGCAGACAG
GGCTCGCTTC ACCAGCAATT CCATCTCACG GGACTTAACA AATCGACGAA ATACATAGCG
TACTTGATCT CCGACTTCCA CGGAAGCGAC TTTGGAGGAG CGGTGTATCG TCCCTTTGAG
TTTGAAACGT TGGACACTGA AGCCTGTGAA CTAATCTATA ACTTAGAATT TTGCAACCAG
ATTGCCTATT CGGTACCGGC TACACCAGGT GGTTCTAAGG AAGAAGTTCG ATCTCTCTAC
GATAATCAAG CAAGAAACCT CTTCACCAAC TTCAGTAAGG CTATCCAGCA AATTCTGTGT
GATACTGAGG ATACGGCCCA ATTCTCTCCT ATTAAAACCT GCAGTGATTG TATATCTTCA
TATAAGGACT GGCTCTGTGC TATCACCATT CCTCGATGTT CAACCAGAAA CATAACCGGA
TACACCGAAA GAAAGCCAGG TGAATCTCGT AATAGTTTTA TCAATGACAT TGTTATGCCC
AACTTGTCTT ACTACGAAGT TATGCCCTGT GTCAATATCT GTGAAGCTAT AGTGAGAGAC
TGTCCGGCTC AGTTTGGATT CATGTGTCCC ACCACCAACG AAACTATACG ACAATCGTAC
TACTGGGATA ACGGGGGACA ATGGCCTACT TGTAACTATG TCGGCAAGTT GACCGTCGTG
ACTAATGCTG CCTTCAGGGC ATCCATGGTT AATTGGTTTA TGTTGGTTCT CCTGGTAGCT
TTAACAGTAT TGGTGTAG
 
Protein sequence
MNNLVVAFVS FFLATGAVAI NFEFDIDAGL ALPVENPYEE IAQEYEMNQI EFGSSLNKKP 
TVKVSNVNSH GLHEFTPISD TIVQSDTKYY SFSVNTTSGL GEFYELLIFI TGNICTQPSN
VGANQTSLAV YYSFNSSIFT NIEYSTMVLF ENGYVQSLAD VAVNSNNNES VLYIVVRAPE
NVNKTATWTY QIGVSQNDLV FQWDDQTWAQ FIDSDDDSAL VVTGNLTNVQ GLNITELNAT
RSQFSLYVYS NDYRHYFDTL NSSWCAVRNG PALMNPATIE SSYTSRQGSL HQQFHLTGLN
KSTKYIAYLI SDFHGSDFGG AVYRPFEFET LDTEACELIY NLEFCNQIAY SVPATPGGSK
EEVRSLYDNQ ARNLFTNFSK AIQQISCDTE DTAQFSPIKT CSDCISSYKD WLCAITIPRC
STRNITGYTE RKPGESRNSF INDIVMPNLS YYEVMPCVNI CEAIVRDCPA QFGFMCPTTN
ETIRQSYYWD NGGQWPTCNY VGKLTVVTNA AFRASMVNWF MLVLSVALTV LV