Gene PICST_41725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41725 
Symbol 
ID4837545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp91371 
End bp93086 
Gene Length1716 bp 
Protein Length571 aa 
Translation table12 
GC content44% 
IMG OID640388860 
Productpredicted protein 
Protein accessionXP_001382779 
Protein GI126132508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID[TIGR00907] amino acid permease (GABA permease) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.817066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00692909 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAAAA TAGACTCCGA AAAGAGCACT GAGATACATA ACGTGCCCAG CGTTGGCTAC 
GGAGAAATAC AGAACTATGT CTCCAATCGT ACCGCCCAAG GTATGCCAGC CTTGGATGCT
CTTGCCCAGG CTGATGGCAA GAATGCTGAG AAGTTGATGG AAGAAGCTCA GGCCAACTTG
GAATTAGTTC AGGAGACCGG TTATGCTCCA GAATTGAGAC GTAACTTTGG TGTGATCTCA
TTGTTAGGTG TGGGGTTTGG GTTAACCAAC TCTTGGTTCG GTATCTCGGC CTCTTTGGTC
ACAGGTATCA GTTCTGGTGG TCCCATGATG ATCATCTACG GTATTCTCAT TGTTGCCTGT
ATTTCCATGT GTGTAGCCAT CAGTTTGAGT GAGTTGATCA GTGCCATGCC TAATGCTGGT
GGCCAATACT ACTGGACAAT GAAGTTGGCT CCCAAGAAAT ACGCTCCTTT CTGGGCTTAT
ATGTGTGGTG CTTTTGCATG GGCTGGTTCC GTCTTCACAA GTGCTTCCGT TACTCTTTCC
ATTGCTTCCT CGGCTGTCGG GATGTACATG TTGTACCATC CAGACAAGAC CATCCAAACA
TGGCATGTGT TTGTAACTTA TGAAATCGCC AACATCTTAT TAGTATTCTT CAACCTCTGG
GAAAAACCTC TACCAGCCAT CTCAAAGAGT TCGTTGTATA TCTCTCTTTT GTCGTTCTTG
ATCATCACTA TTGTGGTGTT GGCCAAATCT GGAGGAGAAT TCCAATCGGC CAACTTCGTG
TTTGTGGAAT TTACTAACGG TACTGGTTGG AGTTCCAGTG GTATTGCTTT CATTGTTGGT
TTGATCAACC CCAACTGGTC CTTCAGTTGT TTGGATGCTG CCACCCATCT TGCTGAAGAA
TTACTTGAAC CAAGAAAGCA AATTCCAATT GCAATTATCG GCACTGTTAT TATTGGATTC
ATCACCTCGT TCTCCTACTC CATTGCCATG TTCTTCTGCA TCAAGGATTT GGACGCCATC
TACAACTCCA ACACTGGTGT GCCAATCATG GATATCTTCT ACCAGGTATT GAACAATAAG
GCTGGTGCTG TCATCTTGGA ATTCCTAATT TTCTTGACTG CCATCGGTTG TAACATTGCC
TCTCACACTT GGCAGGCTAG ATTATGTTGG TCTTTTGCTA GAGACAATGG TTTGCCAGGA
TCCAGATATT GGTCCAAAGT CAACCCAAGA ACTGGTGTTC CAGTGAATGC CCATCTTATG
TCTTGTGTGT GGTGTGCTAT CATTGGTTGT ATCTACATGG GCTCTACTAC TGCCTACAAT
GCCATGGTCA TTGGGTGTAT TATCTTTTTA TTGATGTCAT ACGCTGTGCC AGTTGTTTTC
TTGTTAATGA AGGGAAGAGA CAACATTAAG CATGGTCCAT TCTGGTTAGG TAAAATTGGA
CTTTTCGCCA ACATTGTTCT TCTCGTCTGG ACTGTATTCA CTACTATTTT CTACAGTTTC
CCACCTGTCA TGCCAGTCAC CGCAGGTAAC ATGAACTACG TCTCTGTCGT AGTTGGTGTC
TTTGGAGCAT ACTGTATTAT CTATTGGTTT GCTAGAGGCA AAAAGAAGTT CATCACTGCA
GAAGACAGAG AAGCAAAGAT TGACGAGTTG ACACACCAAT TGTCGCAACA AATATCACAC
ATAGAAGTCG TTCTCTCCCA CAAGAACGAC GTGTAA
 
Protein sequence
MSKIDSEKST EIHNVPSVGY GEIQNYVSNR TAQGMPALDA LAQADGKNAE KLMEEAQANL 
ELVQETGYAP ELRRNFGVIS LLGVGFGLTN SWFGISASLV TGISSGGPMM IIYGILIVAC
ISMCVAISLS ELISAMPNAG GQYYWTMKLA PKKYAPFWAY MCGAFAWAGS VFTSASVTLS
IASSAVGMYM LYHPDKTIQT WHVFVTYEIA NILLVFFNLW EKPLPAISKS SLYISLLSFL
IITIVVLAKS GGEFQSANFV FVEFTNGTGW SSSGIAFIVG LINPNWSFSC LDAATHLAEE
LLEPRKQIPI AIIGTVIIGF ITSFSYSIAM FFCIKDLDAI YNSNTGVPIM DIFYQVLNNK
AGAVILEFLI FLTAIGCNIA SHTWQARLCW SFARDNGLPG SRYWSKVNPR TGVPVNAHLM
SCVWCAIIGC IYMGSTTAYN AMVIGCIIFL LMSYAVPVVF LLMKGRDNIK HGPFWLGKIG
LFANIVLLVW TVFTTIFYSF PPVMPVTAGN MNYVSVVVGV FGAYCIIYWF ARGKKKFITA
EDREAKIDEL THQLSQQISH IEVVLSHKND V