Gene PICST_39044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39044 
SymbolDIP5.1 
ID4850778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp37424 
End bp38983 
Gene Length1560 bp 
Protein Length519 aa 
Translation table 
GC content40% 
IMG OID640392486 
Productdicarboxylic amino acid permease 
Protein accessionXP_001387660 
Protein GI126273512 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.115951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATGAAATAA CTGATGAAGA TTCAATCTTT AAAGAAGACA GGGAAAGATT GTCCAAAGAC 
CTTCATTCTA GACACTTGCA AATGATTGCG ATCTGTGGAG TATTTGGTAC TGGTATCTTT
CTTAGTTCAG GAAAGGTTTT TGCACTCACA GGTGCTGGAG GTACTTTCCT CGCCTATGCT
TTGATGGCAA TTATAGTTGG AATTAACCAG ATAGCTATTG CGGAAGTTGC AGCTTTAATG
CCGACTTCTT CTGCCACTGT TAGACACTTA GAACATTTTG TTGATCCGGC TCTAGGATTC
GCCTATGGTT GGATTTCTGT CTGGCAAAAT GTCATGCCTG GTGAAATTGC TGCAGCCTCA
GTTATTATAA CATTCTGGAC AGATATCAAT TCTGCTGCTT GGATTAGTAT TATTATTGTT
GCACTTATCG CCGTTAATTC ATATTCGATG AAGTTATATG GAGAAATTGA GTTTTCATTT
GCCATACTTA AACTTACTTT GTTGACAGGA TTGATTATAG TTTCAATAGT CATTACAGCT
GGTGGAGGCC CAAATCATGA GTCTATTGGA TTTAGATATT GGAGGGATCC GGCACCATTT
CTTTCTTATT TGACAACAGG AAGTCTTGGC AGATTTGCAG CCTTTTGGTC CTCGTTGAAT
TCTGTAGTTT ACTCGTTTGG TGGAGTGCAA TCAGTTCCAA TATTAGCTAG TGAGGTCAAA
TACCCTAGAA GAGCAGTTTT CAAAGCTTGC AAAAGAATCT TCTTTAGGGT TTCGATTTTG
ATGACCTTGG CAGTGTTGTG TTTGACCTTA ATTGTTTCTC CAAGGGACAA GAACATCACT
TCAGGTTCAG GAAATGCAAA ATCATCACCT TATGTTGTGG CTATCCAAAA TGCTGGTATT
CCCGCATTAC CCCATATTGT GAATGCTGTT GTCTTTACTT CGGCTTTTTC TGCTGCTAAT
GCTGGTGTTG TCCAGGCTTC TAGAGTTCTT TTCGCTTTGG CTGTCAAACG TCAAGCCCCA
TCTTTCTTCT TGAAGACCAC CAAGAGAGGA ATACCTATCT ATGGTTTGGC ACTTGTTGCT
GTATTCATGC CCTTGTCTTA CATGTCAGTG TCCAAAACTG CAGCAACGGT TTTCAATTGG
TTCCAAAGTT TGACCTCTTC AAATTTGTTA TTAGGATGGA TTTTGATTGG GGTCAACCAT
GCTTCACTTC ATAGAGCTCT CAGAGCTCAA GGCTACTCCA GAAGCAATTT ACCTCATACA
GTGCCAGGAG GTGGTTATGC AGGTTACTTT TCAGTAGTCG TATGTTCCAT TTTGCTTTTG
ACCAATGGGT ACACAAATTT TGTACATGGA CATTTTGACA TCGCCAGCTT CTTTTCTTCC
TACTTTATCT TGCCATTGTT CTTTGGCTTG TACGTGTTTT GGAAATTTTT CAAAAGAACT
GAATTCATTA CACCCGACAA AGTTGATTTA CACTCTTTGT TCCTTGACGT TGAGAGGAAC
CCTGAGCCTC CACAAGTGCC ATTACGTGGA TGGAAATGGA TAACAATATT ATGGGATTGA
 
Protein sequence
NEITDEDSIF KEDRERLSKD LHSRHLQMIA ICGVFGTGIF LSSGKVFALT GAGGTFLAYA 
LMAIIVGINQ IAIAEVAALM PTSSATVRHL EHFVDPALGF AYGWISVWQN VMPGEIAAAS
VIITFWTDIN SAAWISIIIV ALIAVNSYSM KLYGEIEFSF AILKLTLLTG LIIVSIVITA
GGGPNHESIG FRYWRDPAPF LSYLTTGSLG RFAAFWSSLN SVVYSFGGVQ SVPILASEVK
YPRRAVFKAC KRIFFRVSIL MTLAVLCLTL IVSPRDKNIT SGSGNAKSSP YVVAIQNAGI
PALPHIVNAV VFTSAFSAAN AGVVQASRVL FALAVKRQAP SFFLKTTKRG IPIYGLALVA
VFMPLSYMSV SKTAATVFNW FQSLTSSNLL LGWILIGVNH ASLHRALRAQ GYSRSNLPHT
VPGGGYAGYF SVVVCSILLL TNGYTNFVHG HFDIASFFSS YFILPLFFGL YVFWKFFKRT
EFITPDKVDL HSLFLDVERN PEPPQVPLRG WKWITILWD