Gene PICST_60304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_60304 
SymbolDUR8 
ID4839462 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp721694 
End bp723663 
Gene Length1970 bp 
Protein Length520 aa 
Translation table12 
GC content42% 
IMG OID640390777 
Producturea transport protein 
Protein accessionXP_001385148 
Protein GI126137249 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.087993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGT TATCTTCTCA GGGTAACAAC GCCATCATAT ACTTGACGTA TGCCTTCTTG 
CTCTTTACGG GTTTGGCTTT GGCCTGGAAA TTCTCCTCAG CTAAGTCTTT CCTCTCTTCC
AATGGAACGC AGCGTGGTCT TCCGTTGGCC TTGAACTTCA TTGCTTCTGG TATGTTATGA
TTTACTCGTT TATCTTGACT TTTTTTGACA ATTCAACCTT TTCCGATGGA ATATCATTGA
ATAACTGCGA CTTGTAGTTC TAGTCGCTCT TCTAGTTTAA CAATTGAAAC TTGCGGGTGT
CCCCCCATTT CTTATTTGAA ATAGGTGGAC TGAGAACATA CCGTTTGAAC TCGATCAAGC
TAACACTTGC GTGAGGAAGT TTTATTGTTC AGTAAATCGA AAGATAAATC CATGGACGTC
CCCAGGTTGT GGAGTCTGTG TCAGTTTGTG CTGAAATTTT CTACTTATTC AGACTATCTG
AACGAAAGAA GATGAACCAT GAACTATGAC ATAGCACTCC ATAATGAGGT CTCGTTCGTT
TGCCCTATGA CGGGATATTA ACGTTCTTTA TTTCAGCTAT GGGTGTTGGT ATTCTTACTA
CCTACTCACA AATCGCAAAC ATCTCGGGGT TGCACGGCCT CTTGGTGTAT ACTCTCTGTG
GTGCTATCCC AATTGTAGGC TTTGCCTTAG TGGGTCCTCT CATCAGAAAG AAGTGTCCTG
ATGGGTTTAT TCTTACCGAA TGGGTTCGTG CCAGATTTGG AATTGTCACT GCCTTATACC
TATCTTTCTT TACTTGTTTG ACCATGTTCT TGTTCATGGT TGGTGAATTG TCTGCCATCA
GAGGTGCGAT TGAAACTTTG ACTGGTTTGG ATGCTCTTGG AGCAGTTATT GTAGAATGTA
TTGTCACCAC TATCTATACC TCTATCGGTG GTTTCAAAAT CAGTTTCATC ACAGACAACT
TCCAAGGTGC TCTTGTTATT ATTCTTATCA TTATATGTGG TGCAGGTATG GGTTCCTATA
TCGATATCGA CACCTCTAAG GTTGGCCCAA GTGGATTATT GAAGGCTAAC AAGTTGGGCT
GGCAATTGGT GTACATATTA TTCGTTGCAA TTGTTACCAA CGATTGTTTC TTGTCTGGTT
TCTGGTTGAG AACTTTTGCC TCTCGTACCG ATAAGGACTT GTGGATTGGT ACTTCTATAG
CAGCATTTGC TACCTTTGCC ATCTGTACCT TAGTAGGAAC CACTGGGTTC TTGGCTGTTT
GGTCCGGAGA CTTGGAAGTT TCTGATGACA ACGGTTACAA TGCCTTCTTC ATCTTGTTGG
CCAAGATGCC CCGTTGGATT GTTGCCTTTG TGTTGATCTT CTGTATTTGT TTGTCTACTT
GTACTTTTGA CTCGTTGCAA TCGGGTATGG TTTCTACAAT CTCTAACGAT GTTTTCAGAA
ACAAGTTACA CATTAACTAC ACCAGAGGAT TAGTCGTTCT CGTAATGGTT CCAATTGTCG
TGTTGGCTGT AAAAGTTGCA GACAATATCT TGCAGATCTA CTTGATTGCT GACTTGGTTT
CCGCAGCTAT CATTCCAGCT GTGTTCCTAG GTTTAAGTGA TACTTGTTTC TGGTACATTC
GCGGATTTGA CGTTATGTGT GGTGGTTTGG GTGCATTGAT TGGTGTCTTT ATTTTTGGAA
CCGTCTACTA CGGTACTGCT AGAGAAGGTG CTAAACTTTT GTTGATCTGG AATGGTATCT
ACGATCCAGA AGACTGGGGT GCCTTTGGAG CTTTTGTCAT TGCTCCATTT GGAGGTTTGG
TTATTACTTT TGCAGCCGCT GCTTTGAGAA TTGGAGCCGC TTATGTCTAT GCTAAGGTTA
GCGGGAACAC ATTCTCTGCT TTGGATAAGC CGGAGATTGT TGAAGTCGTT TCTACGGAAG
TCGAATCTCA ATACGGGTCT ACTGATGACT CTAAGAAGAA TAGCGTTTAA
 
Protein sequence
MAQLSSQGNN AIIYLTYAFL LFTGLALAWK FSSAKSFLSS NGTQRGLPLA LNFIASAMGV 
GILTTYSQIA NISGLHGLLV YTLCGAIPIV GFALVGPLIR KKCPDGFILT EWVRARFGIV
TALYLSFFTC LTMFLFMVGE LSAIRGAIET LTGLDALGAV IVECIVTTIY TSIGGFKISF
ITDNFQGALV IILIIICGAG MGSYIDIDTS KVGPSGLLKA NKLGWQLVYI LFVAIVTNDC
FLSGFWLRTF ASRTDKDLWI GTSIAAFATF AICTLVGTTG FLAVWSGDLE VSDDNGYNAF
FILLAKMPRW IVAFVLIFCI CLSTCTFDSL QSGMVSTISN DVFRNKLHIN YTRGLVVLVM
VPIVVLAVKV ADNILQIYLI ADLVSAAIIP AVFLGLSDTC FWYIRGFDVM CGGLGALIGV
FIFGTVYYGT AREGAKLLLI WNGIYDPEDW GAFGAFVIAP FGGLVITFAA AALRIGAAYV
YAKVSGNTFS ALDKPEIVEV VSTEVESQYG STDDSKKNSV