Gene PICST_52351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52351 
SymbolDUR3.1 
ID4851229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1257430 
End bp1259604 
Gene Length2175 bp 
Protein Length724 aa 
Translation table 
GC content43% 
IMG OID640392937 
Producturea active transport protein 
Protein accessionXP_001387472 
Protein GI126274215 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0516586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.828998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTTC TTGCTGTTAC TTTGGACCAA GGTGCCGGAT ATGGTGTCGT GGTTGGCATT 
GGTGCCCTTT TTGCTTTTGG TATGATTTTC ACCACTTATG TATTAAGAAG ATACAATAAA
GAAATCATTA CTGCTGAAGA ATTCGCAACT GCAGGTAGAT CGATCAAAAC TGGGTTGATT
GCTGCTGCCG TTGTTTCTTC ATGGACTTGG GCTGCTACCT TGTTGACTTC CACAACTATG
GTCTACAACA ATGGTATTTC TGGAGGATTT TTCTATGCTG CTGGTGCTAC TTGTCAAATT
ACTCTTTTCG CCTGTTTGGC CATTAAGGGT AAGGAAAGAG CCCCTGGTGC GCATACTTAC
TTGGAAATCG TTAAAGCTAG ATACGGCACC ACCTGTCACT TGGTCTATGT TTTCTGGGGT
CTTGCTACCA ACATCTTGGT CACGGCCATG TTGTTGACTG GTGGTTCTGC CGTGGTCAAT
GACTTGACTG GAATGCACGT TGTTGCAGCT ATTTTATTGT TACCACTTGG TGTTGTCGCT
TATACACTTT TCGGTGGTCT TAAGGCTACC TTCTTGACAG ATTACGCCCA CACCGTCATT
ATGGTTGTAA TTATTTTGAT CTTTGCCTTC ACCACCTGGG CTACTTCTGA TGTTTTGGGT
TCTCCAGGTG CTGTCTGGGA AGCTGTTACT GCATTGGCTG AAACCCAACC AAGAGATGGT
AATGCTGGAG GTTCTTACTT GACCTTGCAC TCCAGATCTG GTGGTATTTT TTTCGTCATC
AATATCGTTG GTAATTTTGG TACCGTGTTC TTGGATAACG GTTACTTCAA CAAGGCTTTC
GCTGCCAACC CTGGAGCTGC CTTACCTGGA TATGTTCTTG GTTCTCTTGC CTGGTTCGCT
ATTCCATGTT TCACTTCTTT GACCATGGGA TTGGCTGCTT TGGCTTTGGA AGGTACAGAT
GCTTGGCCAA CAGACCACAA GATGACTCCC CAAGAAGTCT CTGCTGGTCT CGTTCTTCCA
AATGCCGCCG TTGCCTTGTT GGGTAAGGGT GGTGCTGCCT GCTCGTTACT TATGGTTTTC
ATGGCTTGTA CATCTGCTAT GTCTGCTGAA TTAATTGCTA CTTCATCTAT CTTCACTTAT
GATATTTACA GAACTTACAT CAACCCTGAA GCTACCGGTA AGAAATTGAT CTGGGTTTCT
CACATTTCCG TCATTGGGTA CGCTTACGTC ATGGCTGGTT TTGCTATTGG CTTATACTAC
GCTGGTGTAT CTATGGGTTA CTTGTACGAG TTAATGGGTA TCATTATTGG TGGTGCTGTA
TTGTCATCTG CCTTGTGCTT GCTTTCAAAG AGACAGAATG TTCAAGCTGC TATTTTCACT
CCTCCTATCG CCACAGCTCT TGCTATTATG TCTTGGTTGG TTTGCACCAA GAAGATGTAC
GGTTCCATTA ACCTTACAAC CACCTTCATG GACGATCCTA TGTTGACTGG TAACGTTGTT
GCCTTGTGCT CTCCATTGAT CTTTGTTCCT TTACTCACCA TCATCTTCAA GCCACAAAAC
TTCGACTGGC AAATCTTGAA ATCCATTCGT AGAGTTGATG AAGAGGAAGA AATCTTGGAA
GCTGAACATG TAGCAGTTGA TCACGAAAAG GTTCATCCAG TTAAGTCCCA AGTGTCAGTT
ATTGCTAGCG AATTGGTGGA TCTCGAAAAG GACAAATACG CTGAAGAAGA ATTGATGTTG
CATAACTCTT TCAAGAAGGC TGTCATTATT TGTGTTGGTT TGACTCTTTG TCTTTTGATT
CTTTGGCCAA TGCCAATGTA CGGTACTTCC TACATTTTTT CCAAGCGTTT CTTCACCGGT
TGGGTGGTAG TTATGTTCAT TTGGATTTTC TTTACTGTGG GTATGGTTAT CATTTATCCT
ATCTACGAAG GTAGATTCGC TCTCTACAAT ACTTTCAGAG GTATGTACTG GGATTTGACT
GGTCAAACTT GGAAGTTGAG AGCATGGCAA CAAGAACATC CCGAGAAGAT GCATGCTGTT
GTTTCGCAAG TGAGTAACCA AATATTAGCT GCTACTCAAT CTCAAATCTA CGAAGGTAAG
ACAGTGTTCA ATGGAGCCAT TACTCCTCGT AACATAGACG ACGAAATTAG TGATCTCAAG
AAGGACTCCA ATTAG
 
Protein sequence
MSLLAVTLDQ GAGYGVVVGI GALFAFGMIF TTYVLRRYNK EIITAEEFAT AGRSIKTGLI 
AAAVVSSWTW AATLLTSTTM VYNNGISGGF FYAAGATCQI TLFACLAIKG KERAPGAHTY
LEIVKARYGT TCHLVYVFWG LATNILVTAM LLTGGSAVVN DLTGMHVVAA ILLLPLGVVA
YTLFGGLKAT FLTDYAHTVI MVVIILIFAF TTWATSDVLG SPGAVWEAVT ALAETQPRDG
NAGGSYLTLH SRSGGIFFVI NIVGNFGTVF LDNGYFNKAF AANPGAALPG YVLGSLAWFA
IPCFTSLTMG LAALALEGTD AWPTDHKMTP QEVSAGLVLP NAAVALLGKG GAACSLLMVF
MACTSAMSAE LIATSSIFTY DIYRTYINPE ATGKKLIWVS HISVIGYAYV MAGFAIGLYY
AGVSMGYLYE LMGIIIGGAV LSSALCLLSK RQNVQAAIFT PPIATALAIM SWLVCTKKMY
GSINLTTTFM DDPMLTGNVV ALCSPLIFVP LLTIIFKPQN FDWQILKSIR RVDEEEEILE
AEHVAVDHEK VHPVKSQVSV IASELVDLEK DKYAEEELML HNSFKKAVII CVGLTLCLLI
LWPMPMYGTS YIFSKRFFTG WVVVMFIWIF FTVGMVIIYP IYEGRFALYN TFRGMYWDLT
GQTWKLRAWQ QEHPEKMHAV VSQVSNQILA ATQSQIYEGK TVFNGAITPR NIDDEISDLK
KDSN