Gene PICST_32520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32520 
SymbolDUR3.2 
ID4840130 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1198 
End bp3875 
Gene Length2678 bp 
Protein Length658 aa 
Translation table12 
GC content38% 
IMG OID640391445 
ProductUrea active transport protein 
Protein accessionXP_001385686 
Protein GI150866181 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAT CTTTGCTCAA CGAAGGCTAT GGCTACGGCT TCGTTCTTGG CCTTGGTGCC 
GCATTCGCAC TTATAATGTC AATCATCACT AAGGTGTTAT CCAAATATGG GGGCCAGGTT
CAAAATTCCG AAAGATTCTC AACTGCATCC AGAAATGTCA ACTCAGGTTT AATCGCCTCA
AGCACAGTTT CTGCTTGGAC TTGGCCAGCT ACTCTACTCT CAGCTGGAAC ATGGAGTTAT
TCTCATGGTA TCATGGGTGG ATTCATGTAT GGTATTGGGG GAACTATTCA AATTACTTTG
TTTGTCTTCT TAGCAATTCA AATTAAACTC AGAGCCCCGT CAGCACACAC TGTATCAGAA
TGTTTTTCGA TAAGATATGG TAAGGCAGGT CACTGTGTAT TTCTTTGCTA CTGCATTGCT
ACCAACATTT TGGTGTCCTC TCTTCTTCTA CTTGGAGGTT CGCAAGCTTT TTCCGCTACC
ACAGGCATGC ATGCAGTTGC TGCAAGTTTC CTTTTGCCTC TAGGAGTAAT GGTGTATACT
GCTCTTGGTG GATTGAAGGC TACATTTATG TCCGATTGGA TCCACACTGT CGTTATATAT
GTAATTTTAA TGGTTTCTTG TTTTACTATT TATTGCAGCT CAAGTCTAAT TGGCTCTCCA
TCAAGAATGT ATGATCTCTT AAGAGAGGTT CAGGAACAGT TTCCCTCAGA AACTGGTACA
AGTTATCTAT CATTCAGAGA CAAGGAAATG ATGCTTTTGA CTTGGACAGT TATGCTTGGA
GGACTTAGCT CAGTTTTTGG TGACCCTGGG TATTCTCAGC GAGCAATCGC GTCTGACGCC
AAAAGTGTGT TCCAAGGATA TATAATGGGT GGTATCTGTT GGTGGATCAT TCCTGCAGCT
CTTGGATCAT CGGCAGGTCT TGCATGCAGA GCATTATTAA CCAACCCTGC TTCGGTTACA
TATCCGAGGG CACTTTCCAC TTCCGAAGTA GACTCGGCTT TGCCAGTTAT CTACGGTTTG
ACTGCAATTT TTGGTCGGAG TGGTGCAGCC GCAGGTTTAG TTATGGTATT TATGTCAGTT
ACTTCAGCAA CTTCAGCAGA ATTAATTGCA TTTTCATCTG TCACTACCTA TGACGTTTAC
AGAACTTACA TCAATCCTTC TGCAACAGGA GTGCAGTTAG TTAGGGTTGG TCACATCGCT
GTAATTGGAT TCAGTTTATT CATGGCTGTT TTGTCTACTA TTTTCAACTA TGTCGGAGTT
ACTTCGGGTT GGCTATTGAG TTTCGTTGGT ATTGTTCTTT CTCCAGAAGT CAGTGCTGTA
TTGTTGACCT TATTCTGGAA TAAAATGACA AAGACGGCAT TGATAATAGG GGCTCCTTTG
GGCACTATTT CAGGTATCGC ATGTTGGATT GGAGCTACAT ATTCATTTGC TAATGGAGTC
GTTGATAAAA ACAGTGTAAT GATGAGCGAG GCAACCTTCG TGGGAAACAT TGTGGCTTTA
GCCTCAACTC CAATTTATAT TTTTCTCATT AGTTTCCTTT ATCCGGAAAA ATCGTTTGAT
CTAGCATCAT ATAATCGGAT TGAGCTAGGT GATGATCTTG ATGAAGAGGA AAAAATGGCA
CTTATTGTGG ATACGAGCGA CAAGAGAACA CTTAGGATAC AATCGTGGTG GTGTTTGGGT
ATCAACGTCT TCATATTATT TGGTTGTTAC ATAATAATTC CTTGTGCATT GTATGGAAGT
GGCCATGATC TAAGTAAAAA ATCATTCACA GCACTAATTG TTGTACTACT CATATGGTTG
GTGTTAGCAG CATCATATAT TATTATTTTC CCACTATGGC AAGGTAGAGA AGCAATCAGG
TTGATTATAA AGAGCATGCT TGAGCAGGAT GAATATCAAA CTGAAAGTTT GGATGGCTTA
GATGGTACGG GTGTTGTAGA GATTGTTTCC ATCAAATATG AAGCAAAGAA ATGATGATAT
CCCTTTGTAA ATGTTTTGAA AAATATATAA TTGAACCTTT TCTTTAACTA AAATTCAAAG
GTGATGAATT CTTACTTGAT TGAATGAAGG TAATCGAATG GTATTACTTT TAGGACAGGT
GCTATTTTTT GCAGCAATTA GTATTTTTAT AGTATATTTC GCATTCTTTC TAGAGAGACA
AGCAGTACTT ACTGTACAGT AATTGAGAAG AGTTATAAAA GCAAAGTAAG CGTGTAAAAT
ATTAATAACA AGGATTGAAT ATGTTAACAG TTAAAAATAC CATGGTTTAG GATTGTTTGG
ATGTGTTATC GTTGTACTAT ATTGTAGTGA TTTGAGGAAT TGGAGGGAGA GTTATGTATA
TAGAATCACA TGACTAAATC TCATGGGAAA TGGAATTATA TACCGTCCAT ATAGAGGTGA
ATGTAATATA GTGCGTGTAT TACTAAAATG AGAACTGAGA GGTATATAAG AGAGGCGAGA
TCCTCAAAAT TTGAAAATTA ATATCAAGTT TATAATTAAA CAATTTATTA TTCATTTGAA
GGAGGATATT CTTGATTTTT GGGGAATAAC ATAATGTTCA ATAGTAGGAT TCTATTATCA
ACTTAAGACC AGGTGGGCAG CTTGTCTAAG CAGTTGTCTA AACAGTTGTA CCAAAGCAGC
AAAGAATTGC AGTAGGTGAA AGCCTGACCA GAACGTAG
 
Protein sequence
MAESLLNEGY GYGFVLGLGA AFALIMSIIT KVLSKYGGQV QNSERFSTAS RNVNSGLIAS 
STVSAWTWPA TLLSAGTWSY SHGIMGGFMY GIGGTIQITL FVFLAIQIKL RAPSAHTVSE
CFSIRYGKAG HCVFLCYCIA TNILVSSLLL LGGSQAFSAT TGMHAVAASF LLPLGVMVYT
ALGGLKATFM SDWIHTVVIY VILMVSCFTI YCSSSLIGSP SRMYDLLREV QEQFPSETGT
SYLSFRDKEM MLLTWTVMLG GLSSVFGDPG YSQRAIASDA KSVFQGYIMG GICWWIIPAA
LGSSAGLACR ALLTNPASVT YPRALSTSEV DSALPVIYGL TAIFGRSGAA AGLVMVFMSV
TSATSAELIA FSSVTTYDVY RTYINPSATG VQLVRVGHIA VIGFSLFMAV LSTIFNYVGV
TSGWLLSFVG IVLSPEVSAV LLTLFWNKMT KTALIIGAPL GTISGIACWI GATYSFANGV
VDKNSVMMSE ATFVGNIVAL ASTPIYIFLI SFLYPEKSFD LASYNRIELG DDLDEEEKMA
LIVDTSDKRT LRIQSWWCLG INVFILFGCY IIIPCALYGS GHDLSKKSFT ALIVVLLIWL
VLAASYIIIF PLWQGREAIR LIIKSMLEQD EYQTESLDGL DVVPKQQRIA VGESSTRT