Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_52351 |
Symbol | DUR3.1 |
ID | 4851229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1257430 |
End bp | 1259604 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | |
GC content | 43% |
IMG OID | 640392937 |
Product | urea active transport protein |
Protein accession | XP_001387472 |
Protein GI | 126274215 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0516586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.828998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTTC TTGCTGTTAC TTTGGACCAA GGTGCCGGAT ATGGTGTCGT GGTTGGCATT GGTGCCCTTT TTGCTTTTGG TATGATTTTC ACCACTTATG TATTAAGAAG ATACAATAAA GAAATCATTA CTGCTGAAGA ATTCGCAACT GCAGGTAGAT CGATCAAAAC TGGGTTGATT GCTGCTGCCG TTGTTTCTTC ATGGACTTGG GCTGCTACCT TGTTGACTTC CACAACTATG GTCTACAACA ATGGTATTTC TGGAGGATTT TTCTATGCTG CTGGTGCTAC TTGTCAAATT ACTCTTTTCG CCTGTTTGGC CATTAAGGGT AAGGAAAGAG CCCCTGGTGC GCATACTTAC TTGGAAATCG TTAAAGCTAG ATACGGCACC ACCTGTCACT TGGTCTATGT TTTCTGGGGT CTTGCTACCA ACATCTTGGT CACGGCCATG TTGTTGACTG GTGGTTCTGC CGTGGTCAAT GACTTGACTG GAATGCACGT TGTTGCAGCT ATTTTATTGT TACCACTTGG TGTTGTCGCT TATACACTTT TCGGTGGTCT TAAGGCTACC TTCTTGACAG ATTACGCCCA CACCGTCATT ATGGTTGTAA TTATTTTGAT CTTTGCCTTC ACCACCTGGG CTACTTCTGA TGTTTTGGGT TCTCCAGGTG CTGTCTGGGA AGCTGTTACT GCATTGGCTG AAACCCAACC AAGAGATGGT AATGCTGGAG GTTCTTACTT GACCTTGCAC TCCAGATCTG GTGGTATTTT TTTCGTCATC AATATCGTTG GTAATTTTGG TACCGTGTTC TTGGATAACG GTTACTTCAA CAAGGCTTTC GCTGCCAACC CTGGAGCTGC CTTACCTGGA TATGTTCTTG GTTCTCTTGC CTGGTTCGCT ATTCCATGTT TCACTTCTTT GACCATGGGA TTGGCTGCTT TGGCTTTGGA AGGTACAGAT GCTTGGCCAA CAGACCACAA GATGACTCCC CAAGAAGTCT CTGCTGGTCT CGTTCTTCCA AATGCCGCCG TTGCCTTGTT GGGTAAGGGT GGTGCTGCCT GCTCGTTACT TATGGTTTTC ATGGCTTGTA CATCTGCTAT GTCTGCTGAA TTAATTGCTA CTTCATCTAT CTTCACTTAT GATATTTACA GAACTTACAT CAACCCTGAA GCTACCGGTA AGAAATTGAT CTGGGTTTCT CACATTTCCG TCATTGGGTA CGCTTACGTC ATGGCTGGTT TTGCTATTGG CTTATACTAC GCTGGTGTAT CTATGGGTTA CTTGTACGAG TTAATGGGTA TCATTATTGG TGGTGCTGTA TTGTCATCTG CCTTGTGCTT GCTTTCAAAG AGACAGAATG TTCAAGCTGC TATTTTCACT CCTCCTATCG CCACAGCTCT TGCTATTATG TCTTGGTTGG TTTGCACCAA GAAGATGTAC GGTTCCATTA ACCTTACAAC CACCTTCATG GACGATCCTA TGTTGACTGG TAACGTTGTT GCCTTGTGCT CTCCATTGAT CTTTGTTCCT TTACTCACCA TCATCTTCAA GCCACAAAAC TTCGACTGGC AAATCTTGAA ATCCATTCGT AGAGTTGATG AAGAGGAAGA AATCTTGGAA GCTGAACATG TAGCAGTTGA TCACGAAAAG GTTCATCCAG TTAAGTCCCA AGTGTCAGTT ATTGCTAGCG AATTGGTGGA TCTCGAAAAG GACAAATACG CTGAAGAAGA ATTGATGTTG CATAACTCTT TCAAGAAGGC TGTCATTATT TGTGTTGGTT TGACTCTTTG TCTTTTGATT CTTTGGCCAA TGCCAATGTA CGGTACTTCC TACATTTTTT CCAAGCGTTT CTTCACCGGT TGGGTGGTAG TTATGTTCAT TTGGATTTTC TTTACTGTGG GTATGGTTAT CATTTATCCT ATCTACGAAG GTAGATTCGC TCTCTACAAT ACTTTCAGAG GTATGTACTG GGATTTGACT GGTCAAACTT GGAAGTTGAG AGCATGGCAA CAAGAACATC CCGAGAAGAT GCATGCTGTT GTTTCGCAAG TGAGTAACCA AATATTAGCT GCTACTCAAT CTCAAATCTA CGAAGGTAAG ACAGTGTTCA ATGGAGCCAT TACTCCTCGT AACATAGACG ACGAAATTAG TGATCTCAAG AAGGACTCCA ATTAG
|
Protein sequence | MSLLAVTLDQ GAGYGVVVGI GALFAFGMIF TTYVLRRYNK EIITAEEFAT AGRSIKTGLI AAAVVSSWTW AATLLTSTTM VYNNGISGGF FYAAGATCQI TLFACLAIKG KERAPGAHTY LEIVKARYGT TCHLVYVFWG LATNILVTAM LLTGGSAVVN DLTGMHVVAA ILLLPLGVVA YTLFGGLKAT FLTDYAHTVI MVVIILIFAF TTWATSDVLG SPGAVWEAVT ALAETQPRDG NAGGSYLTLH SRSGGIFFVI NIVGNFGTVF LDNGYFNKAF AANPGAALPG YVLGSLAWFA IPCFTSLTMG LAALALEGTD AWPTDHKMTP QEVSAGLVLP NAAVALLGKG GAACSLLMVF MACTSAMSAE LIATSSIFTY DIYRTYINPE ATGKKLIWVS HISVIGYAYV MAGFAIGLYY AGVSMGYLYE LMGIIIGGAV LSSALCLLSK RQNVQAAIFT PPIATALAIM SWLVCTKKMY GSINLTTTFM DDPMLTGNVV ALCSPLIFVP LLTIIFKPQN FDWQILKSIR RVDEEEEILE AEHVAVDHEK VHPVKSQVSV IASELVDLEK DKYAEEELML HNSFKKAVII CVGLTLCLLI LWPMPMYGTS YIFSKRFFTG WVVVMFIWIF FTVGMVIIYP IYEGRFALYN TFRGMYWDLT GQTWKLRAWQ QEHPEKMHAV VSQVSNQILA ATQSQIYEGK TVFNGAITPR NIDDEISDLK KDSN
|
| |