Gene PICST_39042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39042 
SymbolUGA4 
ID4851142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1033567 
End bp1035279 
Gene Length1713 bp 
Protein Length570 aa 
Translation table 
GC content44% 
IMG OID640392850 
ProductGABA-specific high-affinity permease 
Protein accessionXP_001387842 
Protein GI126274128 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.108543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCTCC ATCCAGTTCT CTCCAGAGAG CAGATTCTTG AGCAGGTTGT CTCTAACAAG 
GCCTACTACA ACAATACCCA GAATGTTGAA GATCCAAAGA TTGAAGCCAT CTTCTCCAAC
TCAGATGAAC AGTTGTTGGC ACAGATTGGG TATAAACCGG AATTGAAAAG ACATTTTCTG
ACACTCCAGG CGTTTGGGGT TGCCTTTTCC ATCATGGGAT TGTTACCATC TATTGCTTCT
ATCTTGGCCC AGGGATTGGT AGCCGGTCCT GCTGGATGTT TGTGGGGATG GGTAATTTCA
TCCATTCTCA TCTTGACTAT TGCAATTTCG ATGTCTGAAA ACGGTCTGAG TATTCCCACT
TCTGGTGGTT TATACTACTG GACAAATTAC TACGCCCCTC CTCGTTTTAA GACTATTTTC
TCATACCTTA TCGGCAACAC CAACTCCATT GCCTTGATCG GTGCCTTGTG CTCTGTAGAT
TACGGATTCG CAGTGGAAGT ATTGTCTGTT GTAGTCATTG CCAAAGATGG AGACTTTGAA
ATAACGCCAG CAAAGACTTA CGGTATCTTC GTAGCCTGTG TCTTGCTACA CATCGCCATC
ACCTGCTTTT CTTCCAAAAA CTGTGCTTAT CTCCAGACTA CGTCTATCGT GGTAAATCTT
GCTATCATTG TATTATACAT TATTGCGTTA CCAATTGGAG CAAAAGGGAA CTTCAAGCCT
GCCAAGTTTG TCTTTGGGGA GTTTGATAAT ATTTCCAATT GGCCCATCGG CTGGACCCAA
CTTAGTGCAG CTTGGCTACC AGCAATCTGG ACCATTGGAG CCTTTGACTC CGTCATCCAC
ATGAGTGAGG AAGTTAAAGA CGCAGAACAT ACCATCCCAA TCGGAATATT GGGCTCCGTT
ATCGCCTGTG GTTCTATCGG TACTGTGATC TTGATTATTA CTTTCTTCTG TATCCAAACC
AACGATATTG AAACTGATAT CTTGGGCTCC AAATTCGGTC AACCATTGGC TCAGATCATC
TTTGATGTCT TAGGTAAGAA GTGGGCTTTG ACATTTATGG TTCTTATTGC TTTTGCCCAA
TTCTTGATGG GTGCCTCGAT CTTAACCGCC ATTTCTCGTC AAATCTGGGC TTTTGCCAGA
GACAATGGCT TGCCATTCTC TCGTATAATC AAGAAGGTCA ACAAGAAGTT GTCCGTGCCT
ATCAACGCTG TATGGTTCGG TGGTATTATG TCCATTATCA TCGGGTTATT AGTGTTGATT
GGAACTGTCG CTGCCAATGC ACTCTTCACA TTGTACATTG CTGGTAACTA CGTAGCGTGG
GGAACTCCAA CTTTCTTGAG ATTAACCACT GGCAGAAAGA AATTCAAGCC AGGAAAGTTC
TGGTTGGGCC CAGTGTTCTC ACCCTTGATC GGGTGGACCT CCACCATCTT CATTGTCTTT
ACTTTCTTCA TGGTGATGTT CCCAGCTAAC ACCAACCCAG ACAAAGATAG CATGAATTAT
ACCTGTGTGA TCACACCCAG TGTGTGGATC TTTTCTTTGA TATACTATTA CGTCTATGCC
CACAAGATTT ACCATGGTCC TTGTAAGACT ATTGATGATG TGGATGAAAC TTCTCTGGAA
GCTGGAATAG ATGCTGTCAT AGATGGAGTT GATCCTCAGG GAGACAGCGA TAAGGTATCT
CAGACTAATG TCAATGTTTT GGAGAAGGTC TAA
 
Protein sequence
MVLHPVLSRE QILEQVVSNK AYYNNTQNVE DPKIEAIFSN SDEQLLAQIG YKPELKRHFL 
TLQAFGVAFS IMGLLPSIAS ILAQGLVAGP AGCLWGWVIS SILILTIAIS MSENGLSIPT
SGGLYYWTNY YAPPRFKTIF SYLIGNTNSI ALIGALCSVD YGFAVEVLSV VVIAKDGDFE
ITPAKTYGIF VACVLLHIAI TCFSSKNCAY LQTTSIVVNL AIIVLYIIAL PIGAKGNFKP
AKFVFGEFDN ISNWPIGWTQ LSAAWLPAIW TIGAFDSVIH MSEEVKDAEH TIPIGILGSV
IACGSIGTVI LIITFFCIQT NDIETDILGS KFGQPLAQII FDVLGKKWAL TFMVLIAFAQ
FLMGASILTA ISRQIWAFAR DNGLPFSRII KKVNKKLSVP INAVWFGGIM SIIIGLLVLI
GTVAANALFT LYIAGNYVAW GTPTFLRLTT GRKKFKPGKF WLGPVFSPLI GWTSTIFIVF
TFFMVMFPAN TNPDKDSMNY TCVITPSVWI FSLIYYYVYA HKIYHGPCKT IDDVDETSLE
AGIDAVIDGV DPQGDSDKVS QTNVNVLEKV