Gene PICST_41966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41966 
SymbolGAP1.7 
ID4836951 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp973558 
End bp975486 
Gene Length1929 bp 
Protein Length621 aa 
Translation table12 
GC content46% 
IMG OID640388266 
Productgeneral amino acid permease 
Protein accessionXP_001382961 
Protein GI150864226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.668272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCTT CACTAGCATC GGGCCAGTCC TCCGATACCG GCTCGGAGTA TTCAGAATAC 
TCCGTCTACT CCAGAAGTTA CCATCCCAGG GACTTGTTCT ACGACTTCAT CGACTCGTTC
AAGCCCTTCG ACTACTCCGT CTTACCACCT ATATACAATA CTGAGTCAGG AACTCATCAG
GGGACCCACG ATATCGAAAA TCGCGATGTT CTGAACCTTG TAGCCAGAGA CACCCTTGTG
TCAGACAAAC TGGTCCTTCA CCCCAACCAC CCACAATTTG ATTATTCACA TCTCACAGAG
CTTGAGAGAG CGGCTTTGGT CACGGCAACA TCGCCCCTTT CAAAGCACTT GAAAACTCGT
CATTTGACCC TTATCTCCCT TGGTGGTGCT ATCGGTACCG GGCTCTTCAT AGGTCTGGCT
CTGTCCTTGA GCCATGCTGG CCCTCTAGGA ATGCTCTTGG TGTGGATCTT TATCGGCTCG
ATCACCTTCA CGACGATGTC GTCACTTTCA GAATTAGCAA CTGCTTTTCC TATTGCTGGT
TCCTTTGTCA CATTCACCAC ATTATTCATA GATGCCTCCT GTGGCTTCGC TATAGCTTGG
AACTATGCCT TACAGTGGTT GGTAACCATG CCTCTAGAAC TTGTTGCAGT CTCGATGACA
TTCTCCTATT GGAATACAGA TGTTCATCCG GCTGTGTATG TTGCCATCTT CTACGTAGTC
ATTGTAGTGA TCAACTTGTT CGGTGTAAAA GGATACGGTG AGGCAGAGTC CTTGTTCAGT
ATTATCAAAA TTATAGCCGT CATCGGCTTC AACATCTTGT CCATCATTAT AGTCACCGGG
GGTGTCCCTG GACAACCCTA CATCGGAGGC AAGTACTGGC ATAAACCAGA AGGAGGGTTA
TTTAACACTG TGGAGCCATT TAAACAATGC TGCTACATCA TTGCTAATGC CTCGTTTGCC
TACGCTGGGG TAGAGATCTT TGCCCTTGCG GCAGTAGAAA GTAAGCAGCC TAAGAAATCG
ATCAACAGCG CCAGAAAGCA GATTTTCTAC CGTATTCTTG TTTTTTACAT CTGTTCGCTT
GTAATGATTG GTTTGCTTGT GCCCTACACA GACGAGAGAT TGTTGGGAAC TAACACTAAG
GCCGGCAACA TTGGAGTTGA TATCAACACC TCTCCATTTG TAATTGCAAT CAAAAACGCA
AACATCAGAG CATTGCCTAC TATCATGAAC ATCGTGATTA TAATCACTGT TGTATCAGTT
GGAAATGCCA GTGTGTATGG CTCTTCTCGT GCCTTGTGTG CACTTGGAGC CTTGAGACAG
GGTCCTTCTA TTTTGAACTT TATAGATCGT AGAGGTAGAC CAATGGCTGG ACTTCTAGTC
CAGTTTGCCT TTGGATTGTT AGCCTTCTTG GTAGCAATTC CTGGCCCCAG TGTAACGACT
CAGATCTTCA ACTGGCTCTT AAGTTTGTCT GGTCTCAGTA TCTTGTTCAC GTATTTGTCG
ATTAACATCT GCCACTTGAG ATTCCGCAGG GCTTTGAATG TCAGAGCCAG ACTTCCTCAG
GATGAACTTG TCTACACTTC ACCAGTGTGG GTATCGTGGT ATGCCATCAT CTGCATCATA
ACTGTGTTGG GATTGCAGTT CTGGGCAGCT TTGTTCCCAC CAGGCAACCA CGCAGCAGAT
TGGGAGAGCT TCTTGACCAT ATACTTGGGG TTGCCTGTGT TGATTTTGTT CTACATATGT
CACAAGATCT ATGCCAAAAT CTTCTTGAAG GTGCCATTGA CTAAGTTTTG GCTCACTGCT
GAAGAAATAG ATATCGACAC TGGAAGAAGA CAGATTGACA TGGAAGCATT GAAGCAAGAG
ATTGCTGAAG AGAGATTGAG CTTCCAATCC AAACCATTGT ATTATAAGGT GTTCCGGTTT
TTCTGCTAG
 
Protein sequence
MVSSLASGQS SDTGSEYSEY SVYSRSYHPR DLFYDFIDSF KPFDYSVLPP IYNTEDTLVS 
DKSVLHPNHP QFDYSHLTEL ERAALVTATS PLSKHLKTRH LTLISLGGAI GTGLFIGSAS
SLSHAGPLGM LLVWIFIGSI TFTTMSSLSE LATAFPIAGS FVTFTTLFID ASCGFAIAWN
YALQWLVTMP LELVAVSMTF SYWNTDVHPA VYVAIFYVVI VVINLFGVKG YGEAESLFSI
IKIIAVIGFN ILSIIIVTGG VPGQPYIGGK YWHKPEGGLF NTVEPFKQCC YIIANASFAY
AGVEIFALAA VESKQPKKSI NSARKQIFYR ILVFYICSLV MIGLLVPYTD ERLLGTNTKA
GNIGVDINTS PFVIAIKNAN IRALPTIMNI VIIITVVSVG NASVYGSSRA LCALGALRQG
PSILNFIDRR GRPMAGLLVQ FAFGLLAFLV AIPGPSVTTQ IFNWLLSLSG LSILFTYLSI
NICHLRFRRA LNVRARLPQD ELVYTSPVWV SWYAIICIIT VLGLQFWAAL FPPGNHAADW
ESFLTIYLGL PVLILFYICH KIYAKIFLKV PLTKFWLTAE EIDIDTGRRQ IDMEALKQEI
AEERLSFQSK PLYYKVFRFF C