Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_41725 |
Symbol | |
ID | 4837545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 91371 |
End bp | 93086 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388860 |
Product | predicted protein |
Protein accession | XP_001382779 |
Protein GI | 126132508 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | [TIGR00907] amino acid permease (GABA permease) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.817066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00692909 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTAAAA TAGACTCCGA AAAGAGCACT GAGATACATA ACGTGCCCAG CGTTGGCTAC GGAGAAATAC AGAACTATGT CTCCAATCGT ACCGCCCAAG GTATGCCAGC CTTGGATGCT CTTGCCCAGG CTGATGGCAA GAATGCTGAG AAGTTGATGG AAGAAGCTCA GGCCAACTTG GAATTAGTTC AGGAGACCGG TTATGCTCCA GAATTGAGAC GTAACTTTGG TGTGATCTCA TTGTTAGGTG TGGGGTTTGG GTTAACCAAC TCTTGGTTCG GTATCTCGGC CTCTTTGGTC ACAGGTATCA GTTCTGGTGG TCCCATGATG ATCATCTACG GTATTCTCAT TGTTGCCTGT ATTTCCATGT GTGTAGCCAT CAGTTTGAGT GAGTTGATCA GTGCCATGCC TAATGCTGGT GGCCAATACT ACTGGACAAT GAAGTTGGCT CCCAAGAAAT ACGCTCCTTT CTGGGCTTAT ATGTGTGGTG CTTTTGCATG GGCTGGTTCC GTCTTCACAA GTGCTTCCGT TACTCTTTCC ATTGCTTCCT CGGCTGTCGG GATGTACATG TTGTACCATC CAGACAAGAC CATCCAAACA TGGCATGTGT TTGTAACTTA TGAAATCGCC AACATCTTAT TAGTATTCTT CAACCTCTGG GAAAAACCTC TACCAGCCAT CTCAAAGAGT TCGTTGTATA TCTCTCTTTT GTCGTTCTTG ATCATCACTA TTGTGGTGTT GGCCAAATCT GGAGGAGAAT TCCAATCGGC CAACTTCGTG TTTGTGGAAT TTACTAACGG TACTGGTTGG AGTTCCAGTG GTATTGCTTT CATTGTTGGT TTGATCAACC CCAACTGGTC CTTCAGTTGT TTGGATGCTG CCACCCATCT TGCTGAAGAA TTACTTGAAC CAAGAAAGCA AATTCCAATT GCAATTATCG GCACTGTTAT TATTGGATTC ATCACCTCGT TCTCCTACTC CATTGCCATG TTCTTCTGCA TCAAGGATTT GGACGCCATC TACAACTCCA ACACTGGTGT GCCAATCATG GATATCTTCT ACCAGGTATT GAACAATAAG GCTGGTGCTG TCATCTTGGA ATTCCTAATT TTCTTGACTG CCATCGGTTG TAACATTGCC TCTCACACTT GGCAGGCTAG ATTATGTTGG TCTTTTGCTA GAGACAATGG TTTGCCAGGA TCCAGATATT GGTCCAAAGT CAACCCAAGA ACTGGTGTTC CAGTGAATGC CCATCTTATG TCTTGTGTGT GGTGTGCTAT CATTGGTTGT ATCTACATGG GCTCTACTAC TGCCTACAAT GCCATGGTCA TTGGGTGTAT TATCTTTTTA TTGATGTCAT ACGCTGTGCC AGTTGTTTTC TTGTTAATGA AGGGAAGAGA CAACATTAAG CATGGTCCAT TCTGGTTAGG TAAAATTGGA CTTTTCGCCA ACATTGTTCT TCTCGTCTGG ACTGTATTCA CTACTATTTT CTACAGTTTC CCACCTGTCA TGCCAGTCAC CGCAGGTAAC ATGAACTACG TCTCTGTCGT AGTTGGTGTC TTTGGAGCAT ACTGTATTAT CTATTGGTTT GCTAGAGGCA AAAAGAAGTT CATCACTGCA GAAGACAGAG AAGCAAAGAT TGACGAGTTG ACACACCAAT TGTCGCAACA AATATCACAC ATAGAAGTCG TTCTCTCCCA CAAGAACGAC GTGTAA
|
Protein sequence | MSKIDSEKST EIHNVPSVGY GEIQNYVSNR TAQGMPALDA LAQADGKNAE KLMEEAQANL ELVQETGYAP ELRRNFGVIS LLGVGFGLTN SWFGISASLV TGISSGGPMM IIYGILIVAC ISMCVAISLS ELISAMPNAG GQYYWTMKLA PKKYAPFWAY MCGAFAWAGS VFTSASVTLS IASSAVGMYM LYHPDKTIQT WHVFVTYEIA NILLVFFNLW EKPLPAISKS SLYISLLSFL IITIVVLAKS GGEFQSANFV FVEFTNGTGW SSSGIAFIVG LINPNWSFSC LDAATHLAEE LLEPRKQIPI AIIGTVIIGF ITSFSYSIAM FFCIKDLDAI YNSNTGVPIM DIFYQVLNNK AGAVILEFLI FLTAIGCNIA SHTWQARLCW SFARDNGLPG SRYWSKVNPR TGVPVNAHLM SCVWCAIIGC IYMGSTTAYN AMVIGCIIFL LMSYAVPVVF LLMKGRDNIK HGPFWLGKIG LFANIVLLVW TVFTTIFYSF PPVMPVTAGN MNYVSVVVGV FGAYCIIYWF ARGKKKFITA EDREAKIDEL THQLSQQISH IEVVLSHKND V
|
| |