Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_72031 |
Symbol | GCP1 |
ID | 4838629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 125121 |
End bp | 127797 |
Gene Length | 2677 bp |
Protein Length | 847 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389944 |
Product | glutamated carboxypeptidase |
Protein accession | XP_001383985 |
Protein GI | 150864957 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.585457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTCGTTTTC TTTGTACTTT TTGCTGTTTC CCATCGTTTC TCAGCCTCTC TTCATTCTGC AACACAACCT GCAGCCATGG CATCCCATAT GCGTTACACT CCGCTCACCG CAGACCCCAG TCATACGTTG CCCCTGTTCC CACCGTCCTA CGACGATTTG GATGACAACA ACGACGACTA CACGTATTCG TCGTCGGCCA CTCCGGCTAT CGAGCAATTT GAGTTGGACG ACGGCATCGA TGGCATCGAC GACGGCATCA CACGACGAAC ACAACGTGAA GGGCTCCTTG TACGAGCTTC GCTCATGACC AAGAAGTTCG CTAACAACTT CAACAACTCC ATCATCCACC CGGTGCTGCA AATAATAGAC CCCATCTACG AAGGCTACAA ATACTTCCAG CAGAAATATG AGCAGAATAT CCTCAAGTTG GGCAACCCGT TGGTGGTGAA ACGGTTGCTC TACGTGTTGT TCATCATGAT AGTAATCTTC TTTGTCACCA AGCATAACGT CAACGACGGC GTTAATGGCA CTTCCGGAGG TACTTTTTCG GCAGGAAAGT TCTACAATAT AGATAAGCTC AGTTCTTCGG TCGATGACTT CATCTCAGCC AAGTTAATGA AAGAAAACTT GGAGTACTTT TCGTCTATGT CTCACATCAC CGGCTCCAAC GGCGACTTGA CTCTTGCCCG TTATATCGAA ACCTACATGC ACAATAACGG CATCCGCATC ATGGACATGA ACCAGCTCCA GAGCTATACT AACTACCCCG TTTACAATGA GAAAGACACC TATCTCAAGC TTCTGGACAA CTCTTTCTCT GCTCATTTGT ATGAGATGAA TAACAAGACG ATGGAACACT TGGCATATAA CCCCAATGCG CTCAACACCA ATGGCCCCGT GGAGTCCCAT TTTATTTACG GCAACTACGG AACACAAGAG GATTACCAGA AATTGATTAG TAGCGGCATA GATCTTACCG ATGCCATTCT ATTGATCAAG TACGGTGGAA GTATTCCCGA GCCAAACAAA GTCAGCTTTG GCCAGCAGTC TAAAGTCAAG GCGATTGTCT TTATAACACC GAAATTCGAG TTCGGAACTG GTGATAGCAA GCAAGAATTC GTAGACGTAA TCCAAAAAGC AAATGTGGGC TTAACCCGAG TAGATCCTGG TGACGTACTT ACCCCTGGCT GGTCTTCTGA AGATGGGTAT GTCACAAGAT TGCCATGGTT TAGGTCTTCG ACGACTCCCA AGATCCCTAC AATTCCAATC TCGTGGGAAG ATGGTGAGAA ATTGCTTTCT AAATTGGAAG GTTCTGGTGT CAAGTTCGAT GATGGCTACT TTTCTGGAAA GGGTAAATCA TCTTCAGTCC CCACCATGGT GTTGAAAATA GCCAACGAAG AAAGAGCTAC CCATCAGATA TGGAATGTAG TTGGTTCTAT TCAAGGAAGA GAACAGAATG AAAAGGGGAT AATCATTGGT TCTAGTCGTG ACTCTACTTG TTATGGTACG ATTTCCTCGA ATACTGGTAC GGTGGTGATG CTTGAAATGA TCAAAGTATT CACCTCTTTA CAACGAAAAT ATAACTGGAG TCCTTCCAGG TCTATATATT TTGTCTCATT TGATGCTACA GAATATAATT TGGCTGGTTC AGCTGAATGG ATCGAAAACA GAAAGGACCT GTTGCGAAAG GAAGGCTATG CATACATTGA TCTCAGTGAT GCTATCAGCG GCGATGATTT GTCTATCAAG GCAAGTCCTT TTGTAGAAGA CGTTATAAAA AAGGCCCTCA AGCAGGTGGA GACCGATGTC ACCAAGAATG ATAATGGCAA CGAAAAGCTC AGTTTGTACG ACCTATATCT CAGACAACAA GGAAACGATG GTATTTCAAA CGACTTGATT GAGCTGAAAA ACTATATTCC TTTCATTAAT CTTGTGAACA TGCCATCGTT AGAACTTAAA TTCTCAGGCA AGAAATATCC CAAGAACTCT TGCTATGACA ATTTTGAAAA TTTCGAGAAA TCGGCTATCG ATCCTGATAT GAACAAACAT CGGGAACTTG TAAAAGTGCT ATCATTGATT GCCTTGAGGT TAGCTGAGAG CCCCATGATA CCGTTTGACT TCCAGAAGTT GGCCAGCAAG TTAACACATT ATGTAGATGA CTTGGAGAAG TATTCGCGCG ACATTATTCT GACACTTGAG CAGGAAAACA AGCCTGTATT GCACTTCAAG TCCCTACGTG ATTCTATCGA AATCTTAAGA AATGCGGCAA ATTCACTCCA AAGTTGGGGC GATAGCTGGA AGCAGTATGT CGAAAACTCT GCTGAAATCG AGCCGTCAAT GTTGGCAATG AACAGATGGA AGCGTAATGA GAATATGGTT GCATTCAACC AGAAATTCCT TGTAAGGAAT CACATGAAGG ACACGAGACC CGGTTTTGCC AATGTTCTCT TCGGAGTACC ATTCATGGCT CCTGAAGTCA GCGATGGTAA ATATGAGTGG AACACCTTCC CACGTATCAG AGACAGCCTT TACTTACATG ATTTTAACGC TGCCCAAGAC CAGATTAATA AGTTGGCATC GTTGGTCCAG GAGGCATGCC AGGAATTAGA TAGCCAATAG AATATGTTCG TAGACATTAA TGGGTATAGC GGTATAGGGA TACAAATTTA TCGACTT
|
Protein sequence | MASHMRYTPL TADPSHTLPS FPPSYDDLDD NNDDYTYSSS ATPAIEQFEL DDGIDGIDDG ITRRTQREGL LVRASLMTKK FANNFNNSII HPVSQIIDPI YEGYKYFQQK YEQNILKLGN PLVVKRLLYV LFIMIVIFFV TKHNVNDGVN GTSGGTFSAG KFYNIDKLSS SVDDFISAKL MKENLEYFSS MSHITGSNGD LTLARYIETY MHNNGIRIMD MNQLQSYTNY PVYNEKDTYL KLSDNSFSAH LYEMNNKTME HLAYNPNALN TNGPVESHFI YGNYGTQEDY QKLISSGIDL TDAILLIKYG GSIPEPNKVS FGQQSKVKAI VFITPKFEFG TGDSKQEFVD VIQKANVGLT RVDPGDVLTP GWSSEDGYVT RLPWFRSSTT PKIPTIPISW EDGEKLLSKL EGSGVKFDDG YFSGKGKSSS VPTMVLKIAN EERATHQIWN VVGSIQGREQ NEKGIIIGSS RDSTCYGTIS SNTGTVVMLE MIKVFTSLQR KYNWSPSRSI YFVSFDATEY NLAGSAEWIE NRKDSLRKEG YAYIDLSDAI SGDDLSIKAS PFVEDVIKKA LKQVETDVTK NDNGNEKLSL YDLYLRQQGN DGISNDLIES KNYIPFINLV NMPSLELKFS GKKYPKNSCY DNFENFEKSA IDPDMNKHRE LVKVLSLIAL RLAESPMIPF DFQKLASKLT HYVDDLEKYS RDIISTLEQE NKPVLHFKSL RDSIEILRNA ANSLQSWGDS WKQYVENSAE IEPSMLAMNR WKRNENMVAF NQKFLVRNHM KDTRPGFANV LFGVPFMAPE VSDGKYEWNT FPRIRDSLYL HDFNAAQDQI NKLASLVQEA CQELDSQ
|
| |