Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65905 |
Symbol | APE2.2 |
ID | 4839938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 560790 |
End bp | 563487 |
Gene Length | 2698 bp |
Protein Length | 870 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640391253 |
Product | alanine/arginine aminopeptidase |
Protein accession | XP_001385797 |
Protein GI | 150866260 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.167791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTACTCACA CCGTACTGCA ATGTGTCGCC ATTCGTCCTC CGACTCGTCG CTGGTTGTTC CTGCTGACAG AGAAGTGTTA CCAACCAACG TTAAGCCTCT CCACTACGAC TTGACCTTAG AACCAATCTT CTCCACCTTC AAGTTTAATG GTCAAGAAAC AATTGATTTC CATGTGAACG AAGACACCGA CTACATCACA TTGAACTCGT TGGAAATCGA AATCCAAGAA GCAATTATCA ACGGCTCGGC TGTATCTGAC ATTTCCTTCA ATGTAGACAA ACAGACTGTT ACCTTCAAGT TGCCACAGCC ATTGGCTCAG GGAAGCAACG CCAAATTAGC CCTCAAGTTC ACCGGTGACT TGAACAACAA GATGGCTGGT TTCTACCGTT CCTCTTACCA AGAAAACGGG GAAACAAAGT ACTTGGCTAC CACCCAGATG GAGCCTACCG ACTGTAGAAG AGCTTTCCCA TCGTATGATG AACCTTCTGC TAAAGCCAAG TTCACAATTT CGCTCATTGC CGAAAAGAGC TTGGTAGCCT TGTCCAACAT GGACGAAGCC TCCACAGTAG AATTGGCTGA CAACAAGAAG AAGGTCACTT TCAACACCAC ACCCTTGATG TCTACCTACT TGGTAGCTTT CATTGTAGGA GACTTGAAAT ATGTAGAGAA CAACGACTAC AGAGTCCCCA TCAAGGTTTG GGCAACTCCT GGCTCAGAGC ACTTGGGTCA GTACTCGGCT GATATCGCTG CTAAGACTTT GAGCTTCTTT GACAAGAAGT TCGACATCCC ATACCCTTTG CCCAAATGTG ACATGGTGGC TATTCACGAT TTCTCTGCTG GTGCCATGGA GAACTTTGGT TTGATCACAT ACAGAACCAT CGACTTATTA TTGGACCCAC TGAACACCAA CATTGTAACG AAGCAGAGAG TCACTGAAGT CGTGATGCAC GAGTTGGCCC ATCAGTGGTT TGGTAACTTG GTCACCATGG ATTTCTGGGA TGGCTTGTGG TTGAACGAAG GTTTCGCTAC GTGGATGTCG TGGTACGCCT GTGACTCGTT GTACCCTGAC TGGAAAGTCT GGGAATCGTA TGTTTCAGAC TCGTTGCAAC ACGCTTTGAC GTTGGATGCT TTGAGAGCTT CTCACCCAAT CGAAGTACCT GTAAAGAGAG CTGATGAGAT CAACCAGATC TTTGACGCCA TTTCCTATTC AAAGGGCTCT TCATTGTTGA AGATGATCTC CAGATGGTTG GGCGAAGACG TCTTCATCAA GGGTGTATCC AACTACTTGA AAAAGCACAA GTGGGGAAAT ACCAAGACTT CTGACTTATG GGAAGCCTTG AGCGATGTAT CTGGCCAGGA CGTAGTCAAG GTCATGGACA TCTGGACCAA GAATGTCGGT TTCCCTATCG TTCACGTTGA AGAAGCTGGT TCTGACATCA AAGTCACCCA ACACCGTTTC TTGGCTACTG GAGATGTCAA GCCCGAAGAG GACTCTATCT TGTACCCAGT GTTTTTAGGG TTGAAGACTT CTTCCGGACT CGACGAGACT GCTGTCTTGG ACTCTAGATC GACTACTCTT ACCCTTCCTA CTTCTGATGG CTTCTTCAAG ATCAACGGAG ACCAAGCTGG TATCTACCGT ACTGCCTACA CCTCGTCTCG TTGGATTAAG TTGGGTCAAG CTGGTGTTGA AGGCAAACTC TCTGTTGAAG ATCGTGTTGG TTTAGTTGCC GATGCTGGTT CCTTGGCTTC TTCTGGTTTC ATTGAAACCA CCAGTTTCTT GAACTTGATC AAATCCTGGA GTAAGGAGTC CAACTTCGTC GTCTGGGATC AAATATTGTC TGACATCGGC TCTGTCAAGA GTGCTTTCAT CTTTGAAGCC GAAGAATTCA AGGATGCCTT GAACTTGTTC ACCGTTGACT TGATCAGCGA AAAGTTGAAA TCTATCGGCT GGGAATTCTC GGACAACGAT TCTTTCGCTG ACCAACAGTT GAAGGGTTCC TTATTTGCTT CTGCTGCAAA CGCTGGCCAT GCCGAAGTCA TTGACTTCTC ACAGAAGTCC TTTGCTGCTT ACGTTGCTGG TGACAAAAAG GCTATCAATC CAAACTTGAG AGCTACCATC TTTAATGTCG TAGCTAAGTT AGGTGATGAA CACACGTTTG AACAGTTATT GAACATCTAC AAGAACCCAC AGAGCAACGA AGAAAAGATT GCTGCTTTGA GATCCTTCGG TAGATTCACT AAACCAGAAA TCTTGGACAA GGTCACCGCA TTGCTTTTGC AAACTGACAT CGTCAAGCAA CAAGATATCT ACATTCCAAT GCAAGGCTTG AGAGCACACA AACTAGGCGT TGAGAAGTTG TGGGCTTGGT TGACTGAAAA CTGGGACAAG GTCTACGAAA TCTTGCCTCC AGGATTGTCG ATGTTGGGTT CTGTGGTCAC TATTGCTACT TCTGGTTTCA CCAAGAAGGA ACAAAGGGAT GCAGTTGAAA AGTTCTTTGC TACCAAGAAC ACCAAGGGGT TCGATCAGGG TCTTGCTAGA TCCTTGGACA TCATTGCTTC CAAGGGCAAT TGGGCCAGCC GTGATGGTCA AGTTATTTCA GAATGGTTGT CTGAAAACGG CTACTCCAAG TAACTTTTGC GAGCCAATAG TTTTAATGTG AAATCTACTT GATGAAATAC TAATATATAT CGTTTCAT
|
Protein sequence | MCRHSSSDSS SVVPADREVL PTNVKPLHYD LTLEPIFSTF KFNGQETIDF HVNEDTDYIT LNSLEIEIQE AIINGSAVSD ISFNVDKQTV TFKLPQPLAQ GSNAKLALKF TGDLNNKMAG FYRSSYQENG ETKYLATTQM EPTDCRRAFP SYDEPSAKAK FTISLIAEKS LVALSNMDEA STVELADNKK KVTFNTTPLM STYLVAFIVG DLKYVENNDY RVPIKVWATP GSEHLGQYSA DIAAKTLSFF DKKFDIPYPL PKCDMVAIHD FSAGAMENFG LITYRTIDLL LDPSNTNIVT KQRVTEVVMH ELAHQWFGNL VTMDFWDGLW LNEGFATWMS WYACDSLYPD WKVWESYVSD SLQHALTLDA LRASHPIEVP VKRADEINQI FDAISYSKGS SLLKMISRWL GEDVFIKGVS NYLKKHKWGN TKTSDLWEAL SDVSGQDVVK VMDIWTKNVG FPIVHVEEAG SDIKVTQHRF LATGDVKPEE DSILYPVFLG LKTSSGLDET AVLDSRSTTL TLPTSDGFFK INGDQAGIYR TAYTSSRWIK LGQAGVEGKL SVEDRVGLVA DAGSLASSGF IETTSFLNLI KSWSKESNFV VWDQILSDIG SVKSAFIFEA EEFKDALNLF TVDLISEKLK SIGWEFSDND SFADQQLKGS LFASAANAGH AEVIDFSQKS FAAYVAGDKK AINPNLRATI FNVVAKLGDE HTFEQLLNIY KNPQSNEEKI AALRSFGRFT KPEILDKVTA LLLQTDIVKQ QDIYIPMQGL RAHKLGVEKL WAWLTENWDK VYEILPPGLS MLGSVVTIAT SGFTKKEQRD AVEKFFATKN TKGFDQGLAR SLDIIASKGN WASRDGQVIS EWLSENGYSK
|
| |