Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ00580 |
Symbol | |
ID | 3254304 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | + |
Start bp | 161353 |
End bp | 164240 |
Gene Length | 2888 bp |
Protein Length | 803 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253215 |
Product | Prolyl endopeptidase, putative |
Protein accession | XP_567311 |
Protein GI | 58259797 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGTC AACAAGCAAG TCACTCTTTC AGCACAGACA AGACGGGCCA TGGCACTCTT AAGAACGTCC ATGCGTCTGA TTTTACAATC AGCCCCGGAC AGTGGAAGAA AAATGTTAAC TTTTCACCAT ACCCAGTTCC TCCTCAACAT GGGGGCATTA CTGAAATTAT CCACGGAATT GAAATCGAGG ACCCATGGCG AGCTCTTGAA GACCCCGATT CCGAGGTGAC AAAGAAGTTC GTCAAGGAAC AAAATGATGT GAGTGCTTTC AATGTAAGGA AGGACTCTAC CTGACCAGCA AACACAGTTC TCTGTTCCCA GACTTACCAA CCACCCCCTT CGAAAAGAGC TCGAAGCCGC CGTTGAGCAA TGTTACAATC ATGAACGTAT GACCAGTCCC GAACTTCAGG GCGATGGATA CTATTATTGG AAGTTTAACC CTGGTACCTC TCCTCGGGAC GTCATCGTTC GATCGAAGGA TCTCAAACGC GACTTTGGGA AGGCTCCTGG CGGGAGTGGT CCCGAAATTT TCTATGACTT GAACAAGGAG GAGAATATCT CTCTTTATGC CCATAGCTTT AGTCCTAGCG GGAAACTCTG GTGTGCTGTT CTGCAGTATG CAGGGTAAGC GAAATAAATA TGTGAAGGAA GGCTATTGCT TACCAGTTAT GTAGGAGTGA CTGGCAAAGG ATTCGAGTCA TCGACACCGA GAGCAAAGCT GTCCTGGAAA AGGACTTGGG AGGATCGAAG TTCACTTTCG GCGTTACTTG GGTAGGCGAG AAGGTGAATG CTTTTGATGT TCCTCTCTTC GTTAACTGAC ATGGATTACA GGGTTTTATT TACAAGCGGT CAATCGACTA CGATGCCACT AGTGACGGTT ACGACGGTAT CGACGGCTCC TTCGGCATGT TCTACCACGC AGTCGGCCAA CATCAGTCCA CCGATGTTAT CGTTTGGAGT CCCCCGCCTG GAGAGTTTCA ATTCATTGGT AAAGCCAAGG TCGTTGCCGT TGATGAGAAG GAGGAGAACA ACAAAAGGGC ATTCTTGGCT CTCGACATCT ACAAGAATAC CAGTCCTGAG ACTGAGCTGC TGCTGGTCGA GTTGCCCGGC GGCACTGCAG GCCCTGCTGG CGTTCTTCTT CCAGAACTGG TTACCAAGGA GATGAAGTGG GTATCCAGAG GTTTTACTGG AGAAACTCAT TGTGAGTATC GAATTCGATT ATATTACTTC CATTGTTAAT GAAAGGTAGA TATTGGTTCG TCCAGTGCCG AACGTCACTT CTTCACTTCT TTTACGGACG GCGTCTCTAC CGGCCGTATC ATTGCCTTCG ACTCCGCCGA CTGGGATGCC ACAGACATCG ACAGCCCCTT ACCTATGCAA GAGATTGTAC CCGCGGATCC CGAAGGCCAC CAACTTCAAA GCGCCTACTT CATCGGCGAC CGACTGCTCG CTCTCATCTA CCTCAAACAC GCTTGCGCCT CTGTTGTCTT CATTGACGCT CGGACGGGCA AGCCTCTGGG TTCTGCCGAT GCCCAAGGTA CCCATGGTAA CGTTGCTGCC GACCCAGAGA CTCAAGTGCC CGTTCCAGAG GAAGAGGTCC AGCACGCAAA GGAAGGGCAA GTCGTCATTC CAGAGCACGG TGCTATCACC AGCATTTCTT GCCGACCTGA CGCCAACGAC TTTTACTTTA CCGTTGACAC CTGGGTTGCG CCTTCATACG TACTCAAGGG TGAGCTCATC AAGAACAAGG CTGGTCGGTA CGAGGTAGAC ATTAGTAGTG TCAACTCTTC TGAGACCGCT GCTCAAGAGA CGTTGGTTTG TTCTCAAGTA TTCTATACCT CACATGACGG TACCAGGATT CCCATGTTCA TCTGTCACCC TCATGACCTT GACCTCACAC GCCCTCATCC TCTGCTTCTC CATGCTTATG GGGGCTTCTG CTCGCCTCTT ATTCCCCACT TTGACCCAAT GTTTGCCGTT TTCATGCGTA ATCTCCGAGG AGTGTAAGCT TCATTTCTTC GCCCGACAGA CTAATTCCGC TGACTCCTTT CAGGGTTGCC ATCGCTGGTA TTCGAGGAGG TGGTGAATAC GGCAAGGCGT GGCATGAAGC TGCTATCGGT ATCAAGCGCT CTGTCGGCTG GGATGACTTT GCTGCCGCCG CTCGATATGT TCAGTCTCGA GGACTTACCA CCCCTTCTCT CACCGCAATC TACGGTAGCT CCAACGGTGG TCTCCTTGTT TCTGCTGCCA CTGTTCGAAA CCCAGAGCTT TACTCTGTCG TGTTTGCTGA TGTGGCTATC ACAGACTTGA TCAGATACCA CAAATTTGTG AGTGTCATGT TTCGCTTTCT GTTTGACTTT CTCGCTTATG CCTACAACCT CATTTGCAGA CACTCGGACG AATGTGGATG ACTGAATATG GCTCCCCAGA AGAACCTGAA ACCCTCGCGG TTCTTCGCGC TAATTCCCCT CTTCACAATA TCAGCCGCGA TCCTTCTGTC CAATATCCTG CTATGCTCCT CACCACCGGT GACCATGATA CACGAGTGGT ACCCGGTCAT TCGCTCAAGC TACTTGCAGA GCTGCAGAGT GAGTTACTTC TCAATCAAGC CATGATAACT ACTGACATGT ATAAACTAGC TCTCAAGGCT AAGAACCACG GGGCAATGTG AGATTTATAG TATTGATGCT AGAAACTTTT TGCTGATCAC TTGTTCTAGC CTTGGTCGAG TGTACATAAA CGCAGGGCAC GAACGTACGT CTACCATATA ATTGATCCAT TCCAGCCACT GATATTAGAC CGTTGCAGAA TCAACAAAGT CAACCGAGAA GAAAGTTGAG GAGGCGGTTG ACCGTTTGGT ATTTGCACTT GACAACATCA AGATTTGA
|
Protein sequence | MSGQQASHSF STDKTGHGTL KNVHASDFTI SPGQWKKNVN FSPYPVPPQH GGITEIIHGI EIEDPWRALE DPDSEVTKKF VKEQNDFSVP RLTNHPLRKE LEAAVEQCYN HERMTSPELQ GDGYYYWKFN PGTSPRDVIV RSKDLKRDFG KAPGGSGPEI FYDLNKEENI SLYAHSFSPS GKLWCAVLQY AGSDWQRIRV IDTESKAVLE KDLGGSKFTF GVTWGFIYKR SIDYDATSDG YDGIDGSFGM FYHAVGQHQS TDVIVWSPPP GEFQFIGKAK VVAVDEKEEN NKRAFLALDI YKNTSPETEL LLVELPGGTA GPAGVLLPEL VTKEMKWVSR GFTGETHYIG SSSAERHFFT SFTDGVSTGR IIAFDSADWD ATDIDSPLPM QEIVPADPEG HQLQSAYFIG DRLLALIYLK HACASVVFID ARTGKPLGSA DAQGTHGNVA ADPETQVPVP EEEVQHAKEG QVVIPEHGAI TSISCRPDAN DFYFTVDTWV APSYVLKGEL IKNKAGRYEV DISSVNSSET AAQETLVCSQ VFYTSHDGTR IPMFICHPHD LDLTRPHPLL LHAYGGFCSP LIPHFDPMFA VFMRNLRGVV AIAGIRGGGE YGKAWHEAAI GIKRSVGWDD FAAAARYVQS RGLTTPSLTA IYGSSNGGLL VSAATVRNPE LYSVVFADVA ITDLIRYHKF TLGRMWMTEY GSPEEPETLA VLRANSPLHN ISRDPSVQYP AMLLTTGDHD TRVVPGHSLK LLAELQTLKA KNHGAILGRV YINAGHEQST KSTEKKVEEA VDRLVFALDN IKI
|
| |