Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ00620 |
Symbol | |
ID | 3254300 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | - |
Start bp | 179459 |
End bp | 181621 |
Gene Length | 2163 bp |
Protein Length | 518 aa |
Translation table | |
GC content | 55% |
IMG OID | 638253219 |
Product | serine-type endopeptidase, putative |
Protein accession | XP_567315 |
Protein GI | 58259805 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACTACCCCG CATTCCCATA CTACCTACAC TCACAATGCG CTTCACCGCA GCCTCCCTGC TGCTCCTCCC CCTCGTGGCT CTCGCCTCGC CCATCGCCCA GCCCACCCTG GCGCCCCTCG AGGCCCCGTC AGGCGGCGAC CACATCGACG ACGCTTACAT CGTCGTCTTC AAGAAGGGCG TTGATGCCAG CCAGATCGCC CTCCATATCG GAGAGGTCCA GGAGCTGCAT GCTGCCAGCG TGAGTATCGC CATCTCCCTT TGCCACTTGA TGGCGATCAT CGGCCGAAGG CGTCCGTCTC TCCGCGGTCT CCTACGTGGA CCCATCCTGA TGGACGATAG CTGACATCTG TACAGCCTCT TCACAGCTCC TTGACTGACG ATGGCGAAGT CGATGAAGGA GGTTTGAGGC ACGTCTACTC CCCTCCCTCC CCAAGCTCCT CCTCTGGTTT CTTCGGTTAT GGTACGTCAT ATATCTCATC GCGGCCACCC ATCAGCCGGA CGGGCCCTCG GCAGAATCCG TGCTGATCAT TTGCCACCCA CCCATTGCCC TGTCCTACAA CAGCCGGCAA GTTTTCCCCT AGCACCCTTG ACGCCATCCG AGCTGCTCCC GAGGTGGCCT ATGTCGAGAA GGACCAGATC ATGACCACTC TCGAGGTTCC CCGCGGCGTC GATGACGACG CGTCTGACGA GCAGGTCGAG CAGGTCGACG TTTCCGCCTC TGGTATTACC ACTGAACTCG GTGCCCCCTG GGGTCTCGCC CGTATTTCCC ACAGAGATAG GCTTAGCCTC TCCACTTTCC AAAAGTACCT TTACTATTCT GGGGGTGGCG ACGGTGTTGT CGCTTATGTC ATTGACACTG GTATTAACAC TGACCACGTC GAATTCGAGG GTCGAGCCGT TTGGGGCAAG ACCATTCCCA AAAATGATGT TGACAAGGAT GGAAACGGTC ACGGAACCCA TTGCGCTGGT ACTATTGGTT CTAGGAAGTA CGGTGTCGCC AAGGATGTCA AGCTTGTTGC TGTCAAAGTT CTTGGTTCCA ACGGTTCTGG TACTATGAGC GACGTCATCG CCGGTGTTCT GTGAGTCCAG TTATTTCTGC CTGCTTAGAT CGTATGCTGA TTATGCTTGT AGCTGGGCTG CTGAGCAGGC GACCGAGGAC GCCGAGAGGG CTGCCCAGGA AGTCGCCACT ACTGGCAAGA CCAAACATAA GGGTTCCGTT GCCAACATGT CCCTTGGAGG TGGCAGGGCC AAGACTCTCG ACGACGCCGT TGACGCCGCT GTCGAGGCCG GCCTCCACTT TGCCGTTGCC GCCGGAAAGT AAGTCGGTCG TCTTTTTTGT CCGTTGTATT TCTAACGAAT TCTCGCAGCG ACAACAAGGA CGCCTGTAAT TACTCCCCCG CTGCCTCCAA GTTCGCCGTC ACTGTCGGTG CTTCCACCCT CGGCGATGAG CGTGCTTACT TCTCTAATCA CGGCAAGTGT GTCGACATCT TTGCTCCAGG TCTCAACATT CTCTCCACCT GGATCGGCGG CAACTCCACC ACAAACACCA TCTCTGGTAC TTCCATGGCT TCGCCCCACG TCTGCGGTCT TCTCGCTTAC TTCGTCTCTA TTTACGGGAC TGAGTCTTTC CCTCATATTA AGAGCGACTT TAACGAGTCT CTTGCCGCTA CTCGACCTTC TATCGTATCC ATGGCCTATG CTCAGGCTTA CGCCATGCTC CCTTCCCTAG CTCAGACCGT CCTCCCCAAG CCGGAGCTCC TACCTGTGCC CCCCAAGAAT GATACTCTTA CCCCCGACAT GCTCAAGAAG GCGCTCGTTG GTCTTGCTAC CAAAGATATG CTTAGTGACC TCCCCGAGGG GTCTCCCAAC CTTTTGGCAT ACAACAACGC CACTCTTCCC AAGAGGAAGT AAATGATTCT CCCTGTGAAT CAAGTAGGAA CGGAACTTTT TTTTTGCTAT TTTAAGAAAG AGACATTACA ATAGAAGGGC AGAGAAGGGC GTGTGATGGA ATGGCAGTGG GATTTGTTTC TCAATATACC CCGTTGCTTA AACCCAGACC GTCTTCACTT TTTTACCCGC TAGTGTACGC ATCGTGTTTG AGAGTTAAGA CGATTTGTAT GTTATGCTAA GAATCGGAAG AGCTTATGTA TCACTTTGCA ATT
|
Protein sequence | MRFTAASLLL LPLVALASPI AQPTLAPLEA PSGGDHIDDA YIVVFKKGVD ASQIALHIGE VQELHAASPL HSSLTDDGEV DEGGLRHVYS PPSPSSSSGF FGYAGKFSPS TLDAIRAAPE VAYVEKDQIM TTLEVPRGVD DDASDEQVEQ VDVSASGITT ELGAPWGLAR ISHRDRLSLS TFQKYLYYSG GGDGVVAYVI DTGINTDHVE FEGRAVWGKT IPKNDVDKDG NGHGTHCAGT IGSRKYGVAK DVKLVAVKVL GSNGSGTMSD VIAGVLWAAE QATEDAERAA QEVATTGKTK HKGSVANMSL GGGRAKTLDD AVDAAVEAGL HFAVAAGNDN KDACNYSPAA SKFAVTVGAS TLGDERAYFS NHGKCVDIFA PGLNILSTWI GGNSTTNTIS GTSMASPHVC GLLAYFVSIY GTESFPHIKS DFNESLAATR PSIVSMAYAQ AYAMLPSLAQ TVLPKPELLP VPPKNDTLTP DMLKKALVGL ATKDMLSDLP EGSPNLLAYN NATLPKRK
|
| |