Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA03390 |
Symbol | |
ID | 3254010 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 881843 |
End bp | 883350 |
Gene Length | 1508 bp |
Protein Length | 398 aa |
Translation table | |
GC content | 47% |
IMG OID | 638252670 |
Product | O-sialoglycoprotein endopeptidase, putative |
Protein accession | XP_566670 |
Protein GI | 58258515 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.313169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGTAC TTTTAATTGT CTTCTTTAAT AGTTATTATA CTCATTGTTC TCACCATTCT CTCTATTCGT TCCATTGCAC TCCCCCGGAT CCAGCCATGA AGCAATCACC ACTCCACAGA CCATGTAAGT GTCCTCGCTC ACGCTCTGTC TGACATTCCT AGCCCGTCCT CTCCTCGCTC TCGGCATAGA AGGCTCAGCA AATAAGCTCG GATGCGGCAT CATATCACAT TCTCCTTCAC CCACAGGTGG ACCTACTTTA GTCATGGTAC TCTCAAACGT TCGGCATACG TACATCACTC CTCCTGGCGA AGGTTTCCTG CCATCAGATA CAGCCAGACA TCATAGAGAA TGGGTTGTTA AAGTCATCGA AGAGGCTGTT CGAAAGGCGG GTGTCAGGAT GGGCGATCTC GATTGCATTG CCTTTACCAA AGGTATTACT ATTCATATTC ATCACAAACA CGTGCTGATG AACATGAAAT AAGGCCCGGG CATGGGTACC CCTCTCCAAG TGGGAGCGCT CGTCGCCCGT ACGCTATCTT TACTTCACAA CATCCCCCTT GTCGGCGTCA ATCACTGTGT TGGCCGTAAG TGACGTACTG GATTTGAAAC AAGATGCCAG CTAACATATC TCCAGACATT GAAATGGGTC GCCAAATAAC GTCTTCTCAT AACCCCATCG TCCTATATGT TTCGGGCGGC AACACCCAGG TCATCGCGTA CTCTCAGCAA CGCTATCGCA TCTTCGGCGA GACATTAGAT ATAGCTATCG GGAACTGTCT AGATCGCTTT GCCAGAGTTA TCGGCCTGAG AAACGATCCA AGCCCTGGGT ATAACATTGA AAAAGAGGCA AAAAAGTGAG TACATTAGGT TTGTATGAGG TAACACCACA CACGTATATA CTGATTCAGC ATGACCAATA GGGGCAAGCG TCTAGTCCAG CTCCCATACG GTACGAAGGG TATGGATGTA TCTTTAGCAG GTATCTTACA CTCCGTTGAG GCCTATACAA AAGACAAACG CTACCGCTCT TGGGATCAAG TCAACGATGT CGAAGAAGAT ATAATTACGC CATACGATCT TTGTTTTTCT CTGCAGGAGA CCACTTTTGC GATGCTGGTG GAGATAACTG AAAGAGCAAT GGCTCATGTG GGAGCGAAGG ACGTCTTGAT TGTTGGCGGT GTTGGTTGTG AGTTCTGATC CTTTGTAAAA GTTCACAATG ATTAATCGAT CGGTTGTAAT CAGGTAATTT GAGATTACAG GAGATGATGG GTATCATGGC CAGTGAAAGG GGAGGACGCG TATTCGCAAC TGATGAGAGG TACGCTTTGA TTCTACTGTT TGAACTTGCA GCGATTGATC GATATCTAGT TTCTGTATCG ATAACGGAAT AATGATTGCC CAAGCAGGAT TACTGGCCTT CAGAATGGGG AATACCATGC CATTAGAAAA GACAGGTGTT ACTCAGCGAT ATCGGACCGA CGCCGTCCAC GTGGCTTGGC GAGCGTGA
|
Protein sequence | MLVLLIVFFN SYYTHCSHHS LYSFHCTPPD PAMKQSPLHR PSRPLLALGI EGSANKLGCG IISHSPSPTG GPTLVMVLSN VRHTYITPPG EGFLPSDTAR HHREWVVKVI EEAVRKAGVR MGDLDCIAFT KGPGMGTPLQ VGALVARTLS LLHNIPLVGV NHCVGHIEMG RQITSSHNPI VLYVSGGNTQ VIAYSQQRYR IFGETLDIAI GNCLDRFARV IGLRNDPSPG YNIEKEAKKG KRLVQLPYGT KGMDVSLAGI LHSVEAYTKD KRYRSWDQVN DVEEDIITPY DLCFSLQETT FAMLVEITER AMAHVGAKDV LIVGGVGCNL RLQEMMGIMA SERGGRVFAT DESFCIDNGI MIAQAGLLAF RMGNTMPLEK TGVTQRYRTD AVHVAWRA
|
| |