Gene CNA03390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA03390 
Symbol 
ID3254010 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp881843 
End bp883350 
Gene Length1508 bp 
Protein Length398 aa 
Translation table 
GC content47% 
IMG OID638252670 
ProductO-sialoglycoprotein endopeptidase, putative 
Protein accessionXP_566670 
Protein GI58258515 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.313169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGTAC TTTTAATTGT CTTCTTTAAT AGTTATTATA CTCATTGTTC TCACCATTCT 
CTCTATTCGT TCCATTGCAC TCCCCCGGAT CCAGCCATGA AGCAATCACC ACTCCACAGA
CCATGTAAGT GTCCTCGCTC ACGCTCTGTC TGACATTCCT AGCCCGTCCT CTCCTCGCTC
TCGGCATAGA AGGCTCAGCA AATAAGCTCG GATGCGGCAT CATATCACAT TCTCCTTCAC
CCACAGGTGG ACCTACTTTA GTCATGGTAC TCTCAAACGT TCGGCATACG TACATCACTC
CTCCTGGCGA AGGTTTCCTG CCATCAGATA CAGCCAGACA TCATAGAGAA TGGGTTGTTA
AAGTCATCGA AGAGGCTGTT CGAAAGGCGG GTGTCAGGAT GGGCGATCTC GATTGCATTG
CCTTTACCAA AGGTATTACT ATTCATATTC ATCACAAACA CGTGCTGATG AACATGAAAT
AAGGCCCGGG CATGGGTACC CCTCTCCAAG TGGGAGCGCT CGTCGCCCGT ACGCTATCTT
TACTTCACAA CATCCCCCTT GTCGGCGTCA ATCACTGTGT TGGCCGTAAG TGACGTACTG
GATTTGAAAC AAGATGCCAG CTAACATATC TCCAGACATT GAAATGGGTC GCCAAATAAC
GTCTTCTCAT AACCCCATCG TCCTATATGT TTCGGGCGGC AACACCCAGG TCATCGCGTA
CTCTCAGCAA CGCTATCGCA TCTTCGGCGA GACATTAGAT ATAGCTATCG GGAACTGTCT
AGATCGCTTT GCCAGAGTTA TCGGCCTGAG AAACGATCCA AGCCCTGGGT ATAACATTGA
AAAAGAGGCA AAAAAGTGAG TACATTAGGT TTGTATGAGG TAACACCACA CACGTATATA
CTGATTCAGC ATGACCAATA GGGGCAAGCG TCTAGTCCAG CTCCCATACG GTACGAAGGG
TATGGATGTA TCTTTAGCAG GTATCTTACA CTCCGTTGAG GCCTATACAA AAGACAAACG
CTACCGCTCT TGGGATCAAG TCAACGATGT CGAAGAAGAT ATAATTACGC CATACGATCT
TTGTTTTTCT CTGCAGGAGA CCACTTTTGC GATGCTGGTG GAGATAACTG AAAGAGCAAT
GGCTCATGTG GGAGCGAAGG ACGTCTTGAT TGTTGGCGGT GTTGGTTGTG AGTTCTGATC
CTTTGTAAAA GTTCACAATG ATTAATCGAT CGGTTGTAAT CAGGTAATTT GAGATTACAG
GAGATGATGG GTATCATGGC CAGTGAAAGG GGAGGACGCG TATTCGCAAC TGATGAGAGG
TACGCTTTGA TTCTACTGTT TGAACTTGCA GCGATTGATC GATATCTAGT TTCTGTATCG
ATAACGGAAT AATGATTGCC CAAGCAGGAT TACTGGCCTT CAGAATGGGG AATACCATGC
CATTAGAAAA GACAGGTGTT ACTCAGCGAT ATCGGACCGA CGCCGTCCAC GTGGCTTGGC
GAGCGTGA
 
Protein sequence
MLVLLIVFFN SYYTHCSHHS LYSFHCTPPD PAMKQSPLHR PSRPLLALGI EGSANKLGCG 
IISHSPSPTG GPTLVMVLSN VRHTYITPPG EGFLPSDTAR HHREWVVKVI EEAVRKAGVR
MGDLDCIAFT KGPGMGTPLQ VGALVARTLS LLHNIPLVGV NHCVGHIEMG RQITSSHNPI
VLYVSGGNTQ VIAYSQQRYR IFGETLDIAI GNCLDRFARV IGLRNDPSPG YNIEKEAKKG
KRLVQLPYGT KGMDVSLAGI LHSVEAYTKD KRYRSWDQVN DVEEDIITPY DLCFSLQETT
FAMLVEITER AMAHVGAKDV LIVGGVGCNL RLQEMMGIMA SERGGRVFAT DESFCIDNGI
MIAQAGLLAF RMGNTMPLEK TGVTQRYRTD AVHVAWRA