Gene CNI04140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI04140 
Symbol 
ID3259792 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1099507 
End bp1100686 
Gene Length1180 bp 
Protein Length180 aa 
Translation table 
GC content48% 
IMG OID638258909 
Productmetalloendopeptidase, putative 
Protein accessionXP_572606 
Protein GI58270900 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.563575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATTGGCCG CCCAAATCGC AAGAACTGTC CCGGCTGATC TCTGAACGAG AAGCCTTGAT 
AGGAGAAGGA GACAGATATT ACCTACCGAG TGGTACAGCT AAGAGTACAT ACGTACCCTA
CAGACCGCCT ACCAACAACC CTTTGAAACA GTTTGAGTCG CCAGACTGGC GCGTGTATGT
GATAGATTCA GTAAGCTCAT AAACCTTTTC GGGAAACTGT TGTTTCTAAA GTCTGACAAT
CCCTTTCGGT GTTAGCCTGA AGTGAATGCC TTCGCCCTAC CAAGCAGAGA TGTTTTTGTT
TACACCGGTC TCCTTGACAC ACTGCCCGGG GATGATGTCA TGCTGTCTGC AATCTTAGCC
CATGAGATCG CTCATGTCGT AGAAAGACAT ACGGTTGAAA ATCTAGGAGT AAGTCTTACA
GTTATTTGAG ATGGATCTTC CAGCGCTGAC TTGAATGATC TCAGTTCTTG AATCTGGCGA
CTGTGGGATT TGACGTCTTG CGAGGATTGG CCTTTGCATT TACCATCTCC TTCCCATTGT
ACGTACTGTT GAAACTGCCT GGAATGCTAC CTGATGATCG CTCATACGAG TGCACGAATA
TAGTATCACG GACTCAGCCG GGATGTGTAT CAACTGGATC AACAATGTCC TCGCCGACAG
AGCTTACTCT AGAAAACTTG AAATGGAGGC CGATGCTGTA GGCTTGGAGG TACATATGCT
CTCGAATTGA GATCAATAAA ACGTGTGGCA TGTGCTGATA GCGATCAATC AGATCATGGC
GACCGCAGGA TACGACCCTA GGGCCGCAAG CGACTTGTGG GAGCTTATGG CATGTGTGGA
GGACGACGCA GCGGCGATGG GACAAGGGAT CAGTGTCGAG AACCGGTTCA CTCTGCTTAG
GACGCATCCG ACAAGTGACG TTCGACTAAA AGTAAGCAAT TGGAAGCGTA CGCATTTTTT
ACTTTGTGCC CTGACTGGAA TTTGATTTCT AGGCTCTCAG CAAGGATATG GAAGGTGCGC
TGAAGATTTG GCGGGACCAT AGGAGGAAGC GTCAGCCCAA GAGAGTGGAG AAAAAGCAGG
AAAAGAAGGA TAACGTCCCT GAATCGGACA AAGCTGTATC GGAATAAGGA TGGTATTCCA
GAAGATAGCA TGCGGTCGTT ATTTAGCATG CACTTATTCG
 
Protein sequence
MLSAILAHEI AHVVERHTVE NLGFLNLATV GFDVLRGLAF AFTISFPFIT DSAGMCINWI 
NNVLADRAYS RKLEMEADAV GLEIMATAGY DPRAASDLWE LMACVEDDAA AMGQGISVEN
RFTLLRTHPT SDVRLKALSK DMEGALKIWR DHRRKRQPKR VEKKQEKKDN VPESDKAVSE