Gene CNK02010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02010 
Symbol 
ID3254620 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp601105 
End bp602334 
Gene Length1230 bp 
Protein Length236 aa 
Translation table 
GC content48% 
IMG OID638253694 
Productproteolysis and peptidolysis-related protein, putative 
Protein accessionXP_567677 
Protein GI58260534 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGTT TCAGCGTTCA CAAAGCGGCT TTGGAAGGTT AGTACTCTTT GCCTCTTGTT 
TGGCTATCAT CTTCAAAAAG GTTTGTTTAA CCTTCATCAT AGGTCAGATA GGTCTTGCGA
GATCATTATT GAATGACGAT CCTAAACTTA TTAACTCGAA GGACGAAGTA CGTTCTGGCT
GGCTCGAGAG GTGACATATG CTGACCTTCC GTTATTCCCG CCTCGCCCCA CTATCTTGTC
CTTGTGCTCG CGCTCTCTTT GATGCCTTCA ATTCAAATTT ACCCACAGGA TGGCCGTACT
CCGCTCCACT GGGCAGCTTC AACTTCAAAC CTTTCTGTCC TCCAACTGTT GCTCAACTAC
CATCCTGACT TGGAAGCGAG AGATACTATG GGATGGACGG CGTTGATGAT TGCCTGTTGG
TTCATTCGGC TTTCATAACT AGCGGGAAGC TAGGTTTAAT CAAAAGTCCC CTTGTTAATG
CCCATTTTTA GCTGCGGCAG GACATCCGGA AATAGTCAGA GAGCTGATAG GTGCGGGTGC
CAAAGTCGAT GCAGTGAATG AGAAGGGTCA AACGGCCCTG TGAGTTTGTC CTCTGTGTCT
GAAATCAGTT GTAGCTGCTA ACCACTCTCA TTTTCTCTGA TAATGCTAGA CATTATGCGG
CTTCCAAGGG AAACGTATCT GTAGGTGCCT CTCGTGTCCC AATATCTACA GATACTAACG
TCGTTGCAGA TTGGCCGTTT GCTCATCAAC CACGGGGCGG ATGTAAGTGT CAAGCATGCT
CCAACACTCA GGAGCAAATA TTTACCTTTA TTCTCATCAA TCAGATTAAT GCCAAAGACC
GAGCGTCACA GCATCCCCTT CACCGAGCGG CAACCACAGG TAACAATGCT TTTTTGCAAT
TACTCTTGAA CCCCCCAGAG GGACGACCAA AGACGAGGTT GAATACCGCT GACCGTGCTG
GTAAGCCTCT GTTTTGTCTT ATTCTTATGG AATTATTTGT TTAAGACACT TGCTGATAAG
GTGGTCAATT ATCAAGGTAA CACACCTCTG CACCTAGCGA TGGAAAGTGG ACATGGAGAC
GCTGCTGTCG TGCTCATTGA GGCTGGAGCG GACCGTGAAC GGTCAAACTC CGAGGGGCAA
ATGGCCGAAG AGATTGAGGG TGTCGGCGGA CAAGAACAAA ACAAGGTTAG AGAATACGTA
GCGTCCAAGG TAGGACGAAG GTCTGAGTGA
 
Protein sequence
MSSFSVHKAA LEGQIGLARS LLNDDPKLIN SKDEDGRTPL HWAASTSNLS VLQLLLNYHP 
DLEARDTMGW TALMIASAAG HPEIVRELIG AGAKVDAVNE KGQTALHYAA SKGNVSIGRL
LINHGADINA KDRASQHPLH RAATTGNNAF LQLLLNPPEG RPKTRLNTAD RAGNTPLHLA
MESGHGDAAV VLIEAGADRE RSNSEGQMAE EIEGVGGQEQ NKVREYVASK VGRRSE