Gene CNK00220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00220 
Symbol 
ID3254479 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp67149 
End bp68299 
Gene Length1151 bp 
Protein Length287 aa 
Translation table 
GC content53% 
IMG OID638253516 
Productcytoplasm protein, putative 
Protein accessionXP_567593 
Protein GI58260366 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0412] Dienelactone hydrolase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAC AAACCACCGT CTCTCCCTGC TGCATCACAG GCCGTACGTC CCTGGCACTA 
CTTACCTTTC TCCCACTCGC CAGGCTCCAC GACACCGCTA ACCTGGTTGA GACAGATATC
CACTCCGGAA AGCCCCTCGG CTCCATCTCC ATCCAGCACG GCCTCCGCAC TTACGTCTCT
CTCCCTTCCT CCGCTGAGAA AGGCAAAGCT GAAGGGCAAG TGGGCCAGAA ACAAGACACA
ATCATCTTGA TTTCTGATAT CTTTGGGATC GACCTCGTCA ACTCCAAGCT CGTCGCCGAC
GAATGGGCTG GGCAGGGGTA CAAGGTCCTC TTGCCTGATT TCTTTGAGGG TGATCCTATC
CCCGAGTCTC TTCTTCAGGT AAATTCCCTT CTCTCCTCCC CTGTCCCTTC CCTTTTACTT
GCTGCTTGTG ATAATGTTAC TGACCATGTG ATGGTGATGC GTCAATCTCA TATCGGATAG
TCGATCGTAC CGAACCTGAG GCACCAAGCC GAAGCTACAG CACTCACCAA GGCCGCCGAT
ACCGCCAAAG CGGCTGCTGC CCTTGGTCCT TGGCTTGTCA AGCACCGCGA GGCTGGTAAG
TTTCCAAAAC AACCCCCCAA CATCCTTCTC CTTGCCCCAC CTATGATTGC ATCGCCTGTT
ATGGATATTA TGCTAACCAA CAAGCAAACT GACTGGCGGG ATAAACAGTC ACCCGGCCGC
TTGTGGAGAA ATACGTCCAA TCCGTGCGCT CCGACACCTC CACGGGCAAG ATCGCCGCCG
TCGGTTATTG CTTTGGTGCG CGCTACGCCC TCCTCCTCGC CCAACCCCAG TCTGGCGCCA
AGTCCAGCGT AGACGTCGTG GTAGCCAACC ACCCTTCCTT CCTCGTCCTC GATGACGTCA
AAGACATCAA CTCCACCCCT TGCATCATCC TCAAAGGGGA TAAAGATGAT ATCATGAGTG
AAGATGATTT GGATAAGGTG GAAGAAATTA TGAAGCAAAA TTTGGGTGAG AAATTGGTAG
TCAAGAGATT CCCTGGGGCC GTCCATGGGT TTACGATAAG AGGTGATATG GAGGATGGAC
AGGAAAAGTC GCAAAAAGAG CAGGCGAACA AGGATTCGTT TGCTTTCGTT GCAAAGTACT
TTAAAAGCTA G
 
Protein sequence
MPEQTTVSPC CITGHIHSGK PLGSISIQHG LRTYVSLPSS AEKGKAEGQV GQKQDTIILI 
SDIFGIDLVN SKLVADEWAG QGYKVLLPDF FEGDPIPESL LQSIVPNLRH QAEATALTKA
ADTAKAAAAL GPWLVKHREA VTRPLVEKYV QSVRSDTSTG KIAAVGYCFG ARYALLLAQP
QSGAKSSVDV VVANHPSFLV LDDVKDINST PCIILKGDKD DIMSEDDLDK VEEIMKQNLG
EKLVVKRFPG AVHGFTIRGD MEDGQEKSQK EQANKDSFAF VAKYFKS