Gene CNK00840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00840 
Symbol 
ID3254415 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp268930 
End bp270786 
Gene Length1857 bp 
Protein Length478 aa 
Translation table 
GC content49% 
IMG OID638253574 
Productconserved hypothetical protein 
Protein accessionXP_567647 
Protein GI58260474 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTCG TCAGCAATAC CGCTCTCTTT GCGATTCTCA CGGCTCTTGC TGTTCGTTCA 
GCACCTGCTC CACAATCTGG TACCGATTAC ACTTGTGATA GTGACACTCA ATGGCATGAC
GCATACCAGA CCGTTACTTG TCCTGGCGAT ACCGTTTGTG TCACTGGTGC CAGTGGGAAC
CCTTGCCAAT TTCCATCGGG GTAAGTTTTA TTAATGTTTC GTGACGATAA TATTGACAAA
AGTAGTTATG GACAGTCTGG AGTAGTGGCT GTTGCGGTCA CTTCTGCCGC TGCTGTAACT
TCGGCAGCCG CGGTAGCCAC TTCGGCTAGT GCTGCTGGAG GGGTAACCTC AGCGACTTCA
GCTGGAGGTA TCAGCATCAG CGGAAACGCA GCCACTTCAG CTTCAGAAGA AGCTGTAACT
TTGTCGTCGG GGGCTACAGC TACCGAATCT GGTGGTGGAG GCAGTGCCTC GCACACGAAC
TCTGCAGCAT CGGCAAGTGG TACCTCCACT TCGGACGGCA CTAGTGTATC GAACAGATTT
GTGACTTATT GGGATAAGTG AGTACTGCCA TTTAAGGCAG TCGAAAAGTA CTAATCAGAA
TTCAGCTACG CAAATATGGG AGGGGTCAAC GCTGGTCAAT TGACGGCTGT TACTCATGTA
ATTCTTTGTA AGTTCATCTC CATCAGATTA GTCAACATTC TAACCCGTTT GATCGTCATA
GCCTTTGCCG ATATGACCGA CTGGGCTACC GAGCAAACGA CTTGGAAGTT CATGGAATCT
TCCAACGGCA ACTTTGACTC TTCAACAGCC GCAACGCTCA AGGGCATGCA ATCGGGTCTT
AAAGTTTGTG GAGCTCTTGG TGGTTGGGGT CTCGATAGTG TTATGGCTAC TGCAGTACGA
GGCGGAGACT CAACTATTGC AACGTTTGTG GCCAATGTGA AGGGATTTGC CGACTACTTC
AACTTGGACG GCATTGATAT TGACTGGGGT ATGACCAGCA CTTGATAAAT GCACAAGTCA
CTGATACTGA AAATTAGAAT TCCCCTCCGC CTCTGATGAC GCCAACCTCA TCACTTTTAT
TACCCAGCTG CGTGCTGCAC TTGGTGATGA CAAACTTATT TCAGTCGCAC TTGGCTCCCG
AGTTGATACT ACCGATGCCG CCGCGTTCAA TAGTGACACG TTCTCGAAAC TTGACAGCCT
TGTTGACATG TGGAACCTCA TGACTTACGA CTATGTCAAT CGCTACAGTA CTGTCACCGA
ACAACAGGCT GGTAACCGCG TTGTCACCAC CGTCATGGAT TACTATGAGC AGCAAGGTAT
CACCATGGAG AAATGTAACG TCGGTTTTCC TATGAACGCC AAGTACTTCA CCCTTACCGA
AACCTGTGAT TCTTCAAACC CAATCGGCTG TTCTCTTCCA GGCACCGACT ACTATGAAGA
CAGCGGCGTT GATAATTACA AGTCGGGATG GGTCAGATTT AATCCAGATT TGGATTCTTC
GCTGGGTACA GAAGGAACAG AATGGGCGAC CAAGATGAGG GCGCAGTGGG AAGCTCGGCC
AACCGATGGA AGTACAGAGA TTACTGCCGA TGTATCGAAC GCCTGGGTTG ATGAGACCAA
TGATGTCTTC TGGACTTGGC TGTCTGACTC TGACATGAAG ACAACTTGCC AAAACTGGGT
GACGTCGGGC AAGGTGGGAG GCGCTATGGT TTGGAGTCTG AACCAGGTGA CTATCCATTC
ATTTATAGTA CGTGCTCAAA TGCTGATCCA CTTTTGACAC ACAGGACGAC GAAAGTCAAG
ACGGAGGGAG TCACTTGACT GCACTTGCAG AATGTATCCA GGGGTCGTAA TCCATAG
 
Protein sequence
MHFHLLHNLV PITLVIVTLN GMTHTRPLLV LAIPFVSLVP VGTLANFHRA AVATSASAAG 
GVTSATSAGG ISISGNAATS ASEEAVTLSS GATATESGGG GSASHTNSAA SASGTSTSDG
TSVSNRFVTY WDKLVNILTR LIVIAFADMT DWATEQTTWK FMESSNGNFD SSTAATLKGM
QSGLKVCGAL GGWGLDSVMA TAVRGGDSTI ATFVANVKGF ADYFNLDGID IDWEFPSASD
DANLITFITQ LRAALGDDKL ISVALGSRVD TTDAAAFNSD TFSKLDSLVD MWNLMTYDYV
NRYSTVTEQQ AGNRVVTTVM DYYEQQGITM EKCNVGFPMN AKYFTLTETC DSSNPIGCSL
PGTDYYEDSG VDNYKSGWVR FNPDLDSSLG TEGTEWATKM RAQWEARPTD GSTEITADVS
NAWVDETNDV FWTWLSDSDM KTTCQNWVTS GKVGGAMVWS LNQVTIHSFI NVSRGRNP