Gene CNG01720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG01720 
Symbol 
ID3258926 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp489604 
End bp491827 
Gene Length2224 bp 
Protein Length505 aa 
Translation table 
GC content48% 
IMG OID638257789 
Productcytoplasm protein, putative 
Protein accessionXP_571896 
Protein GI58269480 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.256066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATAG AAGATGAAAA GACCATTCGT TGCGTTGGCG TTCTGGCCGG AAATGGTTTG 
GGGCCCCAGA TCGGCAAACG TCGTCTCTTG CGCTTCTCGT GCACCGTTGC CCAACAATCG
TTCACTTCGT TATACATCTG CCATCTTGAC CCTTTTCTTA TGGCTCACTC CCCAAAACCA
GCCCAGTATC CGTCCGCGAA CAAGAAACCT ACATTCAAGC CGAGTCTCGC TTTCCTAGCT
GTCCTCCTTC TAGCTGCTAT CGCTTTCCTC TTCACACAGA TTGATTTCGC ATTCCCGTCG
CATCCCTGGA AGACTGTTGG AAAGCTACTG AAAGGCGGCG AACTAGAAAT GAACAATCCG
AAGCGTACTG TCGGATACTT TGTAGGTGAT CCTGTACCTT GTATGCAGGG AATGATAGCT
AAACGACTTG ACGTAGGTGA ACTGGTCAGA TCTCTCATGT GATCTTGGCC TGGCAGATTC
ACTGACTTGC AGATCTCCAG GGGTATCTAC GATCGCAAAT TTTTCCCTCA AAATATCCCT
AGCCAACACC TCACTCATAT TAATTATGCC TTTGGAAATG TGAAGAAGGA TAGCGGTGAA
GTCGTATTGA GTGACTCTTG GGCTGACGTC GAAGTGAGTC ATTCGCACGG TTTGATGCAT
GCAAGGTGCT CATTGTGTGT ACAAGATCCA CTACGATGGT GATTCGTGGG ATGAGCCTGG
TACCAACCTC TATGGCTGCT TCAAGGCTAT CTATCTTATG AAGAAACAGA ATCGGTGAAT
AGTTGATTTT GTTAAAAGAC TTTAGCTAAT GCCGTATTCA GCAACTTGAA GGTGCTGCTC
TCCATTGGTG GATGGTCCTT CTCCCCTGTA AGTGTGATGG TCTCCATAAC TCCCATATCG
TTCCCAGACT AAACTACGTA TAGAATTTTG CCGGTATCGT ACACCCCAAA TGGCGATCAA
CGTTCGTACA ATCTGCTGTC AAGCTCGTGG AGGATGTCGG TCTGGATGGG TAAGAATTTA
AAGCTTCTGA TCCTAAACTG TCTGCTAACA TCCCCCATAG TCTCGATGTG GGTGTCAAAC
GTATAGCCAC CTGGTGGCTT CTTACTGACC CTATTGAAGA TTGATTACGA ATACCCCAAG
ACTCCGCGGG ACGCCGAAGC CTATGTCGCT CTGCTTCGCG AACTTCGACA GGGACTTGAG
CAACTCGCTC AAAGTAAGGG CAAGCCTCAG GGGCAGTATC AGCTTACTAT CGCTGCTCCT
TGTGGCTGGG AGCAGATGCA AGTCTTGAGA GTGAAAGAGA TGGACCAGGT GAGCCTCAAC
CGTATATCAT CCAGAGAAAG ATCAGTCGCT AATGTAGGAT AGGTGTTGGA TTTCTGGAAT
TTGATGGCAT ACGATGTAAG TCTCTCGATA ACATTTTCAT GACTATATCC TTATCGTGTT
TGATGTAGTT TGCGGGTTCG TGGGACTCTG TGGCAGGTCA CCAAGCCAAC CTTTACTCCG
ACAAGCCTGA CGGACAGTCT GTTGATCGAT CGGTGCGATT CTACCTTGAG GCTGGTGTGC
ATCCTACCAA GCTCGTCGTT GGTATGTCTG TTCTTTCTTA TACTATGTAC GAAACTGACG
AAATGCACAG GATTGCCCGT CTATGGACGT TCTTTCGCTA ACACCAAAGG AATCGGTTCG
CCTTTCTCAG GTACCGGAGA AGGGTCATGG GAGGCAGGTA TGTGGGACTA TAAAGCCCTG
CCTCAGCCCG GTGCCCAAGA AATCAATGAC CACCGTCTGG GTGCTTCATA CAGCTACGAT
CCGTAAGATC ACAAGATCCT CCTATCCCGC CAACACAACT GACCATTTCT TAGGGCAAAG
CGTTTCCTCA TTTCCTATGA TACCCAAGCA ATCGCCCATC AGAAAGCTTC ATACATCGCT
TATCACGGCC TGGGCGGCGC AATGTGGTGG GAGCTGGACT CTGACAAGCC TGAAGAAACA
GGACAGGCGT TGGTGAGGAC TGTGAGAGAC GCGCTGGGCC AACTAGAATG GAGGGAGAAT
GAGCTGGATT ATCCTGGTAG CAGTGAGTCT TTGAGATGTG CAATGGTGGA CAGAACATTG
CTGACTGGGA ATCAGAGTAC GATAATTTGA GGAGAAGGAT GGAGTAAGAT ATAATGAATG
ATACAACTAT AGCAAGGTGT TGAAATTTAG AAGAGCATTT TCTTGCGTAA TGTGCCCGTA
CTTT
 
Protein sequence
MHIEDEKTIR CVGVLAGNGL GPQIGKRRLL RFSCTVAQQS FTSLYICHLD PFLMAHSPKP 
AQYPSANKKP TFKPSLAFLA VLLLAAIAFL FTQIDFAFPS HPWKTVGKLL KGGELEMNNP
KRTVGYFVNW GIYDRKFFPQ NIPSQHLTHI NYAFGNVKKD SGEVVLSDSW ADVEIHYDGD
SWDEPGTNLY GCFKAIYLMK KQNRNLKVLL SIGGWSFSPN FAGIVHPKWR STFVQSAVKL
VEDVGLDGLD IDYEYPKTPR DAEAYVALLR ELRQGLEQLA QSKGKPQGQY QLTIAAPCGW
EQMQVLRVKE MDQVLDFWNL MAYDFAGSWD SVAGHQANLY SDKPDGQSVD RSVRFYLEAG
VHPTKLVVGL PVYGRSFANT KGIGSPFSGT GEGSWEAGMW DYKALPQPGA QEINDHRLGA
SYSYDPAKRF LISYDTQAIA HQKASYIAYH GLGGAMWWEL DSDKPEETGQ ALVRTVRDAL
GQLEWRENEL DYPGSKYDNL RRRME