Gene CNL06640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL06640 
Symbol 
ID3255027 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp848237 
End bp849683 
Gene Length1447 bp 
Protein Length376 aa 
Translation table 
GC content48% 
IMG OID638254141 
Productphospho-2-dehydro-3-deoxyheptonate aldolase, putative 
Protein accessionXP_568187 
Protein GI58261554 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0587923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGACTTTACA TTTATTACAT CTGCATATTC AGAACCTCTA TTCGAGAAAT GCCCTCCCCT 
ACAAGAGTAT CTATCCGAGA CGTAAGTCAT TCTGTTGCTT GATTTTTGAT TGGCAAAAAG
CTCATTTGAA GACAGGCCAT GGAACTCCTC GACGACCGAA GGGTCAAGAT TGTCAGGCCT
CTTATCCCGT ACGTCTCTGA TCAATGTATT CGTTTGGAAG CTTATGCTCC CTCAGCCCTC
AGATTTTACA TGAAGAGCTT CCCCTCTCAT TGAGAGGCGC CCAAACTGTG CTTGACGGCC
GTCGACAAGT TGAGGCTGTC ATCAAAGGCG ATGATGACCG ATTGCTTGTC GTTGTCGGCC
CCTGTTCCGT GCACGATCCC GAACAGGCCA TCACCTATGC CAAAGCTCTC AAAGAGTACG
CCGACAAGGC TGCTGAAGAT CTTGTGATTG TTATGCGAGT CTACTTTGAA AAGTATGTCT
ATCCAATATG GAAAAAGCAA GTATTATTGA GCTGACATTC AGGTAGACCT CGAACAACTG
TTGGCTGGAA GGGATTGATC AACGACCCGG ACATGAATGG TTCTTACCAA ATTAACCGAG
GTCTTAAGAT TGCACGAAAG TTGCTGTTGG ACATTACCGA AATTGGTTTG CCCGCTGCCG
GCGAGTTCCT TGGTTTGTCT TCACCATCTC TTCCCTTGTT TGAACTCCCA AGCTTACAGC
TTCACAGATG TCATTTCTCC CCAGTACCTC GCCGACCTTT TCGCATGGGG CGCCATCGGA
GCCCGAACCA CGGAATCCCA AGTCCACCGA GAACTCGCGT CTGCACTCTC CATGTCCGTC
GGTTTCAAGA ACGGTACTGA CGGCTCTATC GGGATTGCAA TTGATGCGAT CAAAGCAGCC
GGATCTGGAC ACACTTTCTT GTCTGTTACC AAGCAAGGAT TGTCCGCGAT TGTTGAGACG
GAAGGAAACA GTTCTACACA TGTCATCTTG AGAGGAAGCA GCAAGGGACC TAATTATGGA
GCGGATGATG TGGCCGCTTG TGCGGAAAAA TTGAACAAAA GCGGATTGCC TGCCAAGCTT
ATGGTACGTT TATTAATCGT CTCTTCTGGT TTAAAAATAC ATGGCTAAAA CAAGCAAACA
GATTGACTGC TCTCATGGTA ACTCCTCCAA ACAACACCTC AACCAAATTA AGGTCGGTGC
CGACATTGCC TCCCAACTTT CCTCTGGACC CACATCCAAC GCCATTGTCG GTGTCATGAT
TGAGTCCAAC ATCTTTGAAG GTCGACAAAA TGTTCCTGCC GAGGGACCTT CTGGATTGAA
GTACGGTATC TCTGTGACGG ATGCTTGTAT TTCGATGGAG CAGACTATTC CTTTGTTGGA
TGAGTTGAGG AAGGGTGTGC AAGCGAGGAG AGAAGCTGTC AAGGCTAAGA GAGAGGGACA
GCAGTAA
 
Protein sequence
MPSPTRVSIR DAMELLDDRR VKIVRPLIPP QILHEELPLS LRGAQTVLDG RRQVEAVIKG 
DDDRLLVVVG PCSVHDPEQA ITYAKALKEY ADKAAEDLVI VMRVYFEKPR TTVGWKGLIN
DPDMNGSYQI NRGLKIARKL LLDITEIGLP AAGEFLDVIS PQYLADLFAW GAIGARTTES
QVHRELASAL SMSVGFKNGT DGSIGIAIDA IKAAGSGHTF LSVTKQGLSA IVETEGNSST
HVILRGSSKG PNYGADDVAA CAEKLNKSGL PAKLMIDCSH GNSSKQHLNQ IKVGADIASQ
LSSGPTSNAI VGVMIESNIF EGRQNVPAEG PSGLKYGISV TDACISMEQT IPLLDELRKG
VQARREAVKA KREGQQ