Gene CND02070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND02070 
Symbol 
ID3257100 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp555857 
End bp557082 
Gene Length1226 bp 
Protein Length253 aa 
Translation table 
GC content49% 
IMG OID638256141 
Productconserved hypothetical protein 
Protein accessionXP_570455 
Protein GI58266598 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.413309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGACTTTTT GCATAGCTAC CAACACAAAC GCCAAAACAA TGGACAAGCG AACCTATCTC 
AGAAACGACC TTCTTGCTGG AAAACCCGGT ATTGGGATGT GGCTCACGTG AGTCAAGTCG
TGGAATATTC CTCGCAATAG CTAACTGATT GCGTGATAAT AGGTTGCCTG GGTCCGCATT
AGCCAAGACC GTAGCCACTA TCCCTGGCTT CAACTGGATC CTTATTGATG CCGAGCATGG
CCAGATTACA GACAGGGACT ACTTTGATGT GGGTATCTAT AGACGAGTAC TGCCTTGCTT
GCCTCGCGAA AGCTCACAGT ACGCTGCTGA ATAGCTCACC AATCATATCA CAACCGAAGG
CGTTTCACCC ATTATCCGTA TCCCCTCCGA TGAACCTTGG TTAATCAAGC GGGCCCTCGA
CTCAGGAGCC CACGGCTTGA TGATTCCTAT GTGTCACAAC CCTGTACGTT CCGGGACATT
CGTTTGCATG ACCACGAACT CGGCACTGAC AGCGACCTTG TAGGATGTCG CCAAAAAGGT
TGTCTCTTCC AGCAAGTACG CTGCTCGAGG CACTCGAGGA TGTGGTTCAC CTTTCACCCA
GATCATCTTC GGTGTTCCCG AGTCTCAATA TGAGGCAACT TGCAACGACA ACTTGATGGT
CATCGTCCAG ATCGAGTCGG CAGAGGGTGT GAAGAATGTG GAGTCCATTG CTGCTGTCCA
GGGAGTGGAT GTTCTTTTCG TTGGTGGGTT CAGTCTCATG GTACGCTTCT TGGATAGATT
TCAGATATTC ATCGTAATCT GAGTAGGCCC CTTTGACCTT GCCAAGTCAA TGGACATAGA
GTTCGGTGGC GAGGAACACG AGGCGGCTAT TGCTCGAACC CTTAAGGCAT GTAAAGATAA
CGGCAAGAAG GCAGCCATCT TTTGTAAGTC AGCAGTGGGG CGGACCCTGT CATGTAACGA
CTAACTGTTG ATTTAGGCAT GTCCGGCGCC CAGTCCAGGA AGCGTTTGCA GCAGGGCTTT
GATATGGTTT CAATCGCCAC CGATACAGAC TCTATAATTC GAGAGTTCTC CAGGCAGCTC
GAAGACATGA AGGCTTGATC CGATGTATGC ATATAATTAT ACAGATAACT CCTACTTTCC
TCATTTATGG TTAAGCGAGT TAATAAGTAA TGGGATCGAA CTCTTAAGCG AACCCTGTCC
GCTGCATGAA TTATTCAATT ATTTTG
 
Protein sequence
MDKRTYLRND LLAGKPGIGM WLTLPGSALA KTVATIPGFN WILIDAEHGQ ITDRDYFDLT 
NHITTEGVSP IIRIPSDEPW LIKRALDSGA HGLMIPMCHN PDVAKKVVSS SKYAARGTRG
CGSPFTQIIF GVPESQYEAT CNDNLMVIVQ IESAEGVKNV ESIAAVQGVD VLFVGPFDLA
KSMDIEFGGE EHEAAIARTL KACKDNGKKA AIFCMSGAQS RKRLQQGFDM VSIATDTDSI
IREFSRQLED MKA