Gene CNC02750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC02750 
Symbol 
ID3256304 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp792864 
End bp794012 
Gene Length1149 bp 
Protein Length263 aa 
Translation table 
GC content46% 
IMG OID638255497 
Productconserved hypothetical protein 
Protein accessionXP_569572 
Protein GI58264832 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.617702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTA AGACTTTCCT CAAGAACGCT CTTGCTCAGA AAAAGCCAGG GCTCGGTTTC 
TGGTGCACGT GAGTGAACAC TTCCTTCGAA TGAGAGTAAA AAGCTAGACT ATCAGATGTT
GACCTGTTTT GCTGCAGTCT TCCTGGAGCC GCGACCGTAG CTACAGCCCT TTCGGCTGGT
GGCTTTAACT GGACTCTGAT TGATGCCGAG CACGGCATGA TTACGGACAA GGATTATTTC
GAGGTGCGCT TCCAGATTCC TAGATTGAGT TAAAAGCTCT CAAAGTCAAA CTGGTAGCTA
ACGTACACCA GCTTGTTACC ACAGTCACTT CTCTTGGAGC CTCACCAATT ATTCGAATTC
CCTGGAACGA AGAATGGATG ATCAAAAGAG CTCTAGATGC TGGAGCCCAA GGAGTCATGA
CTCCAATGTG TCACTCTGCC GTGGGTGTTC ATTTTTAGCT CATGCGTTTT CAAAGTTTTG
ACGTTTTCTC ATCCGTTGTT ATCGTCACAG GAGGATGCTA AGAGAATTGT TTCTTACTCT
AAATACCCTC CAACCGGTTC TCGAGGCTAC GGCCCGATGT TTTGTCCCCC GGTCTTCGGA
TGCAAAGGGT CCGACTATGA TGCAGGGGCA GACAAAAACC TCCTAGTTAT CGTGCAGATT
GAATCCAGAA AAGGAGTCGA GAACGTCGAG GAAATTGCCA AGGTAGAAGG CCTGGACTGC
TTATTCATCG GTGCGTAGGA CTGCACATAT TTGATCCCAG GTTGATTTAC TATTGTAGGT
CCATTTGATC TGTCAAAGCA AATGAACGTC CCCTTCGGTG GAGAGGAACA TGAAGCCGCG
ATTGAGAAGA CTCTCCAAGC AGCGCACAGT GCTGGCAAGA TCGCCGCCAT CTTCTGTGAG
TATTCTACTT GATGAAACTG CATGATATAC ATGAATAAAT ATGCTTATAT ATTTCTAGGT
TCCAATGGTG AAATTGCCCG CAAACGCCTT GCTCAAGGCT TTGACATGGT ATCAATAGCT
GTTGACAGTT CTTGCCTAGC AGCGGAAATG GAAAGACAAT TAAGCTTGGT GACGGGTGAA
GCAGGTAAAG GTGACAGGTC TTATTCGTAG TTTAACCATG CTCTTCTTGA ATGCACATGT
AATTTTGCC
 
Protein sequence
MESKTFLKNA LAQKKPGLGF WCTLPGAATV ATALSAGGFN WTLIDAEHGM ITDKDYFELV 
TTVTSLGASP IIRIPWNEEW MIKRALDAGA QGVMTPMCHS AEDAKRIVSY SKYPPTGSRG
YGPMFCPPVF GCKGSDYDAG ADKNLLVIVQ IESRKGVENV EEIAKVEGLD CLFIGPFDLS
KQMNVPFGGE EHEAAIEKTL QAAHSAGKIA AIFCSNGEIA RKRLAQGFDM VSIAVDSSCL
AAEMERQLSL VTGEAGKGDR SYS