Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND02070 |
Symbol | |
ID | 3257100 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | - |
Start bp | 555857 |
End bp | 557082 |
Gene Length | 1226 bp |
Protein Length | 253 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256141 |
Product | conserved hypothetical protein |
Protein accession | XP_570455 |
Protein GI | 58266598 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.413309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGACTTTTT GCATAGCTAC CAACACAAAC GCCAAAACAA TGGACAAGCG AACCTATCTC AGAAACGACC TTCTTGCTGG AAAACCCGGT ATTGGGATGT GGCTCACGTG AGTCAAGTCG TGGAATATTC CTCGCAATAG CTAACTGATT GCGTGATAAT AGGTTGCCTG GGTCCGCATT AGCCAAGACC GTAGCCACTA TCCCTGGCTT CAACTGGATC CTTATTGATG CCGAGCATGG CCAGATTACA GACAGGGACT ACTTTGATGT GGGTATCTAT AGACGAGTAC TGCCTTGCTT GCCTCGCGAA AGCTCACAGT ACGCTGCTGA ATAGCTCACC AATCATATCA CAACCGAAGG CGTTTCACCC ATTATCCGTA TCCCCTCCGA TGAACCTTGG TTAATCAAGC GGGCCCTCGA CTCAGGAGCC CACGGCTTGA TGATTCCTAT GTGTCACAAC CCTGTACGTT CCGGGACATT CGTTTGCATG ACCACGAACT CGGCACTGAC AGCGACCTTG TAGGATGTCG CCAAAAAGGT TGTCTCTTCC AGCAAGTACG CTGCTCGAGG CACTCGAGGA TGTGGTTCAC CTTTCACCCA GATCATCTTC GGTGTTCCCG AGTCTCAATA TGAGGCAACT TGCAACGACA ACTTGATGGT CATCGTCCAG ATCGAGTCGG CAGAGGGTGT GAAGAATGTG GAGTCCATTG CTGCTGTCCA GGGAGTGGAT GTTCTTTTCG TTGGTGGGTT CAGTCTCATG GTACGCTTCT TGGATAGATT TCAGATATTC ATCGTAATCT GAGTAGGCCC CTTTGACCTT GCCAAGTCAA TGGACATAGA GTTCGGTGGC GAGGAACACG AGGCGGCTAT TGCTCGAACC CTTAAGGCAT GTAAAGATAA CGGCAAGAAG GCAGCCATCT TTTGTAAGTC AGCAGTGGGG CGGACCCTGT CATGTAACGA CTAACTGTTG ATTTAGGCAT GTCCGGCGCC CAGTCCAGGA AGCGTTTGCA GCAGGGCTTT GATATGGTTT CAATCGCCAC CGATACAGAC TCTATAATTC GAGAGTTCTC CAGGCAGCTC GAAGACATGA AGGCTTGATC CGATGTATGC ATATAATTAT ACAGATAACT CCTACTTTCC TCATTTATGG TTAAGCGAGT TAATAAGTAA TGGGATCGAA CTCTTAAGCG AACCCTGTCC GCTGCATGAA TTATTCAATT ATTTTG
|
Protein sequence | MDKRTYLRND LLAGKPGIGM WLTLPGSALA KTVATIPGFN WILIDAEHGQ ITDRDYFDLT NHITTEGVSP IIRIPSDEPW LIKRALDSGA HGLMIPMCHN PDVAKKVVSS SKYAARGTRG CGSPFTQIIF GVPESQYEAT CNDNLMVIVQ IESAEGVKNV ESIAAVQGVD VLFVGPFDLA KSMDIEFGGE EHEAAIARTL KACKDNGKKA AIFCMSGAQS RKRLQQGFDM VSIATDTDSI IREFSRQLED MKA
|
| |