Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02750 |
Symbol | |
ID | 3256304 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 792864 |
End bp | 794012 |
Gene Length | 1149 bp |
Protein Length | 263 aa |
Translation table | |
GC content | 46% |
IMG OID | 638255497 |
Product | conserved hypothetical protein |
Protein accession | XP_569572 |
Protein GI | 58264832 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.617702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCTA AGACTTTCCT CAAGAACGCT CTTGCTCAGA AAAAGCCAGG GCTCGGTTTC TGGTGCACGT GAGTGAACAC TTCCTTCGAA TGAGAGTAAA AAGCTAGACT ATCAGATGTT GACCTGTTTT GCTGCAGTCT TCCTGGAGCC GCGACCGTAG CTACAGCCCT TTCGGCTGGT GGCTTTAACT GGACTCTGAT TGATGCCGAG CACGGCATGA TTACGGACAA GGATTATTTC GAGGTGCGCT TCCAGATTCC TAGATTGAGT TAAAAGCTCT CAAAGTCAAA CTGGTAGCTA ACGTACACCA GCTTGTTACC ACAGTCACTT CTCTTGGAGC CTCACCAATT ATTCGAATTC CCTGGAACGA AGAATGGATG ATCAAAAGAG CTCTAGATGC TGGAGCCCAA GGAGTCATGA CTCCAATGTG TCACTCTGCC GTGGGTGTTC ATTTTTAGCT CATGCGTTTT CAAAGTTTTG ACGTTTTCTC ATCCGTTGTT ATCGTCACAG GAGGATGCTA AGAGAATTGT TTCTTACTCT AAATACCCTC CAACCGGTTC TCGAGGCTAC GGCCCGATGT TTTGTCCCCC GGTCTTCGGA TGCAAAGGGT CCGACTATGA TGCAGGGGCA GACAAAAACC TCCTAGTTAT CGTGCAGATT GAATCCAGAA AAGGAGTCGA GAACGTCGAG GAAATTGCCA AGGTAGAAGG CCTGGACTGC TTATTCATCG GTGCGTAGGA CTGCACATAT TTGATCCCAG GTTGATTTAC TATTGTAGGT CCATTTGATC TGTCAAAGCA AATGAACGTC CCCTTCGGTG GAGAGGAACA TGAAGCCGCG ATTGAGAAGA CTCTCCAAGC AGCGCACAGT GCTGGCAAGA TCGCCGCCAT CTTCTGTGAG TATTCTACTT GATGAAACTG CATGATATAC ATGAATAAAT ATGCTTATAT ATTTCTAGGT TCCAATGGTG AAATTGCCCG CAAACGCCTT GCTCAAGGCT TTGACATGGT ATCAATAGCT GTTGACAGTT CTTGCCTAGC AGCGGAAATG GAAAGACAAT TAAGCTTGGT GACGGGTGAA GCAGGTAAAG GTGACAGGTC TTATTCGTAG TTTAACCATG CTCTTCTTGA ATGCACATGT AATTTTGCC
|
Protein sequence | MESKTFLKNA LAQKKPGLGF WCTLPGAATV ATALSAGGFN WTLIDAEHGM ITDKDYFELV TTVTSLGASP IIRIPWNEEW MIKRALDAGA QGVMTPMCHS AEDAKRIVSY SKYPPTGSRG YGPMFCPPVF GCKGSDYDAG ADKNLLVIVQ IESRKGVENV EEIAKVEGLD CLFIGPFDLS KQMNVPFGGE EHEAAIEKTL QAAHSAGKIA AIFCSNGEIA RKRLAQGFDM VSIAVDSSCL AAEMERQLSL VTGEAGKGDR SYS
|
| |