Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05760 |
Symbol | |
ID | 3255806 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1614698 |
End bp | 1615848 |
Gene Length | 1151 bp |
Protein Length | 259 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255218 |
Product | glucose 1-dehydrogenase, putative |
Protein accession | XP_569317 |
Protein GI | 58264322 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0117369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTCG GTCTACTCCA GGGTAAAGTC GTCGCCATCA CAGGCTGCTC AACAGGTATC GGGCGAGCCA TTGCCATCGG TGAACCATCC TGTTTTACAC TCCTAAAAAC CTACGAACTG ACGCAGACAT CCAGGCGCTG CAAAAAATGG GGCCAATGTC GTTCTGCACC ATCTTGGGGA TTCGACCGCT AGCGATATCG CTCAAGTCCA AGAAGAATGT AAACAAGCTG GCGCAAAGAC TGTGGTGGTA CCAGGCGACA TTGCTGAAGC TAAAACTGCC AACGAGGTAA ATTGAGACCG TCTTCTTATA TCTCTTCCCA AAGTGATTCG CATAACCGCT GACGACCCTT AATTATTTCC CCAGATCGTC TCAGCCGCCG TCTCCTCCTT CTCCCGCATC GACGTCCTCA TTTCCAATGC CGGTATCTGC CCCTTCCACT CCTTTCTTGA CCTTCCTCAC CCTCTATGGA AACGCGTGCA AGATGTCAAC CTAAACGGTT CTTTCTACGT TGTTCAAGCG GTTGCTAATC AGATGGCAAA GCAAGAGCCT AAAGGAGGGA GTATCGTTGC GGTGAGCAGT ATCTCGGCTT TGATGGGCGG TGGCGAGCAG TGTCATTACA CACCGACAAA AGCCGGAATC AAGAGTTTGA TGGAGAGTTG TGCGATTGCG TTGGGGCCAA TGGGGATTAG GTGTAACTCT GTTCTTCCTG GTACGTCCTA TTTTCTCGTT CTTGCCCTCG TCCTCTTTCC ATTTCCCATC ACACTCCTGC TTATAGTCCC TTGTGGATGA GAATGTTGGC TGAATCACAT CAAATCTGAA ATAGGGACTA TCGAAACGAA CATCAACAAA GAAGACCTTT CCAACCCCGA GAAACGAGCA GACCAAATTC GACGTGTCCC CCTTGGCAGA TTGGGTAAAC CGGAAGATCT CGTAGGACCT ACTCTATTCT TTGCGAGTGA TTTGAGCAAC TATTGCACCG GAGCGAGCGT GCTGGTAGAT GGCGGAATGG CGATTTCTCT TCAGTAAAGT AGAGAAGAGC CTGTGAGCTG TTTATAACTA GTACTTAGTA CCGTGATGTG CGTACGCATG CTAGACCATC GCGATTTTCT AATTTCATAC TCAGTCCTAT ATGTAAACAA CAAGTCTGTG A
|
Protein sequence | MSFGLLQGKV VAITGCSTGI GRAIAIGAAK NGANVVLHHL GDSTASDIAQ VQEECKQAGA KTVVVPGDIA EAKTANEIVS AAVSSFSRID VLISNAGICP FHSFLDLPHP LWKRVQDVNL NGSFYVVQAV ANQMAKQEPK GGSIVAVSSI SALMGGGEQC HYTPTKAGIK SLMESCAIAL GPMGIRCNSV LPGTIETNIN KEDLSNPEKR ADQIRRVPLG RLGKPEDLVG PTLFFASDLS NYCTGASVLV DGGMAISLQ
|
| |