Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00720 |
Symbol | |
ID | 3255527 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 233092 |
End bp | 234195 |
Gene Length | 1104 bp |
Protein Length | 322 aa |
Translation table | |
GC content | 48% |
IMG OID | 638254489 |
Product | SUMO activating enzyme, putative |
Protein accession | XP_568612 |
Protein GI | 58262404 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATCTT CCACAGTTCT TATTCTCTCC CTTCGATCTC TCGCCCATGA AACTATCAAG AATCTCGTAC TTGCTGGTAT CGGTCGCTTA ATCGTCGCCG ACTCTGATGT TGTCACAGAA GAAGATTTAG GATCAGGGTT TCTTTTCCGA GAAGAAGACA ACGCAGTTGG AAAACTTAGA ACGGATGCTG CTCTGGAACA GATTCAATCA CTGAACCCAC TTGTGACACT GAGCAAAATA GGTATGGACA GTTTTGAAGG AGAAGAGGAC AAAGTTGCGG AAATATTAAA AAAGGAGGCT GTGGATGTCG TTGTTACGTG TGACTTAAGC GTGAAAGAAA ATGTAAGCGG GGAAGAATAT TTGTACGACG TATGTTGACA AGGCTGTAGG AGAGGATCGA TGCGGCTGCC AGAAAAGCCA GTTCATTGTT CTACGCTGCA GGAACGTACG GTTTCACAGG ATACGTTTTT GCGGACTTGG GCGAGTCATA TGAATACGTT GTCAAGTATG TCGAGTGCCA TATAATCCCT AGTATAATGG TTGCTGACCA AATGCAGCTC AATAGACGGA TTATCAAAGA AAGTGCTCTC CTACCCTTCT TTTTCAACTG TGCTTGACAG GTCGAACTGG GCTAAACCCG GTGGTAGTCC CTTCAAGGGA TTATCCAGAA ATGCGACAAG GTCGGCAGCA CCTGCTACTA TCCTTGGCAT CACTGGTGAA GCAATCCAGA GTCTAAACTC GCATTACAAA TGCTGACCAG AAACAGCCCT TTGGGAATAT GAATCCCAGA ACGGCCACCT CCCCGCTGAG GAATCTTCCC TTTCTGCTCT CACTTCCTCC GCCGAATCCA TCCGCACCGC TCTAGGAGTC AATTCTACCG CCGTCCCGTC CGTCGACTCT TCTTTACTGA CCCATCTCGC TTCTCACGCC ACTCACTTCT TCCCTCCTAC GCTCGCTATT CTCGGGGGTC TGCTTGCACA AGATGTCTTG CGAGCACTGA GTCGGAAAGA TAAGCCTGTT GCCAACTTGT TGGCTGTCGA CAGTATGAGT GGTGTTGGCA CCGTTGGACG ATGGAGCATG ATGGACGCGA AGGACACTCA ATAG
|
Protein sequence | MRSSTVLILS LRSLAHETIK NLVLAGIGRL IVADSDVVTE EDLGSGFLFR EEDNAVGKLR TDAALEQIQS LNPLVTLSKI GMDSFEGEED KVAEILKKEA VDVVVTCDLS VKENERIDAA ARKASSLFYA AGTYGFTGYV FADLGESYEY VVNSIDGLSK KVLSYPSFST VLDRSNWAKP GGSPFKGLSR NATRSAAPAT ILGITGEAIQ TLWEYESQNG HLPAEESSLS ALTSSAESIR TALGVNSTAV PSVDSSLLTH LASHATHFFP PTLAILGGLL AQDVLRALSR KDKPVANLLA VDSMSGVGTV GRWSMMDAKD TQ
|
| |