Gene CNN00720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00720 
Symbol 
ID3255527 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp233092 
End bp234195 
Gene Length1104 bp 
Protein Length322 aa 
Translation table 
GC content48% 
IMG OID638254489 
ProductSUMO activating enzyme, putative 
Protein accessionXP_568612 
Protein GI58262404 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATCTT CCACAGTTCT TATTCTCTCC CTTCGATCTC TCGCCCATGA AACTATCAAG 
AATCTCGTAC TTGCTGGTAT CGGTCGCTTA ATCGTCGCCG ACTCTGATGT TGTCACAGAA
GAAGATTTAG GATCAGGGTT TCTTTTCCGA GAAGAAGACA ACGCAGTTGG AAAACTTAGA
ACGGATGCTG CTCTGGAACA GATTCAATCA CTGAACCCAC TTGTGACACT GAGCAAAATA
GGTATGGACA GTTTTGAAGG AGAAGAGGAC AAAGTTGCGG AAATATTAAA AAAGGAGGCT
GTGGATGTCG TTGTTACGTG TGACTTAAGC GTGAAAGAAA ATGTAAGCGG GGAAGAATAT
TTGTACGACG TATGTTGACA AGGCTGTAGG AGAGGATCGA TGCGGCTGCC AGAAAAGCCA
GTTCATTGTT CTACGCTGCA GGAACGTACG GTTTCACAGG ATACGTTTTT GCGGACTTGG
GCGAGTCATA TGAATACGTT GTCAAGTATG TCGAGTGCCA TATAATCCCT AGTATAATGG
TTGCTGACCA AATGCAGCTC AATAGACGGA TTATCAAAGA AAGTGCTCTC CTACCCTTCT
TTTTCAACTG TGCTTGACAG GTCGAACTGG GCTAAACCCG GTGGTAGTCC CTTCAAGGGA
TTATCCAGAA ATGCGACAAG GTCGGCAGCA CCTGCTACTA TCCTTGGCAT CACTGGTGAA
GCAATCCAGA GTCTAAACTC GCATTACAAA TGCTGACCAG AAACAGCCCT TTGGGAATAT
GAATCCCAGA ACGGCCACCT CCCCGCTGAG GAATCTTCCC TTTCTGCTCT CACTTCCTCC
GCCGAATCCA TCCGCACCGC TCTAGGAGTC AATTCTACCG CCGTCCCGTC CGTCGACTCT
TCTTTACTGA CCCATCTCGC TTCTCACGCC ACTCACTTCT TCCCTCCTAC GCTCGCTATT
CTCGGGGGTC TGCTTGCACA AGATGTCTTG CGAGCACTGA GTCGGAAAGA TAAGCCTGTT
GCCAACTTGT TGGCTGTCGA CAGTATGAGT GGTGTTGGCA CCGTTGGACG ATGGAGCATG
ATGGACGCGA AGGACACTCA ATAG
 
Protein sequence
MRSSTVLILS LRSLAHETIK NLVLAGIGRL IVADSDVVTE EDLGSGFLFR EEDNAVGKLR 
TDAALEQIQS LNPLVTLSKI GMDSFEGEED KVAEILKKEA VDVVVTCDLS VKENERIDAA
ARKASSLFYA AGTYGFTGYV FADLGESYEY VVNSIDGLSK KVLSYPSFST VLDRSNWAKP
GGSPFKGLSR NATRSAAPAT ILGITGEAIQ TLWEYESQNG HLPAEESSLS ALTSSAESIR
TALGVNSTAV PSVDSSLLTH LASHATHFFP PTLAILGGLL AQDVLRALSR KDKPVANLLA
VDSMSGVGTV GRWSMMDAKD TQ