Gene CNA01230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01230 
Symbol 
ID3253709 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp333448 
End bp337375 
Gene Length3928 bp 
Protein Length1015 aa 
Translation table 
GC content48% 
IMG OID638252456 
Productubiquitin activating enzyme, putative 
Protein accessionXP_566574 
Protein GI58258323 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR01408] ubiquitin-activating enzyme E1 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGTATTCTC TTGCATTCCA TCTCAAATAC ATTCATATAA ATTCCTCTTC TTTCTTACAC 
CGTGTACTAG ATACGACCTT TGGAGGAAGA ATCCCTATTG TCACTTTGAA CGAAAACAAG
GAATACGAAG CAAAGCAGCA ATGGCAGCTC CTATGCAAGT TGACGAAGCC GTCCCTGGTG
ACTCTAGTAA GCTCAAGCCA TATACGTCTT CTGATGCATG CTAACGTGAC TTCCCTCTTT
TAGTCGATGG TAAGCAATAG TTTGGTATAC AATGCATAGA CAAGTGCTGA TAGTCTGGTG
CGACTGATCA GAGGGCTTGT ACTCTCGTCA ACTGTGGGTT ATGATCTCTT CTGTACCAAT
AGTGTTGAAT TAACTTTCGG ATTAGCTATG TGTTGGGCCA CGAAGGTGAG GGCATCGGAT
AAGGATGAAC TGAGATTGAA CCTGATGTTA TCTGTAGCGA TGAAGAAGAT GGCCACTTCC
AACGTTCTTA TTGTTGGCAT GAAGGGCCTT GGTGTTGAGA TTGGTAAGCT ATCTTGCTTG
CGACTTGCAT TGCAGCGAGC AATCATTGAA GATACAAAAG CTTATATATT TGATAGCCAA
GAACGTTGCC TTGGCAGGTG TCAAGACTGT CACCATCTAC GACCCTTCTG CAGTAGAAAT
CGCGGATCTC GGTACACAAT TCTTCCTTCG TGAAGAGGAC ATTGGTCGAC CTCGAGCTGA
AGTCACCGCT CCCCGACTTG CCGAACTCAA CTCTTACGTC CCTATCAAGA TTCTCCCTGG
AGCAGGCGAA ATCACACCGG AAATGATCGA GCCTTACCAG ATTGTTGTAC TCACCAACGC
CACTGTCAGG AAGCAAGTAG AGATCGACGA GTACTGTAGG CAAAAGGGGA TCTATTTTAT
CGCAGCGGAT GTGAGGGGTC TATTCGGCAG CGTGTTCAAC GACTTTGGCA AGGATTTTGC
TTGTGTCGAC CCTACAGGAG AGAACCCTCT AAGCGGAATG ATTGTCGAGA TTGACGAGGT
AGGTCGCTAT TTTCTGATCC CAAAAAACAT TTGCTCATCC AGTCTCTACA GGATGAGGAT
GCTATTGTTA CCTGTCTTGA CGAAACGCGA CATGGACTTG AAGACGGCGA CTTTGTCACT
TTCTCTGAGA TTAAGGGTAT GGAGGGTTTG AACGGCTGTG AGCCTAGAAA GATCTCTGTC
AAGGGTGAGC GTTGTCGGGT TACTTGATGA CAAAAAAAAA GCTAACATTT AGATAGGTCC
TTACACTTTC TCAATCGGCG ACACCCGTGG GTTGGGCAAG TACAAATCTG GCGGTCTCTT
CACCCAAGTG AAGATGCCCA AGATTCTTCA ATTTGTTAGT GATGTGCGCA TAAATTAAAT
GACATTGCTA ATCCGATGTA CAGAAAACCC TTAAAGAGTC CCTCACTAAC CCCGAGTTCT
TCATCACCGA CTTTGCCAAA TGGGACCGAC CCGCTGCCTT GCACGTTGGT TTCCAGGCTC
TTTCTGCATT TTACGAAAAG GCCGGTCATC TTCCTCGACC TCGTAACGCC GCCGACGCCG
AGCAAGTCAT TTCTCTCGCT AAGGAGATCC ACTCTGCTGC TGGAGGCGAA GACGTCCTTG
ACGAGAAGAT TCTCACCGAG CTTTCTTACC AAGCTACTGG AGACCTTTCC CCTATGGTTG
CCGTCATTGG TGGTTTCGTC GCTCAAGAAG TCCTCAAGGC TTGTTCCGCC AAATTCCACC
CCATGCAACA AAACATGTAC TTTGACTCAC TCGAGTCTCT CCCTGCTACC CTTCCTTCTG
AGGCTGACGT CCAGCCTCTT GGATCTCGAT ACGACGGGCA AATCGCCGTC TTCGGTAAGG
CCTTTCAGGA AAAGATTTCC AACACTCGCG AGTTCCTTGT CGGTTCGGGT GCTATCGGTT
GTGAGATGTT GAAGAATTGG AGCATGATGG GCCTTGCCAC TGGTCCCAAC GGTATCATTC
ATGTTACCGA CCTGGACACC ATTGAAAAGA GTAACTTGAA CCGACAGTTC TTGTTCAGAG
CCAAGGATGT GGGCAAGTTC AAGGCTGAAA GTGCGGCTGC TGCCGTTGCG GACATGAACC
CCAATTTGAA GGGCAAGATC ATTGCTCACG ATGACAGGGT CGGCCCCGAG ACTGAAAGTG
AGTATATCCA TGTGAATGGC TTGGGCAGTG CTGACCAACG GCAGATGTCT ATGGTGATGA
ATTCTTCGCC AATCTTGATG GCGTCACCAA TGCCCTCGAT AACGTGTCAG CGCGTCAGTA
CATGGACCGA CGATGTGTGT TCTACTGCAA GCCTCTCCTT GAGTCTGGTA CTCTTGGTAC
CAAGGCCAAT ACTCAGGTCG TTGTTCCTCA CCTCACCGAG TCATATTCAT CTTCCCAGGA
CCCTCCTGAG AAGTCTATTC CCTCTTGTAC CGTCAAGAAC TTCCCGAATG CCATTGAGCA
CACCATCCAA TGGGCCCGAG AAGCGTTCGA TTCTTTCTTC GTCAATCCTC CTACTACTGT
CAACCTTTAT CTTTCTCAAC CAGATTTTGT CGAGACCACC CTCAAGTCTT CTGGCCAGCA
CCACGAGCAT CTCAAACAGA TTGAGAAATA CCTTGTGAAG GAGAGGCCCA TGTCTTTCGA
GGAGTGTATC ATGTGGGCCA GGTTACAATA TGAGAACAAC TATGTAAATG AGATCAAGCA
GTTGTTGTTC AACTTGCCCA AGGACCAAGT TAACGCCAAC GGCACTCCCT TCTGGTCCGG
ACCCAAGAGG GCTCCCACCG CTCTTGCCTT CAACATTGAC GATGTAGGTC CTGGAGCTTA
TTTGAGCGGA TAATAGATCT AATTCTAGTA TAGCCTCTTG ATATGGAGTA CCTCATCGCC
GCTGCCAACC TTCACGCTTT CAACTACGGT CTCAAGGGCG AGCGAGACCC TGCTTTATTC
CGAAAGGTGG TCGAGTCTAT GAACGTCCCC GAGTTCACTC CTAAGAGCGG TGTCAAGATC
CAGATCAATG AAAATGAACC TGTTGAAAAC AACGGTAACG ATGGCAAGTT CTCATATCAA
TGGCAAAAGA TTGCGCTAAC TGGTGAGATA TAGATGAAGA TGACATTGAA GCTATTGTTT
CTTCTCTTCC CCCTCCTGCT TCTCTCGCTG GTTTCCGACT TCAACCTGTT GACTTCGAGA
AGGATGATGA CTCTAATCAC CACATTGACT TCATCACCGC CGCTTCCAAC TTGCGTGCCC
GAAACTATGG CATCACCCTT GCCGATAGGC ATAAGACCAA ACTCATCGCA GGAAAGATTA
TCCCTGCCAT CGCTACAACC ACTGCTCTCG CTGTCGGTTT GGTTTGCTTG GAACTTTACA
AGTTGATTGA CGGCAAGAAC AAGCTTGAGG ACTACAAAAA CGGTTTCGTG AATTTGGCGT
TGCCCTTCTT CGGTTTCTCT GAGCCTATTG CGGCGGCGAA GCAAAAGTAT GGCGAGACTG
AGTGGACTTT GTGGGACAGG TTTGAGATCG AGGGCAATCC TACGTTGCAG CAATTCCTGG
AATGGTTCCA GGAGAACCAC AAGTTGGAAG TACAAATGGT TTCTCAGGGT GTTTCCATGT
TGTGGTCCTC TTTCGTCCCC TCTAAGAAGG CGAGTAGACG CTCCGACTGA TGAACTCGTA
ATGACATAGC TAATATTAAC GCTTAGGCTG CCGACCGAAT GAGGATGCGT ATGAGCGAGC
TTGTCGAACA CGTCGGCAAG AAGCCAATCC CTCCTCATGT CAAGAACTTA TTGGTGGAGG
TTATGGTTAA TGATGAGAAT GACGAGGATG TCGAGGTGCC CTATGTGTTG GTGCACATCT
AAAGTGCCAA TAGAAGGGGA AGATGCGAAG AAATAAGAAG TTCCATGTAG AGCATTGTCT
TTAACAAAAC CTAGCAGTGA TGCAGTGT
 
Protein sequence
MAAPMQVDEA VPGDSIDEGL YSRQLYVLGH EAMKKMATSN VLIVGMKGLG VEIAKNVALA 
GVKTVTIYDP SAVEIADLGT QFFLREEDIG RPRAEVTAPR LAELNSYVPI KILPGAGEIT
PEMIEPYQIV VLTNATVRKQ VEIDEYCRQK GIYFIAADVR GLFGSVFNDF GKDFACVDPT
GENPLSGMIV EIDEDEDAIV TCLDETRHGL EDGDFVTFSE IKGMEGLNGC EPRKISVKGP
YTFSIGDTRG LGKYKSGGLF TQVKMPKILQ FKTLKESLTN PEFFITDFAK WDRPAALHVG
FQALSAFYEK AGHLPRPRNA ADAEQVISLA KEIHSAAGGE DVLDEKILTE LSYQATGDLS
PMVAVIGGFV AQEVLKACSA KFHPMQQNMY FDSLESLPAT LPSEADVQPL GSRYDGQIAV
FGKAFQEKIS NTREFLVGSG AIGCEMLKNW SMMGLATGPN GIIHVTDLDT IEKSNLNRQF
LFRAKDVGKF KAESAAAAVA DMNPNLKGKI IAHDDRVGPE TENVYGDEFF ANLDGVTNAL
DNVSARQYMD RRCVFYCKPL LESGTLGTKA NTQVVVPHLT ESYSSSQDPP EKSIPSCTVK
NFPNAIEHTI QWAREAFDSF FVNPPTTVNL YLSQPDFVET TLKSSGQHHE HLKQIEKYLV
KERPMSFEEC IMWARLQYEN NYVNEIKQLL FNLPKDQVNA NGTPFWSGPK RAPTALAFNI
DDPLDMEYLI AAANLHAFNY GLKGERDPAL FRKVVESMNV PEFTPKSGVK IQINENEPVE
NNGNDDEDDI EAIVSSLPPP ASLAGFRLQP VDFEKDDDSN HHIDFITAAS NLRARNYGIT
LADRHKTKLI AGKIIPAIAT TTALAVGLVC LELYKLIDGK NKLEDYKNGF VNLALPFFGF
SEPIAAAKQK YGETEWTLWD RFEIEGNPTL QQFLEWFQEN HKLEVQMVSQ GVSMLWSSFV
PSKKAADRMR MRMSELVEHV GKKPIPPHVK NLLVEVMVND ENDEDVEVPY VLVHI