Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA01230 |
Symbol | |
ID | 3253709 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 333448 |
End bp | 337375 |
Gene Length | 3928 bp |
Protein Length | 1015 aa |
Translation table | |
GC content | 48% |
IMG OID | 638252456 |
Product | ubiquitin activating enzyme, putative |
Protein accession | XP_566574 |
Protein GI | 58258323 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR01408] ubiquitin-activating enzyme E1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGTATTCTC TTGCATTCCA TCTCAAATAC ATTCATATAA ATTCCTCTTC TTTCTTACAC CGTGTACTAG ATACGACCTT TGGAGGAAGA ATCCCTATTG TCACTTTGAA CGAAAACAAG GAATACGAAG CAAAGCAGCA ATGGCAGCTC CTATGCAAGT TGACGAAGCC GTCCCTGGTG ACTCTAGTAA GCTCAAGCCA TATACGTCTT CTGATGCATG CTAACGTGAC TTCCCTCTTT TAGTCGATGG TAAGCAATAG TTTGGTATAC AATGCATAGA CAAGTGCTGA TAGTCTGGTG CGACTGATCA GAGGGCTTGT ACTCTCGTCA ACTGTGGGTT ATGATCTCTT CTGTACCAAT AGTGTTGAAT TAACTTTCGG ATTAGCTATG TGTTGGGCCA CGAAGGTGAG GGCATCGGAT AAGGATGAAC TGAGATTGAA CCTGATGTTA TCTGTAGCGA TGAAGAAGAT GGCCACTTCC AACGTTCTTA TTGTTGGCAT GAAGGGCCTT GGTGTTGAGA TTGGTAAGCT ATCTTGCTTG CGACTTGCAT TGCAGCGAGC AATCATTGAA GATACAAAAG CTTATATATT TGATAGCCAA GAACGTTGCC TTGGCAGGTG TCAAGACTGT CACCATCTAC GACCCTTCTG CAGTAGAAAT CGCGGATCTC GGTACACAAT TCTTCCTTCG TGAAGAGGAC ATTGGTCGAC CTCGAGCTGA AGTCACCGCT CCCCGACTTG CCGAACTCAA CTCTTACGTC CCTATCAAGA TTCTCCCTGG AGCAGGCGAA ATCACACCGG AAATGATCGA GCCTTACCAG ATTGTTGTAC TCACCAACGC CACTGTCAGG AAGCAAGTAG AGATCGACGA GTACTGTAGG CAAAAGGGGA TCTATTTTAT CGCAGCGGAT GTGAGGGGTC TATTCGGCAG CGTGTTCAAC GACTTTGGCA AGGATTTTGC TTGTGTCGAC CCTACAGGAG AGAACCCTCT AAGCGGAATG ATTGTCGAGA TTGACGAGGT AGGTCGCTAT TTTCTGATCC CAAAAAACAT TTGCTCATCC AGTCTCTACA GGATGAGGAT GCTATTGTTA CCTGTCTTGA CGAAACGCGA CATGGACTTG AAGACGGCGA CTTTGTCACT TTCTCTGAGA TTAAGGGTAT GGAGGGTTTG AACGGCTGTG AGCCTAGAAA GATCTCTGTC AAGGGTGAGC GTTGTCGGGT TACTTGATGA CAAAAAAAAA GCTAACATTT AGATAGGTCC TTACACTTTC TCAATCGGCG ACACCCGTGG GTTGGGCAAG TACAAATCTG GCGGTCTCTT CACCCAAGTG AAGATGCCCA AGATTCTTCA ATTTGTTAGT GATGTGCGCA TAAATTAAAT GACATTGCTA ATCCGATGTA CAGAAAACCC TTAAAGAGTC CCTCACTAAC CCCGAGTTCT TCATCACCGA CTTTGCCAAA TGGGACCGAC CCGCTGCCTT GCACGTTGGT TTCCAGGCTC TTTCTGCATT TTACGAAAAG GCCGGTCATC TTCCTCGACC TCGTAACGCC GCCGACGCCG AGCAAGTCAT TTCTCTCGCT AAGGAGATCC ACTCTGCTGC TGGAGGCGAA GACGTCCTTG ACGAGAAGAT TCTCACCGAG CTTTCTTACC AAGCTACTGG AGACCTTTCC CCTATGGTTG CCGTCATTGG TGGTTTCGTC GCTCAAGAAG TCCTCAAGGC TTGTTCCGCC AAATTCCACC CCATGCAACA AAACATGTAC TTTGACTCAC TCGAGTCTCT CCCTGCTACC CTTCCTTCTG AGGCTGACGT CCAGCCTCTT GGATCTCGAT ACGACGGGCA AATCGCCGTC TTCGGTAAGG CCTTTCAGGA AAAGATTTCC AACACTCGCG AGTTCCTTGT CGGTTCGGGT GCTATCGGTT GTGAGATGTT GAAGAATTGG AGCATGATGG GCCTTGCCAC TGGTCCCAAC GGTATCATTC ATGTTACCGA CCTGGACACC ATTGAAAAGA GTAACTTGAA CCGACAGTTC TTGTTCAGAG CCAAGGATGT GGGCAAGTTC AAGGCTGAAA GTGCGGCTGC TGCCGTTGCG GACATGAACC CCAATTTGAA GGGCAAGATC ATTGCTCACG ATGACAGGGT CGGCCCCGAG ACTGAAAGTG AGTATATCCA TGTGAATGGC TTGGGCAGTG CTGACCAACG GCAGATGTCT ATGGTGATGA ATTCTTCGCC AATCTTGATG GCGTCACCAA TGCCCTCGAT AACGTGTCAG CGCGTCAGTA CATGGACCGA CGATGTGTGT TCTACTGCAA GCCTCTCCTT GAGTCTGGTA CTCTTGGTAC CAAGGCCAAT ACTCAGGTCG TTGTTCCTCA CCTCACCGAG TCATATTCAT CTTCCCAGGA CCCTCCTGAG AAGTCTATTC CCTCTTGTAC CGTCAAGAAC TTCCCGAATG CCATTGAGCA CACCATCCAA TGGGCCCGAG AAGCGTTCGA TTCTTTCTTC GTCAATCCTC CTACTACTGT CAACCTTTAT CTTTCTCAAC CAGATTTTGT CGAGACCACC CTCAAGTCTT CTGGCCAGCA CCACGAGCAT CTCAAACAGA TTGAGAAATA CCTTGTGAAG GAGAGGCCCA TGTCTTTCGA GGAGTGTATC ATGTGGGCCA GGTTACAATA TGAGAACAAC TATGTAAATG AGATCAAGCA GTTGTTGTTC AACTTGCCCA AGGACCAAGT TAACGCCAAC GGCACTCCCT TCTGGTCCGG ACCCAAGAGG GCTCCCACCG CTCTTGCCTT CAACATTGAC GATGTAGGTC CTGGAGCTTA TTTGAGCGGA TAATAGATCT AATTCTAGTA TAGCCTCTTG ATATGGAGTA CCTCATCGCC GCTGCCAACC TTCACGCTTT CAACTACGGT CTCAAGGGCG AGCGAGACCC TGCTTTATTC CGAAAGGTGG TCGAGTCTAT GAACGTCCCC GAGTTCACTC CTAAGAGCGG TGTCAAGATC CAGATCAATG AAAATGAACC TGTTGAAAAC AACGGTAACG ATGGCAAGTT CTCATATCAA TGGCAAAAGA TTGCGCTAAC TGGTGAGATA TAGATGAAGA TGACATTGAA GCTATTGTTT CTTCTCTTCC CCCTCCTGCT TCTCTCGCTG GTTTCCGACT TCAACCTGTT GACTTCGAGA AGGATGATGA CTCTAATCAC CACATTGACT TCATCACCGC CGCTTCCAAC TTGCGTGCCC GAAACTATGG CATCACCCTT GCCGATAGGC ATAAGACCAA ACTCATCGCA GGAAAGATTA TCCCTGCCAT CGCTACAACC ACTGCTCTCG CTGTCGGTTT GGTTTGCTTG GAACTTTACA AGTTGATTGA CGGCAAGAAC AAGCTTGAGG ACTACAAAAA CGGTTTCGTG AATTTGGCGT TGCCCTTCTT CGGTTTCTCT GAGCCTATTG CGGCGGCGAA GCAAAAGTAT GGCGAGACTG AGTGGACTTT GTGGGACAGG TTTGAGATCG AGGGCAATCC TACGTTGCAG CAATTCCTGG AATGGTTCCA GGAGAACCAC AAGTTGGAAG TACAAATGGT TTCTCAGGGT GTTTCCATGT TGTGGTCCTC TTTCGTCCCC TCTAAGAAGG CGAGTAGACG CTCCGACTGA TGAACTCGTA ATGACATAGC TAATATTAAC GCTTAGGCTG CCGACCGAAT GAGGATGCGT ATGAGCGAGC TTGTCGAACA CGTCGGCAAG AAGCCAATCC CTCCTCATGT CAAGAACTTA TTGGTGGAGG TTATGGTTAA TGATGAGAAT GACGAGGATG TCGAGGTGCC CTATGTGTTG GTGCACATCT AAAGTGCCAA TAGAAGGGGA AGATGCGAAG AAATAAGAAG TTCCATGTAG AGCATTGTCT TTAACAAAAC CTAGCAGTGA TGCAGTGT
|
Protein sequence | MAAPMQVDEA VPGDSIDEGL YSRQLYVLGH EAMKKMATSN VLIVGMKGLG VEIAKNVALA GVKTVTIYDP SAVEIADLGT QFFLREEDIG RPRAEVTAPR LAELNSYVPI KILPGAGEIT PEMIEPYQIV VLTNATVRKQ VEIDEYCRQK GIYFIAADVR GLFGSVFNDF GKDFACVDPT GENPLSGMIV EIDEDEDAIV TCLDETRHGL EDGDFVTFSE IKGMEGLNGC EPRKISVKGP YTFSIGDTRG LGKYKSGGLF TQVKMPKILQ FKTLKESLTN PEFFITDFAK WDRPAALHVG FQALSAFYEK AGHLPRPRNA ADAEQVISLA KEIHSAAGGE DVLDEKILTE LSYQATGDLS PMVAVIGGFV AQEVLKACSA KFHPMQQNMY FDSLESLPAT LPSEADVQPL GSRYDGQIAV FGKAFQEKIS NTREFLVGSG AIGCEMLKNW SMMGLATGPN GIIHVTDLDT IEKSNLNRQF LFRAKDVGKF KAESAAAAVA DMNPNLKGKI IAHDDRVGPE TENVYGDEFF ANLDGVTNAL DNVSARQYMD RRCVFYCKPL LESGTLGTKA NTQVVVPHLT ESYSSSQDPP EKSIPSCTVK NFPNAIEHTI QWAREAFDSF FVNPPTTVNL YLSQPDFVET TLKSSGQHHE HLKQIEKYLV KERPMSFEEC IMWARLQYEN NYVNEIKQLL FNLPKDQVNA NGTPFWSGPK RAPTALAFNI DDPLDMEYLI AAANLHAFNY GLKGERDPAL FRKVVESMNV PEFTPKSGVK IQINENEPVE NNGNDDEDDI EAIVSSLPPP ASLAGFRLQP VDFEKDDDSN HHIDFITAAS NLRARNYGIT LADRHKTKLI AGKIIPAIAT TTALAVGLVC LELYKLIDGK NKLEDYKNGF VNLALPFFGF SEPIAAAKQK YGETEWTLWD RFEIEGNPTL QQFLEWFQEN HKLEVQMVSQ GVSMLWSSFV PSKKAADRMR MRMSELVEHV GKKPIPPHVK NLLVEVMVND ENDEDVEVPY VLVHI
|
| |