Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04300 |
Symbol | |
ID | 3254790 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 192228 |
End bp | 196821 |
Gene Length | 4594 bp |
Protein Length | 1175 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253901 |
Product | hypothetical protein |
Protein accession | XP_567981 |
Protein GI | 58261142 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5113] Ubiquitin fusion degradation protein 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.14655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGTT GTTGCCGCCG CGCAAGTCAA TACGAAGCGT CACCCATAAA GTGCTTGAAC TGCTTACTTA TTACTCTCTC CCCTGTCTTC CTCCCCACCA TGGCTGACAA CTCAAACCTT TCAGACGCTG ACAAGGTGAG CACACTCAGA TCAGGACACC CTCTTACCCC CCACAGATCC GTCTCAAGCG TCTCGCCCGC CTCGGCACCT CCACCCCCAT CCCCTCGCAG ACCCAGCCAC AGCAACAGCC CTCGTCCAGC AGTACTCCCG AGCCTCATCA CCCACCCTCC GCCTCCTCAA GACTTCTTGC AAACCTTCCT CCCGCTGCTA GTCCCACATC CTCCTCCCCC GCTATTGTGT CCCCAAAACC CTCTGCTTCT ACCAAACCTG ACCCTCAAGC CGTCCAACCC AAGCTTTCCG TCACCCCCAG TCTTTCTCTC AAGAGACCAT CATCTGCTAG TACCCCCAGA GATGAACCGG TGGGTCCGAG AATTGTGCAT ACCAAGCCAA TCCAGCCACT TGTCAAGACG GAATACAAGG CATGGGAGAC CGAGAAGGTT GGGCAAGTAT TCGCTGTTAC ACTAAGTGTT AGTCCATTTG CAGACTGTGT TAGCCAACGA CTAACTATTT ACATAGAAGC AGAAGGCTCA GGAAACCGAT TGGTCCCTTT GTTGGCTCAA AGATCTAGAA CAGGAGCTCA ATGAAGAAAG TAAGTTGTGC CTGTCTTGAT CAAATACTAA TGTAATAGAC TATCCTTCTC CGCTCAGGAC CGATATAGAG CTAGCTGATA GACTTCTCAT TGCCCGTCTC TCCATGGACC CCACGCTTAT GGCTCAGTCC GATGATCCCG ATGCCCTTAC GATTCTTGCA GGACTACCTC AGAATGAGAC TGTCTTTGAA TACCTTGCTG GGTGTTGGAA GAGGTTGTAC CAAGCCAGTA GGGACGCCAA CCGGTATGCC TTCTCCGAAG ACGAAAAGAG TCAATGGGGC AAATCAATGG ATAAGATCAA GGGGCTGGTT GTCTCATATT GTGGTATGAC GATCGAGGAC CCTACCATGT TCCCTCAACC AGCAGAGTGA GTTGCTTGGA TCACTCGAGA CGCTAAAAGT AACGTTGGAT TCTTAGGAAA CCGCTTGGGC CGGCCGAATT TCTCCCGCTT CTCCTTTCAG TTCATCAACC GTCATCCGGT GATCTTCTCA TGTCTACCCC TTCCGTCCCG ACGCCGCTAT CTGGTCCCCT TCAGCCCAAC GACTTGTTAC CTTTCCTCCA GGACCTCGCT GCCGGTTTTG ACGATGATAC TTTAAAAGAT GTCATCAGCC CGACACTGAG TCTATTCTTC CAAGAGTGGT TCAAGATCAC TCCTACCCCG GATATCATGG GTGCGGAGTG GAGGAGATAC CTCGGTGCTA TGAACTTGTT GGTACAAGTC AAACATATAG CCGCGTTTGT AAGTTGTACT TCTATGACCT AATCATTAAG GCTAACCATG AGCAGCTGCC CACTTTACCT ATTTGGGTGG CGCCAAATGT GACAGCTCCA AAACTCGAAT GGCAATCGCT TTTAGGTCCC CTTACACGTC TGAGTGTTTT CCCTAGAGAA TTCGTGAGTC ATTAGCGTCA ATTTTGAATC GTGAAACTCT CTAAAACAAC CATGTGCATA GCCTGAAATT TGGAAGACTT ACTTTTCTAA CCCTACCGAA AGGAAGAAGG AAGACATTGA CGCGAACAAA AGCAACCTTC GATTTACCCT CGGAAGCCTA CATGTACGTA ACAAAGTCGC TAATTTTTGC TGACTTGATG ACAGTCATCC CTGTTTAACG TCTACAATGC CATCGTCCGT GCATCACCAG ATGCCAGAGA GGGTATTCTT GACTTTTTCA CACTTGCTTT GCGTCTCAAC GAGAAGCGTG CGGGAATGCG AGTTGACCCT CGAACCGTCT CCTCAGACGG CTACATGACG AATCTTCAGG TCGTGCTGCT GAAACTGTTT GAGCCGGTAA TGGACGCCAG ATTCTCCAAA ATTGACAAAG TTGATCCAGC ATATTATAAA TCCTCGAAAA GGATCGATAT CTCAGAAGAG ACAAAGATTA AGGGCGCCAA GGAAGAGGCG GATGAATATT TTGGAAGCTC CATGGATGGT ATGTTGTCAC GCCTTTCAAT ATAAATGCAT AGACTCAATT CCTTTTTAGT GGACACGAAA CCAAACTTCA TCTCCGACTT GTTTTTCCTT CTCAATAGCT ATCTCCATCT TGGTGTAGTC AAGACTATCT CAACTAGGAT TCGAGCTGAA AAAAATCTGA GTGAAATAGA AAAGGAACTG AAGAGAGTTG AGGCATCCAC AGGCGATTGG GCAAATGTAA GGACATGATT CTTACCCGGT TTTATTTTTG GTTAACATCT TTGCAGAATG CGACATTACA AGCACAGGGC GAAGCAACAA TTAAAAAGTT GAAGAGCGAC ATGTCTGTTC TTCATGCTTC TATTCACGCC TACGACACGC AGCTTTTGGA TCGAGATATG ATTAGAATGG TTGTCTCCTT CCTCAGCTTT GTTATGACAT GGCTCATCCG CCTTGTCGAC CCAAACCACC AGTACCCAGC GTCGCCTCTT AATTTACCGT TACCAAAAGA AGCTCCAATG GCTTTTAGGA TGTTGCCAGA ATTTTTCATC GAAAACATCG CAGAGTACTT TGAGTTCCTC GCCAAGTGAG TGACATTTAA CAAGCTCGGG AATAAACTGA CGCATACCCC AGATATGATC CCGATGCGCT TGATGATGTA GATAAGGACA TCTTTATTAC CTTTGCCATC ACTTTTCTTT CTCCCAACTA CGTTAATAAC CCTTTCCTCA AAGCTAAACT TGTCACGGTG TGTTACCTCT TATTGTCTCG TGTGAAACTT GACTAACAGC ACCCTAGATT ATCTCATATG GCCTGTATCC CATGGGCTAC TGGCGCCATG GTCCTCTCTT TGACAGACTT AGTATACTCA GCGTGGCCAC AGACCACTTG ATGCCAACTT TGATCCGTTT CTTCATCGAT GTGGAGATCA CTGGAGGTCA TACGCAATTC TGGGGTGAGT GAATATGTCT CTAGTCTAAT GTCTTGACTG ACTGGTGATG CAGACAAGTT CAACTTTAGG TTAGCAACGT CTTAATTTCA AGATGAATGG ATTTTGACTG ACTGTGCACA GGCGCGACAT TGGCCACATC TTCAAAGCCA TGTGGACAAA CCCTCTCCAC CGGGAAGCAT TTGTCAAGTC TAGACAGTAC GTCCCAGGTA TCGCTGTATT CATGGTGTTG GGTGTCTAAC TTTTTTGCGT TTCCCCCCCC TCCAGTGATG ACTTTGATCA GTTTATTCGC TTTGTCAACA TGCTCATGAG CGATACTACA TTCCACCTCG AAGAGTCCTT GACCGGTTTG GCCAAGATTG GACAAATAGA GTCTCAAAAA GCCAACACTG CTTCGTGGGA AGCATTGCCC CAGTCAGAGC GGGAGGATCT CGATGGTCAG TTGAGGCAGA CGGAGGGTAG TGTCCCATGG CACACGCAGA TGGGCTTATC AAATGTCAAA TTGATTCGAG ACTTTACGGC AACCACACGA GAGCCATTTG TTGCTCCCGA GATTGTTGAC CGTTTGGCAG CAGTAAGTTG GTTTACTGAT GTGGGCGTGG ATGATGGCTA ACATTCATTA TTTAGTCCTT GGATGAGAAC CTTACAGCTC TCGTTGGCCC AAAGATGTCG GATCTTAAAG TATCCAACCC CGACAAATAC TACTTCAAGC CTAAGGACCT TTTAGCAGCT ATTGCTCAAA TCTACCTCAA TCTTTCCGTC GAATCTGAAT TCATCCGTGC TGTCGCCAAT GATGGTAGAA GTTATTCAAA GGATCTGTTT ATGAAGTTCG CTAGGACTTT GAAGAACAGA GCCATCATGA CTGAAGGAGA GGTAGCCGAG GTTATCAGCT TTACGCAAAA AATAGAAGAC ATGAAAGCGA CAATATCAAT GGAGGACGAG AGGGAGATTC CGGACGAGTT TTTGGATCCA TTGTTGTCGA CTTGTAAGTC AAATCTCGCT TGGCCATATC CCATGCCTTT GCTGACGCTC GCCCTCCGTA GTAATGAAAG ATCCTGTCAT CTTACCAGTA TCTCGAGTGA CCATTGATAG AGGCACCATT CGAACTGTCT TGCTTTCAAA GGAAGTCGAC CCTTTCAACA ACGTGCCGTA AGTGATCATG TACCTCCTTA ATTGCGTATA GACTTGGAAG CTAATATCTT GCATCTAGGC TGAAGTACGA GGATTGCATT CCCGACACAG AGCTAAAGGC CAAGATCGAT GCGTGGCTGG CGGAGAGCAA CGCAAAACAA GCGGACTCGG TGATGGATGT TGATCAGCTG TAGGAGAATG TCGTTGTAGC AAGTATACGA TAGTACTACA TTGTAAGATA CTTAGAATTT GCATGGGAAC AGAAGAAGCA AGCAGTGTAG TACAATAGTG AACAGTGGCC GTCAATCATC TGTGCGATGG TGCCAGTTAT TGCAAATGCT TGCACTGTGC TATGTGCTAA GTATCTGGTC ACATGTATCG TGAATGAGTA TATG
|
Protein sequence | MSRCCRRASQ YEASPIKCLN CLLITLSPVF LPTMADNSNL SDADKIRLKR LARLGTSTPI PSQTQPQQQP SSSSTPEPHH PPSASSRLLA NLPPAASPTS SSPAIVSPKP SASTKPDPQA VQPKLSVTPS LSLKRPSSAS TPRDEPVGPR IVHTKPIQPL VKTEYKAWET EKVGQVFAVT LSKQKAQETD WSLCWLKDLE QELNEENYPS PLRTDIELAD RLLIARLSMD PTLMAQSDDP DALTILAGLP QNETVFEYLA GCWKRLYQAS RDANRYAFSE DEKSQWGKSM DKIKGLVVSY CGMTIEDPTM FPQPAEKPLG PAEFLPLLLS VHQPSSGDLL MSTPSVPTPL SGPLQPNDLL PFLQDLAAGF DDDTLKDVIS PTLSLFFQEW FKITPTPDIM GAEWRRYLGA MNLLVQVKHI AAFLPTLPIW VAPNVTAPKL EWQSLLGPLT RLSVFPREFP EIWKTYFSNP TERKKEDIDA NKSNLRFTLG SLHSSLFNVY NAIVRASPDA REGILDFFTL ALRLNEKRAG MRVDPRTVSS DGYMTNLQVV LLKLFEPVMD ARFSKIDKVD PAYYKSSKRI DISEETKIKG AKEEADEYFG SSMDVDTKPN FISDLFFLLN SYLHLGVVKT ISTRIRAEKN LSEIEKELKR VEASTGDWAN NATLQAQGEA TIKKLKSDMS VLHASIHAYD TQLLDRDMIR MVVSFLSFVM TWLIRLVDPN HQYPASPLNL PLPKEAPMAF RMLPEFFIEN IAEYFEFLAK YDPDALDDVD KDIFITFAIT FLSPNYVNNP FLKAKLVTII SYGLYPMGYW RHGPLFDRLS ILSVATDHLM PTLIRFFIDV EITGGHTQFW DKFNFRRDIG HIFKAMWTNP LHREAFVKSR HDDFDQFIRF VNMLMSDTTF HLEESLTGLA KIGQIESQKA NTASWEALPQ SEREDLDGQL RQTEGSVPWH TQMGLSNVKL IRDFTATTRE PFVAPEIVDR LAASLDENLT ALVGPKMSDL KVSNPDKYYF KPKDLLAAIA QIYLNLSVES EFIRAVANDG RSYSKDLFMK FARTLKNRAI MTEGEVAEVI SFTQKIEDMK ATISMEDERE IPDEFLDPLL STLMKDPVIL PVSRVTIDRG TIRTVLLSKE VDPFNNVPLK YEDCIPDTEL KAKIDAWLAE SNAKQADSVM DVDQL
|
| |