Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF02660 |
Symbol | |
ID | 3258101 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 759709 |
End bp | 762572 |
Gene Length | 2864 bp |
Protein Length | 796 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257393 |
Product | ubiquitin carboxyl-terminal hydrolase 14, putative |
Protein accession | XP_571361 |
Protein GI | 58268410 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5207] Isopeptidase T |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.989129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTGCA CGCACCTCCA GCCATCCCTC CATTCCCTCA AACCACCCTC CCCCTCCCAG CAAGTACACC GGTGAGTGTA CTCCATCCCC ACGCAGCCAT CCCCTGACAA CCGCCAAACC AACAGCGAAG AGTGCACCCT CTGCTTCGAC GGACAGGTGA GCGCCACGCA CCCCAAACGC TGTATTCTGA CTCGTCAACT CACAAACAAG CACAGGATGA CCCGCGAGGT GTCGTCGTAT GTCTTTCTTG CTTCAATGGC GCCTGTCTTT CTCCAGATAG GCAGCATGCG CACCTCCATT ACCAGAGAAC AGGCCACCCT TTGGGTATGG TTATCAAGCG GTCCAGGAAA GAGATTCGTA AAAGAGTACG GTTCTTTAAT TTTTTTTTTA AATACGGGAC CATGACTAAC AATGGGTTTA GGACTCGAGT GAGCCGCCTA TGAAGAAACT CGCCATTTCC GCTCCCAAAG ACGAGGAAAT ATGGGATTAC CACACCGCCT TTGTGTGCTT GGCCTGCTCG TCTGCAGGTC AAGAAATCAC GGCGCAAGAA CCCAAGTTGG AGGAGATGAA AACGGGGATC ATGACCGCGC TTTCCTCTGC CCAGCAGTCT GAGATCAAGG CATGGGAAGA AGAGATTCTT CCTTGTGAAC ATACTTTGAC TCTGCAGCAA GAGTCTGTTG TTGTCCCTGG AAATGGCAAG TTTTTTATTT ATTTTTAGAA GCCATGTGGA ACGTGGGGGC TAATATCTTT TACAGTCCCA TCGCAATGTT CGTCCTGCGA CCTGACTTGC AACTTGTGGC TATGTCTCAC CTGCGGTCTT GCCAATTGCG GACGACAACA ATTCGGCGGT ATCGGGGGTA ACGGCCATGC ATTGAAACAT TTTCATGAGA CGGGGCACAT GTTGGGCGTC AAGCTGGGTA CCATCACCCC GGAAGGAACT GCTGGTAAAG GCTTTCTTCC CTCGTTTTGA CTCAACGTAC GCTAAGCAAG AAAAAATAGA CATTTACTGC TACGCCTGTG ACGATGCCAA GATCGACCCC GAATTGGCCA CTCACCTTTC AACATTCGGC ATCGAAGTGA TGAGTCAGAC CAAGACTGAA AAGTCCATGA CTGAACTCCA GCTCGAACAC AACTTGAAAT TCGATTTTTC CATGGTTGGC GATGACGGTA AAGAACTCGA ACCTGTCTTT GGCAAAGGTC TCACCGGCTT GAGAAACTTG GGAAACAGCT GTTACATGGC CTCTGTCCTC CAAACCCTCT TCTCCCTCCC CGCTTTCCGA TCAAGGTATA CCACACCCGA AGCGTTCAAC CACTTTCAAA CCTGTCCCAA CCTTCTTCCC GCTTCCTGTA TCGAATGTCA AATGCTCAAG CTCGGCGATG GTTTGCTGTC CGGCAGATAC TCTCATGTGG CAAGGCTACC CCCGCCCACC ACCCATTTCG AAGAACAAGA AGCTCCCAAA TTCCAACAAG GAATCAAGCC GACACAGTTT AAAGCACTTA TCGGTAGAGG CCACGAAGAG TTTTCGACTA TGCGCCAACA AGATTCGGAA GAGTTTTTAC AGCATCTTTT GACCCGGTTA AGAGACGAAG CGAAGCGTCA AGGTAGGGAC GAGGCCGCCG AGCCGACTGA GATTCTCAAG TTTGCAATGG AGCAGAGGTT GCAGTGTGGC AAATGCAAGC GTGTCGGTCT CCAGGTGGAA GGTGTAGACT TGGCGAGTTT GCCTGTTGAG GCAGTGGAGG CGGGGGTGAG TGAAGACGGC AAGAAGCTTT ACGAGGGGGT GGAACTTGAA ACGTGTTTAG AGCAATTATG TGCAGAGGAA GCAGTGGCCG AGTACCAATG TGATCATTGC AAGGAGAAGA CGACGGCGTA CAAGTAAGTT TAAAAAATAG CTATTAAAAA AAAAATCTCA GAGGGCCTTT GACGAGGGCT GACAATCCCA TAGATCGACC AAATTCAAGA CATTCCCAGA CTTGTTGGTG TTGCACATGA AAAAGTTTCA ACTTGTGAAT TGGCTCCCAA CCAAGCTCGA TATTCCTGTA TCTGTCCCCG ACATGCTTAC TTTGGACCAC CTCGTTGCCC ATGGACTTCA ACCGGGTGAA GAAGAGCTCA CAGTATCGTC ATCGTCTCCT TCTCTCCCAG AGTTCAACGC CACGGCCATG GCACAGCTTG AGGCTATGGG GTTCCCCACA GTGAGATGCC AAAAGGCATT ATTGGCAACT GGCAACAGTG ACGCTGAGAT TGCTATGGGA TGGCTGTTTG AGCATATGGA AGATCCCGGT GAGTTGGTCG TTTATTAGTG CATGGAAGGA TTCGCTCAAC AAAAGTACCA CTAGACATTG ACGCGCCGAT TGAGCTCGGG GGTTCAAAAG CCGCCAGCAA CGAACCTTCA CAAGAACAGA TTGGCATGAT TGCCGATATG GGATTCTCAC ATAACCAAGC TCGCAAAGCG TTGCGCGAAA GCGTAGGTTT TCTTTTTTTT ACAAGGTCGT TGCCAAGAGA CGGGGCATGG TACTGATAGG AATAGGACGG CAACCCCGAG CGGGCCATTG AGTGGTTGTT TAGTAACCCT GGCGATCCAG GAGAAGACGC TGCCCCAGCT GGTAGTGCCG AACCTTCCAT CGGAGGTTCA TCTTCTCTTC CTGCCAAGTA CCGACTGAAA GCATTCATTT CGCACAAGGG CCCGTCTGTG CATTCTGGCC ACTATGTGGC TACTATCCGG CAGCCGCAAG CGGGGATCGA GGGAGAGAGG GAAGAAGAGG GAGAATGGGT GTTGTACAAT GATGAAAAGG TCGTGCGAGC TGCGTCTGGC GGTGGGGAGG AGATGAGGGG GCTCGCGTAT TTGTATGTAT ATGAACGGGT GTAG
|
Protein sequence | MSCTHLQPSL HSLKPPSPSQ QVHREECTLC FDGQDDPRGV VVCLSCFNGA CLSPDRQHAH LHYQRTGHPL GMVIKRSRKE IRKRDSSEPP MKKLAISAPK DEEIWDYHTA FVCLACSSAG QEITAQEPKL EEMKTGIMTA LSSAQQSEIK AWEEEILPCE HTLTLQQESV VVPGNVPSQC SSCDLTCNLW LCLTCGLANC GRQQFGGIGG NGHALKHFHE TGHMLGVKLG TITPEGTADI YCYACDDAKI DPELATHLST FGIEVMSQTK TEKSMTELQL EHNLKFDFSM VGDDGKELEP VFGKGLTGLR NLGNSCYMAS VLQTLFSLPA FRSRYTTPEA FNHFQTCPNL LPASCIECQM LKLGDGLLSG RYSHVARLPP PTTHFEEQEA PKFQQGIKPT QFKALIGRGH EEFSTMRQQD SEEFLQHLLT RLRDEAKRQG RDEAAEPTEI LKFAMEQRLQ CGKCKRVGLQ VEGVDLASLP VEAVEAGVSE DGKKLYEGVE LETCLEQLCA EEAVAEYQCD HCKEKTTAYK STKFKTFPDL LVLHMKKFQL VNWLPTKLDI PVSVPDMLTL DHLVAHGLQP GEEELTVSSS SPSLPEFNAT AMAQLEAMGF PTVRCQKALL ATGNSDAEIA MGWLFEHMED PDIDAPIELG GSKAASNEPS QEQIGMIADM GFSHNQARKA LRESDGNPER AIEWLFSNPG DPGEDAAPAG SAEPSIGGSS SLPAKYRLKA FISHKGPSVH SGHYVATIRQ PQAGIEGERE EEGEWVLYND EKVVRAASGG GEEMRGLAYL YVYERV
|
| |