Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH01120 |
Symbol | |
ID | 3259198 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 865949 |
End bp | 868972 |
Gene Length | 3024 bp |
Protein Length | 917 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258371 |
Product | Kex protein, putative |
Protein accession | XP_572303 |
Protein GI | 58270294 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4935] Regulatory P domain of the subtilisin-like proprotein convertases and other proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0397465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCATCTGTA ACTTCTGCAA AATGCGCACC TTATTATCCC TTTGGGGGAT TCTTCTGGCG CTCATAGTGC CTCCATCGCT CGCCCTGCAA AGACCTCAAC CAAGGTCCTA CGATACGCAC GCTTATTACG CCTTGGAGCT CGACCCATCA ATATCACCAG CAGCCGCGCT GCAACTCTCA AAATCTTTAG GCGTCGAGCT GGTGGAACGT ATAGGAGAGT TGGACGGACA TTGGCTTGTC AGGACTGAAG GGTGGACACC AGAGCATGCG TCAATAACAA AAAGAAGTGT TTCTCATGAT CCAATATTGA AGCGATGGGA AGCATTGCCT TCAAGTCTTG GCAAGAAATC CCTCACGCCC TTGTCACTCA AGCAACGTGC CAAGCGACAT AAATCATATT CTCCTCGTTC CCGTCATTCA AGAGACGATA GAACAGAGCT TTTATATGCC CAAAATGAGC TGCATTTGGC AGACCCTATG CTCGATCAGC AATGGCATCT CATCAATACC CAGATGAAGG ACATCGAGCT CAATGTCACT GGCCTTTGGG GGAGGGGTAT TACTGGTGAG GGTGTTCACG TGGTGATCAT AGACGATGGA CTGGATGTAG AGAGCAAAGA TTTGAAGGAT AATTTCGTGC GTCATCGCTC GCTGAGCTCA TTTAGTTAAA CTCACTGGCT CTATGCAGTT CGCTGAAGGG TCTTACGACT TCAACGACCA CACTGAGCTT CCGATTCCTC GCCTCAAAGA CGACCAACAT GGTACTAGAT GTGCTGGCGA GATTGCTGCT GTTCCCAACG ACGTGTGTGG AGTCGGCGTA GCATATGATA GCAAAATCGC CGGTGTCCGT ATCCTTTCGG CTCCAATATC CGATGCCGAC GAAGCAGCTG CTCTCAATTA TGCCTATCAA CTCAACGACA TTTATTCTTG CTCATGGGGT CCTCCCGACG ATGGGAGGTC AATGGAAGCC CCTGATGGTT TGATCCTCAA GGCGATGGTG AACGGTGTTC AAAAGGGACG AGACGGTAAA GGATCGGTTT TCGTGTTCGC TGCTGGCAAC GGTGGTGGAT CAGACGATCA GTGTAATTTT GACGGATATA CGAACTCTAT TTTCTCTGTC ACTGTTGGAG CGGTAGATAG AAAAGGATTA CATCCTTACT ATTCAGAGAT GTGTGCAGCC ATGATGGTGG TTGCGCCTTC TTCAGGCAGT GGAGATCACA TTGTGGGTCA TTGTGTCTGT CCTTTGCGCT CAATTGACTT TCTCTGTAGC ATACAACAGA CGTTGGAAAG GATAAGTGCT CACACAGCCA TGGCGGAACT TCTGCGGCTG CACCTCTCGC TGTTGGAGTC TTCGCTCTCG CCCTTTCCGT GCGCCCCGAC CTTACTTGGC GAGACATTCA ACATCTTGCC GTGCGGCATG CTGTTTTCTT CAACCCTGAT GATCCAGCTT GGGAGCTAAC TGCTGCTGGA AGACATTTCA GCTATAAATG CAAGTTCCTT TCTACATAAC GCACATGTGC TGATATCATT CTGCAGATGG TTATGGAAAG CTTGACGCAG GTTTGTTCGT TGAAGCTGCT GAAAAATGGC AACTCGTCAA GCCCCAAACG TGGTATGACT CTCCATCGGT TTATCTTCCT ACCACTTCGC CTGCCGATGT CACCAGACGT CAAGACGAAG CTGCCGACGG CCCCACAAGC TCTGACGAGG AGACCTCCAA CCCGCCGCCT GTGGTCGAGC CCAGTGGATC TTTCATTACA GAAGATGGTG TTATCTCCAC GTATGAAGTC ACTCAGTCTA TGCTTTTTGA TGCCAACTTT GAGAGACTGG AGCATGTCAC CGTTAGGGTT TGGATAGACC ATCAGAGGAG GGGTGATGTT GAGGTGGAGC TTACCAGTCC CAATGGGGTG GTTAGTGTCT TGTGCAGGCA GAGGAGGTTT GACAATGCAG ATAGTGGTTT CCCTGGCTGG AAATTTATGT CTTTGAAGCA TTGGTATGTT TTAATGCGTT TTTGAGTGAG ACCAAATGGA TCTGATGAAA TGTAGGGATG AGAACCCGGT AGGTACATGG ACCATCAAAG TCAAAGACCA AGTCAACCCC GACAAAACCG GCCGTTTCGT CGCATGGTCA CTTCAGTTGT GGGGAGAATC TGTTGATCCT GCCCTTGCCA AACTCTGGGC ACCCGCAGAA GAAGGTCAAC CGGATGAAGA GCAAACAGGT TCCAACCCCA GTACTACTGT CAGCCAAAAG CCCAAACCTA CGGCACTCCT TCCTGGGGAT CATGGTGAGG CTTCTGGTGA AGCGACCCAG CCAGGACTCG GATCTGCTAC AGCCCATCCT CAACCCACAA GCACGACTGG TGACGCTGGA AATGTCGCGG AGCCAACCGG CCCCACAGAT GCCGATGCCG ACGAAGGGTT CTTCAGCGGT ATTTCCAACC TCGCTTCATC CTCTACATGG CTTGCAGGCG CAGGTGCCAT TATCATCCTC TCTGGTGCTG CTATTGGTGC CTTTTTCTTC ATCCGTGCCC GACGACAGAA ACGCAACCTC TTTGGTCTCT CCAACAACGG TCAAGGAGCT CGCGGTGCTT ACGAGCCTGT TGATGACGTG CAGATGAGTC TCCTTGAGAG AGGCAGGAGG AAGTTTGGCA AGAGCAAGAG TGAGAGTCAA GGAACGAAAG ATTTGTATGA TGCCTTTGGA GATGGTCCGA GTGATGAAGA GGAGGAGGAT TTGGATGAGA GGACTGCATT GAGGTATCAT GATGGTTTCT TAGAGGATGA CGAGCCGAAT GAGGTAGGGC CCAAGACAGA GTACAAGGAT GAGCCTGAGT CTGAGCCTGA GACCTTTAAA GATGGAGAGG AAACTGTGGG AACAAAAGAT AAGGGTAAGG GGAAAGGTCC AAGTGAGGGA GAGAGTGGTA GCGGGAGTTC CAGCAGTTGG CAAGATGCCG CCGACGAAGA AGCGCGTGTG TAAGATGGGG GTATCAACGA GTCATGGATG ATGTACTGTC ATATGTATAT ATCG
|
Protein sequence | MRTLLSLWGI LLALIVPPSL ALQRPQPRSY DTHAYYALEL DPSISPAAAL QLSKSLGVEL VERIGELDGH WLVRTEGWTP EHASITKRSV SHDPILKRWE ALPSSLGKKS LTPLSLKQRA KRHKSYSPRS RHSRDDRTEL LYAQNELHLA DPMLDQQWHL INTQMKDIEL NVTGLWGRGI TGEGVHVVII DDGLDVESKD LKDNFFAEGS YDFNDHTELP IPRLKDDQHG TRCAGEIAAV PNDVCGVGVA YDSKIAGVRI LSAPISDADE AAALNYAYQL NDIYSCSWGP PDDGRSMEAP DGLILKAMVN GVQKGRDGKG SVFVFAAGNG GGSDDQCNFD GYTNSIFSVT VGAVDRKGLH PYYSEMCAAM MVVAPSSGSG DHIHTTDVGK DKCSHSHGGT SAAAPLAVGV FALALSVRPD LTWRDIQHLA VRHAVFFNPD DPAWELTAAG RHFSYKYGYG KLDAGLFVEA AEKWQLVKPQ TWYDSPSVYL PTTSPADVTR RQDEAADGPT SSDEETSNPP PVVEPSGSFI TEDGVISTYE VTQSMLFDAN FERLEHVTVR VWIDHQRRGD VEVELTSPNG VVSVLCRQRR FDNADSGFPG WKFMSLKHWD ENPVGTWTIK VKDQVNPDKT GRFVAWSLQL WGESVDPALA KLWAPAEEGQ PDEEQTGSNP STTVSQKPKP TALLPGDHGE ASGEATQPGL GSATAHPQPT STTGDAGNVA EPTGPTDADA DEGFFSGISN LASSSTWLAG AGAIIILSGA AIGAFFFIRA RRQKRNLFGL SNNGQGARGA YEPVDDVQMS LLERGRRKFG KSKSESQGTK DLYDAFGDGP SDEEEEDLDE RTALRYHDGF LEDDEPNEVG PKTEYKDEPE SEPETFKDGE ETVGTKDKGK GKGPSEGESG SGSSSSWQDA ADEEARV
|
| |