Gene CNH01120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH01120 
Symbol 
ID3259198 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp865949 
End bp868972 
Gene Length3024 bp 
Protein Length917 aa 
Translation table 
GC content50% 
IMG OID638258371 
ProductKex protein, putative 
Protein accessionXP_572303 
Protein GI58270294 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4935] Regulatory P domain of the subtilisin-like proprotein convertases and other proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0397465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCATCTGTA ACTTCTGCAA AATGCGCACC TTATTATCCC TTTGGGGGAT TCTTCTGGCG 
CTCATAGTGC CTCCATCGCT CGCCCTGCAA AGACCTCAAC CAAGGTCCTA CGATACGCAC
GCTTATTACG CCTTGGAGCT CGACCCATCA ATATCACCAG CAGCCGCGCT GCAACTCTCA
AAATCTTTAG GCGTCGAGCT GGTGGAACGT ATAGGAGAGT TGGACGGACA TTGGCTTGTC
AGGACTGAAG GGTGGACACC AGAGCATGCG TCAATAACAA AAAGAAGTGT TTCTCATGAT
CCAATATTGA AGCGATGGGA AGCATTGCCT TCAAGTCTTG GCAAGAAATC CCTCACGCCC
TTGTCACTCA AGCAACGTGC CAAGCGACAT AAATCATATT CTCCTCGTTC CCGTCATTCA
AGAGACGATA GAACAGAGCT TTTATATGCC CAAAATGAGC TGCATTTGGC AGACCCTATG
CTCGATCAGC AATGGCATCT CATCAATACC CAGATGAAGG ACATCGAGCT CAATGTCACT
GGCCTTTGGG GGAGGGGTAT TACTGGTGAG GGTGTTCACG TGGTGATCAT AGACGATGGA
CTGGATGTAG AGAGCAAAGA TTTGAAGGAT AATTTCGTGC GTCATCGCTC GCTGAGCTCA
TTTAGTTAAA CTCACTGGCT CTATGCAGTT CGCTGAAGGG TCTTACGACT TCAACGACCA
CACTGAGCTT CCGATTCCTC GCCTCAAAGA CGACCAACAT GGTACTAGAT GTGCTGGCGA
GATTGCTGCT GTTCCCAACG ACGTGTGTGG AGTCGGCGTA GCATATGATA GCAAAATCGC
CGGTGTCCGT ATCCTTTCGG CTCCAATATC CGATGCCGAC GAAGCAGCTG CTCTCAATTA
TGCCTATCAA CTCAACGACA TTTATTCTTG CTCATGGGGT CCTCCCGACG ATGGGAGGTC
AATGGAAGCC CCTGATGGTT TGATCCTCAA GGCGATGGTG AACGGTGTTC AAAAGGGACG
AGACGGTAAA GGATCGGTTT TCGTGTTCGC TGCTGGCAAC GGTGGTGGAT CAGACGATCA
GTGTAATTTT GACGGATATA CGAACTCTAT TTTCTCTGTC ACTGTTGGAG CGGTAGATAG
AAAAGGATTA CATCCTTACT ATTCAGAGAT GTGTGCAGCC ATGATGGTGG TTGCGCCTTC
TTCAGGCAGT GGAGATCACA TTGTGGGTCA TTGTGTCTGT CCTTTGCGCT CAATTGACTT
TCTCTGTAGC ATACAACAGA CGTTGGAAAG GATAAGTGCT CACACAGCCA TGGCGGAACT
TCTGCGGCTG CACCTCTCGC TGTTGGAGTC TTCGCTCTCG CCCTTTCCGT GCGCCCCGAC
CTTACTTGGC GAGACATTCA ACATCTTGCC GTGCGGCATG CTGTTTTCTT CAACCCTGAT
GATCCAGCTT GGGAGCTAAC TGCTGCTGGA AGACATTTCA GCTATAAATG CAAGTTCCTT
TCTACATAAC GCACATGTGC TGATATCATT CTGCAGATGG TTATGGAAAG CTTGACGCAG
GTTTGTTCGT TGAAGCTGCT GAAAAATGGC AACTCGTCAA GCCCCAAACG TGGTATGACT
CTCCATCGGT TTATCTTCCT ACCACTTCGC CTGCCGATGT CACCAGACGT CAAGACGAAG
CTGCCGACGG CCCCACAAGC TCTGACGAGG AGACCTCCAA CCCGCCGCCT GTGGTCGAGC
CCAGTGGATC TTTCATTACA GAAGATGGTG TTATCTCCAC GTATGAAGTC ACTCAGTCTA
TGCTTTTTGA TGCCAACTTT GAGAGACTGG AGCATGTCAC CGTTAGGGTT TGGATAGACC
ATCAGAGGAG GGGTGATGTT GAGGTGGAGC TTACCAGTCC CAATGGGGTG GTTAGTGTCT
TGTGCAGGCA GAGGAGGTTT GACAATGCAG ATAGTGGTTT CCCTGGCTGG AAATTTATGT
CTTTGAAGCA TTGGTATGTT TTAATGCGTT TTTGAGTGAG ACCAAATGGA TCTGATGAAA
TGTAGGGATG AGAACCCGGT AGGTACATGG ACCATCAAAG TCAAAGACCA AGTCAACCCC
GACAAAACCG GCCGTTTCGT CGCATGGTCA CTTCAGTTGT GGGGAGAATC TGTTGATCCT
GCCCTTGCCA AACTCTGGGC ACCCGCAGAA GAAGGTCAAC CGGATGAAGA GCAAACAGGT
TCCAACCCCA GTACTACTGT CAGCCAAAAG CCCAAACCTA CGGCACTCCT TCCTGGGGAT
CATGGTGAGG CTTCTGGTGA AGCGACCCAG CCAGGACTCG GATCTGCTAC AGCCCATCCT
CAACCCACAA GCACGACTGG TGACGCTGGA AATGTCGCGG AGCCAACCGG CCCCACAGAT
GCCGATGCCG ACGAAGGGTT CTTCAGCGGT ATTTCCAACC TCGCTTCATC CTCTACATGG
CTTGCAGGCG CAGGTGCCAT TATCATCCTC TCTGGTGCTG CTATTGGTGC CTTTTTCTTC
ATCCGTGCCC GACGACAGAA ACGCAACCTC TTTGGTCTCT CCAACAACGG TCAAGGAGCT
CGCGGTGCTT ACGAGCCTGT TGATGACGTG CAGATGAGTC TCCTTGAGAG AGGCAGGAGG
AAGTTTGGCA AGAGCAAGAG TGAGAGTCAA GGAACGAAAG ATTTGTATGA TGCCTTTGGA
GATGGTCCGA GTGATGAAGA GGAGGAGGAT TTGGATGAGA GGACTGCATT GAGGTATCAT
GATGGTTTCT TAGAGGATGA CGAGCCGAAT GAGGTAGGGC CCAAGACAGA GTACAAGGAT
GAGCCTGAGT CTGAGCCTGA GACCTTTAAA GATGGAGAGG AAACTGTGGG AACAAAAGAT
AAGGGTAAGG GGAAAGGTCC AAGTGAGGGA GAGAGTGGTA GCGGGAGTTC CAGCAGTTGG
CAAGATGCCG CCGACGAAGA AGCGCGTGTG TAAGATGGGG GTATCAACGA GTCATGGATG
ATGTACTGTC ATATGTATAT ATCG
 
Protein sequence
MRTLLSLWGI LLALIVPPSL ALQRPQPRSY DTHAYYALEL DPSISPAAAL QLSKSLGVEL 
VERIGELDGH WLVRTEGWTP EHASITKRSV SHDPILKRWE ALPSSLGKKS LTPLSLKQRA
KRHKSYSPRS RHSRDDRTEL LYAQNELHLA DPMLDQQWHL INTQMKDIEL NVTGLWGRGI
TGEGVHVVII DDGLDVESKD LKDNFFAEGS YDFNDHTELP IPRLKDDQHG TRCAGEIAAV
PNDVCGVGVA YDSKIAGVRI LSAPISDADE AAALNYAYQL NDIYSCSWGP PDDGRSMEAP
DGLILKAMVN GVQKGRDGKG SVFVFAAGNG GGSDDQCNFD GYTNSIFSVT VGAVDRKGLH
PYYSEMCAAM MVVAPSSGSG DHIHTTDVGK DKCSHSHGGT SAAAPLAVGV FALALSVRPD
LTWRDIQHLA VRHAVFFNPD DPAWELTAAG RHFSYKYGYG KLDAGLFVEA AEKWQLVKPQ
TWYDSPSVYL PTTSPADVTR RQDEAADGPT SSDEETSNPP PVVEPSGSFI TEDGVISTYE
VTQSMLFDAN FERLEHVTVR VWIDHQRRGD VEVELTSPNG VVSVLCRQRR FDNADSGFPG
WKFMSLKHWD ENPVGTWTIK VKDQVNPDKT GRFVAWSLQL WGESVDPALA KLWAPAEEGQ
PDEEQTGSNP STTVSQKPKP TALLPGDHGE ASGEATQPGL GSATAHPQPT STTGDAGNVA
EPTGPTDADA DEGFFSGISN LASSSTWLAG AGAIIILSGA AIGAFFFIRA RRQKRNLFGL
SNNGQGARGA YEPVDDVQMS LLERGRRKFG KSKSESQGTK DLYDAFGDGP SDEEEEDLDE
RTALRYHDGF LEDDEPNEVG PKTEYKDEPE SEPETFKDGE ETVGTKDKGK GKGPSEGESG
SGSSSSWQDA ADEEARV