Gene CNE03590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE03590 
Symbol 
ID3257965 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1015089 
End bp1018183 
Gene Length3095 bp 
Protein Length782 aa 
Translation table 
GC content47% 
IMG OID638256942 
Producthemolysin, putative 
Protein accessionXP_570927 
Protein GI58267542 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCGTACCCAA ATTCTCGTAA AGCACAGCAA ACATGCCCCC TCTCGTGCGT CGCTCAACCA 
CAAGCTCAAT CCCCACTCTC CTCTCCCTCA CGAAACTCGT ACAACTTGTA CCACGATCAG
CTCAACAATG GTCCTCCTTC GGTGCAGCTA CCCCGCCAAT TGAGCCTGAA GATCCTCCCG
ACAGTCCTAA ATTTTGGTGG AAGCTTGGCC TGAGTGTGGT TCTTGTGTTG GCCGGTGGAG
TATTTGCCGG GTAAGTCTTA GTCCTTGCAT TGCCATAATC AAAGCAAGAT ACTGATCAGT
GGAACGTACA GTTTGACTTT GGCTTTGATG GGTTCGGACG ACTTGAACCT TCGGGTGCTC
TCAACATCTT CCTGCAATCC CAAAGAGCGC AAAGCGGCGA ACAAAGTATT GAGGCTTCTT
GCCAGAGGTC GACATTGGGT ATTGGTCGTG TTATTGTTGG GTAATGTCGT AAGTACTTGC
TCGCTCAGTT TGAAGCCCAG CTTATTAGGT CAAAATGCTC CAGATTGTTA ATGAATCGCT
ACCCATCTTC CTAGATGATG TTCTCGGTGG TGGACTGTCT GCCATCATCG TGTCAACGAC
AATGATCGTC ATTTTTGGCG AGATTATTCC TCAAGCTGTA AGTGTACCTG GTTCTCTTGT
GACATGCTTT TTGAGCGCTT ATTATTGTGG AAGATTTGCG TTAGATATGG CCTTTCTATT
GGCGGCGTCT GTGCACCCGT GGTGTGGGCA CTCATGATTC TCTTCGCGCC CATCGCATGG
CCGATCGCAA AGCTCTTGGA CCGTATCTTA GGGAAAGATG AAGGTCATAC GTAAGCTGAT
CACCCATTTC CGAAACATGG CGTTAATATA ATGTTCAGCT ATAAGAAAGC AGAACTCAAG
AGTTTCTTGC AGTTTCATCG TGAAGGCGAG GAGCCGCTGC GTGACGACGA AGTGAGCGAG
CGCTTTTTGG ATGGTATGGG ATTCACATTG ATCTTTATTG CTAGATTGTG ATTCTCAACA
GCGTCCTTTC ACTGAATGAC AAGCACGCCA AGGAAATCAT GACTCCTATT GAAGACTGCT
TGATCCTTCC TTCCAACAAA ATTCTCAACC ACGATACCAT AGATGAGATC CTATTGTCAG
GGTTCTCTCG AATCCCTATT CATGAGCCTG GTCAGAAAGA CAACTTTATC GGAATGCTCT
TAGTCAAAAA GGTCCGTATC ATCTTCTATA CCACCAATAT TACGACAAGT CTGATATGTT
CATAGCTCAT ATCTTACAAT CCCGATGATG AATGGCCCGT GTCCAAGTTC CCCCTCCTTC
CGCTGCCGGA AGCCAAGCCT GAAATCAACT GTTTTCAGGC CCTGGATTAC TTCCAAACTG
GGCGAGCACA TCTTATACTC ATCAGTGACA CACCTGGCCA AAGGGGAGGT GCTTTGGGAA
TCGTCAGCTG TGAGTAAAGT CTGTATACTT TACTAAGTGG TGCTAACGAA AAGGTCTTAA
GTGGAAGATT TGATCGAAGA GTAAGTAATC TTGTATAATC TAATATTCTC CTTTTCATTA
TTGACCAAAC ATGCCTGCCA AGGATTATTG GTGAGGAAAT TGTGAGCACA TTGTATAGCA
GCATTGTATA CATATCACTG ACATGACTTT AAAGGTGGAC GAGACTGATA GATATCAGGA
CAACCACTCT AAGAAGGCGG TCAAGAGGTC TGGTACAGCT GCTGTCATGC GTGAGTACAG
TTGTTTTCTT CAACGGTGCC GTAATTCACG TTTTTTCAGG TGGCATTATC GAGCGTCGCC
GAGTTCTAAA CGCATTCAGT CGAGCTCCGT CTCGATCCGA TCCGCAGACA CCTGTTGGCC
TTAGCGGCCC CGATGGTAAA GGAGCCAATT CGACCAATGG TATCCTTGTT CAAGTATCTG
AAGGATCCCT CATCGCTGAG ACCGTTGACC TTTCCAACGA CGGTGCGACT GCTGCAGCGG
ACCCCTCCAG ACGATTGGTG ATCAGTGCGG CTGAACCCAT GAGCCCTATA CAAGAGATCA
GTACTAGCCC GGAGATCATG TATGAAGCTA TACAGAATGG AAATAGGGAA AATGGTGTTC
AGGAGGAAAG TAAAGATGTC AAGGATGCTA AGAGGAGTTA GTTGTCACAT TTACACAAAA
ACATGTATCC ACCTTGCTGT ATGCTCATGA TGACCGGCGA CTAGGCGCTT TAGATCAAGA
AGCAAAAAAT GGGTCAAACC TTGAATCTGA AGCATACCCA GATCATGAGA TCTTTGAGAT
GGCCTGTAAA GCATTGGTGG ATAAGTATTG TCTGACAAAA GAAGGGGAAG TTGGGCCGGG
GAGAACGAAC AAAGGTTGGG AATGGAGGGA ACATAAGGTA CGCCACCGGC AAAAGCTATG
TAAGGTGGTC GACGTTGATG GACCCCTGTC GCAGTTTGTA GCCCACCAAG GGTACCTCTA
CCGCCGGATC GTACGTTATG CACCCTCTAC TCTTTTCAGC TATCCTTTGG GCCAGACTCA
CTCGGATCAA TCGTTCAACC AAACCAATGA GGATGAGGAA GATTTAGAAG ACGCAGAAGA
CAGTGCCCTT TTGCCTCAAC AACAAGATGT ACTGCGGTCA ACTAAGGTTG AGATAGAGCA
ATATATCGTT TACTCTAGGA CTTATAGAGT GCCCATGTAC TGTTTCAGGG CTTGGGACGA
AGGTAAGCGT GATTTTCTTC ATTTAATGAT ACTTACGCAT CATGCTAGCT GGAGCGCCCC
TCTCCATCTC TCTTCTTCTC AAGTTGAACA TCCTTCACCC TCCTTCACCC CCACAGCCCC
TGGGTTTTGG CGACTTTGGG GCTGCCTTGG ACACCCTCAT GCTCCCTCCT GAGAATCTGC
CAGGAGAAGA CACATCGCCT TTTCCTCTTG TGCAGTCCAC AGAACATCCC ACGACCGGAG
AGCTTGTCTT TAGTATACAT CCTTGTAGAA TCGGCGGGGC GGTTGAAGAA GTCTTAACGA
CGGAGGGACT GGGTAAAGGG GCTCAAGCCG GTTTGGAGTG GCTGGAAGTT TGGATGGCGA
TGACTAGCAA AATTGCCGAC TTGACATACG CTTAA
 
Protein sequence
MPPLVRRSTT SSIPTLLSLT KLVQLVPRSA QQWSSFGAAT PPIEPEDPPD SPKFWWKLGL 
SVVLVLAGGV FAGLTLALMG SDDLNLRVLS TSSCNPKERK AANKVLRLLA RGRHWVLVVL
LLGNVIVNES LPIFLDDVLG GGLSAIIVST TMIVIFGEII PQAICVRYGL SIGGVCAPVV
WALMILFAPI AWPIAKLLDR ILGKDEGHTY KKAELKSFLQ FHREGEEPLR DDEIVILNSV
LSLNDKHAKE IMTPIEDCLI LPSNKILNHD TIDEILLSGF SRIPIHEPGQ KDNFIGMLLV
KKLISYNPDD EWPVSKFPLL PLPEAKPEIN CFQALDYFQT GRAHLILISD TPGQRGGALG
IVSLEDLIEE IIGEEIVDET DRYQDNHSKK AVKRSGTAAV MRGIIERRRV LNAFSRAPSR
SDPQTPVGLS GPDGKGANST NGILVQVSEG SLIAETVDLS NDGATAAADP SRRLVISAAE
PMSPIQEIST SPEIMYEAIQ NGNRENGVQE ESKDVKDAKR SALDQEAKNG SNLESEAYPD
HEIFEMACKA LVDKYCLTKE GEVGPGRTNK GWEWREHKFV AHQGYLYRRI VRYAPSTLFS
YPLGQTHSDQ SFNQTNEDEE DLEDAEDSAL LPQQQDVLRS TKVEIEQYIV YSRTYRVPMY
CFRAWDEAGA PLSISLLLKL NILHPPSPPQ PLGFGDFGAA LDTLMLPPEN LPGEDTSPFP
LVQSTEHPTT GELVFSIHPC RIGGAVEEVL TTEGLGKGAQ AGLEWLEVWM AMTSKIADLT
YA