Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE03590 |
Symbol | |
ID | 3257965 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 1015089 |
End bp | 1018183 |
Gene Length | 3095 bp |
Protein Length | 782 aa |
Translation table | |
GC content | 47% |
IMG OID | 638256942 |
Product | hemolysin, putative |
Protein accession | XP_570927 |
Protein GI | 58267542 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGTACCCAA ATTCTCGTAA AGCACAGCAA ACATGCCCCC TCTCGTGCGT CGCTCAACCA CAAGCTCAAT CCCCACTCTC CTCTCCCTCA CGAAACTCGT ACAACTTGTA CCACGATCAG CTCAACAATG GTCCTCCTTC GGTGCAGCTA CCCCGCCAAT TGAGCCTGAA GATCCTCCCG ACAGTCCTAA ATTTTGGTGG AAGCTTGGCC TGAGTGTGGT TCTTGTGTTG GCCGGTGGAG TATTTGCCGG GTAAGTCTTA GTCCTTGCAT TGCCATAATC AAAGCAAGAT ACTGATCAGT GGAACGTACA GTTTGACTTT GGCTTTGATG GGTTCGGACG ACTTGAACCT TCGGGTGCTC TCAACATCTT CCTGCAATCC CAAAGAGCGC AAAGCGGCGA ACAAAGTATT GAGGCTTCTT GCCAGAGGTC GACATTGGGT ATTGGTCGTG TTATTGTTGG GTAATGTCGT AAGTACTTGC TCGCTCAGTT TGAAGCCCAG CTTATTAGGT CAAAATGCTC CAGATTGTTA ATGAATCGCT ACCCATCTTC CTAGATGATG TTCTCGGTGG TGGACTGTCT GCCATCATCG TGTCAACGAC AATGATCGTC ATTTTTGGCG AGATTATTCC TCAAGCTGTA AGTGTACCTG GTTCTCTTGT GACATGCTTT TTGAGCGCTT ATTATTGTGG AAGATTTGCG TTAGATATGG CCTTTCTATT GGCGGCGTCT GTGCACCCGT GGTGTGGGCA CTCATGATTC TCTTCGCGCC CATCGCATGG CCGATCGCAA AGCTCTTGGA CCGTATCTTA GGGAAAGATG AAGGTCATAC GTAAGCTGAT CACCCATTTC CGAAACATGG CGTTAATATA ATGTTCAGCT ATAAGAAAGC AGAACTCAAG AGTTTCTTGC AGTTTCATCG TGAAGGCGAG GAGCCGCTGC GTGACGACGA AGTGAGCGAG CGCTTTTTGG ATGGTATGGG ATTCACATTG ATCTTTATTG CTAGATTGTG ATTCTCAACA GCGTCCTTTC ACTGAATGAC AAGCACGCCA AGGAAATCAT GACTCCTATT GAAGACTGCT TGATCCTTCC TTCCAACAAA ATTCTCAACC ACGATACCAT AGATGAGATC CTATTGTCAG GGTTCTCTCG AATCCCTATT CATGAGCCTG GTCAGAAAGA CAACTTTATC GGAATGCTCT TAGTCAAAAA GGTCCGTATC ATCTTCTATA CCACCAATAT TACGACAAGT CTGATATGTT CATAGCTCAT ATCTTACAAT CCCGATGATG AATGGCCCGT GTCCAAGTTC CCCCTCCTTC CGCTGCCGGA AGCCAAGCCT GAAATCAACT GTTTTCAGGC CCTGGATTAC TTCCAAACTG GGCGAGCACA TCTTATACTC ATCAGTGACA CACCTGGCCA AAGGGGAGGT GCTTTGGGAA TCGTCAGCTG TGAGTAAAGT CTGTATACTT TACTAAGTGG TGCTAACGAA AAGGTCTTAA GTGGAAGATT TGATCGAAGA GTAAGTAATC TTGTATAATC TAATATTCTC CTTTTCATTA TTGACCAAAC ATGCCTGCCA AGGATTATTG GTGAGGAAAT TGTGAGCACA TTGTATAGCA GCATTGTATA CATATCACTG ACATGACTTT AAAGGTGGAC GAGACTGATA GATATCAGGA CAACCACTCT AAGAAGGCGG TCAAGAGGTC TGGTACAGCT GCTGTCATGC GTGAGTACAG TTGTTTTCTT CAACGGTGCC GTAATTCACG TTTTTTCAGG TGGCATTATC GAGCGTCGCC GAGTTCTAAA CGCATTCAGT CGAGCTCCGT CTCGATCCGA TCCGCAGACA CCTGTTGGCC TTAGCGGCCC CGATGGTAAA GGAGCCAATT CGACCAATGG TATCCTTGTT CAAGTATCTG AAGGATCCCT CATCGCTGAG ACCGTTGACC TTTCCAACGA CGGTGCGACT GCTGCAGCGG ACCCCTCCAG ACGATTGGTG ATCAGTGCGG CTGAACCCAT GAGCCCTATA CAAGAGATCA GTACTAGCCC GGAGATCATG TATGAAGCTA TACAGAATGG AAATAGGGAA AATGGTGTTC AGGAGGAAAG TAAAGATGTC AAGGATGCTA AGAGGAGTTA GTTGTCACAT TTACACAAAA ACATGTATCC ACCTTGCTGT ATGCTCATGA TGACCGGCGA CTAGGCGCTT TAGATCAAGA AGCAAAAAAT GGGTCAAACC TTGAATCTGA AGCATACCCA GATCATGAGA TCTTTGAGAT GGCCTGTAAA GCATTGGTGG ATAAGTATTG TCTGACAAAA GAAGGGGAAG TTGGGCCGGG GAGAACGAAC AAAGGTTGGG AATGGAGGGA ACATAAGGTA CGCCACCGGC AAAAGCTATG TAAGGTGGTC GACGTTGATG GACCCCTGTC GCAGTTTGTA GCCCACCAAG GGTACCTCTA CCGCCGGATC GTACGTTATG CACCCTCTAC TCTTTTCAGC TATCCTTTGG GCCAGACTCA CTCGGATCAA TCGTTCAACC AAACCAATGA GGATGAGGAA GATTTAGAAG ACGCAGAAGA CAGTGCCCTT TTGCCTCAAC AACAAGATGT ACTGCGGTCA ACTAAGGTTG AGATAGAGCA ATATATCGTT TACTCTAGGA CTTATAGAGT GCCCATGTAC TGTTTCAGGG CTTGGGACGA AGGTAAGCGT GATTTTCTTC ATTTAATGAT ACTTACGCAT CATGCTAGCT GGAGCGCCCC TCTCCATCTC TCTTCTTCTC AAGTTGAACA TCCTTCACCC TCCTTCACCC CCACAGCCCC TGGGTTTTGG CGACTTTGGG GCTGCCTTGG ACACCCTCAT GCTCCCTCCT GAGAATCTGC CAGGAGAAGA CACATCGCCT TTTCCTCTTG TGCAGTCCAC AGAACATCCC ACGACCGGAG AGCTTGTCTT TAGTATACAT CCTTGTAGAA TCGGCGGGGC GGTTGAAGAA GTCTTAACGA CGGAGGGACT GGGTAAAGGG GCTCAAGCCG GTTTGGAGTG GCTGGAAGTT TGGATGGCGA TGACTAGCAA AATTGCCGAC TTGACATACG CTTAA
|
Protein sequence | MPPLVRRSTT SSIPTLLSLT KLVQLVPRSA QQWSSFGAAT PPIEPEDPPD SPKFWWKLGL SVVLVLAGGV FAGLTLALMG SDDLNLRVLS TSSCNPKERK AANKVLRLLA RGRHWVLVVL LLGNVIVNES LPIFLDDVLG GGLSAIIVST TMIVIFGEII PQAICVRYGL SIGGVCAPVV WALMILFAPI AWPIAKLLDR ILGKDEGHTY KKAELKSFLQ FHREGEEPLR DDEIVILNSV LSLNDKHAKE IMTPIEDCLI LPSNKILNHD TIDEILLSGF SRIPIHEPGQ KDNFIGMLLV KKLISYNPDD EWPVSKFPLL PLPEAKPEIN CFQALDYFQT GRAHLILISD TPGQRGGALG IVSLEDLIEE IIGEEIVDET DRYQDNHSKK AVKRSGTAAV MRGIIERRRV LNAFSRAPSR SDPQTPVGLS GPDGKGANST NGILVQVSEG SLIAETVDLS NDGATAAADP SRRLVISAAE PMSPIQEIST SPEIMYEAIQ NGNRENGVQE ESKDVKDAKR SALDQEAKNG SNLESEAYPD HEIFEMACKA LVDKYCLTKE GEVGPGRTNK GWEWREHKFV AHQGYLYRRI VRYAPSTLFS YPLGQTHSDQ SFNQTNEDEE DLEDAEDSAL LPQQQDVLRS TKVEIEQYIV YSRTYRVPMY CFRAWDEAGA PLSISLLLKL NILHPPSPPQ PLGFGDFGAA LDTLMLPPEN LPGEDTSPFP LVQSTEHPTT GELVFSIHPC RIGGAVEEVL TTEGLGKGAQ AGLEWLEVWM AMTSKIADLT YA
|
| |