Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC04460 |
Symbol | |
ID | 3256148 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1352869 |
End bp | 1355802 |
Gene Length | 2934 bp |
Protein Length | 856 aa |
Translation table | |
GC content | 51% |
IMG OID | 638255667 |
Product | shk1 kinase-binding protein 1, putative |
Protein accession | XP_569696 |
Protein GI | 58265080 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.776045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGACTTG TCCTAAACCT CGTTCGACAT CTCTTTTTCA TCCACATCCA ACATGCCTCG CCATAAAGTA GCCCTGTACC TCCCGCATCC CCTTCCATCC CTCCCCATAG AGCCTCCCCC GACTCCCTCT CCTTTGCAGC AAGTCATAGC ATCCACTCTG TCCACAACCG ACTACGACCT CGTCTCGTTC CCCCTCACAA ATGCTGCATG GCAAGCTCGC TGGGAGAAAC TGTGTCTACG ACCCATTGAG GAAGAAGGTC TTTCAGAGGC GGAGTTAGAA CGGAGGGCGA TCGAGGAGAA GAAGGTTGAC CAAGAGGCTG ACATTTGGAG ACGCGACGGA GGGTTGAAAA GGAGCGAGGT CGTTGTGAGT AGACTGGAAG AGAGCCAGGG GGTGATACCC CTTGCAAGTG AGTGGTTGGA GCTTGACTCC CCAGACGAAG GTATTCGCTT CGACTCTGAA CTTGTTGGTC TTTTACATCT TTTGTCTGTG GCATATGACT GGCTGACCAG TTATGCGCGT AGGCCTTGAG AGCAGAGTTC GCACATGCTC TTTACCTGTC TCTCCCGGTT CTGATCCTTC CTGCGCCGTC GCTGGCTAAT AGGGAGTACT TACCTTCCTA TGCCAGGGCC ATCTCCAACT TGCTGCAAAT GGGTGGGCAG AGTGCGGTGA CCAATATATC AATCAGAATC CCGGTGTCAA ACCCATTAGA GCTGATTGCG CCGGAATCGG TCATGCCAAA TGGATTGCCT GGGTCGCCAT CACCCATCGC TCCTCCTCTT GCATCTGGCG CACCCCAGAC GGACAAAAAG CATAAACGTC TTTCGTCGCT TTCTACCCGC CCGCAATCTA TGCAAACTTC ACTTTTTGGC CAGCCTGCAA ACCAAGGCAC AAATCAGAAC CAGCAGCAAG GAATGAGGAT CACCTCTGGA GCAAGTTCAC TCATGTCTGC TAATACTGCT TACGGGTCCG TTGCTGGGGT CGGACAAGCC AGTTTAGGCG TCGCAGCTCA TGGAGGAGAC CTCAGTTCGA CATGGGAAAT GTGGGATTGT ATCAGGACCT TGTGCGGGTA TCACCCGCGA TTATCAGTTA GTATGTTTAA AATTCTCTTT CCGAGCCCTC TATCTAGCTG TGGACTGATT TTGTGGAAAG CTCTGGACTT GACAAACCCT CTGCCACCAT CCGCCGGTGC CCTCGCAAGA TGGTCAGCTG AACCAGTCAA CTATATCTGG TTGCCCGCCT CGTCATTCAT ACCCAATGCC AAAGGATACC CCGTGTTGAG CAAGGCCTGC CAGGCCTTTA TTCGGGAGAT GGGCAAGCAG AATCCCACAT ATATTCTTTC CCAGACAACT ATGAAGAGAC ACTCAGCTGG AGGACATAAC GCCTACCTCC AATACATCAG ACATATCACG TCCACTCCTC AGCCTGGACC TAATGCCCAA CCGCGCGCAA TCATGGCTCT GCCTGCTGGC GCTTCGGAAA AATTTCAAGA TTATTCCGAC TATCTACAAG CGCCCCTACA ACCATTGATG GACGATCTTG GGAGCATGAC ATATAATATT TTTGAAAATG ATCCGGTCAA GTATGCCCAG TATGAGAGTG CCATCACTCA AGCTTTGCTG GATCTACCAG CGAACAAGAA GCAGTAAGCT ATTATGATCT TATAGAGTCT GCCGCAGACA GCTGATAGCC CTAAGTGTCG TGACAGTAGT TGGCGCTGGT CGTGGACCCC TTATAGACTG CACCCTTCGC GCCCTCTTGC ATTCCGGTCG CCAAGCATCC ATCTACGCCG TCGAGAAGAA TACCAATGCC TTCGTAACTC TGCAGGAACG CAAAGAGCTT GAATGGCGCG ACAAGGTACA TATTATCAGC GGAGATATGA GGGCAGTCGA TGTTCCCGAA AAGTGTGATA TACTGGTTTC AGAGCTCTTG GGAAGCTTTG GAGATAATGA ACTGAGTCCT GAGTGCTTAG ATGGAGCATT GAGATTAATG AAATGTAAGT CAAGAGTCGC TGCTAGCGCG AGGCAGGCCG CTGATGTCTC TTTTATAGCA ACTGGTGTCT CCATCCCATC CTCTTATACG GCCCATATCG CTCCTCTTTC GACATCAAAA CTTTATCAAG AAACACATTC TCCTACTCGC GGTCCTTCAT CTGCTGAAAC GCCCTACGTC GTAATGTTAT CTCAAGTCGA CCCCATTTCG GGTGACAACA ATGTACCTGG AGTGAGTGCG CGATGCGGCG AAAGAATTCA GCAGTGCTGG CAGTTTGTCC ACCCAAACAG AGATATTACT GTCGATTCAA ATGGTTAGTC TTTTCCCTTC TATGTATAAA TAGAACTTCT TATCCTTATG ATGTTGTTAG GAGTACCCCT TTCAAATTCA CACAATGCTC GTGCGAGCAC CCACACATTT CACATTCCTC ATGCGGCTAC TCTTCACGGG TTTGGTGGCT ATTTCGAGGC TCATCTTTAC GGTGATGTTG GCCTTTCAAT TCACCCCGAG AACGCACACG CCGTATCACC AGATTTGACC AGTTGGTTCC CTCTTTTCTT CCCACTGAAA GAACCAATGT ACCTTCCAAG TGGAGCAGAA TTGCAAGTGA ACTTATGGAG AATGGGCGAT GGAAAAGGAA AGAAGGTATG GTACGAGTGG GCGGTGGAGA GCTATTTGCC GGTAGTGCAA TCAGTATCTT CAGGCCCAGG TGCCGCGACC GTCCCAGGGT CAAGGAATGT CAGTTCGGCG TCTGCATCTG GGATTGGGTT CGGTGGACAA CCTAGCCCTT TGATGGACGC ACAGTTCTCG CCGGGAACGG GACACATGGG CTTGTCAGGT GAACTAGGGA GGGTGAAGAT TGGGCAATCC ACTCTGCATA ATCCAGGAGG GATTCATTCT TGGGTTGGCC TCTAGACGGT ATAGAGGACC TATGGACTTT CAAA
|
Protein sequence | MPRHKVALYL PHPLPSLPIE PPPTPSPLQQ VIASTLSTTD YDLVSFPLTN AAWQARWEKL CLRPIEEEGL SEAELERRAI EEKKVDQEAD IWRRDGGLKR SEVVVSRLEE SQGVIPLASE WLELDSPDEG IRFDSELALR AEFAHALYLS LPVLILPAPS LANREYLPSY ARAISNLLQM GGQSAVTNIS IRIPVSNPLE LIAPESVMPN GLPGSPSPIA PPLASGAPQT DKKHKRLSSL STRPQSMQTS LFGQPANQGT NQNQQQGMRI TSGASSLMSA NTAYGSVAGV GQASLGVAAH GGDLSSTWEM WDCIRTLCGY HPRLSVTLDL TNPLPPSAGA LARWSAEPVN YIWLPASSFI PNAKGYPVLS KACQAFIREM GKQNPTYILS QTTMKRHSAG GHNAYLQYIR HITSTPQPGP NAQPRAIMAL PAGASEKFQD YSDYLQAPLQ PLMDDLGSMT YNIFENDPVK YAQYESAITQ ALLDLPANKK HVVTVVGAGR GPLIDCTLRA LLHSGRQASI YAVEKNTNAF VTLQERKELE WRDKVHIISG DMRAVDVPEK CDILVSELLG SFGDNELSPE CLDGALRLMK STGVSIPSSY TAHIAPLSTS KLYQETHSPT RGPSSAETPY VVMLSQVDPI SGDNNVPGVS ARCGERIQQC WQFVHPNRDI TVDSNGVPLS NSHNARASTH TFHIPHAATL HGFGGYFEAH LYGDVGLSIH PENAHAVSPD LTSWFPLFFP LKEPMYLPSG AELQVNLWRM GDGKGKKVWY EWAVESYLPV VQSVSSGPGA ATVPGSRNVS SASASGIGFG GQPSPLMDAQ FSPGTGHMGL SGELGRVKIG QSTLHNPGGI HSWVGL
|
| |