Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ03250 |
Symbol | |
ID | 3254137 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | + |
Start bp | 1016387 |
End bp | 1019436 |
Gene Length | 3050 bp |
Protein Length | 902 aa |
Translation table | |
GC content | 53% |
IMG OID | 638253474 |
Product | MMS2, putative |
Protein accession | XP_567405 |
Protein GI | 58259990 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.537283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTACAATGCG CATATACCTA CCACTGCTGC TCCTGGCAGC CCCCCTTCTG AGAGCACAAG ATGCGTGTGC ACCTCTAGAG GACGCGGAAC AAATATTAGA ATCACTTCAA CCGGCTCCGG ACGCACATCT ACATTCAAAG CTTCCTGACC TGGCGCCGTT TGGGAATCAC GGATGGGCAG ACGGTCTCGG GTGGTCTCAG GATGTGAGTG CATGACCAGC ATCATGAACT AACATATTAC AGGGTCCGTT ATCTACAACA TTGCGTCTTC TCCCACGTGT GTTCTCTGCG ATATCCCCGA CAAGGCTCCT GTCGCAGACT GCCAAGACGA TAGGAAGGAA AGGAAACGCC AAAATGTCGA GGTCGAGGAA AGAAAGGGCC GAGAAGGTGT TGGAGCTCGT GCAAGAAGCG GAAGAAGCTG GTTGTGATAA AGTGTGGCGG TTTAGAGCTC GGCTCCGGAT GTTCCCACCC AGAGGCATCA AGCAGGACCT TGCAGCAGCT TATGAAGCGT ACAAGAAACA TCTCGAGATT GAACCCAGCC CAGAGGCTCA GTTTATCCTC GGGGTATTCC ACTCCACAGG GCTTGGTGGG ATTCCAATTG ATCAAGGCAA GGCGTTATTG TACTATACTT TTGCAGCAGC GCAAGGGTAT AGGCCTGCGG CGATGGCGTT GGGGTACAGA CATTGGGCGG GGATTGGGGT GAAAGAGGAT TGTGGAGTGG CTCTGGAGCA TTATTCTCAT GCGGCTGATA TTTGCAAGTC TATCCAACCT TTCTTTTTTT GACAAAACAT TCGGACTAAC ATTCTCCGTT TTGCAGCCTA TCGACGTTTC CTCGACGGTC CACCTGGCGG TCTCACTCTT CCCCTCACGC CTATCCGCCT GTCTGACCGT GTAGGCGGTA TCTACGGTCC GCACGCTTCA TGGGCGTCAA CAGGCGCCAA CTCTCTCCGT CCTGCTATTC GAGCATCTAT CGCTTCTGCC CGAGGGGAGA CGACCCAAGA GATCCTCGAA TATTACCAGT ACCATTCCGA TCGTGATTCA TACATCTACA CCGCCCGCCT CGGCCGGTTA TTCTACCATG GCTCTGTCCA CTTTTCGGCG AATGGGATAT CTTCTGGCGC GGAGAGCGTA GGCGCCATTC CTCAATCGTT TCATAAAGCC CGGACATATT TCCTCAAAGT CGCCCGTGTC CTCTGGCCTA GCGATTTCCT CCCCGGTACG ACTGACCAGC CTGCCGGGCG TCGCAAATTG ACAAAGGAAC AAGAAGATAA AGTCCGTGAA GCCGCAATGA TTTCCGCGTC CTTCCTCGGT CGGATGGCTC TCCGCGGGGA GGGGCAGAAA GCGGATTACC AACGAGCCAA AATGTGGTAT GAGCGCGCTG CAGAGTTGGG AGATCGGGAA GCGCTTAATG GGCTGGGTAT TTTGTACCGT GACGGTCTCG GTGTACTTGT TGATTTGGCA AGGGCGCAAG GTTATTTCCA AGTTGCCGCT GCTGCCTCTT TACCAGAAGC CCAAGTGAAC GTCGCGAAAC TCTTACTTAA CCGCGGGGAA TACCAAGCAG CTCTTCCTTT CCTCGATTCA GCTCTTCGGG GTGGAAATCC GCTCGAGGCA TTCCACCTTT CCGCTCAAAT CCATACTACC CACGGCCGAT CTTCGAAATC TGCAAGTCTC CCGCCGGCCA TGTGTGGTGT AGCTGTAGCA TACGAGAAGC TCGTCTCGGA GCGCGGATCG TGGAACGAAG ATTACTTGTT AGAAGCGGAT GAGGCGTGGG CGAGAGGGGA GGAAGGGAAA GCGATGATGG GGTGGTATAT CGCTGCCGAG ATGGGGTACG AGATTGCGCA GAATAATGTG GCGTTCATGC GCGAAGGAGG GTGGCAGTTT GATGCGGAAA GGGAAGGCGA GTGGGTAATA GGGAAAGGAA AAGCAGAAGA GGGAGATAAG GAGGCGTTGG TATGGTGGTT GAGAAGCGCG GCGCAGGATA ATGTGGATGC AATGGTCAAA GTCGGAGATT ACTACTGTTC GTCCATCTAT TCCACGTCTT TTATCGTTTC TCTTCATCGC TGATAACTTG AACAGACTCA AAGCAAGACT ACCCCCACGC TCTCGCTCAC TACCTCTCTG CATCCGAGAC CCAGCAATCT CCAATGGCAT ACTGGAATCT TGGTTGGATG TATCAATCCG GTCAAGGTGT AGCGAGAGAT TGGCATCTGG CGAAACGCTA TTATGACCTA AGCAGAGAGA CGGGTGAAGA AGCTGGTCTG GCAGTTTGGT TCAGTTTGTG GGGTCTTTAT CTCCAAAGGT ACGCTTCGGT TTTATTCTAT TTTATTGTAA TTTTACCATG CTTCCCCACT TTTCTTTCAT CTGCCCAGAG CATTATGCTG ATCCAGATTG TTCGTTACGT GATGTAGCTG GTGGACACAT TTCAGGACTC GCGGCTCTAT TTCTGGTTTA CCGCTCTTTG AGCCACCGAC ACCATCCGAT TCCAAACCCC TCGGGACATG GTCCCGCCTC AAGTCCCTTT TTACCTCACC TTTCCAATGG GCAGAACTTG AATTTGATGA GGACTGGGAA AACAACCTTG AGCCGGAAGG GATCGTTTTG GGTGAAGGTG ACATTGAAGG AGGGGGAGGA GCACTTGGAG AAGCTGGAAA TGGAGGCCGG GAAGGGAGGG AGTGGGAGGG AGAGACGATG GGGGAGATGA TGGAAGATCT GCTGTTAATA GGGCTGATGT TTGGTATAGG GGGGTTGATG TGGTTAAGAG CGAGGTGGGC GGCAATGGCT GGTAATGGTG GTAGAGTACA GGGCGACGCC GCGCAGGCGC AAGGGCCGAT AGGTGGTGGT GTAGGGTTTG AAGGTGCAGA GGCATTTGGT GCAGGCTTTG AGACTCCACA ACAGCCCGAA AGAGTGCAGC AAGATCGACC GCCCGAGGAA ATACAACGGC CGCAAGAGAC AGAGGACGAG GAAAACGAGC GGAGGGAGGA GTGATGGATG GGTCGAGGGT GTTGTAGGAC TGTCTTCTTT ATGAGCTCAC GATTTCAAGA GCTTTGCATG
|
Protein sequence | MRIYLPLLLL AAPLLRAQDA CAPLEDAEQI LESLQPAPDA HLHSKLPDLA PFGNHGWADG LGWSQDGPLS TTLRLLPRVF SAISPTRLLS QTAKTIGRKG NAKMSRSRKE RAEKVLELVQ EAEEAGCDKV WRFRARLRMF PPRGIKQDLA AAYEAYKKHL EIEPSPEAQF ILGVFHSTGL GGIPIDQGKA LLYYTFAAAQ GYRPAAMALG YRHWAGIGVK EDCGVALEHY SHAADISYRR FLDGPPGGLT LPLTPIRLSD RVGGIYGPHA SWASTGANSL RPAIRASIAS ARGETTQEIL EYYQYHSDRD SYIYTARLGR LFYHGSVHFS ANGISSGAES VGAIPQSFHK ARTYFLKVAR VLWPSDFLPG TTDQPAGRRK LTKEQEDKVR EAAMISASFL GRMALRGEGQ KADYQRAKMW YERAAELGDR EALNGLGILY RDGLGVLVDL ARAQGYFQVA AAASLPEAQV NVAKLLLNRG EYQAALPFLD SALRGGNPLE AFHLSAQIHT THGRSSKSAS LPPAMCGVAV AYEKLVSERG SWNEDYLLEA DEAWARGEEG KAMMGWYIAA EMGYEIAQNN VAFMREGGWQ FDAEREGEWV IGKGKAEEGD KEALVWWLRS AAQDNVDAMV KVGDYYYSKQ DYPHALAHYL SASETQQSPM AYWNLGWMYQ SGQGVARDWH LAKRYYDLSR ETGEEAGLAV WFSLWGLYLQ SWWTHFRTRG SISGLPLFEP PTPSDSKPLG TWSRLKSLFT SPFQWAELEF DEDWENNLEP EGIVLGEGDI EGGGGALGEA GNGGREGREW EGETMGEMME DLLLIGLMFG IGGLMWLRAR WAAMAGNGGR VQGDAAQAQG PIGGGVGFEG AEAFGAGFET PQQPERVQQD RPPEEIQRPQ ETEDEENERR EE
|
| |