Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI02260 |
Symbol | |
ID | 3259534 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 626649 |
End bp | 629467 |
Gene Length | 2819 bp |
Protein Length | 731 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258715 |
Product | conjugation with cellular fusion-related protein, putative |
Protein accession | XP_573002 |
Protein GI | 58271692 |
COG category | [S] Function unknown |
COG ID | [COG2966] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTCTTCGG TCTTGTCGTT TGAGTCGGAC TCATTTGTGC AGCCACGTCC CATGAAGCGG AGAGCGTCTA GCCGGCGTTC TTCAGCTGGC TCTTGGGCAT CGTCCGACAA TGACTCAGAC TATGACGCAC AGGTTGGAAC CGGCGGAATC CTGAGCGCCT TGCTAGGTCT ATATGGTAAT GAGGCTTCTT ACAAAAAACG AAAGGCTCTC AGGTCTCTCT TGAGGCGTAG CGACAAAGAT AATGAAGACA GTCACCGAGG AAGAGCAAGA AAGCGTTGGA GTACGGAATC GCTTTTTGCC CTCCCATCTC GCAACTCTAG CAGAAGTGCC AGTCGCTCAG TGAGCAGGGG AGCAACGAGT GCACCTGATG GGCCTGAAGT TGCTAGGAGA ATGCGCGACG AGAGGGAAAA GGTGCATAGG AATCGGAGGC ATCAACGTTT GCCTCATCGA AGTCAAGACC TCAATCAACC GTATATCCCA TCCGCGCGGT ATTCAAATCC CTTACAACCT GCTCCAAATG TCACTGCCGC GCAACGGTTC AGAACTTTTA TGACAGGCAG GAGCGCTCCT GCTACCCTTC TTGGTGCCCC AGATGCTGTA GACCATTCTA AATCACAAGA AGAAAGCAGG TACCGTACTA TGGCCGCCCT GTTGATCACT ACGAACTCCC TTATGAGCAT AGGAAGCCCT ATTCTTGCCC ATGTTGCGCC TGCGTCCGGG CAAGAGGGAG AGTCTAGTGG AGGTCATCGG AAGATCAGCT GGTATGAAAG CGTAGCAGAA AAGGAGCGGG CTGAACAAGA AAGAGAACAG AAGGAAGAAC ATGATCTCGG ATTGATTGAT GGAGAAGAGG ATAACATTCG GGCTATGGAA GAGGGGATGT GTGGAGGGAA GAGGAAAAGA AAAAGAACAA GGGGTAAAAG AATACAGCGA GAGATGGCAG TTACCAAGCA TGTTTCAAAC TTAATTCAAA GGAAGAAGTG GGTCATAAAA CAATATTTGT TATTAAACAC TGACATGCGA CAGGTTTATT GAGGAACTTG CCAAAGCAGT CGTCAAGTAG GTCAATCTCG ATGCCGAATA CTTCGCGCTA AAATAAAACT AGTTATGGTG CTCCGGCTCA TTCAGTGGAG GCATGGCTAG CCTCAACAGC TGATATTCTT TCCGTCGAGG CATCTTTTGT CTATCTACCG ACTGTCCTAC TTGTTGCCTT CCGAGACACC GATGTACACT CTACCGATGT CCTTTTCATT CGTCCAAGTG GAGGCCTAGA GCTGTACCGA TTATCTCTGG TGCATGAGGT CTACAGAAGA GTGACTCACG ATAAAATATC CGCTAGTCAA GGTCGTCGAG CTCTGAAAAG GATCGGGCGC GAGGTGAGTA CCTTTCAAAG AGGCAAAAGG CGTGGCTAAC AGAATACCTT CATTCAGACC CTTCCCTACT CTCGACTGAC TCTCATCCTC ACAGCAGCTG TTGCTTCCGC CATCTCTGCC AGAGTGGCCT TCTCCGGATC GTTCATCGAC ATCCTCATGT CAGGCGCATT GGGGGCCCTT TTCACCTTAG TCCAATTCAC AGTATCGAAG GAGAATAGGG TGTTTTCTAA TATATTTGAA ATTGGAATGG CCGGAATATT GAGTTTTGTA GCTGTAAGTG TTTCTTGCGA AGTTGATCGT GGATGCTGAT CATCTTTAGC GTGGACTTGC TTCATCCAAA TATTTCTGTT ACGAGTCTTT GGCGTCTGTG AGTCGCATAT ATGTATGTAG TGATGGGCTT CCTCTGATTT TGTCAGGCTT CCATTGTTTT GATCCTTCCA GGGTGGCATA TATGTCTGGG AGCTCTTGAA CTTGGTTCGA AGAACATCAT TGCAGGTGGC ATGTGAGTGT CTTGCCTGTG TGTGGCTTTT TTCCCTTGTT AAAAAATCAC AGTCGTCTTG TTTGGGCTGT TGTCTACACT CTCTTCCTCT CGCTTGGTCT GGGCATTGGT AGTGAAATTT GGGACTCTTT TGGGCCGCCT CAGCCTGGGA ATGACTCAAA TAGCTCAGAT GCAACGGAGG TGCAGTGCTA TCGTGATCCC AATTGGGATT ACTGGTGGTA CACCGAGCGT ATGTTGCGTG ACTGGGGAGT CTGAGAAGTA AAGGGAATGC TAATAGCATC AACACAGCCA GCGACTGGTG GCTTTTCCTA CTCGTGCCGG TATTTGCATT CTCTCTTGCA GTGTGGTTCC GAGCGGATTG GCGTTCTAAG GATATAGTTG TCATGGTGTT AGTGGCTTGC GCAGGTTATG TTGTCAACTA CTTTCTTTCA GGCGTACGTG ATTCTGGAAT GCCTGCGACA GCGTGTTGAC TCAGTGGTCT ATGTAGCAAA TTTCGCAAAT CAACGTTACA TCCGCTGTAT CGGCCTTCGC CGTTGGGTAA GGAGTCTTTA CTTGTGGATC TATGCCTGGT ATGCTGACTT TGCGTTTCTA TTAGCGTTCT GGGCAACCTT TATTCTCGTC TTGGAAGAGG TTCGGCGTTC CCATCAATGG TTTGCGGTAT CCTGCTTCTT GTGCCCAATG CAATCGCGGC CGCCGGTGGA CTAGCTACTA ACAGCATTGA CCAAAGCGAC TCGACTAATA GCAATAATGA ACAAGAAATC AACACAGCTG TCATTGTTAG CATAAGAATG ATTCTCGTGA GTCAGCTCTT ATGTCAAAGT CTCCCCATAG GATTCTCGAT GATGGCTGAT GTATTGCAGG TTGGAGTCGG CCTAGCAGTG GGACTCTTCG CTTCCACGGT CGCCATCTAT CCTTTTGGAA AGACAAGGCG TTATATATTT AGTTACTAA
|
Protein sequence | MKRRASSRRS SAGSWASSDN DSDYDAQVGT GGILSALLGL YGNEASYKKR KALRSLLRRS DKDNEDSHRG RARKRWSTES LFALPSRNSS RSASRSVSRG ATSAPDGPEV ARRMRDEREK VHRNRRHQRL PHRSQDLNQP YIPSARYSNP LQPAPNVTAA QRFRTFMTGR SAPATLLGAP DAVDHSKSQE ESRYRTMAAL LITTNSLMSI GSPILAHVAP ASGQEGESSG GHRKISWYES VAEKERAEQE REQKEEHDLG LIDGEEDNIR AMEEGMCGGK RKRKRTRGKR IQREMAVTKH VSNLIQRKKF IEELAKAVVN YGAPAHSVEA WLASTADILS VEASFVYLPT VLLVAFRDTD VHSTDVLFIR PSGGLELYRL SLVHEVYRRV THDKISASQG RRALKRIGRE TLPYSRLTLI LTAAVASAIS ARVAFSGSFI DILMSGALGA LFTLVQFTVS KENRVFSNIF EIGMAGILSF VAASIVLILP GWHICLGALE LGSKNIIAGG IRLVWAVVYT LFLSLGLGIG SEIWDSFGPP QPGNDSNSSD ATEVHINTAS DWWLFLLVPV FAFSLAVWFR ADWRSKDIVV MVLVACAGYV VNYFLSGQIS QINVTSAVSA FAVGVLGNLY SRLGRGSAFP SMVCGILLLV PNAIAAAGGL ATNSIDQSDS TNSNNEQEIN TAVIVSIRMI LVRFSMMADV LQVGVGLAVG LFASTVAIYP FGKTRRYIFS Y
|
| |