Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ00350 |
Symbol | |
ID | 3254256 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | + |
Start bp | 92773 |
End bp | 95823 |
Gene Length | 3051 bp |
Protein Length | 811 aa |
Translation table | |
GC content | 49% |
IMG OID | 638253192 |
Product | conserved hypothetical protein |
Protein accession | XP_567292 |
Protein GI | 58259759 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATCG AAACTTCGGC CGCCCACGAC GTCGACAGTG CACCTCACGG TCTTCTCAAG AACGCACCTA CGGACGACCT GACTCTCGAA GATGAAAGTT GGCAGCATAG TGTTTGCGTG CACTCTTACC CATCTCCACC CTTAAATGGG GGCGTGACTG AAATCATCTT CGATATTGAA GTCAAGGACC CTTGGAGGGC CCTTGAGGAC CAAGGCTCCG AAGTGACCAA GAAGTTTATT GAGGAGCAGA ACCACGTGAG CCTTCCACTA CCTCCTTAGC ACATCTATGT GGAAGCCACC GCTAATTGAG TTATTCACAG TTGTCCGTCC CTCGCCTCAG CAATCACCCT CTTCGAACGG AGCTTGAGAT TGCTGTCGAA CAATGCTACA ATCACGAGCG CATGACCTGT CCTGAACTTC AAGCCAGCGG CTATTATTAC TGGAAATACA ATCAAGGTAC ATCTCCCCGA GATGTCATTC TCCGTTCAAA AAATCTCGAG AGTGATTTTG GCAAGTTTGC CTCCGAGGAC GGCAAGGGCC CTGAGCTTTT CTTCGACCTT AATACAGAAG AAAACATATC TTTATATGCT CACAGCTTCA GCCCTAGCGG GAAACTTTGG TGTGCCATTC TTCAACAGTC TGGGTGAGTT TTCCGTTCTC CGTATGTATC TGCCTCCTGA TACTCGATCC AGGGGTGACT GGTTGCGTCT TCGTGTATAT GACACTCAGA CTAAGAAGGC TATCGAGAGA AGTGTGGGAG GAGCGAAATT TACTTTTGGT GCTACCTGGG TGGGGGAGAA AGTGAGTAAT CATGCCGTTT ATTACTCGGT GCTATCTAAC AGGCGTGCAG GGCTTCATCT ACAAGCGAGT GATCGACTAC GATACCACCG ACGGCAACTA CCAAGCAAAG GAAGGTCAAT TCGGCTTGTT CTATCATCAA ATCGGCACTC CCCAATCAGA AGATGTCCTC GTCTGGAAAG CCCCAGAGGG CGTCTTCCAG TACATTGGCA AGCCCTTGAT CATCACATCC GATGCCAAAG AGGAGAATAA GAAGAGGGCT TGGTTCATGC TCGATATCTA CAGGAACACA AGTCCGGAGA CTGAGGTTTT GATGGTGGAG CTGCCTGGAG GGACAGCTGG CCCTGTGGGT CATACACTGC CATCTCTTGT GCTACATGGG AAAAAATGGG TATCAAAAGG CTTCACTGGA ATGACCAATT GTAAGTTCCG TTCCATAATC GTTTTGCTCC ACTTTTGACA TTTGCGCTAG ATATTGGCTC TTTGTCCGAC GACACTCACT TGTTCACCTC TTTCACCGAC GGAATCTCCA CTGGCCGAAT TATCTCTGTG AGCGCTGCTG ACTATGACGC CTGTGGCGTC AACGAGGCTA TCAAGTTCAA CACTGTCGTT CCCGCAAACT CCGAGGGCCA TCAACTACGT CACGCATACC TTATTGGCGA CCAAGTCATC GTTCTCGATT ATCTCAAGCA TGGTTGTTCG TTCCTCGTGT TCCTAGATGC TCGGACCGGC AAATCTGTGG GCTCGTCAGA CTCCAGGGGA ACCCGCGGTG ATGCCGCTAT AGATCCAGAT GTGGAAGTGC CTGTTCCAGA GGAGGAGGTA GCCGAGCAAT CACCAACTGA AGATCAAGTC ATCATACCTC AACATGCCTC CATCAATGAA CTCCAAAGCC GACCAGACTC GAACGACTTT TATTTTTCAG TAAACACCTT TGTCGCTCCT CCTTACGTGC TTCGAGGAGA ATTAATTAAA AATCACAAGG TGGAGAAGGG CATCAAGATC AGTGGCATTT CAAAATCTCA CACGATGCCT CAAGAGACTC TTGTCTGTTC TCAGTTGTTT TACGAGTCGC ATGATGGTGT CAAGATCCCG ATGTTCATCT GCCATGCCCA CGATCTCGAT TTGACCAAGC CTAACCCGGC ACTAGTTCAC GCTTACGGTG GCTTCTGTTC TCCATCTCTT CCCCGCTTTG ACCCCATGTT TGTGGCGTTC ATGCGCAACC TCAGAGGGAT GTAAGTACGC AAATAATGGT CCACCACATA TTTCAAGCTT ATTCTACCTG TAGTGTTGCC GTTGCTGGCA TTCGTGGAGG TGGAGAATAC GGGCCTGAAT GGCATGAAGC CGCGCTCGGT ATCAAGCGAT GGGTAGGATG GGACGACTTT GCTTGGGCAG CCAAGTACCT GCAAGGGAAA GGTCTTACAA CTCCGGCGCT CACTGCGACT TACGGAACAT CAAACGGGGG CCTTCTGGTG TCGGCTGCCA TGGTTCGCAA CCCAAGCTTG TATTCTGTCG TTTTTCCCGA TGTTGCCATC ACCGACCTTC TGAGGTATCA CAAATTCGTA TATCTTTTCA CACCTTTTTT AAACTAGGCA AAAGCTTATT GCGCTGTTAG ACTCTTGGTC GTATCTGGAT GGACGAGTAT GGGTCTCCCG AAAAGGCTGA GGACTTCCCC ATTCTCCATT CCACTTCACC TCTTCACAGT GTAGACGGCG ACCCCGCCGT TCAGTATCCC GCTGTGCTTA TAACTACTGC CGATCACGAC ACCCGGGTCG TTCCTAGTCA TTCTCTCAAG TTCCTGGCGG AGCTCCAAGG TCAGTCTTCC ATCAAATATA TTGCTGTTTG TGGCTTTTAC TGACGGTCTC TGCTAATAAA ATTAGCTCGG AAGTCCGAGA ACAAGGGGGT ATTGTAATTC AATCTGTTTT TATCAGTAAA TGACGAGGAC GCTAATGATG GTTTGCATAG CCTCGGCCGC ATTTACGAAA ATGCTGGTCA TGAGCGTGAG TAATGTCCTT CTTCTAGCCT AATCACGATG ATGCTTGATG GACAACTAAA ACTAACGATG CACTCGATAG TCGGTTCAAA ACCTACCAAG AAGAAGGTTG AGGAGGCAGT TGACCGTCTG GTTTTTGTTT TGTACAACTT GAAGGAACAG TGATCAAGTC ATCCGCATTT TGTAACGCTT TCCATTAATT TTTCACCGCA CTATTTTGAC AACTGTCAAA TAAATGTACA GTAAATCCAT GCACAAGAAC A
|
Protein sequence | MAIETSAAHD VDSAPHGLLK NAPTDDLTLE DESWQHSVCV HSYPSPPLNG GVTEIIFDIE VKDPWRALED QGSEVTKKFI EEQNHLSVPR LSNHPLRTEL EIAVEQCYNH ERMTCPELQA SGYYYWKYNQ GTSPRDVILR SKNLESDFGK FASEDGKGPE LFFDLNTEEN ISLYAHSFSP SGKLWCAILQ QSGGDWLRLR VYDTQTKKAI ERSVGGAKFT FGATWVGEKG FIYKRVIDYD TTDGNYQAKE GQFGLFYHQI GTPQSEDVLV WKAPEGVFQY IGKPLIITSD AKEENKKRAW FMLDIYRNTS PETEVLMVEL PGGTAGPVGH TLPSLVLHGK KWVSKGFTGM TNYIGSLSDD THLFTSFTDG ISTGRIISVS AADYDACGVN EAIKFNTVVP ANSEGHQLRH AYLIGDQVIV LDYLKHGCSF LVFLDARTGK SVGSSDSRGT RGDAAIDPDV EVPVPEEEVA EQSPTEDQVI IPQHASINEL QSRPDSNDFY FSVNTFVAPP YVLRGELIKN HKVEKGIKIS GISKSHTMPQ ETLVCSQLFY ESHDGVKIPM FICHAHDLDL TKPNPALVHA YGGFCSPSLP RFDPMFVAFM RNLRGIVAVA GIRGGGEYGP EWHEAALGIK RWVGWDDFAW AAKYLQGKGL TTPALTATYG TSNGGLLVSA AMVRNPSLYS VVFPDVAITD LLRYHKFTLG RIWMDEYGSP EKAEDFPILH STSPLHSVDG DPAVQYPAVL ITTADHDTRV VPSHSLKFLA ELQARKSENK GVFLGRIYEN AGHELGSKPT KKKVEEAVDR LVFVLYNLKE Q
|
| |