Gene CNJ00350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNJ00350 
Symbol 
ID3254256 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006679 
Strand
Start bp92773 
End bp95823 
Gene Length3051 bp 
Protein Length811 aa 
Translation table 
GC content49% 
IMG OID638253192 
Productconserved hypothetical protein 
Protein accessionXP_567292 
Protein GI58259759 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATCG AAACTTCGGC CGCCCACGAC GTCGACAGTG CACCTCACGG TCTTCTCAAG 
AACGCACCTA CGGACGACCT GACTCTCGAA GATGAAAGTT GGCAGCATAG TGTTTGCGTG
CACTCTTACC CATCTCCACC CTTAAATGGG GGCGTGACTG AAATCATCTT CGATATTGAA
GTCAAGGACC CTTGGAGGGC CCTTGAGGAC CAAGGCTCCG AAGTGACCAA GAAGTTTATT
GAGGAGCAGA ACCACGTGAG CCTTCCACTA CCTCCTTAGC ACATCTATGT GGAAGCCACC
GCTAATTGAG TTATTCACAG TTGTCCGTCC CTCGCCTCAG CAATCACCCT CTTCGAACGG
AGCTTGAGAT TGCTGTCGAA CAATGCTACA ATCACGAGCG CATGACCTGT CCTGAACTTC
AAGCCAGCGG CTATTATTAC TGGAAATACA ATCAAGGTAC ATCTCCCCGA GATGTCATTC
TCCGTTCAAA AAATCTCGAG AGTGATTTTG GCAAGTTTGC CTCCGAGGAC GGCAAGGGCC
CTGAGCTTTT CTTCGACCTT AATACAGAAG AAAACATATC TTTATATGCT CACAGCTTCA
GCCCTAGCGG GAAACTTTGG TGTGCCATTC TTCAACAGTC TGGGTGAGTT TTCCGTTCTC
CGTATGTATC TGCCTCCTGA TACTCGATCC AGGGGTGACT GGTTGCGTCT TCGTGTATAT
GACACTCAGA CTAAGAAGGC TATCGAGAGA AGTGTGGGAG GAGCGAAATT TACTTTTGGT
GCTACCTGGG TGGGGGAGAA AGTGAGTAAT CATGCCGTTT ATTACTCGGT GCTATCTAAC
AGGCGTGCAG GGCTTCATCT ACAAGCGAGT GATCGACTAC GATACCACCG ACGGCAACTA
CCAAGCAAAG GAAGGTCAAT TCGGCTTGTT CTATCATCAA ATCGGCACTC CCCAATCAGA
AGATGTCCTC GTCTGGAAAG CCCCAGAGGG CGTCTTCCAG TACATTGGCA AGCCCTTGAT
CATCACATCC GATGCCAAAG AGGAGAATAA GAAGAGGGCT TGGTTCATGC TCGATATCTA
CAGGAACACA AGTCCGGAGA CTGAGGTTTT GATGGTGGAG CTGCCTGGAG GGACAGCTGG
CCCTGTGGGT CATACACTGC CATCTCTTGT GCTACATGGG AAAAAATGGG TATCAAAAGG
CTTCACTGGA ATGACCAATT GTAAGTTCCG TTCCATAATC GTTTTGCTCC ACTTTTGACA
TTTGCGCTAG ATATTGGCTC TTTGTCCGAC GACACTCACT TGTTCACCTC TTTCACCGAC
GGAATCTCCA CTGGCCGAAT TATCTCTGTG AGCGCTGCTG ACTATGACGC CTGTGGCGTC
AACGAGGCTA TCAAGTTCAA CACTGTCGTT CCCGCAAACT CCGAGGGCCA TCAACTACGT
CACGCATACC TTATTGGCGA CCAAGTCATC GTTCTCGATT ATCTCAAGCA TGGTTGTTCG
TTCCTCGTGT TCCTAGATGC TCGGACCGGC AAATCTGTGG GCTCGTCAGA CTCCAGGGGA
ACCCGCGGTG ATGCCGCTAT AGATCCAGAT GTGGAAGTGC CTGTTCCAGA GGAGGAGGTA
GCCGAGCAAT CACCAACTGA AGATCAAGTC ATCATACCTC AACATGCCTC CATCAATGAA
CTCCAAAGCC GACCAGACTC GAACGACTTT TATTTTTCAG TAAACACCTT TGTCGCTCCT
CCTTACGTGC TTCGAGGAGA ATTAATTAAA AATCACAAGG TGGAGAAGGG CATCAAGATC
AGTGGCATTT CAAAATCTCA CACGATGCCT CAAGAGACTC TTGTCTGTTC TCAGTTGTTT
TACGAGTCGC ATGATGGTGT CAAGATCCCG ATGTTCATCT GCCATGCCCA CGATCTCGAT
TTGACCAAGC CTAACCCGGC ACTAGTTCAC GCTTACGGTG GCTTCTGTTC TCCATCTCTT
CCCCGCTTTG ACCCCATGTT TGTGGCGTTC ATGCGCAACC TCAGAGGGAT GTAAGTACGC
AAATAATGGT CCACCACATA TTTCAAGCTT ATTCTACCTG TAGTGTTGCC GTTGCTGGCA
TTCGTGGAGG TGGAGAATAC GGGCCTGAAT GGCATGAAGC CGCGCTCGGT ATCAAGCGAT
GGGTAGGATG GGACGACTTT GCTTGGGCAG CCAAGTACCT GCAAGGGAAA GGTCTTACAA
CTCCGGCGCT CACTGCGACT TACGGAACAT CAAACGGGGG CCTTCTGGTG TCGGCTGCCA
TGGTTCGCAA CCCAAGCTTG TATTCTGTCG TTTTTCCCGA TGTTGCCATC ACCGACCTTC
TGAGGTATCA CAAATTCGTA TATCTTTTCA CACCTTTTTT AAACTAGGCA AAAGCTTATT
GCGCTGTTAG ACTCTTGGTC GTATCTGGAT GGACGAGTAT GGGTCTCCCG AAAAGGCTGA
GGACTTCCCC ATTCTCCATT CCACTTCACC TCTTCACAGT GTAGACGGCG ACCCCGCCGT
TCAGTATCCC GCTGTGCTTA TAACTACTGC CGATCACGAC ACCCGGGTCG TTCCTAGTCA
TTCTCTCAAG TTCCTGGCGG AGCTCCAAGG TCAGTCTTCC ATCAAATATA TTGCTGTTTG
TGGCTTTTAC TGACGGTCTC TGCTAATAAA ATTAGCTCGG AAGTCCGAGA ACAAGGGGGT
ATTGTAATTC AATCTGTTTT TATCAGTAAA TGACGAGGAC GCTAATGATG GTTTGCATAG
CCTCGGCCGC ATTTACGAAA ATGCTGGTCA TGAGCGTGAG TAATGTCCTT CTTCTAGCCT
AATCACGATG ATGCTTGATG GACAACTAAA ACTAACGATG CACTCGATAG TCGGTTCAAA
ACCTACCAAG AAGAAGGTTG AGGAGGCAGT TGACCGTCTG GTTTTTGTTT TGTACAACTT
GAAGGAACAG TGATCAAGTC ATCCGCATTT TGTAACGCTT TCCATTAATT TTTCACCGCA
CTATTTTGAC AACTGTCAAA TAAATGTACA GTAAATCCAT GCACAAGAAC A
 
Protein sequence
MAIETSAAHD VDSAPHGLLK NAPTDDLTLE DESWQHSVCV HSYPSPPLNG GVTEIIFDIE 
VKDPWRALED QGSEVTKKFI EEQNHLSVPR LSNHPLRTEL EIAVEQCYNH ERMTCPELQA
SGYYYWKYNQ GTSPRDVILR SKNLESDFGK FASEDGKGPE LFFDLNTEEN ISLYAHSFSP
SGKLWCAILQ QSGGDWLRLR VYDTQTKKAI ERSVGGAKFT FGATWVGEKG FIYKRVIDYD
TTDGNYQAKE GQFGLFYHQI GTPQSEDVLV WKAPEGVFQY IGKPLIITSD AKEENKKRAW
FMLDIYRNTS PETEVLMVEL PGGTAGPVGH TLPSLVLHGK KWVSKGFTGM TNYIGSLSDD
THLFTSFTDG ISTGRIISVS AADYDACGVN EAIKFNTVVP ANSEGHQLRH AYLIGDQVIV
LDYLKHGCSF LVFLDARTGK SVGSSDSRGT RGDAAIDPDV EVPVPEEEVA EQSPTEDQVI
IPQHASINEL QSRPDSNDFY FSVNTFVAPP YVLRGELIKN HKVEKGIKIS GISKSHTMPQ
ETLVCSQLFY ESHDGVKIPM FICHAHDLDL TKPNPALVHA YGGFCSPSLP RFDPMFVAFM
RNLRGIVAVA GIRGGGEYGP EWHEAALGIK RWVGWDDFAW AAKYLQGKGL TTPALTATYG
TSNGGLLVSA AMVRNPSLYS VVFPDVAITD LLRYHKFTLG RIWMDEYGSP EKAEDFPILH
STSPLHSVDG DPAVQYPAVL ITTADHDTRV VPSHSLKFLA ELQARKSENK GVFLGRIYEN
AGHELGSKPT KKKVEEAVDR LVFVLYNLKE Q