Gene CNJ00580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNJ00580 
Symbol 
ID3254304 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006679 
Strand
Start bp161353 
End bp164240 
Gene Length2888 bp 
Protein Length803 aa 
Translation table 
GC content50% 
IMG OID638253215 
ProductProlyl endopeptidase, putative 
Protein accessionXP_567311 
Protein GI58259797 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGTC AACAAGCAAG TCACTCTTTC AGCACAGACA AGACGGGCCA TGGCACTCTT 
AAGAACGTCC ATGCGTCTGA TTTTACAATC AGCCCCGGAC AGTGGAAGAA AAATGTTAAC
TTTTCACCAT ACCCAGTTCC TCCTCAACAT GGGGGCATTA CTGAAATTAT CCACGGAATT
GAAATCGAGG ACCCATGGCG AGCTCTTGAA GACCCCGATT CCGAGGTGAC AAAGAAGTTC
GTCAAGGAAC AAAATGATGT GAGTGCTTTC AATGTAAGGA AGGACTCTAC CTGACCAGCA
AACACAGTTC TCTGTTCCCA GACTTACCAA CCACCCCCTT CGAAAAGAGC TCGAAGCCGC
CGTTGAGCAA TGTTACAATC ATGAACGTAT GACCAGTCCC GAACTTCAGG GCGATGGATA
CTATTATTGG AAGTTTAACC CTGGTACCTC TCCTCGGGAC GTCATCGTTC GATCGAAGGA
TCTCAAACGC GACTTTGGGA AGGCTCCTGG CGGGAGTGGT CCCGAAATTT TCTATGACTT
GAACAAGGAG GAGAATATCT CTCTTTATGC CCATAGCTTT AGTCCTAGCG GGAAACTCTG
GTGTGCTGTT CTGCAGTATG CAGGGTAAGC GAAATAAATA TGTGAAGGAA GGCTATTGCT
TACCAGTTAT GTAGGAGTGA CTGGCAAAGG ATTCGAGTCA TCGACACCGA GAGCAAAGCT
GTCCTGGAAA AGGACTTGGG AGGATCGAAG TTCACTTTCG GCGTTACTTG GGTAGGCGAG
AAGGTGAATG CTTTTGATGT TCCTCTCTTC GTTAACTGAC ATGGATTACA GGGTTTTATT
TACAAGCGGT CAATCGACTA CGATGCCACT AGTGACGGTT ACGACGGTAT CGACGGCTCC
TTCGGCATGT TCTACCACGC AGTCGGCCAA CATCAGTCCA CCGATGTTAT CGTTTGGAGT
CCCCCGCCTG GAGAGTTTCA ATTCATTGGT AAAGCCAAGG TCGTTGCCGT TGATGAGAAG
GAGGAGAACA ACAAAAGGGC ATTCTTGGCT CTCGACATCT ACAAGAATAC CAGTCCTGAG
ACTGAGCTGC TGCTGGTCGA GTTGCCCGGC GGCACTGCAG GCCCTGCTGG CGTTCTTCTT
CCAGAACTGG TTACCAAGGA GATGAAGTGG GTATCCAGAG GTTTTACTGG AGAAACTCAT
TGTGAGTATC GAATTCGATT ATATTACTTC CATTGTTAAT GAAAGGTAGA TATTGGTTCG
TCCAGTGCCG AACGTCACTT CTTCACTTCT TTTACGGACG GCGTCTCTAC CGGCCGTATC
ATTGCCTTCG ACTCCGCCGA CTGGGATGCC ACAGACATCG ACAGCCCCTT ACCTATGCAA
GAGATTGTAC CCGCGGATCC CGAAGGCCAC CAACTTCAAA GCGCCTACTT CATCGGCGAC
CGACTGCTCG CTCTCATCTA CCTCAAACAC GCTTGCGCCT CTGTTGTCTT CATTGACGCT
CGGACGGGCA AGCCTCTGGG TTCTGCCGAT GCCCAAGGTA CCCATGGTAA CGTTGCTGCC
GACCCAGAGA CTCAAGTGCC CGTTCCAGAG GAAGAGGTCC AGCACGCAAA GGAAGGGCAA
GTCGTCATTC CAGAGCACGG TGCTATCACC AGCATTTCTT GCCGACCTGA CGCCAACGAC
TTTTACTTTA CCGTTGACAC CTGGGTTGCG CCTTCATACG TACTCAAGGG TGAGCTCATC
AAGAACAAGG CTGGTCGGTA CGAGGTAGAC ATTAGTAGTG TCAACTCTTC TGAGACCGCT
GCTCAAGAGA CGTTGGTTTG TTCTCAAGTA TTCTATACCT CACATGACGG TACCAGGATT
CCCATGTTCA TCTGTCACCC TCATGACCTT GACCTCACAC GCCCTCATCC TCTGCTTCTC
CATGCTTATG GGGGCTTCTG CTCGCCTCTT ATTCCCCACT TTGACCCAAT GTTTGCCGTT
TTCATGCGTA ATCTCCGAGG AGTGTAAGCT TCATTTCTTC GCCCGACAGA CTAATTCCGC
TGACTCCTTT CAGGGTTGCC ATCGCTGGTA TTCGAGGAGG TGGTGAATAC GGCAAGGCGT
GGCATGAAGC TGCTATCGGT ATCAAGCGCT CTGTCGGCTG GGATGACTTT GCTGCCGCCG
CTCGATATGT TCAGTCTCGA GGACTTACCA CCCCTTCTCT CACCGCAATC TACGGTAGCT
CCAACGGTGG TCTCCTTGTT TCTGCTGCCA CTGTTCGAAA CCCAGAGCTT TACTCTGTCG
TGTTTGCTGA TGTGGCTATC ACAGACTTGA TCAGATACCA CAAATTTGTG AGTGTCATGT
TTCGCTTTCT GTTTGACTTT CTCGCTTATG CCTACAACCT CATTTGCAGA CACTCGGACG
AATGTGGATG ACTGAATATG GCTCCCCAGA AGAACCTGAA ACCCTCGCGG TTCTTCGCGC
TAATTCCCCT CTTCACAATA TCAGCCGCGA TCCTTCTGTC CAATATCCTG CTATGCTCCT
CACCACCGGT GACCATGATA CACGAGTGGT ACCCGGTCAT TCGCTCAAGC TACTTGCAGA
GCTGCAGAGT GAGTTACTTC TCAATCAAGC CATGATAACT ACTGACATGT ATAAACTAGC
TCTCAAGGCT AAGAACCACG GGGCAATGTG AGATTTATAG TATTGATGCT AGAAACTTTT
TGCTGATCAC TTGTTCTAGC CTTGGTCGAG TGTACATAAA CGCAGGGCAC GAACGTACGT
CTACCATATA ATTGATCCAT TCCAGCCACT GATATTAGAC CGTTGCAGAA TCAACAAAGT
CAACCGAGAA GAAAGTTGAG GAGGCGGTTG ACCGTTTGGT ATTTGCACTT GACAACATCA
AGATTTGA
 
Protein sequence
MSGQQASHSF STDKTGHGTL KNVHASDFTI SPGQWKKNVN FSPYPVPPQH GGITEIIHGI 
EIEDPWRALE DPDSEVTKKF VKEQNDFSVP RLTNHPLRKE LEAAVEQCYN HERMTSPELQ
GDGYYYWKFN PGTSPRDVIV RSKDLKRDFG KAPGGSGPEI FYDLNKEENI SLYAHSFSPS
GKLWCAVLQY AGSDWQRIRV IDTESKAVLE KDLGGSKFTF GVTWGFIYKR SIDYDATSDG
YDGIDGSFGM FYHAVGQHQS TDVIVWSPPP GEFQFIGKAK VVAVDEKEEN NKRAFLALDI
YKNTSPETEL LLVELPGGTA GPAGVLLPEL VTKEMKWVSR GFTGETHYIG SSSAERHFFT
SFTDGVSTGR IIAFDSADWD ATDIDSPLPM QEIVPADPEG HQLQSAYFIG DRLLALIYLK
HACASVVFID ARTGKPLGSA DAQGTHGNVA ADPETQVPVP EEEVQHAKEG QVVIPEHGAI
TSISCRPDAN DFYFTVDTWV APSYVLKGEL IKNKAGRYEV DISSVNSSET AAQETLVCSQ
VFYTSHDGTR IPMFICHPHD LDLTRPHPLL LHAYGGFCSP LIPHFDPMFA VFMRNLRGVV
AIAGIRGGGE YGKAWHEAAI GIKRSVGWDD FAAAARYVQS RGLTTPSLTA IYGSSNGGLL
VSAATVRNPE LYSVVFADVA ITDLIRYHKF TLGRMWMTEY GSPEEPETLA VLRANSPLHN
ISRDPSVQYP AMLLTTGDHD TRVVPGHSLK LLAELQTLKA KNHGAILGRV YINAGHEQST
KSTEKKVEEA VDRLVFALDN IKI