Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG00650 |
Symbol | |
ID | 3258632 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 176261 |
End bp | 179357 |
Gene Length | 3097 bp |
Protein Length | 911 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257682 |
Product | expressed protein |
Protein accession | XP_571781 |
Protein GI | 58269250 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.768082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTCCACCC ACGTCCCTGC CCCCGGCCCA CACCAAGGCT CCTACGCCCA GCTCACAACC GTCTCCCGGC TCAACAAGCT TATATGCAAC CAGCGCCCTC CTCCCATCGA CACTTTTCGA ACGGAAGTAT TTCACCACAT ACCTCTCTCC ACGGCCACGA AGAACTCGAA TTTGAAGATG ACTTTGACTA TCTCTCCGTT GCCACGCCTT CTCGGCCAAA TGCGTCCAAG AGGGCCGCGC CAAACAAGGG GAAAGCAGTA GAAACCACCA AATGGAAGGG CAAAGGGAAA GCTCGAAACG GTATTCCGGG TGGCGAATGG CAAGTCGCTT CTGGTACATG ACACACGTGC GATAAACTGA TTTGCCGGAC CTCGTAGCTA CCTGCTGAAG CTTCCGGGTG ATCTTCTTCA TACGCTAATG TCTTGCCTGT CGCCCAGGAC CTTGTTGAGA TTTTCCGAGA CGTGCAAGTT ACTGAACGAG CAAACGGAAA ACGATACGGT GTGGCGGCAT AGCTATGTCA ACCGGTTTCT GGGCGAAGGC GTTGCCAGAG ATAGCAGGCG CAGGGAAGAG ATTGTGGTCC TTGCTCAGAG TTGTGTGAAT GCCGCCGGAA GGGGATGGAA AAAGGAGGCA TTGGGAAGGG AGGTCATGCT GGAGTATGTC GATCTGGACT GTAGTGTGAT TTATTGCTGA CCTGTTTGCA GTCTTTGGAC GAATTCCAAA ACTAGCATTG TCATGCACAC CCCACCAACG GGTCTTATCC ATTCCATCTC ACTGTACTAC CCCCCTTTCC TCCCCTCAAA TTCGAAAAGC CTTGTCATTG GAAAGAATCG TCAATCGACG CCCAAAGACA AAGTAGCGTA CACGAGCAAG CTCGGGGAAG ATGGGGACAA CGTCTTGCAA CCGACGACCC CAAAAATAAC TCATCGGCAA AAGTACGAAG CTGTTTTGGC CGCTACGACT CGGCCTCCAC CTTACATGCT CAGCGCGTCG TTGTTCATGG GAGGTGTAGT TAGGAGTGAT CCCATCAGCG GCAAGGTCTC TAAAGGTTTC TGGGGACCTG GAAGGGATGG TAAGTGGAAT GACCAACTAC CAAGACTATA TTGACGTCTT CTCAGCCAAC TTCCACATTC GACCACATAT CGACCCGCTT GCTGAGCCCT CAGCCATCTA TCTGCCAAGT AGATCACAAT CCTTCATTCT TTGGGGTCTT CAAACGGGCA GTGTAGTTTT TACAAGCGTT CAAACTCGTA ATCATGCTAC CCACGGCGGT AGAGCCACTT CCGTCAATGT TTACTCCGAC CCGAGAAAAT CACATGAAGG GTTTGTCGTT GATATTTGGG CGCCACAAGG GCAGAATGAT ACTGCTCTCA AATGGATCAC CGCTGGGCAA GACGGGCGAG TCAAGCTGTG GGAACTACAG TCAGGGACAA TCACCAAAGT CGGCAAGCGT CAAGCAAGTC TGGTGGATGG GAACATTGAT TGTGTCTTTA CATCTCCTCT CGCTGAAACT GGGTTTCCCA ACAGATCAGA GTTGGTCAAG AGACGGCAAG CTGGCAAGCC AGATGAAATC GTTCTGGCGA GATATGATTT GCAGCATGAT ATCGTAGCAG GCGTGACGGA AGATGGAGAT TTACGTGTTT GGTTTGAAGC GTCAAGTGGG AATGAGAATG AGGTGCGAAT CGATCTTGGA TCAGCAGAAA TTGAAGGGGA AATCAAAGTC ATGGAAATGA TCGGTTATCG CCATCAAGAC GAGGTTGCCG TCGCCGTTTT GATTCATCGA CGCCGATCCC ACATTCTGAT GCGGCACGAT ATCTCCAAAT CTGGCAATCA CCACATTACC ACTTTTTACT CCTCTGTCGG TGCCCCACTT AGCTGTATAC ACCCCTCACT CTTTCCCAAC CTCCCGATCT CAGCGCCAAA GCACGGAAAC TCCACGCCTA TGCTAGCGAG GATCGTCACG CCTGGCGAGA CACCTGATCC CTCCCCTCCA CCTCTCGACT TGCCCTCTGG AGGGTTCTCC TCTTCCTCTA CCCATTCTGA GCCAGAGTAT GGGCGGTATG TCCTTGCAGG TGATGAGGCA GGCTTTGTCC ATCTCTGGGC TTGGAATGGG GAGGATAATG AGAGAAAGAC TATACGGTCG TGGGAGGCGA TGGAAGGTAA GATCACAGCG CTCGATATGT CTTGCGGCCT CGTCGCTGTC GGAAGGTAAG TCCTTGTTTC TTCGCTTGAG GAGGCATGGG AGAAAAGGTC GAAATACTTG AACAGTTGAG CACGGCTGAT ATCGGATAAT AGTTTCGACG GTTTTGTCAA GATATACGAT CCACTTCCTA CTCCACCTAA GCTCTTACGC ACATTCCATG CATCGCACCT CTCCCCCGGT GAACTGCTCG TCGCTGGCAG CGACCAGCCC GACGCCAGGT TTTACACCGT CAACAAGATC ATCCTGGAGA ATGATATGGT GGTAGCGAGT ATTGGGAGAA AGGTATTCGC TTGGCGAGCT GGGGCTGGAA AGGGTAAGCA CGGTGGGAAG GAGGGGAAGA AGGGTGGTAT AGGCAAGGGT GAGGGAAGAG GTGGAACCCG TGGGATCGGT ACGTCTCCTT TCCCAACTCT CCCCAGAGCG GTTAAAGTGC CCTTTCCTCG GGTGCTAATA AAAAGTGTTT CATTAGTCAT GAAGGCATTA CATCAAGCTG CTGAAGAAGA TTTTGCAGAG TTTGCACCCC ACGCTGCGAC ACCCAGACAA CGTCTCACGA ATCCGCACGA AACCCTTGAA CGCGAGGCCA TGCAGGAGAT GGGTTTGGAA GATGGAGATG ATGCTTTACA GTATGCGCTC ATGCTCTCCA TGGAAGAACA AAGCCATGCT TCACCTCCTC ACAACGACTT ATTGAATGAG GAGCCATCAG TGTCTGGTTG GGTTGAAGAC GAAGATGAAG AGGAAAATGT GGATGATGAG ACTGCGGAGG CTATTAGACA AGTAGAGGCT TTCAAAAAAG CCGAGCAGGA GAATGAGTTG GCGAGAATGC TCGAAATGAT AAAGCAGGCG GAGAAAAAGG AAGGATAGAC TTATGAAAAT CTGTAGAGTT GTCCGTGAAA ATTCAGA
|
Protein sequence | MQPAPSSHRH FSNGSISPHT SLHGHEELEF EDDFDYLSVA TPSRPNASKR AAPNKGKAVE TTKWKGKGKA RNGIPGGECY LLKLPGDLLH TLMSCLSPRT LLRFSETCKL LNEQTENDTV WRHSYVNRFL GEGVARDSRR REEIVVLAQS CVNAAGRGWK KEALGREVML DLWTNSKTSI VMHTPPTGLI HSISLYYPPF LPSNSKSLVI GKNRQSTPKD KVAYTSKLGE DGDNVLQPTT PKITHRQKYE AVLAATTRPP PYMLSASLFM GGVVRSDPIS GKVSKGFWGP GRDANFHIRP HIDPLAEPSA IYLPSRSQSF ILWGLQTGSV VFTSVQTRNH ATHGGRATSV NVYSDPRKSH EGFVVDIWAP QGQNDTALKW ITAGQDGRVK LWELQSGTIT KVGKRQASLV DGNIDCVFTS PLAETGFPNR SELVKRRQAG KPDEIVLARY DLQHDIVAGV TEDGDLRVWF EASSGNENEV RIDLGSAEIE GEIKVMEMIG YRHQDEVAVA VLIHRRRSHI LMRHDISKSG NHHITTFYSS VGAPLSCIHP SLFPNLPISA PKHGNSTPML ARIVTPGETP DPSPPPLDLP SGGFSSSSTH SEPEYGRYVL AGDEAGFVHL WAWNGEDNER KTIRSWEAME GKITALDMSC GLVAVGSFDG FVKIYDPLPT PPKLLRTFHA SHLSPGELLV AGSDQPDARF YTVNKIILEN DMVVASIGRK VFAWRAGAGK GKHGGKEGKK GGIGKGEGRG GTRGIGTSPF PTLPRAVKVP FPRVLIKSVS LVMKALHQAA EEDFAEFAPH AATPRQRLTN PHETLEREAM QEMGLEDGDD ALQYALMLSM EEQSHASPPH NDLLNEEPSV SGWVEDEDEE ENVDDETAEA IRQVEAFKKA EQENELARML EMIKQAEKKE G
|
| |