Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG04380 |
Symbol | |
ID | 3258929 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 1238045 |
End bp | 1240989 |
Gene Length | 2945 bp |
Protein Length | 884 aa |
Translation table | |
GC content | 49% |
IMG OID | 638258062 |
Product | WD repeat protein, putative |
Protein accession | XP_572117 |
Protein GI | 58269922 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.235349 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAGTGGCAA TGAAGTCCAA CTTCGTCTTT CAAAATCTCT GCGGTACCGT TTACAGACAA GGGAATGTTG TCTTTACGCC CGATGGGAAC TCTGTGTTGA GTCCGGTGGG AAACAGGGTG TCTGTGTTTG ACTTGGTGAA GTAAGTAGTC ACGGGTGTGC TAGATGATCT CCGGCTGATC TCCGCTTCAG TAACAAGTCA AGGACATTGC CCTTTGAGAA CAGAAAAAAT ATCGCTTCAA TTGCTCTTTC TCCAGATGGC AATGTACTAA TTTCCATCGA CGAAGGTGAA TATTCCTATC GTCAATTGAT GTATCTTATC TGACACCCTG GATGCTTTAG ATGGAAGAGC ACTGTTTGTG CACTTCCGTA AAGGGACAGT TCTTCACCAC ATCTCCTTCA AGCGCAAAGT CCACCACGTT TCATTTTCTC CCGACGGCAA ATATATTGCG ATAACGCACG GTCACATGGT GCAAATATGG AACACTCCCA GCCATTTGGT CCGAGAGTTT GCACCTTTTA CCCTCCACAG GGAGTATACT GGTCATCACG ACGAAGTTGT TAGTGTCTGC TGGTCCAAAA GCTCCAGGTG AGCGACCACA AATGATGTCT TACAAAACGG AAGTTGATCA GAAACAGGTA TTTTATCACT ACCTCGAGAG ATATGACTGC GAGGCTATAT ACCATTAACC CACTTGAAGG TTTCCAACCT AAGCAATTTG GGGGACATAG AGATGTTGTG TTGGGAGCGT TCTTTTCTCA AGATGAGAAG ACTGTAAGTC ATAATCTCTC TCAGTCGAGG CGACAAATTG ATGTTCTATA GATTTACACC GTTTCTCGAG ACGGTGCAGT GTTTGTATGG AAAGCAAAAA AGGGCGTCTC TGAGGCGGAC TCTGATGTTG AGATGGACAT TCTCGATGCT CCCACCACTT CCACCTCTGC CGCCAACCTT GCCCTTGAGC ACGCCGTCGC CTACACTCGA TGGGGGGTTC ACGCTCGCCA CTTCTTCAAC CAGCCTGGTA CGAAAGTGAT TTGCGCTACC TTCCACCCCA AAACATCTCT CCTCATCGTC GGCTTTTCCT CTGGTGTCTT CGGCTTGTGG GAGATGCCCG AATTTACCCC CGTACATACG TTGTCGATCT CCAACGAGAA GATCTCTAGT GTGGCAGTCT CCGCGTCGGG AGAGTGGTTG GCATTTGGGG CGGCCAAGCT CGGACAGTTA TTGGTTTGGG AATGGCAGAG TGAGAGTTAC GTTCTTAAAC AGCAAGGTCA CTACTACGAC ATGAACACCC TGGCGTTTAG TCCCGATGGG CAGAACATCG CTACTGGCGG TGAAGATGGT AAGGTCAAGT TATGGAATGC TTCAAGTGGC TTCTGCTTTG TGACCTTCCC TGAACACACT GCCGCTATCT CCACTGTCGA ATTTGCTAAG CAGGGACAAG TTTTATTCAC AGCGTCCCTT GACGGTACTG TCCGCGCATA CGACCTCATC CGATACCGTA ACTTCCGGAC ATTCACCTCT CCCACCCCTG TCCAGTTCTC TGCCCTCGCC GTCGATCCTT CAGGCGATGT GGTCTGTGCT GGATCTCAAG ATTCCTTCGA GATCTACATG TGGTCAGTCC AAACCGGTAA ACTGCTTGAC ATCCTCACTG GCCATACCGC TCCTATATCA GGCCTCGCCT TCTCTCCCAC CGGTAATCAG TTGGCATCTT CCTCTTGGGA TCGTTCTATC CGTTTATGGT CAGTCTTCGG GCGATCAAGA GCCACCGAAC CGATTGAGCT TTCGGGCGAA GCGACTGCGT TGGCGTTCAG GCCTGACGGA AATGAGATCT GCGCTTCTAC TTTGAACGGG GAATTGATCT TTATCGATGT GGAAGAAGGA CAGATTAAGT CTGTTATTGA AGGCCGAAGA GATATTTCTG GAGGGAGAAA GGTGGATGAC CGACTTACAG CTGCCAATAA CGCCGCAAGC AAGTATTTCA ACAGTGTCAT CTACACTGCC GACGGTGCTT GTGTCTTGGC TGGTGGAAGC AGCAAGTATG TTGTGTTGTA TGATCGGACG GAAGGCGTGA TGGTGAAGAA GTTCCAGATC AGCGAAAATC TCAGCTTGGA CGGTACGCAA GAGATGTTGG ACTCGAGGAA GATGACGGAG GCAGGGACCA TCGACTCGTT TGATAGACAA GGCGAGGAGG AGGACTTGGA AGATAGGTTG GACTCGACGT TGCCCGGTGC CAGCAAGGGT GATCTTTCCA AGAGACGGTA TAGACGGGAG GCCAGGACCA ATTGTGTACG TTTTTCTGCC ACTGGTCGAA GCTGGGCGGC GGCGAGTACA GAAGGTCTTT TGATATATTC TCTCGATGAG AGCACCACTT TTGACCCCTT TGACCTCTCC CTCGATCTTA CCCCCGAATC AGTGATGCAG ACCGTCGTCA GTGGCGACCA CCTTATCGCT CTCATTATGG CTCTTCGTCT TTCAGAAAAG CCTCTTATCC AGAGGGTATA TGAATCTATA CCTCCTTCAT CAATCCGACT CATCGCCCGC CAGTTGCCTA GGGTGTACAT TACCCAGTTT ATGAAATTCA TCAGCGACCA TATCGAGAAC ACTCCTCACG TGGAGTTCGA TTTGGTGTGG ACAGCTGCCA TGTTGACTAG TCATGGAAAG TTCTTGAAGG AGAGGAAGGG AGAGATGGCT TCGACGTTGA GAGGATTGGT AAGAGGGTTG ATGGGTTTAG AGATGAGTGT TGCCAAGATG TGAGTTAACA AGCAATATAA TTAATCGGGA TTTATAATAA ATCGCTGACG ATTCTCACTT AGATCGGATG AGAACACATT TTCCCTCAAT TATATCCTTT CTCAGGCAGG CAAGGATGAT CAAATCGAAT TTGAAGAAGG TGAAGGGGAT GCGTTTATTT TGGACGTCGA TGGCGCATAG ACATAGCTCA TGAGA
|
Protein sequence | MKSNFVFQNL CGTVYRQGNV VFTPDGNSVL SPVGNRVSVF DLVNNKSRTL PFENRKNIAS IALSPDGNVL ISIDEDGRAL FVHFRKGTVL HHISFKRKVH HVSFSPDGKY IAITHGHMVQ IWNTPSHLVR EFAPFTLHRE YTGHHDEVVS VCWSKSSRYF ITTSRDMTAR LYTINPLEGF QPKQFGGHRD VVLGAFFSQD EKTIYTVSRD GAVFVWKAKK GVSEADSDVE MDILDAPTTS TSAANLALEH AVAYTRWGVH ARHFFNQPGT KVICATFHPK TSLLIVGFSS GVFGLWEMPE FTPVHTLSIS NEKISSVAVS ASGEWLAFGA AKLGQLLVWE WQSESYVLKQ QGHYYDMNTL AFSPDGQNIA TGGEDGKVKL WNASSGFCFV TFPEHTAAIS TVEFAKQGQV LFTASLDGTV RAYDLIRYRN FRTFTSPTPV QFSALAVDPS GDVVCAGSQD SFEIYMWSVQ TGKLLDILTG HTAPISGLAF SPTGNQLASS SWDRSIRLWS VFGRSRATEP IELSGEATAL AFRPDGNEIC ASTLNGELIF IDVEEGQIKS VIEGRRDISG GRKVDDRLTA ANNAASKYFN SVIYTADGAC VLAGGSSKYV VLYDRTEGVM VKKFQISENL SLDGTQEMLD SRKMTEAGTI DSFDRQGEEE DLEDRLDSTL PGASKGDLSK RRYRREARTN CVRFSATGRS WAAASTEGLL IYSLDESTTF DPFDLSLDLT PESVMQTVVS GDHLIALIMA LRLSEKPLIQ RVYESIPPSS IRLIARQLPR VYITQFMKFI SDHIENTPHV EFDLVWTAAM LTSHGKFLKE RKGEMASTLR GLVRGLMGLE MSVAKISDEN TFSLNYILSQ AGKDDQIEFE EGEGDAFILD VDGA
|
| |