Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02010 |
Symbol | |
ID | 3254620 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 601105 |
End bp | 602334 |
Gene Length | 1230 bp |
Protein Length | 236 aa |
Translation table | |
GC content | 48% |
IMG OID | 638253694 |
Product | proteolysis and peptidolysis-related protein, putative |
Protein accession | XP_567677 |
Protein GI | 58260534 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGTT TCAGCGTTCA CAAAGCGGCT TTGGAAGGTT AGTACTCTTT GCCTCTTGTT TGGCTATCAT CTTCAAAAAG GTTTGTTTAA CCTTCATCAT AGGTCAGATA GGTCTTGCGA GATCATTATT GAATGACGAT CCTAAACTTA TTAACTCGAA GGACGAAGTA CGTTCTGGCT GGCTCGAGAG GTGACATATG CTGACCTTCC GTTATTCCCG CCTCGCCCCA CTATCTTGTC CTTGTGCTCG CGCTCTCTTT GATGCCTTCA ATTCAAATTT ACCCACAGGA TGGCCGTACT CCGCTCCACT GGGCAGCTTC AACTTCAAAC CTTTCTGTCC TCCAACTGTT GCTCAACTAC CATCCTGACT TGGAAGCGAG AGATACTATG GGATGGACGG CGTTGATGAT TGCCTGTTGG TTCATTCGGC TTTCATAACT AGCGGGAAGC TAGGTTTAAT CAAAAGTCCC CTTGTTAATG CCCATTTTTA GCTGCGGCAG GACATCCGGA AATAGTCAGA GAGCTGATAG GTGCGGGTGC CAAAGTCGAT GCAGTGAATG AGAAGGGTCA AACGGCCCTG TGAGTTTGTC CTCTGTGTCT GAAATCAGTT GTAGCTGCTA ACCACTCTCA TTTTCTCTGA TAATGCTAGA CATTATGCGG CTTCCAAGGG AAACGTATCT GTAGGTGCCT CTCGTGTCCC AATATCTACA GATACTAACG TCGTTGCAGA TTGGCCGTTT GCTCATCAAC CACGGGGCGG ATGTAAGTGT CAAGCATGCT CCAACACTCA GGAGCAAATA TTTACCTTTA TTCTCATCAA TCAGATTAAT GCCAAAGACC GAGCGTCACA GCATCCCCTT CACCGAGCGG CAACCACAGG TAACAATGCT TTTTTGCAAT TACTCTTGAA CCCCCCAGAG GGACGACCAA AGACGAGGTT GAATACCGCT GACCGTGCTG GTAAGCCTCT GTTTTGTCTT ATTCTTATGG AATTATTTGT TTAAGACACT TGCTGATAAG GTGGTCAATT ATCAAGGTAA CACACCTCTG CACCTAGCGA TGGAAAGTGG ACATGGAGAC GCTGCTGTCG TGCTCATTGA GGCTGGAGCG GACCGTGAAC GGTCAAACTC CGAGGGGCAA ATGGCCGAAG AGATTGAGGG TGTCGGCGGA CAAGAACAAA ACAAGGTTAG AGAATACGTA GCGTCCAAGG TAGGACGAAG GTCTGAGTGA
|
Protein sequence | MSSFSVHKAA LEGQIGLARS LLNDDPKLIN SKDEDGRTPL HWAASTSNLS VLQLLLNYHP DLEARDTMGW TALMIASAAG HPEIVRELIG AGAKVDAVNE KGQTALHYAA SKGNVSIGRL LINHGADINA KDRASQHPLH RAATTGNNAF LQLLLNPPEG RPKTRLNTAD RAGNTPLHLA MESGHGDAAV VLIEAGADRE RSNSEGQMAE EIEGVGGQEQ NKVREYVASK VGRRSE
|
| |