Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00470 |
Symbol | |
ID | 3259233 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 1054109 |
End bp | 1055316 |
Gene Length | 1208 bp |
Protein Length | 315 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258439 |
Product | conserved hypothetical protein |
Protein accession | XP_572239 |
Protein GI | 58270166 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAGA AGTTCGTCGT CGACAAGCCC CTCCTCCAGC TTAACGGTAC TTTAGGCGAA GGTGAGCGAT CATTTGTATT TAGTATGGTC AACTGCTTAA TGTCTCTGGC GTGCAGGCTG CGTCTGGGAT ACCAGGACTC AGCGTCTCTA TTTCGTCGAC ATTGATCAAT ATAAGCTCTA CACCTACGAG CCGTCTTCTG GCAAGTATGG ATATGAGTCA TTTGACAAGA AGGTCACTGC TTTGGCATCC CTCGAGAATG GAGAGGGTGT GAGTCGGTAG TCGCACACTC ATCCATGAAG ATCAACTAAT TCAAAATAGT TGATTGCTGC TGTTGAAGAT GGTTTCGCCT ACATTTCCTT TGACAGCCTT CCCTTCCCAC CGACTTCTTC CAAGCAGTCG CTTATCCCTA TTTCCTCTGG TGCCAGTCTC AACTGCAGGG AAAAGCGATT CAACGATGGT GCTGTAGACC CCGCTGGGCG ATTTTTAGCC GGTACATTGG GATTCGAGCA CGGTAGCAAA AATGGGAAGA TGTACTCTCT GCAAGCGGAA AAAGATGGGA GCTACAGTGC TCCGTTGATT CTTGATGGGA TCACTTGTAC GAATGGTATG GGATGGACTG AAGACGCAAA GACTTTGTAA GTATGAAGCA ATACCACATA ACAGAAACAT TCTAAGCTAT CCTTAATAGC TATTTCACAG ACAGCTGGAT CAAAGAAATT GCGAAGTTCG ACTACGACAT TGTATGTGCA AGACACCCTG ATTTGAAGAT TCGTTAACTC TTTTCGGTAG ACGACGGGGA AGCTCAGCAA CCGCCGAGTT TTCTCCAACT TTGACGGCTA CGGTGAACCT GATGGCATGT GCATGGATTC TGAAGGCGGT ATCTGGACAT GTCGGTGGGC TTCAGGAAAG GTCTTGCGCC TTACGCCTGA CGGCGAGATT GATGTCGAGA TTGACTTCCC TACTGCTTGG CACATTACCT GTTGCATCTT TGGCGGTAAG TCGAGTATAA GGCATTGGGC TCAGCAGTGG TACTGATCTA CACGTAGGTG AAAACCTTGA CGAACTCTAT GTTACTTCGG CCGCCTCTGA CTACATCGGC GATAACCTTC CCGACCGTAA GAACGGTGGC GATTTGTTCG TTGTGAAGGG CCTTGGATTC AGGGGAATTG AGCGAGGCAG GTTCAAGGGT ACCATTCCCA ACAAATAG
|
Protein sequence | MFKKFVVDKP LLQLNGTLGE GCVWDTRTQR LYFVDIDQYK LYTYEPSSGK YGYESFDKKV TALASLENGE GLIAAVEDGF AYISFDSLPF PPTSSKQSLI PISSGASLNC REKRFNDGAV DPAGRFLAGT LGFEHGSKNG KMYSLQAEKD GSYSAPLILD GITCTNGMGW TEDAKTFYFT DSWIKEIAKF DYDITTGKLS NRRVFSNFDG YGEPDGMCMD SEGGIWTCRW ASGKVLRLTP DGEIDVEIDF PTAWHITCCI FGGENLDELY VTSAASDYIG DNLPDRKNGG DLFVVKGLGF RGIERGRFKG TIPNK
|
| |