Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNM01900 |
Symbol | |
ID | 3255173 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006682 |
Strand | + |
Start bp | 579104 |
End bp | 581123 |
Gene Length | 2020 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 52% |
IMG OID | 638254344 |
Product | conserved hypothetical protein |
Protein accession | XP_568477 |
Protein GI | 58262134 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2971] Predicted N-acetylglucosamine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCAAG AGAATGGAAG TGGTTTAGTA TCCGATCAAC CAACCGTCAC TGGCGATCAG GTATCAGACG TCGCTATGCC TACCAGGCTG CCCACTCCAC CAGCTTCACC CACTGTCCCA CAGCTCATCC TCTGTGCTGA TGGAGGGGGT TCCAAGGTGT GTGTGGTGGT TAGGAGTGCG GATGGGTTAG AGGTCAGAGG CACTGCTGGG CCCTGTAATG TGTGAGTTAT TACACTAATA TCACTATCGC AAGAACTGAG CAACTCTCTA GTCAAAGTGT TGGGTATGCA GCGGCTACTC AGTCACTCCT TTTGGCCACC TATCGTGCCC TTGCCCAACT ACCTCGCTCT CACATCCCAC ACAACCTCCT CATGCCCTCG ATTGACCTTT CTGACAGCAC GCGTTCATCT CCTCAACTTC AAGCCGTCCC CATGATACTC AAAAACCAAC TGCCCTCACC ACCTTCTTCC ATTGGTTCAA AATCTGTGGT GATTCATAAT CTTCCATCTG TCATGAACAA CCTCACACTC GAACCGCCTT CTGCTTCTTC TTCCTTCTCC TCTTCCTCTT CCCTCCCCCC TTCATCATCG CCTTCTACAA CCCATTCGTC TCCCACGTCA ATCCCCTTGA CGCACCTCTC GTCATACCCG ACCTTGACGG ACAAATCAAT ATCCACGCCA AACCCAGGCA ACCCCACCCC TCGCACGCGC TTACCGCCCC TTAGTGTACC GATCTTTCAA TACGCGTGGC TGGCGTTGGC TGGAATATCG TGCAAGGCAG ATGAACAAGC TTTTGCAAAG GTCGTCTGTG GAGTCTTGGG TTTAGATATG GAACGGCTCA AAGTTACGAA TGGTAGGTTC CCATAGCGTT TCAAGATCTT TTTTATGACG CAGCTGATAC GTGTGTGGAA AGATGTAAAC CTCTTGGCGG CACCAGCACT CGACCTTCCG GACATAGACC ATGTAATTGC ACTTGTCGCT GGGACAGGGA CGGTAGGACG GGCGATCAAA GTTGGCGATA AGAAGCGAGG GTTGCCTCTG GAAGATGTTG CCATGTCCCG AGGTTGGGGT TATTTGTGAG TGATTTATTT TTTTGCTATT CTTAGTAAAA CTCAATAAAT GTGCTGACTG TGATGCAGAT TATGCGATGA AGGATCGGCA TTTTGGATCG GTCGGTTAGC TATTAGAGCC CTTTTATCTC TTTCCGACCG CCATGCTTCA TCGGGCATCT ATTCCTCCCC TCCACCGCCT TTCCTGCCCC TTCACAACGA CCTCCTAGCA TACTTTGGAA CGTCCAACCC CCTCGACTTG ATCAACGTCG CATCGCTCAC TGCGTCAGGG ATGGCAGAGC CTACCGAAAG TGTGGGCGAA GCGACGAGCC GGAGGAACGC TTTACTAGCA GGTGCAGCGA GGGTGGTGTT CAAACACGCT TTCCCAGGGG ATGTTAGTCC CCGCCCAGGA TTCCTTACGC CGCCACGCAG TACAGATGGA GGTGCTGATA TGGATGAGGA TCATGAAAGC ACGTCGAGTC CTCGACAGCC GGAAGAGTTG AAGCACGATG GTATTTTGGA TCACGCGTCC CACCTTGAAG CACTCGGTAT CGCACGTCAG GCAGCCGCGC CGCTTATTAC GCTTACACTC TCGCTCCTTG GCGACCGCAC AATCGTCAGA CCTGAAAGGT CAGCGTTAAC ACTTGGAGGC GGGCTGATGA TGAGCGAGGG ATACAGAGAG ATGCTCTTGG ATGGATTGAA GAAGGAGGGA GTGAGCTTTG GACGGGTGAT GGTGGTGGGT GACGCTGCTG GTGAAGGGGC CCAGGCTCTT GGTAGAGTTG AGTTTGAGTG AGAGATGTCT CTGAACTGGT ATTATTTGTG TCTGCATTTG TAGTAGCTGC TCCTGGACGC TATTATAATC CATACCATAG CGAGCTATAT ATTTCTATAT ACCATGCCTG TCAGGTTACT TACCGTCGTC ATCGTTTTGG TCTTTGGCGC TGTCACCCAC GTTGTACGAC GAGTTTGCGT
|
Protein sequence | MLQENGSGLV SDQPTVTGDQ VSDVAMPTRL PTPPASPTVP QLILCADGGG SKVCVVVRSA DGLEVRGTAG PCNVQSVGYA AATQSLLLAT YRALAQLPRS HIPHNLLMPS IDLSDSTRSS PQLQAVPMIL KNQLPSPPSS IGSKSVVIHN LPSVMNNLTL EPPSASSSFS SSSSLPPSSS PSTTHSSPTS IPLTHLSSYP TLTDKSISTP NPGNPTPRTR LPPLSVPIFQ YAWLALAGIS CKADEQAFAK VVCGVLGLDM ERLKVTNDVN LLAAPALDLP DIDHVIALVA GTGTVGRAIK VGDKKRGLPL EDVAMSRGWG YLLCDEGSAF WIGRLAIRAL LSLSDRHASS GIYSSPPPPF LPLHNDLLAY FGTSNPLDLI NVASLTASGM AEPTESVGEA TSRRNALLAG AARVVFKHAF PGDVSPRPGF LTPPRSTDGG ADMDEDHEST SSPRQPEELK HDGILDHASH LEALGIARQA AAPLITLTLS LLGDRTIVRP ERSALTLGGG LMMSEGYREM LLDGLKKEGV SFGRVMVVGD AAGEGAQALG RVEFE
|
| |