Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02430 |
Symbol | |
ID | 3259110 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 442365 |
End bp | 445079 |
Gene Length | 2715 bp |
Protein Length | 759 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258242 |
Product | conserved hypothetical protein |
Protein accession | XP_572418 |
Protein GI | 58270524 |
COG category | [G] Carbohydrate transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family [COG2313] Uncharacterized enzyme involved in pigment biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTGA AGTCAGCTAT TCGCTGCAGG ACCCCGTCTA GGTTGGCGTA TGGGTTCAGT GTCTCTCGAC ACCTTTCCAA CGTTGCAACG GCTAAGAAGC TTTGGGTGAG CATAACAGGC CCGGCGTTTG AACATCGGCT CAAAGTTGCG TACTCCAGGG AGACAGCCTC GTCTTCTCTG AAGAGGTAGA GGCAGCACTG CATGCTCGCT CCCCAATTGT GGCGTTGGAA TCAACAATCA TTACTCATGG CATGCCGCAT CCTGTCAATT TGGAAACCGC GCAATCCGTC GAATCCATCA TTCGCGCTTC TTCAGCTGTC CCGGCTACCA TTGCTATTAT CAATGGCAAG ATCCATGTTG GACTAAGTTC CCGGCAACTC GAGGGAATCG CAGATGTTAG CACGGGGCTG GGCAAGGGAT CGGTCAAAGT TTCAAGGCGA GATCTTGCTC CGACTTTAGC CCTTAGGAGG ACTGGCGGTA CCACGGTGGC AGGGACGATG TACATCGCCA ACAGTGTTGG TATTCACGTC TTCGTCACAG GCGGTATTGG AGGCGTTCAC CGTGGTGCCG AGAACAGTAT GTCATCTGAG AGCAGCAGAG AACTGTCAAG CTAACACTTC CTTAAGGTAT GGATATCTCT GCGGACCTCA TTGAACTTGG ACGCACCCCC ATGGCCGTCG TTTGTGCAGG CGCAAAATCC ATCCTCGACA TTCCTCGAAC CCTTGAAGTT CTTGAGACTC AAGGCGTCTG CGTGGCTACC TATGGAGGAA ACTCCGAGTT CCCGGCTTTC TACCACCCGA GCTCGGGATG CGAGGTTAGT CCTTTGTCAT AATATTGGAA TTGCACGTTT ATTTTGTTCT TCAGAGTCCG TGGTCTGTAC CGGATATCAA ATCCGCTGCC AATCTAGTCT GTACGTCTGA CATTTCGACC TTCTGGTGGA TACATCAACT CACAAACCTT TAGATGCCTC TCTTAATCTT CCCACGCCTC TCAGTGCTCT TCTCGCTGTT CCAATTCCTT CCGAGCACGC CGATGCTGGC CTTACGGTAC AGAAGGCTGT CGAGCAAGCG GCACGCGAGT CTGTTGAACA AGGCATCGAC AAGAGGGGTA AAGAAGTCAC TCCTTGGTTG CTGAAGCGTG TTGGAGAGCT TACTAGGGGT ACTGCATTGG GACTTAGTGG GTATTTCGGC AATCGACTCA GGGTGTCTTA CTGATCGTTT ATTTTTTTGC AGACATCAAG CTTTATGAGA ACAATGCCAG GATTGGTAGT CAAGTTGCGG TACAGGTTTC CAAGCTCTTC AGGGAACAGA AAGATGCATC CTCTGCCCTC TACATCCCGG TCTCCTCATC AGAATCTTTC CCTAAGCCGG AGTTGAAGAA TGAATCTCCC AAGCTCGCGC TACCCAACAC ATCTACTCAG TCTTCTCTAC CTTCCCCCTC TACCCTCGTT TTTGGTTCTG CCGCGGTCGA TCTCACTTCC ACTTCAACGC ATAGTCTCGC ACCCCGCACA ACCACTCCTG GAGAAGTGTT CGTTTCTCCC GGTGGGGTGG GTCGTAACAT TGCCGAGGCT GCTCAAAATC TCCTTCCTCC TAATTCCGTT CAGCTTGTCT CTGCATATGG ATCCGTCTCT GGCTCATCTG AAAGCGATAC GACAGAGCCT GATGCTTTCG GGAAGCTCTT GCTATTTGAA CTTGCAGGTG CCAACATGAG AGCAGATGGT CTTGTGGGCA AGGAAGGGCG AAATACGGCA GTTTGCAGTT TGACCTTGGA AAAAGATGGG GATTTGGTTG CGGGAGTTGC TGATATGGGT ATTGTGGAAA CTTTGACTGA GGAATTTGTA AGGCCGCGTT TACGGCGAAT GCTGCATACA AAAGTTGACA TCCAATGTAG GTCGCACGAA AAATTGACGA GGAAAAGCCT GAGATGGTCG TCTTTGATTT GAATCTTCTT GAAGGGGTAG TCAAGGCCAT TCTAACGGCG TGCCAAACCC TCAACGTCCC TAGTATGTCA TCATCGTCAT ATAACGACGC GGTTTTACTT ATTAAAAGCG TTTTTAGCAT TTTGTGATCC TACCTCTACT CCCAAACTCC CTCGCCTTAT TCCAGCTCTT AACATCCTCC TCCCATCTTC ACCTTCCTTC CCTCGGCCGC TCACCCACCT CACTCCCAAC CTTCTCGAAC TTGACCTTCT TCATTCTCTC TTGAGCTCGT CTGCTTCTGA CGATACTTCC TCCATCACCT GGGAATTTAT CAACTCCCTA GGGCTTGACG GGGACTGGCG TGCAAAAGTC GAACGGTTCA CCAATGTCAA TGGAAGAGAG TGGATCAACG TTAACGGGGT CGTTCAAAAA ATGGTTTCTT GTCTTCCTTA TGTTGCTTCA TTTTGGGTAA AAGCTGGACA AAGAGGACTT TTGCACCTCC GAATGACATC GGTGCCTCCT CAGCCATCAC CTGATACACT GGTTCATCCC CTCGCCGGAC AGCACCATGG GAAATACTTG GCTTTTACGC ATTACACGCC CCCCGTTATC AAACCTGAAG AGATAATCAG TACGACAGGA GCGGGAGATA CATTGGCAGG TGGGTTAGTA GCTGGTTTGG TGGGCGGCAA AGGTGAGCCC GAAGAAATTT GGGTTAGAAG GGCTTTGGAT CGTGTGGGGA GAAGTCTCAG GAATCGACGG GCTGTTGGAT AGGTTATTCC GATAGTAGCT TTGTT
|
Protein sequence | MLLKSAIRCR TPSRLAYGFS VSRHLSNVAT AKKLWGDSLV FSEEVEAALH ARSPIVALES TIITHGMPHP VNLETAQSVE SIIRASSAVP ATIAIINGKI HVGLSSRQLE GIADVSTGLG KGSVKVSRRD LAPTLALRRT GGTTVAGTMY IANSVGIHVF VTGGIGGVHR GAENSMDISA DLIELGRTPM AVVCAGAKSI LDIPRTLEVL ETQGVCVATY GGNSEFPAFY HPSSGCESPW SVPDIKSAAN LVYASLNLPT PLSALLAVPI PSEHADAGLT VQKAVEQAAR ESVEQGIDKR GKEVTPWLLK RVGELTRGTA LGLIAVQVSK LFREQKDASS ALYIPVSSSE SFPKPELKNE SPKLALPNTS TQSSLPSPST LVFGSAAVDL TSTSTHSLAP RTTTPGEVFV SPGGVGRNIA EAAQNLLPPN SVQLVSAYGS VSGSSESDTT EPDAFGKLLL FELAGANMRA DGLVGKEGRN TAVCSLTLEK DGDLVAGVAD MGIVETLTEE FVARKIDEEK PEMVVFDLNL LEGVVKAILT ACQTLNVPTF CDPTSTPKLP RLIPALNILL PSSPSFPRPL THLTPNLLEL DLLHSLLSSS ASDDTSSITW EFINSLGLDG DWRAKVERFT NVNGREWINV NGVVQKMVSC LPYVASFWVK AGQRGLLHLR MTSVPPQPSP DTLVHPLAGQ HHGKYLAFTH YTPPVIKPEE IISTTGAGDT LAGGLVAGLV GGKGEPEEIW VRRALDRVGR SLRNRRAVG
|
| |