Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK01620 |
Symbol | |
ID | 3254519 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 475084 |
End bp | 477277 |
Gene Length | 2194 bp |
Protein Length | 643 aa |
Translation table | |
GC content | 48% |
IMG OID | 638253651 |
Product | conserved hypothetical protein |
Protein accession | XP_567836 |
Protein GI | 58260852 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3118] Thioredoxin domain-containing protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCCCTTAT CCCGTCTCAC CCTGTGCATA GGATATGTCC AAGGTTCAGC TATACGTTTA TGATCTCTCT CACGGTCTCG CCAAGAGCAT GTCCCTTATG CTCACTGGCA AACAAATTGA TGGTATCTGG TATGTAATAT CTGCTTGTCC TTAAGTGGAC CTGGAGCTTT ATGCAGTTGC TGATCATCCT TTTGGGTTAG GCATACCTCA GTTGTCGCTT TTGGCCGCGA AATATACTAT GGACAGGGTG TCCTCGAGTC CAAGCCGGGG GCGACTCACC ATGGTCAACC TTTACAAATT TTAGATGTCG GTGAAACTCA TATAGACGAA GCAACATTCA ATGAGTATCT TTCGAGTCTG AGTGGAATGT ACACGCCTAG CAAATACCAC TTGATTGAAT TCAACTGCAA CCACTTTACG GCCGATGTCG TGGGCTTCTT AACTGGAGCA GAAATCCCAG CTTGGATTAG TAGTGAGTTT TTTAAGACTT CCTCTAATGA TGATATTGAC AACATTTGAA GGTCTTCCCT CCGAATTTCT CTCAACACCT TTCGGACAGG CTATGAAACC CCAAATAGAC GCAATGTTCC GTGGTCCTAC AGCACAGCGT CCTATCCCTG ACAAAATTAG CAGCGCCAAT GCTTCGCCAG CACCTTCCAT TGGCTCCTCA TCTGCACCTG GGGGTGATAC TGCTGCGGCT GGCCCTTCTC TTTCTTCTAC ATTACTACAG TCAATCGCCG CGCAAGCTAC TGCTCAGACA ACTGGCCAAT CTACCGCAGC CAATGGATCA TCCAAACAAC CTCTCAACCC TGAAACATCA CCTCTCACTC TCGTTTCATC TACTGCCAAC TTCCATTCCA TCCTCTCGCA GCACTCTGCT GTCGTCGTAA ATTTTACTAA CACACCATCA TGCCCCCCTT GCCGGGTCAT CAAACCCGTC TATGAGTCGA TCGCTAGCTA TCATTCTGCC GTCTATGGAG CCAAGGGTGC TCGATTTGTG GAGGTCGAAT TAGGAATTGG GCAGGGCCGA GAGATTGCGG GTACTTATGG TGTGCAGGCC ACTCCGACCT TTATGTTCTT CAAGGATGGC AAAAAAGTCG GCGAGATGAA AGGCGCTGCC AAAAGGGAAC TGGAGAACAA AGTTGAACAA TTCTTAGAGG AGTGCTATCC GACTCACCCC CACCGCAGAA TGTATCTTCC CGCGGTTGAA GGATTGCCAA AAAGAGCGAT CACAGTTAGC AACCTGCCCA ATTATCCGGC TTTGTTGAAC AAGCTCGAAG GGTTCTTAGC GGACAAGGGA AAGACAGAAA GCTTCATGGT TCTAAAAAAC GAAGTGGTAC CATTCCTAGA GGGCAAGAGT CTTTCTGAAA CAGAATTGGC TGCTCTGCTT CAGAAGTGGT CTGCTGCCAC CCAAGACTTG CTGCCTGCTC TTCAGCCAAC AGAAACTTTC CCTTTAATCG ATCTCTGGCG AATTGCCCTT CAATGCCAAC CAATCATTCC CTTCATTGGT TTGGGGCTCT CGACCGCCTC AAGCAACGCT GAACCCATCA CCAGTATCAT TTCTCTTGCT TCAAACACTT TCTCTTCTTC TCCAGAAGCC ATACCCAAAC CCTTCATCCT CACTGTCCTT CGTCTTCTCA CAAACTTCAC ATCTTGCGTT GAACTGACAA ACCTTGTGCT CGCGCATGAT GGTAATGTTT CTACGTCTGA GCAGCTCATC AGCGTGTTGG TAGAGTCTCT TCTGTATCCC GATGTGGGTG TAAGAAGCGC GGCTGCTGGT GTAGCGTTTA ACATTGGTCT CTGGAGGCAT CATAACGTTG TAGAAGAGAC TCCAAATGTG GATTGGGAGC TTGAGGTGGT CAGTGGTTTA GTAGAAGCTC TTGACCGGGA AGAGGATGAG GACGTCGGTG AGTTGCACTA TCTTGACTAC AATATGCGAT CCATTACTAA TGTATGTGAA CAGCTCATCG TCTTCTTGCA GCCCTTGCTT TGGAGATCTA CCTTTCTCCA AGCTATGAAG ATAACGTTCA GCCGATGCTG CAGGTCTTGG AAGCATCCAA TAAGATTGAG AAGAGATGTA AGGTTTGGAA GAGGAAAGAG GTTAAGAAGG TGGGAGAAGA GATCGCTAGA AAGCTTTGCT AAGCCACTAC CAAATAGACG TTGCTTGTAG ATACAACAAA GACACGATCC TGAA
|
Protein sequence | MSKVQLYVYD LSHGLAKSMS LMLTGKQIDG IWHTSVVAFG REIYYGQGVL ESKPGATHHG QPLQILDVGE THIDEATFNE YLSSLSGMYT PSKYHLIEFN CNHFTADVVG FLTGAEIPAW ISSLPSEFLS TPFGQAMKPQ IDAMFRGPTA QRPIPDKISS ANASPAPSIG SSSAPGGDTA AAGPSLSSTL LQSIAAQATA QTTGQSTAAN GSSKQPLNPE TSPLTLVSST ANFHSILSQH SAVVVNFTNT PSCPPCRVIK PVYESIASYH SAVYGAKGAR FVEVELGIGQ GREIAGTYGV QATPTFMFFK DGKKVGEMKG AAKRELENKV EQFLEECYPT HPHRRMYLPA VEGLPKRAIT VSNLPNYPAL LNKLEGFLAD KGKTESFMVL KNEVVPFLEG KSLSETELAA LLQKWSAATQ DLLPALQPTE TFPLIDLWRI ALQCQPIIPF IGLGLSTASS NAEPITSIIS LASNTFSSSP EAIPKPFILT VLRLLTNFTS CVELTNLVLA HDGNVSTSEQ LISVLVESLL YPDVGVRSAA AGVAFNIGLW RHHNVVEETP NVDWELEVVS GLVEALDREE DEDVAHRLLA ALALEIYLSP SYEDNVQPML QVLEASNKIE KRCKVWKRKE VKKVGEEIAR KLC
|
| |