Gene CNK01620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01620 
Symbol 
ID3254519 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp475084 
End bp477277 
Gene Length2194 bp 
Protein Length643 aa 
Translation table 
GC content48% 
IMG OID638253651 
Productconserved hypothetical protein 
Protein accessionXP_567836 
Protein GI58260852 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACCCCTTAT CCCGTCTCAC CCTGTGCATA GGATATGTCC AAGGTTCAGC TATACGTTTA 
TGATCTCTCT CACGGTCTCG CCAAGAGCAT GTCCCTTATG CTCACTGGCA AACAAATTGA
TGGTATCTGG TATGTAATAT CTGCTTGTCC TTAAGTGGAC CTGGAGCTTT ATGCAGTTGC
TGATCATCCT TTTGGGTTAG GCATACCTCA GTTGTCGCTT TTGGCCGCGA AATATACTAT
GGACAGGGTG TCCTCGAGTC CAAGCCGGGG GCGACTCACC ATGGTCAACC TTTACAAATT
TTAGATGTCG GTGAAACTCA TATAGACGAA GCAACATTCA ATGAGTATCT TTCGAGTCTG
AGTGGAATGT ACACGCCTAG CAAATACCAC TTGATTGAAT TCAACTGCAA CCACTTTACG
GCCGATGTCG TGGGCTTCTT AACTGGAGCA GAAATCCCAG CTTGGATTAG TAGTGAGTTT
TTTAAGACTT CCTCTAATGA TGATATTGAC AACATTTGAA GGTCTTCCCT CCGAATTTCT
CTCAACACCT TTCGGACAGG CTATGAAACC CCAAATAGAC GCAATGTTCC GTGGTCCTAC
AGCACAGCGT CCTATCCCTG ACAAAATTAG CAGCGCCAAT GCTTCGCCAG CACCTTCCAT
TGGCTCCTCA TCTGCACCTG GGGGTGATAC TGCTGCGGCT GGCCCTTCTC TTTCTTCTAC
ATTACTACAG TCAATCGCCG CGCAAGCTAC TGCTCAGACA ACTGGCCAAT CTACCGCAGC
CAATGGATCA TCCAAACAAC CTCTCAACCC TGAAACATCA CCTCTCACTC TCGTTTCATC
TACTGCCAAC TTCCATTCCA TCCTCTCGCA GCACTCTGCT GTCGTCGTAA ATTTTACTAA
CACACCATCA TGCCCCCCTT GCCGGGTCAT CAAACCCGTC TATGAGTCGA TCGCTAGCTA
TCATTCTGCC GTCTATGGAG CCAAGGGTGC TCGATTTGTG GAGGTCGAAT TAGGAATTGG
GCAGGGCCGA GAGATTGCGG GTACTTATGG TGTGCAGGCC ACTCCGACCT TTATGTTCTT
CAAGGATGGC AAAAAAGTCG GCGAGATGAA AGGCGCTGCC AAAAGGGAAC TGGAGAACAA
AGTTGAACAA TTCTTAGAGG AGTGCTATCC GACTCACCCC CACCGCAGAA TGTATCTTCC
CGCGGTTGAA GGATTGCCAA AAAGAGCGAT CACAGTTAGC AACCTGCCCA ATTATCCGGC
TTTGTTGAAC AAGCTCGAAG GGTTCTTAGC GGACAAGGGA AAGACAGAAA GCTTCATGGT
TCTAAAAAAC GAAGTGGTAC CATTCCTAGA GGGCAAGAGT CTTTCTGAAA CAGAATTGGC
TGCTCTGCTT CAGAAGTGGT CTGCTGCCAC CCAAGACTTG CTGCCTGCTC TTCAGCCAAC
AGAAACTTTC CCTTTAATCG ATCTCTGGCG AATTGCCCTT CAATGCCAAC CAATCATTCC
CTTCATTGGT TTGGGGCTCT CGACCGCCTC AAGCAACGCT GAACCCATCA CCAGTATCAT
TTCTCTTGCT TCAAACACTT TCTCTTCTTC TCCAGAAGCC ATACCCAAAC CCTTCATCCT
CACTGTCCTT CGTCTTCTCA CAAACTTCAC ATCTTGCGTT GAACTGACAA ACCTTGTGCT
CGCGCATGAT GGTAATGTTT CTACGTCTGA GCAGCTCATC AGCGTGTTGG TAGAGTCTCT
TCTGTATCCC GATGTGGGTG TAAGAAGCGC GGCTGCTGGT GTAGCGTTTA ACATTGGTCT
CTGGAGGCAT CATAACGTTG TAGAAGAGAC TCCAAATGTG GATTGGGAGC TTGAGGTGGT
CAGTGGTTTA GTAGAAGCTC TTGACCGGGA AGAGGATGAG GACGTCGGTG AGTTGCACTA
TCTTGACTAC AATATGCGAT CCATTACTAA TGTATGTGAA CAGCTCATCG TCTTCTTGCA
GCCCTTGCTT TGGAGATCTA CCTTTCTCCA AGCTATGAAG ATAACGTTCA GCCGATGCTG
CAGGTCTTGG AAGCATCCAA TAAGATTGAG AAGAGATGTA AGGTTTGGAA GAGGAAAGAG
GTTAAGAAGG TGGGAGAAGA GATCGCTAGA AAGCTTTGCT AAGCCACTAC CAAATAGACG
TTGCTTGTAG ATACAACAAA GACACGATCC TGAA
 
Protein sequence
MSKVQLYVYD LSHGLAKSMS LMLTGKQIDG IWHTSVVAFG REIYYGQGVL ESKPGATHHG 
QPLQILDVGE THIDEATFNE YLSSLSGMYT PSKYHLIEFN CNHFTADVVG FLTGAEIPAW
ISSLPSEFLS TPFGQAMKPQ IDAMFRGPTA QRPIPDKISS ANASPAPSIG SSSAPGGDTA
AAGPSLSSTL LQSIAAQATA QTTGQSTAAN GSSKQPLNPE TSPLTLVSST ANFHSILSQH
SAVVVNFTNT PSCPPCRVIK PVYESIASYH SAVYGAKGAR FVEVELGIGQ GREIAGTYGV
QATPTFMFFK DGKKVGEMKG AAKRELENKV EQFLEECYPT HPHRRMYLPA VEGLPKRAIT
VSNLPNYPAL LNKLEGFLAD KGKTESFMVL KNEVVPFLEG KSLSETELAA LLQKWSAATQ
DLLPALQPTE TFPLIDLWRI ALQCQPIIPF IGLGLSTASS NAEPITSIIS LASNTFSSSP
EAIPKPFILT VLRLLTNFTS CVELTNLVLA HDGNVSTSEQ LISVLVESLL YPDVGVRSAA
AGVAFNIGLW RHHNVVEETP NVDWELEVVS GLVEALDREE DEDVAHRLLA ALALEIYLSP
SYEDNVQPML QVLEASNKIE KRCKVWKRKE VKKVGEEIAR KLC