Gene CNK02040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02040 
Symbol 
ID3254642 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp611640 
End bp613653 
Gene Length2014 bp 
Protein Length482 aa 
Translation table 
GC content49% 
IMG OID638253697 
Productexpressed protein 
Protein accessionXP_567680 
Protein GI58260540 
COG category[L] Replication, recombination and repair
[R] General function prediction only 
COG ID[COG0494] NTP pyrophosphohydrolases including oxidative damage repair enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.239052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGGCGGATA CCTTGATTTA TATACCGCCC CTCTCTTAAG ATATCAACAA ACTGACATAG 
ACATATCGGC AACATGGTAG CCTCGCCTCC TACAGCAACG CCATCAACCG CGCTCTTGTC
TCTTATCCAC TCTCTCCGCG CCTTGCCTAC TCGCCTCATC CAGTCCCCTC CGACCCAGCC
ACGTCGTGCA TCTGTAGCTA TCATCATACG CCTGAGACCT GCTGAAGAGC TGGTTTTTGA
AGGACATGAG CCAGAAGGAT GGACAGGAGA TGTTGTTTCG AGGGAAGATT GGGGAGAAGG
GCTTGAACTG GAAGATTTTA TGAAGTTGTG TAAGTGATCA TCGTGAGAGG CACGAATGAG
TAACCATCTG ACGCTATGGT ACAGCCTGGG TGAATCACCC TAATACTGTT CCAGAGATTC
TGTTCATTCG CCGTGCCTCT CCCTCTTCTT TACCTCCTCC CGGAGCCCAT CACCGTTGGG
CTTCTCATAT CGCCTTCCCC GGCGGCCGTC AAGAACCAGA CGATCAGTCG GCCTATTACA
CCGCATTGAG AGAGACATGG GAAGAGATTG GAATAGATTT GGCTGAGAAA GAATTCCTGA
ATGTAGGGCG GTTAGACGAG AGAGAGGTGA CCACAAGCTT AGGGAAAAGG CTGTTGATGA
TCCTCAGTCC CTTTGGTACG TTTGATGTGC TGCATCTAGA ACAGGACTGA TTAGGTGGAC
AGTATTCATC CAAACTACGC CAATAAGTCC GACTCCAGAG TTACAGGCGG TAAGTTCCTC
AAATGCGTCT CCGATCATGT ACTGACATCT GGAAAGGCCG AAATCTCGTC TGTCCACTGG
GTGCCTCTTT CTCTACTTAC GCCTCCCTTT TCACCTTCTC GTTGGTCGCA CGTCGAAATT
GATGTCAGCA CTCGGTTGAG TCCCCGAAAC AAATTTGTAA GATGGTGCTT GAGAAATCTC
ATAGGCAAAA TGAAGTAGGT TGTGTCTTTC CATATCACAT GTTATAGCAC TGATCAAAGC
GTCAAAGATT TGGCTGTTTG CTTCTTCCTG ATGAGCCGGC CGTAACAGCT GAAAATTTTG
ATCCCTTGGA TTTTGATGAA ACGTTGGAAG GAAGTGGTAG CTGGACCGAT GCAGCGGATG
GCAGTCGATT TTTGAGGCTT TGGGGCTTGA CCTTGGGAAT GACCCTGTAA GCTGATCCTG
CTCTATATCA TTCTGTAAAT CACTGATAAT CACTCTTTCC CTAGGGATCT TATCTCTCAT
CATCCGTCTG CACCCTCCAA GCTATTAGCA GAAGGCCAGA ACCTTTCAAC CTCACAACCC
AGTACTCCTG TTATGGAATA CAAGCCTCAA CTGGCCCCTC GCACTCCTGT GACCACTCAC
AGTACTTTTG AGGACCAGTG GGAGGCTGCG AGGAAAGCAT TGGCGGAAGA AGAGAAGGAC
AGGGCAATTG AGAAAGCGGC GCAAGGTAGA AGACGAAGAG GTGTAGGACC TTATATGACG
TTAGTTGTTC GAAGATGCCC AGTTTCCATC AGGCTGACAG AATAGTAGTG CGGTATTCCC
GAGATTCACA TATCCCGATG TCAATTTTTG GATATGGTGC GTAGCTTGGT CTGCTTGTCT
AGCCTCCGCT CACTAGTATT CATCAGGGTC TTCTCTCGTC GGTACCGACA AGTGCTGAAA
TCCTGGGAAT TGTCAGCCAT TGGCCCCTCT CGTGCAGCCG ACAGGCGTAT CAATTGGTCT
GGTCAGGCAC TTGCTACCTT TTACACGGCA GTAAGACAAG CTTTGGTTGT GACTCTGATC
ATCCGAGCTC TAGGTCTCGG TGTTGGGCTG GCGGGTGTGG GTTACTTGGC GTTCAAGGTT
ATGGGTGGTG GAGAGCTATA AAAGCAAAAG CTGCATGTAT TCTGCTGTAG CTGTTGACTG
GATAGATAAC TTTGAGTGTA TTAGCCTGAG GCCTGAGATG ATGGTTGTAG GATATTCATA
TGCCTCTTAG CTATGTATGT GGTTCATTGT CATG
 
Protein sequence
MVASPPTATP STALLSLIHS LRALPTRLIQ SPPTQPRRAS VAIIIRLRPA EELVFEGHEP 
EGWTGDVVSR EDWGEGLELE DFMKLSWVNH PNTVPEILFI RRASPSSLPP PGAHHRWASH
IAFPGGRQEP DDQSAYYTAL RETWEEIGID LAEKEFLNVG RLDEREVTTS LGKRLLMILS
PFVFIQTTPI SPTPELQAAE ISSVHWVPLS LLTPPFSPSR WSHVEIDVST RLSPRNKFVR
WCLRNLIGKM KFGCLLLPDE PAVTAENFDP LDFDETLEGS GSWTDAADGS RFLRLWGLTL
GMTLDLISHH PSAPSKLLAE GQNLSTSQPS TPVMEYKPQL APRTPVTTHS TFEDQWEAAR
KALAEEEKDR AIEKAAQGRR RRGVGPYMTA VFPRFTYPDV NFWIWVFSRR YRQVLKSWEL
SAIGPSRAAD RRINWSGQAL ATFYTAVRQA LVVTLIIRAL GLGVGLAGVG YLAFKVMGGG
EL