Gene CNK01250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01250 
Symbol 
ID3254665 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp373606 
End bp376639 
Gene Length3034 bp 
Protein Length847 aa 
Translation table 
GC content48% 
IMG OID638253615 
Producthypothetical protein 
Protein accessionXP_567733 
Protein GI58260646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAGATATC AACAAGAAGA GGAAAATACA GGTTAGCACA GCCGATAATC CATGTGCAGT 
TCCAGCTTAC ATTTCACAGC GAGCATGCGA CATCTGTCGG AAAAAGAAGA TTAAGTGCGA
TGGACCTATG AACAGTCTGA GTAGCCATAG TGAGTAAAAG TAAATAGACC CTCGTATGGA
ACCGATCTGG CTAACTCCTA ATGTAGAATG TGCTTACTGT ACTGAACGAG ATTTGGAATG
TACCTATATT GAGGAGGCTC AACGAAGAGG TCCCCCTAAA GGGTCTGTTT TGAGATAAGC
TTCTTACTTA TACTGACATT TTCCTGTCCA GCTATCTGGA AACCGTCGAG CAACGTTGCG
GAAGGCTGGA GAAACTGCTT CAAGAGGCAA GCCATGTTTT CGCCCTTCCT CCTGATATCC
ATCTAACTGG TATTTATTTC TGTTTCAGCT TCATCCCGAC TTCAATTTCG ATTCCTACGT
AGGCCCGCCT ATCGATCGTC AGACATTTGA CCTTGTTACC TATCACGACC TCCTCGATAC
TCTTGAAATC CCACCTTATC CATCCCAAAA GCCTATAGTT ATTGATGATT TACGTCCCGG
ATTATCAGCA TCCGATCCTT CAGTAACTCC GTTATCTTCT GAGTCGTCCG CGAACTCATT
ATCGAACCTC AAGCGGATTC TGACACGCTT CTACAAGGCT GTCAGTTTAG ATGCCCAGAG
AGAGTCTGAT ACGGAAGAAA CTGAGATGGA AGCCCGAACT CAGCTTAGCG TGGCAGAGTC
GATGAAACGT ATGGGTGCAC GTGATCATCA CTGGAGATAT CACGGAACCG CCTCTGGTAT
TCAATTTATC ACTCATCTGC AAGAACTTCG GTCCCGCGCC GCATCGAGAA GAATGGATTT
CATCTCTTGT GTCAACCGTG TAAAGAGGCA GCGATTCTGG GAGGTTCCCG AGTGGGAGCT
TGCCATTGCG AGCGAAGGCT TGAGCCCTCT CGACCTATCA AGCTGGCCCG AAAAAGGCCT
CGACCAGCAA CTCATTGATG CCTATTTTGA TCACGTCAAC CTACATTTAC CGCTGCTTAA
TCGAAAAGTC TTCCAAGAGC AGTATAATTC TTGCAAGTGG AAAAATCATC ACGGGTTTGC
AAGGGTTTGC CTGTTACTAT TCGCGAATGG AGCGCGATAC CTTGATGACG ACCCCAGAGT
CATTTGGCCA CTTGATCAGT GCGCATCGGA AAGTGGCCGC GATACCATAA TCAACGATCC
GTATGATTTG AGGCGGTATT CTGCCGGATG GAGATACGTT CGCAGCTACT TGCGTATGGG
TCAAAGTATC GTCCAGGGTC CAAATCTTTT TGAACTGCAG GGTAGAATTC TTACATGCCA
ATTCGTTGCA GGAACTGCCA TCCCTCACTT TATCTGGATA GTCGCTGGCT ATGGTCTTCG
TTCTGCTCAA GAACTCGGTA TCCACATGCG CGCCACCCTC CTCCATGCCG ACCCTACTGA
GCGAGAGCTC TGCAGTCGAG CGTTCTGGTG TCTTTACCAT ATTGACCGCA TTAGTTGCTC
TGCCATTGGG CGCACTCTCG CCATCCAGGA CTATGATTTC GATTTGCAAT ATCCTGCGGA
TGTTGACGAC GAGTACTGGT CTACCGAAGA TCGAGAAAAG GATTTTCAGC AGCCAATAGG
GAAGGGCCCA TCGATACTCG CCACGTTCAT CGAAACTCTG AAACTTGACC ACATTGTTGG
AGCTGCTCTT CAGACAATTC ACACGCACAG CAAAGGATGG CTGAAATCCA AGGATATCAC
TGCCAACTAC GCCGTCCTGG CTGAGCTTGA CTCTGCTCTC AACAATTGGG AAAAACAAGT
CCCGCAGGCT TTAAAGTGGA ACCCCCAGCA GTCTGACACT CGGCTATTCA AGCAATGCGC
CTTGTTGTAC TGTCGCTATC ACTATGTTCG AATGCTCATC CACAAGCCTT TTGTTCCGAT
CCAATACAAA GCCAAAAAGC AACATTCTGA TGCTTTGCCA TCGCTCCGAA TATGCGTCGA
GGCAGCTCGT TCCTTAGCAA GCATTTTTGA CGCGTGCCTC GTGCGTGGAC GGCGGGAACA
ATATCAGCCT TCTGTAGATG TTGAGATGGC CCTCCCCATG TGGGATGCAG CCACCATCTT
GCTCGTTGAT ATCTTTTCAA GCAAGCAGAC TGCTTCAGAA CGAGAAACGG CGTTGATGTG
TCTGAGATGT TGCCAACAAA CGAGTGCTGT ATTGGAAGGA ACTTGGAGGC AAGTTGCAAA
ATATTCTGAT TTTCTGGTAT CCTTGATGGA TGAGAGTATC ATGCCCCCCA TAGTTGATGA
AAGTCGGGCA GGGAGTGAGA AGCGGACCAG GCAGGATGAT GACAGTGGGC GGTCGAAGAA
AAGGGCTTGG AGAGCAGGTA GCGAGCCCAT AATGAATCCG AGAGGTGAAA ATGAGGAGAG
CAATTTGCAA TCAAGGTCTC AGTCGAGGAT GAACACTCTC GTCTCTCAAC AAGAACCCAC
TCGCACTCCG TTTGTCGGCA CCGACATACC TTTTGTGTCT TCTACCCCTA CTCACGATAT
TCCCAACCCC AACATGCAAC TCCCTTCTTT CCCATACTCT CAAGCCCGAG GAGAATATCC
TAGTCAACCA TCACCGGCTG CTTTTGTACC GTCTACTCAA GACACTTTTC AACCTTCGGA
TATCATGTAT GATTGGTTGA TGAATATGGC CAGTGCTGCC CCACAAACAC TTAGTATGAG
TTCGATGGCG TTAGGTGGGG CCGAAGGTGG GATTGACGAC CCAATTTGGG CGCAGCTTTT
TGGAGATACG GTGTTTTGTG AGTCCCATCA AAAATATAAC TCCTCCATTG CACCGACTAA
CACCCTGGAA GAAATGTATT ATTGCATCTT CACAATAGAT AATCCATGTA CTGTACACGA
CCAAAAATGG TTGATGTTCA TTATTACATC ACATTACGTT CAAATTTGCC GCAATTATTT
TATCACTGGC CCAAGGTCAA TACAATAGTA AGAA
 
Protein sequence
MNSLSSHKCA YCTERDLECT YIEEAQRRGP PKGYLETVEQ RCGRLEKLLQ ELHPDFNFDS 
YVGPPIDRQT FDLVTYHDLL DTLEIPPYPS QKPIVIDDLR PGLSASDPSV TPLSSESSAN
SLSNLKRILT RFYKAVSLDA QRESDTEETE MEARTQLSVA ESMKRMGARD HHWRYHGTAS
GIQFITHLQE LRSRAASRRM DFISCVNRVK RQRFWEVPEW ELAIASEGLS PLDLSSWPEK
GLDQQLIDAY FDHVNLHLPL LNRKVFQEQY NSCKWKNHHG FARVCLLLFA NGARYLDDDP
RVIWPLDQCA SESGRDTIIN DPYDLRRYSA GWRYVRSYLR MGQSIVQGPN LFELQGRILT
CQFVAGTAIP HFIWIVAGYG LRSAQELGIH MRATLLHADP TERELCSRAF WCLYHIDRIS
CSAIGRTLAI QDYDFDLQYP ADVDDEYWST EDREKDFQQP IGKGPSILAT FIETLKLDHI
VGAALQTIHT HSKGWLKSKD ITANYAVLAE LDSALNNWEK QVPQALKWNP QQSDTRLFKQ
CALLYCRYHY VRMLIHKPFV PIQYKAKKQH SDALPSLRIC VEAARSLASI FDACLVRGRR
EQYQPSVDVE MALPMWDAAT ILLVDIFSSK QTASERETAL MCLRCCQQTS AVLEGTWRQV
AKYSDFLVSL MDESIMPPIV DESRAGSEKR TRQDDDSGRS KKRAWRAGSE PIMNPRGENE
ESNLQSRSQS RMNTLVSQQE PTRTPFVGTD IPFVSSTPTH DIPNPNMQLP SFPYSQARGE
YPSQPSPAAF VPSTQDTFQP SDIMYDWLMN MASAAPQTLS MSSMALGGAE GGIDDPIWAQ
LFGDTVF