Gene CNK00400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00400 
Symbol 
ID3254650 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp120112 
End bp123295 
Gene Length3184 bp 
Protein Length919 aa 
Translation table 
GC content54% 
IMG OID638253534 
Productcytoplasm protein, putative 
Protein accessionXP_567609 
Protein GI58260398 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACCG AGTCCCGGCG GATCGCCCTT GCCGACCCGC ACCACCTCGC CCCGCCTCGT 
TTCCTCGTCG GCGGCCAGCT CGCACACATG CTCTCCGGCG ATGTCAAGCT CTACGAGTGG
GACCGCGACG TGAGTTCTTG TCCGTATTCC ATACCATGCG CGCTCAGCCA TGCCCGTGTC
TGCAGAACGG GAAAATGTCC ATGCTCGGCC TCCAGACAGA GATTGGCCAG ATCAAAGCAC
TCGCCTGGTC GCCGTCTCCG GCCCACAGAC ATCTGGTCGC AACGGGTCTG TCCACCGGCA
AGACACTCCT CCTCAACCTC TCACCATCCA CCTTGTCCCT CCCCGTCGCC AACCCGCCCC
CTTCGCCTCC CGTTATTGCT GCCCTAACAG TCAAGCATAC AAGGGCAGTG TCGAGTATAT
CCTTTTCCCC CCACGACGCC AACTATCTCG CCACAGGTCT TGAACGACAT CGATCCGATT
CGAGCTTGTT GATCTGGGAT ATCCACGATG CCGTCGCTGC TTCCCGCTTA CCACCAGACG
GAGACGTACA CTACACCCGG CCCGAACTTC GACTCCCGAT AACCACACCG CTTGCCAAGA
CATCTTCCGC TTCTGAACCG AGGCCAATAC AGGCATACTG CCCGTCGGAA CAAGTCAACT
CTGTCGCCTT TATCCCTACT GCTCCTTACT CCCTGCTGTC TTCTGCGGGC AACAAGGTCA
TTAGGCTATA CGATCTTCGT GCGCCGTCAA ATACTTTGAA AGAACCCAAT TCTGCAGGTT
CCCTCGCGCA ATGGTCTACA CGTACCGTCA TGTCACTCTG TCCCAACCCG TCCTCTACCC
TTTTCGCATC GTACGAATCA ATCCAAGGAG GCAATAGTAC AGTGAGACTG TGGGATACAA
GATACCCAGG CCAAGAAATT GTTGGGTGGG AAGTGAGAGG TGGGGTAGTG GGTATGAGCT
GGGTGGATAG TATGAGATTA GGTGTAGGCA GTAAAGAGGG CGGTGTGGGC GTTTGGGATA
TCGTTCGATG TAAACCCGAT CAGTCTGGGG CTAGCGAATG GGTTACATTG GGCGGTATGC
GTCAGAGTAA GTGTTATATC TTCATAATCT GCAAATCTGC ATTTGCTAAT CGTGAATGCA
TAAACAGTTA TCAAGCCCAA ACCAAACATG CATTCATTCG CATTCACACC GCCAACCCCG
CCCAAGCACA TCGATGTCAT GTACGTCCTC AAAGACAGTA CGATATCCAT CGGTCCCGTC
TCCACCGCCC CTGTGCTGGC TTCCAATTCT CATGGGGGCC TTTCCATCTC TGAACCTACA
TTGTACTTTA TCGATCCTGA TATCCCTCTA AGATCGTCAT CGTCATCGAC ATCTCCCACT
GCCCACGGTT TTGTCCAAGG GTCAGGGTTG GGGTTAGAAA TCGGCCAAGG AGATCGGGCG
GCAGAAGGAG AGGCAGGGGC AGGGGTTGAA GCCATTTACC CTCGAAACAA GTTCCAACTC
CCTCCCGACC AAGTGTCCAC CATCCTCAAC GAACATTCCC GTCTCCGTTC CTCCTCCTTT
GGAGCTGTAT CCACCATCCA CCCTGCCTCC TCGCCTCTTC ACTCCCATAC CAATCCCCAT
GGCCATGGCC GTCCCCATGT CCCACCCTCC GCGCCTACAA GACATCTCCC CTCCGGCTCG
GGCCCCAACG TAGCTATACA CCATATCCCA GATATGTTGA CCCAGTCGTT CTTATCCAGA
GATAGGTGGC ATGAAGAAGA TCCAACAGGG GTGATTAACG ATCGTGATGA GATTACGGGT
GGGTATGAAG GGTGGAGAAG GGTGTTGGGT GGGGATGTGG GGGTGGTGAT GCGGAGAAGG
GCGATGGAAG GGTACGGTTT GGATAACGTG AGTTGATGGG CCTGTCTTTT TCTTTTCTTT
GATGCTTTGA TTACTTTGGT TGCGACGTTG AGGTCTGTTT CCGTTTAGTT TCTGGGCGGC
TGACCGAAAA GAGAAACAGT TGTTATTGAA CGCGGCGATA GCGACAAAGT ATCGTGGAAA
ATTCAAACTT GCCGGTGTCT GGGAATTCGT CGAACGTAAG TTCCACTCCC GTACCTCACG
TACTCCTCGC CACTTTACTA ACCCGCACAC TTTTTTTCCT TAGACCTAAC CAAAACCATG
TCCCCCTCCG TCTCCTCTTC CGGTGGGTAC AACCTCACTC ATCAAGGGAT TTACCCCATC
TGGACATCCC TCGGTACACT CGACGCTGCA TCCGCTTCTG GCGGACCCAT TTCTGCCGCT
TTGAACGCTG ATGAAGACGG ACCCGGTGGG GAAGGCGGCG GTTTAGTCAG CGAGCTGGAA
GAGAGATTGA GACGCATGAA TATCAAGGGG AGTGGTGGTA GTACAGGAAG GGGAAGCAGG
TCGGCCTCTG TATCTCGAAA AGGTAGTGGT ACCCATACCC CTCGCGAACG CAAGATTTCG
GAACGTACAT CCCACACCCA CACCAACACC CACTCCCACT CCCATGCCCA CACATCATCG
ACCTACCCGC AAGGTCCCCC CTCTTACCTC CCCGCCATCT CCCACCTCCT CTCCTCCCGT
CTATCTTTTC CCGATAAAGC CATCCACCCA TCGGAACTGA GACCGTCTAT TTCATCTTCC
TCGGAGAAAG TCGAGTTGAG GAGGTTGATC CTGACGATAT GCGGGGAAAG TAAGGAAGGT
GGGAAGGGGG AGGTGGAGGA GATGTTGAGG AGGGGGGAGA GGAGTAAAGC AGCATTTAGG
GCGTTTTTCA GGGGGGATGA AAGTGGGACA GTGGGAATTT TGATGTCTAG CGAGGGTACG
TCGAATTTGA GCTCTTCCAC CCTGTGTTTC TTTTGCTCCT TTTTTCTCTG TCATCTCCGT
TTGGAAGAGG GCTAACGGAC AAGAGATAGA TCCAAATGAT ACATTACTCG GCTCGACTAT
CGCCGGGTTC ATGTCCCAAT CAGCTTCTAC CCGAGGATCA GAATATTTCA ACGCCCACTG
GCCCAACCTT ATTCGCCGGG TGGACGACCC GTACGTTCGT GCGATCCTTT CTCGTATTGC
GGGTGAAGAC TGGGAAAGTG TCTTGGAAGA GGAGTATATA CCGTTGTTGG AGAGGATGGT
CGTTGGAGTG CAGTATCTCG ATGATTGGGA GGTAGGTGTT TTTTTTTTTT CTCGCGACCA
TTAG
 
Protein sequence
MATESRRIAL ADPHHLAPPR FLVGGQLAHM LSGDVKLYEW DRDNGKMSML GLQTEIGQIK 
ALAWSPSPAH RHLVATGLST GKTLLLNLSP STLSLPVANP PPSPPVIAAL TVKHTRAVSS
ISFSPHDANY LATGLERHRS DSSLLIWDIH DAVAASRLPP DGDVHYTRPE LRLPITTPLA
KTSSASEPRP IQAYCPSEQV NSVAFIPTAP YSLLSSAGNK VIRLYDLRAP SNTLKEPNSA
GSLAQWSTRT VMSLCPNPSS TLFASYESIQ GGNSTVRLWD TRYPGQEIVG WEVRGGVVGM
SWVDSMRLGV GSKEGGVGVW DIVRCKPDQS GASEWVTLGG MRQIIKPKPN MHSFAFTPPT
PPKHIDVMYV LKDSTISIGP VSTAPVLASN SHGGLSISEP TLYFIDPDIP LRSSSSSTSP
TAHGFVQGSG LGLEIGQGDR AAEGEAGAGV EAIYPRNKFQ LPPDQVSTIL NEHSRLRSSS
FGAVSTIHPA SSPLHSHTNP HGHGRPHVPP SAPTRHLPSG SGPNVAIHHI PDMLTQSFLS
RDRWHEEDPT GVINDRDEIT GGYEGWRRVL GGDVGVVMRR RAMEGYATKY RGKFKLAGVW
EFVEHLTKTM SPSVSSSGGY NLTHQGIYPI WTSLGTLDAA SASGGPISAA LNADEDGPGG
EGGGLVSELE ERLRRMNIKG SGGSTGRGSR SASVSRKGSG THTPRERKIS ERTSHTHTNT
HSHSHAHTSS TYPQGPPSYL PAISHLLSSR LSFPDKAIHP SELRPSISSS SEKVELRRLI
LTICGESKEG GKGEVEEMLR RGERSKAAFR AFFRGDESGT VGILMSSEDP NDTLLGSTIA
GFMSQSASTR GSEYFNAHWP NLIRRVDDPY VRAILSRIAG EDWESVLEEE YIPLLERMVV
GVQYLDDWEV GVFFFSRDH