Gene CNK01820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01820 
Symbol 
ID3254585 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp531742 
End bp533572 
Gene Length1831 bp 
Protein Length571 aa 
Translation table 
GC content56% 
IMG OID638253675 
Productconserved hypothetical protein 
Protein accessionXP_567661 
Protein GI58260502 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATCG CATACATTCT CGGCCTCGTA CCCCTCGCTT TCGCCGGTGT CATCAAGCAC 
GACCCTCCCA AGTTCCAGCC TATTCAGTCT ACCAGGATTG TGCGGCTGCA CCCAAACGGG
GACAAAAGCA AGTGCGTTGA CCTCCTGGGT AATACTCGCC AGGATGGTCA GCCCGTGCAG
GTGAGCCCAA GCAATGACAT CACACCATAC TGTGCCACTT TTGCTGATAT CGCTCAGATT
TGCGACTGCG ACGGTACCCC GGCTCAGGAC TGGGTCCTCA ATGCCGGCCG CGGTCAGACC
AAGGTCCAGC TCGCCGGCAC CAGTTTCTGT CTCGATGCCA CCCACCCTTA CGCAGCCGAC
GGGACCAACA TGAAGATCTG GAAGTGCTTG GACGTCCAAC AGCAAGACTG GTATTGGACG
AGTGATAACA GAATCGTTCT CCGCGACCAG GGCAAGTGCC TCGACTGGGC CACTGGGGAT
CGGTCTGATT TCAACCAGCT GCAGGTCTGG CGGTGCAGCA CGGATAACAA CAATCAGTGA
GTCACCTGCT TGCAGTGGCC TGAGTTGATG GGCTGACAAT AATCTCACTA TTAGGGTCTG
GACAACGGGA CCGGACTACG GTGGGAACCA TGGGGGTGAT GCTGGTGGGA ACCCCGGAGG
TAATCAAGGC GATGATTCAA GAGGCAAAAC CAATACTGGT GGAAACCCCG GAGGTAATCA
AGGTGGTGAT TCAGGAGGGA AAACCAATCA CATCATTCCC GACCCCCCAG GGCCAGACCC
CAACAGCGAG CCCCTCAACC CCGCCCTTGA AGCCATTGTT AACGTCACCG AGGCAGCTGG
ACCCTGGCCG CCCATGATCA ACTTCGACGG CGATTACAGC AATGACGACG TGACCGTATC
GGACCAAGTA CCGTTCGACT ACTGTATCGG GGAGGGGTCT GGTAACCCGA CGGATGATGA
AGGACAGCAG CAAGGACAAA ACTTCACAGC AAATGTAGCT GGGATAGGGA GAGACTTCTG
CCTGGACAAT TTTGGCAATC CTGACATTCG GAACACCATT TCTTTCGACA ACAACACCAG
CATTGGGAAC GGAGCGGACA CTGGGCGAGC CCTTCACAAG CGGACATTTG CGGATTCAGG
GGCGACGGGT ACGCCCAACC GGTGGAGACG AGGGTCGGTG ATTTCCATTT GCGTCGAGAG
GAACAACAAT TATCTGGTTC CATATGCGTC CTCCCCCGTT CCCATCCGAG CATCGGCTAT
CGTCGCATCC GCCATGGTAC GTGCAATCAA CTTCTGGAAC GCAGGTCTGA ACAAGCGATT
CGTCTCGTTC GAGTTTGTGG AGAACTGCAA CGACGCCGTG TTCCATACTC TTGCTGTTGA
CCAGATCAAG TCTGCCAAAG AGCCTACTGT GCTCGCGACT GCCCCCTTCC CTCCTCGGGG
TGAAGAGGGT GCTAGGAACC GCAACATCTT CGTGTGGAAT ACGGCTTTCG AGGCCAACTT
TCAGAACGTC CTTACCTTTA TCATGTCACA TGAGCTGGGG CACACTCTTG GCCTGGCGCA
TGAGGACTGC AAATCCAGAG ACCAACCTTG CGAAGTTATC ACTGACAAGG TGGCTGGGTC
AGTCGTGGAA AGCCGTATCT CCGGCAGCAC CACACAGCTG TTCAATGGCC CCACCCCGCT
TGACATAGCA GGGGCGAACG AGTACTACTC ACTTGCAGCG GGACCCAACA CCCCGGAGAA
CATCGTACTC TGGCCTGCGA CGAGGGGTCC GTTTATCAAC TACCCGCCGC TACCGAAATG
CAAGTGGTTC CTCGGTATTT GCTATTACTA G
 
Protein sequence
MHIAYILGLV PLAFAGVIKH DPPKFQPIQS TRIVRLHPNG DKSKCVDLLG NTRQDGQPVQ 
ICDCDGTPAQ DWVLNAGRGQ TKVQLAGTSF CLDATHPYAA DGTNMKIWKC LDVQQQDWYW
TSDNRIVLRD QGKCLDWATG DRSDFNQLQV WRCSTDNNNQ VWTTGPDYGG NHGGDAGGNP
GGNQGDDSRG KTNTGGNPGG NQGGDSGGKT NHIIPDPPGP DPNSEPLNPA LEAIVNVTEA
AGPWPPMINF DGDYSNDDVT VSDQVPFDYC IGEGSGNPTD DEGQQQGQNF TANVAGIGRD
FCLDNFGNPD IRNTISFDNN TSIGNGADTG RALHKRTFAD SGATGTPNRW RRGSVISICV
ERNNNYLVPY ASSPVPIRAS AIVASAMVRA INFWNAGLNK RFVSFEFVEN CNDAVFHTLA
VDQIKSAKEP TVLATAPFPP RGEEGARNRN IFVWNTAFEA NFQNVLTFIM SHELGHTLGL
AHEDCKSRDQ PCEVITDKVA GSVVESRISG STTQLFNGPT PLDIAGANEY YSLAAGPNTP
ENIVLWPATR GPFINYPPLP KCKWFLGICY Y