Gene CNI01400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI01400 
Symbol 
ID3259492 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp414959 
End bp416862 
Gene Length1904 bp 
Protein Length519 aa 
Translation table 
GC content54% 
IMG OID638258623 
Productcholine kinase, putative 
Protein accessionXP_572982 
Protein GI58271652 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0510] Predicted choline kinase involved in LPS biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.918034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCCCTCA CACGCCCCGC CCTGCGCCGC CTGCACACGC CACCCAGCAT GGTGGTCACC 
CCCACCCGTC CCCCCGTCGA GCCATCCGGC CCCCCCCTCT CCCCCTTGGC ACTCGCCTCC
ACTGCCCCCC GAAACAGCTC GGACTCATAC TTCGCCGCCC AAGCCGGATC AAGGCCGCTG
CCCAGGCACA GGAGGACATC AGGAAGCAGC TTCTCAAAGC TGTCCGAGTT CAGCTTGGAC
CCTCCGCTTC TCGACAAAAC GGATCAGTAC GCTGCCTCGC CTGACCAGAT CGATGAAAAC
GGCGTCACAG GCGTCCGGCA TGTGGCGCTG AGTGTGGATG CCAGGTGAGT CTGTTGTATG
GATCGTGTCT TCATGCGCCA GCTCAGTACC AAACCCCAGC GAATGGCGGC AGCCAGTTTT
CAAGCAAAAA GTCCTAGCCA TCTTACGACG CCTTGTGAGT TCCTTGGAAA ACTATACCGT
CTAACACATG CGCAGCACGT CCCGAGATGG TCCTCTACAC TCTTGACACC CACCAACATC
CATCTTCAAA AAGTCTCGGG GGCCTTGACA AACGCCGTCT TCTTTGTCTC TTTCAATCCC
GCTCCCAACC CAACTTCGCC TTCCGAATCG CCCCTGCTAA CCCCTACCAT TCCTCCATCT
GATCCATCCC ACCCCCCACC GCTCACTCCC GACCAGTACC CCCACACCCT TCTCTTCAGA
GTCTATGGCC CTTCCTCTGA GGCACTCATC TCCCGATCGG AAGAACTACG TATACTCCAC
GTCCTTAGTA CTCAATATGG CATTGGCCCC AGAGTTTTTG GCACTTTTAC CAATGGCCGG
GTCGAGGAAT TCTTCCCATC ACGCGCTTTG ACCGCTCAAG AGTTACGCGA CCCCAGTATT
TCTCGTGGCA TTGCCCGGCG GATGCGCGAA CTTCACTCGG TGGACTTGCG TCGCTTGGGA
TACGAACAAG GTCGCGCTAC GGAGCCCGCT TTATGGATAT GTCTCAAAGA ATGGTCTGAA
GCAGCGGAAG ATGTCATCAC CTCTTTGACG GCCCTCGGCG GAACATTGGA AGCATGGGTA
GAGCGTTTCT CTTTACATCG TATCCGGGAA GAAGTCACAA TCTACCGGAA CTTTGTAGAA
TCGCAAAGTG GAAAGGGCAG TGGCGTTGTG TTTGCTCGTG AGTTTTTTTA TCCGTCCGTT
TCAAATGAGG CTTATCGCTA CGCAGACAAC GACACGCAGT ATGGAAATCT GTTACGCCTC
GATGTTGAGC TTCCGCCCAA CACTCCCGAA CATTGCCGTG TGAGTATACC ACAAGTCACC
GCCAATGAGC TTGCTAACGC ACATAAAATA AAAAAAGTAT ATTGTGATTG ACTTTGAATA
TGCTTCTCCC AACCCCCGTG GGTACGATAT CGCCAACCAT TTCCACGAAT GGCGAGCCAA
TTACCACCAT CCAACTCATT CTCACTCCCT CATTCCTCAT TTCCCCTACC CCACACCTAT
TCAGCGTGAA GACTTTTATC GATCATACTT GTCCGTCGAA GTTGACGGAA GAAACGGCGA
AGAAGTGGTG GGCAAACGCA AGGATGTTCC AGCAGACAAG GTTGCTGCTC TTGAACGTGA
AGTAAGGATT TGGAGCCCGG GGTGCAGTAT AAACTGGGCG TTGTGGGGTT TGGTTCAAGC
TGAAGAACAG GTCTGTGCCT TGGCCACGAA GAAGGAAGGG TATGTCCCAG AATTCGATTA
TCTCGTACGT GCCTCTTGTA TCTCTACACG TGTGAATGAA CACTTGTGCT GACCGCCGGT
CACGGGGGTA ACAGTCGTAC GCCGCTGAGC GACTTGAAAT GTTTCGGGAC GAAGCCAAGA
AGCTTGGAGT TCCGTTATAG ATTAACATAG ATTAGCCGAT TATA
 
Protein sequence
MVVTPTRPPV EPSGPPLSPL ALASTAPRNS SDSYFAAQAG SRPLPRHRRT SGSSFSKLSE 
FSLDPPLLDK TDQYAASPDQ IDENGVTGVR HVALSVDASE WRQPVFKQKV LAILRRLHVP
RWSSTLLTPT NIHLQKVSGA LTNAVFFVSF NPAPNPTSPS ESPLLTPTIP PSDPSHPPPL
TPDQYPHTLL FRVYGPSSEA LISRSEELRI LHVLSTQYGI GPRVFGTFTN GRVEEFFPSR
ALTAQELRDP SISRGIARRM RELHSVDLRR LGYEQGRATE PALWICLKEW SEAAEDVITS
LTALGGTLEA WVERFSLHRI REEVTIYRNF VESQSGKGSG VVFAHNDTQY GNLLRLDVEL
PPNTPEHCRY IVIDFEYASP NPRGYDIANH FHEWRANYHH PTHSHSLIPH FPYPTPIQRE
DFYRSYLSVE VDGRNGEEVV GKRKDVPADK VAALEREVRI WSPGCSINWA LWGLVQAEEQ
VCALATKKEG YVPEFDYLSY AAERLEMFRD EAKKLGVPL