Gene CNA04100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04100 
Symbol 
ID3253377 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1100719 
End bp1102605 
Gene Length1887 bp 
Protein Length453 aa 
Translation table 
GC content52% 
IMG OID638252730 
Productcholine-phosphate cytidylyltransferase, putative 
Protein accessionXP_566767 
Protein GI58258709 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0615] Cytidylyltransferase 
TIGRFAM ID[TIGR00125] cytidyltransferase-related domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00623495 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACATACAGC TAGATCCAGC ACTACCCCGT CAACCATGTC TGCAAACCCA CCAGCGCAAA 
AACGACATAA CCGCAACAGG CTGGGAGAGA GACGGGTCAA CCGCGACCCA AGCAGCTCAA
GAGATGCCAG CGAAGAGGGT AGGCAGCGTT CTATATTTTG AGATGCACAA AAAAGAAGCT
AATACTGTGC TCCCCCTATC TGTAATGCCG TGTTTTCCCC GCCTGTCGAC TTCCCCTCAT
CGGCTTCAAT CCTCCTAATG TCACTACTGC AATGTAACCG CTGCGGACTC TTGGCCAGAC
AATGACAATG TCGAAAACTC ATTTTCCGAT GTCGGATCCA TGAACTCTTA CCACGCCGAA
GCACTTTCCA CCACCTCAAC CATTGATTCG CCCACTAGGA TGCCGCCGCC TGCTCTTCCG
CCCCACAGCT CGTCAGGCTC GCCAAGCACT ACCCAGGCTG GAAGACGTTC TTACCAGCAA
AGACGCGTAG AAGAACTGAA TGGAGAAGGG AGCCAGAGCG AGGGTCTCGA TTCCCCAACG
TGAGCCTCTT TCCAACGCAC TTTCTACGGC AAGATGGCGA AAAATATGTC GCTTGCTCGC
AAAATCTCCC GGTCGCACGT GTACGGGTCT GTGCGCCAAA GGATGCTGAC GGTCCCATTT
TTTTTATCAG ATATGACGGC GATGTGGAGA GCTCTTCCAC CATTGGCGGT GCACCTGCTC
ACCACCAACA CACCCACTTC AGAAGGCCCT CATTCCCAGC CCCAGTACCT ACATCAGAAA
CCCCGCATCC TGCAGCCCAC ATTGTCCAGC GGCAACCCAC TCCCAAAGCC TCCCAAATTG
GCTTCTCCGC CGCGGACTAC CCTGCCGTGC CCACTCCTAA AGCAACCTAT GTTCGACCCT
CAGATGTTCC TGTTGCACCT TCTGTAGCCC TTGAAGAGTG CGCGAGAAGC CCACCTACGA
CTTCCTGGAT CCAATCTCCA AATTCCGCTG GAGGACCGCC GAAGATGTAC GCCCGCGCCG
TAGAACGTAC GGAAGAGGAT ATCAAGGGCT TCGTTGAGCG AGCGATCCAC GGCAGAGGGC
AAGAAGATGG TGTTGAGAGA TGGTGGAAGA CCAATCCTCC GCCTGAGGGC AAGGTTGTGA
GAGTGTATGC GGATGGTGTC TATGATCTAT TTCACTTTGG GTACGTTTGG AGTATTTGCT
GTCTGTATGA TGAACATAGC TCATGCTACT CTTTAGCCAT GCCTTGCAAC TTCGCCAAGC
CAAGCTTTCC TTTCCCCAAG TTCATCTCAT GGTTGGCGTT TGCTCTGATG TTCTTTGTGC
GCAGCACAAG TCTGCCCCAG CTATGACCCA CGCCGAGCGC TGTGAAGCAG TCAGGCATTG
TCGATGGGCG GACGAGGTTA TCCCTGACGC ACCTTGGGTT GTTGATCAAG CGTTTTTGGA
TAAGCACCAG ATTGACTATA TCGCGCATGA TGAAGAAGTT TACCCTAGTA AAGATCATGA
AGATGTGTAT GCATTTGCTA AGAAGGAGGG TGAGTGAAAA TCCCGCTGCT CATATATGTT
CTCTCCTCCT CATATATGCT CGCGCTGACA TATGGAAAGG CCGCTTCGTT CCTACTCGTC
GAACACCTGC CATCTCCACG TCCGACCTTC TCGAGCGTAT CGTCCGAGGC TACAGAGATG
GTTTCTTCGA TTCCAAACTT GAAAAGAACG GTCACCCCGA ACTGTTGGCT GCGGATGTCG
ATTGGGACTC TAGCGCATCA ATGGAGAAGC GAGAAAAGAG AAAGGCGGCG CATCACCACA
AAGTGAAAAA GTAGTACCAA AAAGAGAAGG AAAAAAAAAA GTGTTGTCTC TGATGTTTTC
GGTCAGATCT CATTTAGGAT TTTAGTT
 
Protein sequence
MSANPPAQKR HNRNRLGERR VNRDPSSSRD ASEEDNDNVE NSFSDVGSMN SYHAEALSTT 
STIDSPTRMP PPALPPHSSS GSPSTTQAGR RSYQQRRVEE LNGEGSQSEG LDSPTYDGDV
ESSSTIGGAP AHHQHTHFRR PSFPAPVPTS ETPHPAAHIV QRQPTPKASQ IGFSAADYPA
VPTPKATYVR PSDVPVAPSV ALEECARSPP TTSWIQSPNS AGGPPKMYAR AVERTEEDIK
GFVERAIHGR GQEDGVERWW KTNPPPEGKV VRVYADGVYD LFHFGHALQL RQAKLSFPQV
HLMVGVCSDV LCAQHKSAPA MTHAERCEAV RHCRWADEVI PDAPWVVDQA FLDKHQIDYI
AHDEEVYPSK DHEDVYAFAK KEGRFVPTRR TPAISTSDLL ERIVRGYRDG FFDSKLEKNG
HPELLAADVD WDSSASMEKR EKRKAAHHHK VKK