Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA04100 |
Symbol | |
ID | 3253377 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1100719 |
End bp | 1102605 |
Gene Length | 1887 bp |
Protein Length | 453 aa |
Translation table | |
GC content | 52% |
IMG OID | 638252730 |
Product | choline-phosphate cytidylyltransferase, putative |
Protein accession | XP_566767 |
Protein GI | 58258709 |
COG category | [I] Lipid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0615] Cytidylyltransferase |
TIGRFAM ID | [TIGR00125] cytidyltransferase-related domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00623495 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACATACAGC TAGATCCAGC ACTACCCCGT CAACCATGTC TGCAAACCCA CCAGCGCAAA AACGACATAA CCGCAACAGG CTGGGAGAGA GACGGGTCAA CCGCGACCCA AGCAGCTCAA GAGATGCCAG CGAAGAGGGT AGGCAGCGTT CTATATTTTG AGATGCACAA AAAAGAAGCT AATACTGTGC TCCCCCTATC TGTAATGCCG TGTTTTCCCC GCCTGTCGAC TTCCCCTCAT CGGCTTCAAT CCTCCTAATG TCACTACTGC AATGTAACCG CTGCGGACTC TTGGCCAGAC AATGACAATG TCGAAAACTC ATTTTCCGAT GTCGGATCCA TGAACTCTTA CCACGCCGAA GCACTTTCCA CCACCTCAAC CATTGATTCG CCCACTAGGA TGCCGCCGCC TGCTCTTCCG CCCCACAGCT CGTCAGGCTC GCCAAGCACT ACCCAGGCTG GAAGACGTTC TTACCAGCAA AGACGCGTAG AAGAACTGAA TGGAGAAGGG AGCCAGAGCG AGGGTCTCGA TTCCCCAACG TGAGCCTCTT TCCAACGCAC TTTCTACGGC AAGATGGCGA AAAATATGTC GCTTGCTCGC AAAATCTCCC GGTCGCACGT GTACGGGTCT GTGCGCCAAA GGATGCTGAC GGTCCCATTT TTTTTATCAG ATATGACGGC GATGTGGAGA GCTCTTCCAC CATTGGCGGT GCACCTGCTC ACCACCAACA CACCCACTTC AGAAGGCCCT CATTCCCAGC CCCAGTACCT ACATCAGAAA CCCCGCATCC TGCAGCCCAC ATTGTCCAGC GGCAACCCAC TCCCAAAGCC TCCCAAATTG GCTTCTCCGC CGCGGACTAC CCTGCCGTGC CCACTCCTAA AGCAACCTAT GTTCGACCCT CAGATGTTCC TGTTGCACCT TCTGTAGCCC TTGAAGAGTG CGCGAGAAGC CCACCTACGA CTTCCTGGAT CCAATCTCCA AATTCCGCTG GAGGACCGCC GAAGATGTAC GCCCGCGCCG TAGAACGTAC GGAAGAGGAT ATCAAGGGCT TCGTTGAGCG AGCGATCCAC GGCAGAGGGC AAGAAGATGG TGTTGAGAGA TGGTGGAAGA CCAATCCTCC GCCTGAGGGC AAGGTTGTGA GAGTGTATGC GGATGGTGTC TATGATCTAT TTCACTTTGG GTACGTTTGG AGTATTTGCT GTCTGTATGA TGAACATAGC TCATGCTACT CTTTAGCCAT GCCTTGCAAC TTCGCCAAGC CAAGCTTTCC TTTCCCCAAG TTCATCTCAT GGTTGGCGTT TGCTCTGATG TTCTTTGTGC GCAGCACAAG TCTGCCCCAG CTATGACCCA CGCCGAGCGC TGTGAAGCAG TCAGGCATTG TCGATGGGCG GACGAGGTTA TCCCTGACGC ACCTTGGGTT GTTGATCAAG CGTTTTTGGA TAAGCACCAG ATTGACTATA TCGCGCATGA TGAAGAAGTT TACCCTAGTA AAGATCATGA AGATGTGTAT GCATTTGCTA AGAAGGAGGG TGAGTGAAAA TCCCGCTGCT CATATATGTT CTCTCCTCCT CATATATGCT CGCGCTGACA TATGGAAAGG CCGCTTCGTT CCTACTCGTC GAACACCTGC CATCTCCACG TCCGACCTTC TCGAGCGTAT CGTCCGAGGC TACAGAGATG GTTTCTTCGA TTCCAAACTT GAAAAGAACG GTCACCCCGA ACTGTTGGCT GCGGATGTCG ATTGGGACTC TAGCGCATCA ATGGAGAAGC GAGAAAAGAG AAAGGCGGCG CATCACCACA AAGTGAAAAA GTAGTACCAA AAAGAGAAGG AAAAAAAAAA GTGTTGTCTC TGATGTTTTC GGTCAGATCT CATTTAGGAT TTTAGTT
|
Protein sequence | MSANPPAQKR HNRNRLGERR VNRDPSSSRD ASEEDNDNVE NSFSDVGSMN SYHAEALSTT STIDSPTRMP PPALPPHSSS GSPSTTQAGR RSYQQRRVEE LNGEGSQSEG LDSPTYDGDV ESSSTIGGAP AHHQHTHFRR PSFPAPVPTS ETPHPAAHIV QRQPTPKASQ IGFSAADYPA VPTPKATYVR PSDVPVAPSV ALEECARSPP TTSWIQSPNS AGGPPKMYAR AVERTEEDIK GFVERAIHGR GQEDGVERWW KTNPPPEGKV VRVYADGVYD LFHFGHALQL RQAKLSFPQV HLMVGVCSDV LCAQHKSAPA MTHAERCEAV RHCRWADEVI PDAPWVVDQA FLDKHQIDYI AHDEEVYPSK DHEDVYAFAK KEGRFVPTRR TPAISTSDLL ERIVRGYRDG FFDSKLEKNG HPELLAADVD WDSSASMEKR EKRKAAHHHK VKK
|
| |