Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC04230 |
Symbol | |
ID | 3256732 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1287882 |
End bp | 1289053 |
Gene Length | 1172 bp |
Protein Length | 239 aa |
Translation table | |
GC content | 45% |
IMG OID | 638255644 |
Product | conserved hypothetical protein |
Protein accession | XP_569671 |
Protein GI | 58265030 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0288] Carbonic anhydrase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.32725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCATT CTAGTCTCGA CTACCCGGAG ATGGTATGTC TCATCCGCGG TTGTATTCTT TTATGGTCCT GATTGTCCTA TTCGTTGCAG AAGGAGCTTT TCAACCGCAA TCTTAAATGG TCTGAAAATG TCTGGGCGAG GGACCCTTCT GTAAGCGCTG CGTGGAGTCC CCTCCCTCGG TATTGATGTT CTGACGAATG TCGATAGTTC TTCCCGCACC ACTTTCCTGG CCAACGACCG GAAATCTTGT GGATTGGTTG TAAGTATCCT TCTGCGTCTT AACGTTACAA CATGGGCCCC TGCATTATGT CTCCTATTCC CATCTTTGTA CTATACCATG AACATCTGCA GAAGCTCACG TTTTGATCAC TATCTTTACA TCATTAGGCT CTGACGCGCG CGTTCCCGAG ACGACCATCC TGGGCTGCCA GCCTGGGGAT ATTTTTGTGC ACCGAAACAT TGCAAAGTGA GTAGAAACTG ATCCGAAGCA TTGTCCAGTC TCCCCCTTTT TGGATCAAAT ATCTTTACTG GCCTAAATTT ACTTTATGCT GAAAATACGT CAGCTTGTAT TCACCTCAAG ATGATTCACT CAACGCGGTG TTAATGATAG CTTTGTTCAA TTTCAACGTC AAGCACATTG TCGTCACAGT AAGTCGTGTT AATCTCACAA TGCAAACGTT CCATCAATAT CCGGCTAATG TATATAGGGC CATACAAACT GCGTAGGCTG CCTTACAGCT CTCAATGTTT CTCGTCTCCC GGCCACGCCT CCTACAACCC CTTTGCAGCG TTATGTCAAG CCGCTCGCCA CACTCGCCCG AACATTGTAT ACTCCTGACG GCCCTCCTAC TTTGGATCTA TTAGTGGAGG AAAATGTGGT GCAACAGGTG AAGAACCTAG TAGAAAGTGA TATAATCAAG GGCGTGAGTG GAGTGTATAT ACAGTTGAAA AGGAAAAATG CTGATATATC TCTTCAGAAC TGGAAGAAAC GAGGTGCCGA TGGCGTTGTC ATCCACGGTT GGGTATATCA TCTTGAAGAT GTACGTTTAT CTCTTCTCGA AGATTTGATT AGTACTGATA AGCATTCATT ATAGGGAACT ATTCGGGATC TCAACGTCTC CGTGGGACCA TCTGGGCATA TCCCAGGTAA AAAAGTGAAG AGCTTATTCT AG
|
Protein sequence | MSHSSLDYPE MKELFNRNLK WSENVWARDP SFFPHHFPGQ RPEILWIGCS DARVPETTIL GCQPGDIFVH RNIANLYSPQ DDSLNAVLMI ALFNFNVKHI VVTGHTNCVG CLTALNVSRL PATPPTTPLQ RYVKPLATLA RTLYTPDGPP TLDLLVEENV VQQVKNLVES DIIKGVSGVY IQLKRKNADI SLQNWKKRGA DGVVIHGWVY HLEDGTIRDL NVSVGPSGHI PGKKVKSLF
|
| |