Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA07870 |
Symbol | |
ID | 3253555 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 2172442 |
End bp | 2174355 |
Gene Length | 1914 bp |
Protein Length | 279 aa |
Translation table | |
GC content | 48% |
IMG OID | 638253110 |
Product | conserved hypothetical protein |
Protein accession | XP_567134 |
Protein GI | 58259443 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0241] Histidinol phosphatase and related phosphatases |
TIGRFAM ID | [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA [TIGR01664] DNA 3'-phosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCAGTTATG TCCCCTCAGA AGCGAGCAGC TGAATGGCCA GAGCCGCCAG CAAAGAAGAG TATGTTCTCA AAACAACTTT TAAACCTCAA CTAACAACAA ACAGCCCATC CCTTCTTCAC AGGCACGCCA AAAGAACTCG GCAAATTCCA CCCTTCCGAC TCTGCCCTCA TCCACTTTAC CCACCTTGAT CCTTTCGAAT CTTTACCAAA ATCCTCCTCT GAAACCGCCG ATTCCATCGC TTCAATATCA GCCTCCAAAA AGATCCCTGT AGCATTCTAC GATCTTGACG GTACATTAGT CAAGACACGG TCAGGAAACG ATTTCCCCAA AAGCCGAGAT GACTGGATGT GGTGGCACCC TTCTGTCCCC GAAAAGCTAA AGCAAGAATG GGAAGATGGC ACGCATTTGG TCGTGATTTC GAATCAAGGA AGTAAGAAAC CCAAGATCAA AAGTGAATGG CGGGCCAAGT TGCCTTTGAT TGCAGCCAAG GTGCGTTTCC CGGTCGATCC CAGAGACTTG CCATATGCAT ACGTGCCAGG CTAACGTCTT CACGTTACAT TCCTATTTCT TTCCTCTTGA TGGGAGAAGA TGCCGAGCAA CGTCCCACTG CGTATCTTGG CTGCTATAGA GCAAAACAAC GTCTACCGAA AACCAAACAT TGGCATGTTC CAAGCCATCA CCGAAATCTA CCGCGCCCGT GGCCTGGAGA TTGATATGGA GAAATCCATC TTTGTTGGAG ATGCTGCCGG TAGACCTGCC AAAGGATCAC GAAAGAAAGA TCACGGGAAT ACGGATTACA AGTTTGCAAT CAATGTTGGA CTGAGATTCG TCACACCAGA GGTACGCCTT TCAATATTCT TTTTGATCAA ACATTGAATA ATGGCAACGC AAAGATTGAC ATTTTATTAT TGGGACGTTG TGATTCAAAG GAGCATTTCC TAGGTCATCC GCGCCCTTCC TTCCCCAATC CTCCTATAGG CTTCCGGCCC CGCAATCTAG GCACTCTTGA CACTCGTGAG CCCAGCGTCT TTTTCCCTTG GCCGTCCCTC ATTCTCCCTT GTTCCCTTTG TTTCTGCCTA CGCATACCTC AAAAACAAGA CTAACGCTGA CGCAGGCAAT CAGTTCCGCA CATCGTCCCA TCGCATACTC CTATCATTCG CAAGATTGAC AAGGAGGTAG AAATCGTCAT CTTTGTCGGG TATCCAGCAT CCGGCAAATC ATCCTTCTTC CGCAAACATT TCCAGCCCGC GGGTTACGTC CACGTTAATC AAGATACTTT GCGGACGAGG GAAAAGTGTT TAAATGTGGC GGAACAGGCG TTGAAGGGCG GGAAATCTGT TGTCATTGGT TAGTTCAATT CTCAAGTTCG GGTTTGGGCG CACATCCTGG GTATTCGGGG ATAGGCGATG TTTTGGGCAG GACTCTTTTT GAATCTGATC TCACGGGATG ATGGACTGAT GGGATAACTT CTCAGATAAC ACGAACCGAA ATCGAGAAAC ACGAGCTTAC TGGGTATCTC TCGCATCAAA ACTTAACGTC CCTATACGGT AAGCCCTCCC CATCTCTTTG GTCTTTCCTC TCATATACTG ACCCCGTCCA ATCCTTCCTG TCCGGCCTAA TCCACCCCAA CCCAACCCTA GACTATTCCA TTTTCTCTGC CCTCCCGAAT TAGCCAAACA TAACAACCTC TACCGCGCCT ACTATCCTCC TCCTGATGAA CCTACCCGTG GGATACTGCC ATATATCGCC TTTGCCGGCT TCGAGACCGC GTTTGAAGAG CCGAGGAAGG AAGAAGGGTT TGAAGAAATT AAAACAGTAA ATTTTCATTG GGAAGGCTCG GAGGAGCAGA GGAAGAAGTG GGATATGTAT ATTGAATAGT ATATATAGGG GGAAGGATGG TTATTTACTC TAGA
|
Protein sequence | MSPQKRAAEW PEPPAKKTHP FFTGTPKELG KFHPSDSALI HFTHLDPFES LPKSSSETAD SIASISASKK IPVAFYDLDG TLVKTRSGND FPKSRDDWMW WHPSVPEKLK QEWEDGTHLV VISNQGSKKP KIKSEWRAKL PLIAAKMPSN VPLRILAAIE QNNVYRKPNI GMFQAITEIY RARGLEIDME KSIFVGDAAG RPAKGSRKKD HGNTDYKFAI NVGLRFVTPE EHFLGHPRPS FPNPPIGFRP RNLGTLDTRN QFRTSSHRIL LSFARLTRR
|
| |