Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK01860 |
Symbol | |
ID | 3254564 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 547324 |
End bp | 550501 |
Gene Length | 3178 bp |
Protein Length | 924 aa |
Translation table | |
GC content | 49% |
IMG OID | 638253679 |
Product | conserved hypothetical protein |
Protein accession | XP_567665 |
Protein GI | 58260510 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.640491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAATC CTATTCCCCT CCCCACGGTC CCAGAAGACA CTCTTCACTA TGTCATACAT CTTCCTCGGC TGGAAGAACC CCTCACGGCA GCTCTCCTCG TTACTGAGCA CGTCCACTCG CTTCTTCCGG AACGCTGGCT ATGGAACAAG GACCCATGGG AGCTCAAAGT GGCCACAGAT GACGATCACA AACTGGAAGG AAGAATGAGA GTAGGGGATG CGGTAGATGA TGAGTGGCTG GTGGTATGGC TCCTTACACA AGTCTCTAAA AAGTGGCCAG AATTTATCAT TAGGTATGTA TCCGATCACA TCGCCATTCG TTAATTGCAG AATTGGCTAA CACCGCTGGT CCATGGTAGT GTAAGGGACA CGGATGGAGA GTTTCTTCTT ATTGAAGCCG CGAATGAGCT TCCTTCTTGG GTGTCACCCG ATAATGCCAA TAATAGAGTA GGTCTCGTCA TGACTTGCCA CTCGTAAGAG CTCGATCGCT GAATTCAAAT TAGTTGTGGC TCTCAAATGG TCACCTTCAC CTTCTCCCAC TAAACATACA CTCACCTGCA CCTCCACATC AGCTCCCCGA TGACCTCACA TTTGACTCTT CAATTTACCT CTCTGAGACT GATGCTCTTC GGGCCGTACA GACCGGCAGA TATCTTGCTC CCAAAGAGGT CGAGAACGTT GTATGGGAAA GAATAAAGGG CTATCCGGAA GCGATGAAAA CCCATCTTCA TCAGACCAAA ATCTATCTGC CTGCTGGCAT CGCCAAAGCA CTCAATAGGA AACCCGAATT GGTACAGAAA GCTGTAGAAG CTTTTTACGT TCGAGATCCT GCCCAACTTC GTGTGAGTGA TTTTTTTTCC TTATCTCGAT AAACACCCAA AGCTGACTCC TTGCAGGCCG CTTCACGAAT GACACACTTT CCACCCTCCC CTTCAATCCT TAGTCAGCTC ACTCTCACTC GCGCTGCTTA CGCCCAGCTG CAAGGGCAGG TTTTCCATCC ACCCAGAGTC TTTGGCCCTG AATGGAATGT CCGCGACCCT CCCTCTTTGG ACGCTACCGA AGGCACTTAC AAACACCTTG AGAATGAGCG ACGGTGGAGA GATCTAGGTG TTAAACTCGC CACTGGTTTT GAAATTATGT ACCGTGAAGG CGGTAGAAAG TCTCGTTCGG GAGCTACAGG CGAGTCGGTG GATAGCGCCG AGGATAAAGG ATACGCATCC TTCCTTGAGG GAATCAGGAA GGCTGGGTGG TTTGGAAATG AGCTGGAAGG TAGTCAAAAG TGGAAAGAGA GAGAAGAAAA GGCAAGGAAA GGTTACATAA ATGCCAAGTC TGCAGAGTAA GTTTTGACAT TTTGTCGAGA GTGTATTTTG ACAGGAACGT AGTATTGCAT CTCAAAGACC TTCATTTGCC TACCTAGTAG ACAATGCCAT TGCTTCATGC TCATTGTCTC TCGATCAACT TGCCGTTTCA GAAGATGCCC CCGAAGACAA TGACGATTGG CTCCAAATTT CCCCTGACGA ACTCGATTCA ATGATGTTGC ATGCTTCAGG TCAGGCAAAG CAGTCAGATA AGCAGGACGA AGAGCAAAGG AAAAGGGAAA ATGTGGAACT CACTGAAGAG GACGGAAAGG CACTTGGCGA TTTAGCGAAG AAGGTACAAG AGTTTGTTGG GGGCCAAGGT GATCTCCAGG GTGCCCAATT CGTCGAGTGA GTTCCTTCTA TTCTTGACTT GCTATTTCGT GGGTAGCTGA CGAATATGTA GCGAGTTGTC TGACGAAGAC ATGGACTCTG GCTCTGACTC TGACAGCGAA GAATTGCAAG CTCATAAAGA CAAGCTTGAG GCCGAGAAGC AAGCGAGAAT GGATAGTCTC GTCCCAACTC TTCCCGCTTC CGACTGGGGA CGCAAAGTTC ACATCTCCGA ACAATCCCAT CAATTACCAG TAGCAATAAA TCAATCATTA ACGTCTCCAA AAGCGGACGG CAAGAAGATT GACCCTCTCG AGTTCATCCC TTCTAAAATG CGTCCTCCTC GTTTTGCTAA ACAAGAATTC GACGGTGTCG TCTCTGATTC CGAATCTGAC TCTGAGTCAG ACCTTCCTGC TGAAGGTACT TGGGGGAGGA AAGTTGCGCA GATGAAATGG AGTGAATTTC CACCCGTGGA TCTTGAGGAT AACCAACAGG CAAGGATTGA AGAAATCGAG GAGGAGGACG AGGATGAGCA ACAGCGTAAA GCAAAGTTAC GACTGGGCGA GGACGTTGAT TTGGAAGAGG AGATGCAGAG GCGCGTCTGG GGTGATAGAG GAGAGGACGA GGATGAAGAT GAAGACGTGG CCGCTGGAGG AGAAGAAGGG GTTGATATTG ATATGGAGGA TGAGACGGAG GAGTTCCTCA AGTTCTCGAG GGAGGCTCTG GGGATCAACG ATGAGATGTG GGAGGGTATC TTGGGCGATC GTCGAGCCAG AGGAGGTGAG CTCTTGCACA TGGTTTGCGG ATATAGCTGT CACGCTAACA TGAGGTTGCA GCCTTTGTTC CGCAACTGTC TGGCAAGGTT AAGCGTAAAG ACGAGCTACC GGCGAAACTA CAGCCACGAG ATTCCACCAG CAAAAAGGTC CAGTTTGCCG AGGCTGACTT TCCCTCCAAA TCAGCTTCAC AGTCATCAGC CACTACCCAA TCGCCTGATC AATCGAACAC ATCTTTAAAT TCTTTTGAGA CCGTCATGCG TGCTATGGAC GAAGCACTCG CTCGCTCCCG AGACGGACCT TCCACCTCTC AACCAAGCCA GCCTAAGTCA TCCAACAAAT CAAAAAACAA GAAATCGACT TCCTCCGCCA ATCCTTTACC GCCTACCTCG CAGACCAATG ATGTTGATCT TGATGCATTC TCAGAGGATG ACATTGCAGC GATGGATCGC GAGTTACGTT CTGTCTTAAA GGGTGCTGGT ATAGACCCTG ATGACTCGGA TGATGATATT GAAGAGGTCG GCGAGCTTGA CGTGAATCAA AAGAGAGAGT ATGAGATGAT GAAGAATTTC TTGGAAAGTT TCAAGTCTCA AGGAGGGGAA AGCGGTGTTG TGGGGAATCT CTTTGGGCGA CTGTCAGAGA AGCATTGAAG AATAGACAGA CCATTGTTGG GATTTACTTG TCGGTAGTGC AGACTTTATT ACACCCCAAG TACTTATT
|
Protein sequence | MANPIPLPTV PEDTLHYVIH LPRLEEPLTA ALLVTEHVHS LLPERWLWNK DPWELKVATD DDHKLEGRMR VGDAVDDEWL VVWLLTQVSK KWPEFIISVR DTDGEFLLIE AANELPSWVS PDNANNRLWL SNGHLHLLPL NIHSPAPPHQ LPDDLTFDSS IYLSETDALR AVQTGRYLAP KEVENVVWER IKGYPEAMKT HLHQTKIYLP AGIAKALNRK PELVQKAVEA FYVRDPAQLR AASRMTHFPP SPSILSQLTL TRAAYAQLQG QVFHPPRVFG PEWNVRDPPS LDATEGTYKH LENERRWRDL GVKLATGFEI MYREGGRKSR SGATGESVDS AEDKGYASFL EGIRKAGWFG NELEGSQKWK EREEKARKGY INAKSADIAS QRPSFAYLVD NAIASCSLSL DQLAVSEDAP EDNDDWLQIS PDELDSMMLH ASGQAKQSDK QDEEQRKREN VELTEEDGKA LGDLAKKVQE FVGGQGDLQG AQFVDELSDE DMDSGSDSDS EELQAHKDKL EAEKQARMDS LVPTLPASDW GRKVHISEQS HQLPVAINQS LTSPKADGKK IDPLEFIPSK MRPPRFAKQE FDGVVSDSES DSESDLPAEG TWGRKVAQMK WSEFPPVDLE DNQQARIEEI EEEDEDEQQR KAKLRLGEDV DLEEEMQRRV WGDRGEDEDE DEDVAAGGEE GVDIDMEDET EEFLKFSREA LGINDEMWEG ILGDRRARGA FVPQLSGKVK RKDELPAKLQ PRDSTSKKVQ FAEADFPSKS ASQSSATTQS PDQSNTSLNS FETVMRAMDE ALARSRDGPS TSQPSQPKSS NKSKNKKSTS SANPLPPTSQ TNDVDLDAFS EDDIAAMDRE LRSVLKGAGI DPDDSDDDIE EVGELDVNQK REYEMMKNFL ESFKSQGGES GVVGNLFGRL SEKH
|
| |