Gene CNK01860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01860 
Symbol 
ID3254564 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp547324 
End bp550501 
Gene Length3178 bp 
Protein Length924 aa 
Translation table 
GC content49% 
IMG OID638253679 
Productconserved hypothetical protein 
Protein accessionXP_567665 
Protein GI58260510 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.640491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAATC CTATTCCCCT CCCCACGGTC CCAGAAGACA CTCTTCACTA TGTCATACAT 
CTTCCTCGGC TGGAAGAACC CCTCACGGCA GCTCTCCTCG TTACTGAGCA CGTCCACTCG
CTTCTTCCGG AACGCTGGCT ATGGAACAAG GACCCATGGG AGCTCAAAGT GGCCACAGAT
GACGATCACA AACTGGAAGG AAGAATGAGA GTAGGGGATG CGGTAGATGA TGAGTGGCTG
GTGGTATGGC TCCTTACACA AGTCTCTAAA AAGTGGCCAG AATTTATCAT TAGGTATGTA
TCCGATCACA TCGCCATTCG TTAATTGCAG AATTGGCTAA CACCGCTGGT CCATGGTAGT
GTAAGGGACA CGGATGGAGA GTTTCTTCTT ATTGAAGCCG CGAATGAGCT TCCTTCTTGG
GTGTCACCCG ATAATGCCAA TAATAGAGTA GGTCTCGTCA TGACTTGCCA CTCGTAAGAG
CTCGATCGCT GAATTCAAAT TAGTTGTGGC TCTCAAATGG TCACCTTCAC CTTCTCCCAC
TAAACATACA CTCACCTGCA CCTCCACATC AGCTCCCCGA TGACCTCACA TTTGACTCTT
CAATTTACCT CTCTGAGACT GATGCTCTTC GGGCCGTACA GACCGGCAGA TATCTTGCTC
CCAAAGAGGT CGAGAACGTT GTATGGGAAA GAATAAAGGG CTATCCGGAA GCGATGAAAA
CCCATCTTCA TCAGACCAAA ATCTATCTGC CTGCTGGCAT CGCCAAAGCA CTCAATAGGA
AACCCGAATT GGTACAGAAA GCTGTAGAAG CTTTTTACGT TCGAGATCCT GCCCAACTTC
GTGTGAGTGA TTTTTTTTCC TTATCTCGAT AAACACCCAA AGCTGACTCC TTGCAGGCCG
CTTCACGAAT GACACACTTT CCACCCTCCC CTTCAATCCT TAGTCAGCTC ACTCTCACTC
GCGCTGCTTA CGCCCAGCTG CAAGGGCAGG TTTTCCATCC ACCCAGAGTC TTTGGCCCTG
AATGGAATGT CCGCGACCCT CCCTCTTTGG ACGCTACCGA AGGCACTTAC AAACACCTTG
AGAATGAGCG ACGGTGGAGA GATCTAGGTG TTAAACTCGC CACTGGTTTT GAAATTATGT
ACCGTGAAGG CGGTAGAAAG TCTCGTTCGG GAGCTACAGG CGAGTCGGTG GATAGCGCCG
AGGATAAAGG ATACGCATCC TTCCTTGAGG GAATCAGGAA GGCTGGGTGG TTTGGAAATG
AGCTGGAAGG TAGTCAAAAG TGGAAAGAGA GAGAAGAAAA GGCAAGGAAA GGTTACATAA
ATGCCAAGTC TGCAGAGTAA GTTTTGACAT TTTGTCGAGA GTGTATTTTG ACAGGAACGT
AGTATTGCAT CTCAAAGACC TTCATTTGCC TACCTAGTAG ACAATGCCAT TGCTTCATGC
TCATTGTCTC TCGATCAACT TGCCGTTTCA GAAGATGCCC CCGAAGACAA TGACGATTGG
CTCCAAATTT CCCCTGACGA ACTCGATTCA ATGATGTTGC ATGCTTCAGG TCAGGCAAAG
CAGTCAGATA AGCAGGACGA AGAGCAAAGG AAAAGGGAAA ATGTGGAACT CACTGAAGAG
GACGGAAAGG CACTTGGCGA TTTAGCGAAG AAGGTACAAG AGTTTGTTGG GGGCCAAGGT
GATCTCCAGG GTGCCCAATT CGTCGAGTGA GTTCCTTCTA TTCTTGACTT GCTATTTCGT
GGGTAGCTGA CGAATATGTA GCGAGTTGTC TGACGAAGAC ATGGACTCTG GCTCTGACTC
TGACAGCGAA GAATTGCAAG CTCATAAAGA CAAGCTTGAG GCCGAGAAGC AAGCGAGAAT
GGATAGTCTC GTCCCAACTC TTCCCGCTTC CGACTGGGGA CGCAAAGTTC ACATCTCCGA
ACAATCCCAT CAATTACCAG TAGCAATAAA TCAATCATTA ACGTCTCCAA AAGCGGACGG
CAAGAAGATT GACCCTCTCG AGTTCATCCC TTCTAAAATG CGTCCTCCTC GTTTTGCTAA
ACAAGAATTC GACGGTGTCG TCTCTGATTC CGAATCTGAC TCTGAGTCAG ACCTTCCTGC
TGAAGGTACT TGGGGGAGGA AAGTTGCGCA GATGAAATGG AGTGAATTTC CACCCGTGGA
TCTTGAGGAT AACCAACAGG CAAGGATTGA AGAAATCGAG GAGGAGGACG AGGATGAGCA
ACAGCGTAAA GCAAAGTTAC GACTGGGCGA GGACGTTGAT TTGGAAGAGG AGATGCAGAG
GCGCGTCTGG GGTGATAGAG GAGAGGACGA GGATGAAGAT GAAGACGTGG CCGCTGGAGG
AGAAGAAGGG GTTGATATTG ATATGGAGGA TGAGACGGAG GAGTTCCTCA AGTTCTCGAG
GGAGGCTCTG GGGATCAACG ATGAGATGTG GGAGGGTATC TTGGGCGATC GTCGAGCCAG
AGGAGGTGAG CTCTTGCACA TGGTTTGCGG ATATAGCTGT CACGCTAACA TGAGGTTGCA
GCCTTTGTTC CGCAACTGTC TGGCAAGGTT AAGCGTAAAG ACGAGCTACC GGCGAAACTA
CAGCCACGAG ATTCCACCAG CAAAAAGGTC CAGTTTGCCG AGGCTGACTT TCCCTCCAAA
TCAGCTTCAC AGTCATCAGC CACTACCCAA TCGCCTGATC AATCGAACAC ATCTTTAAAT
TCTTTTGAGA CCGTCATGCG TGCTATGGAC GAAGCACTCG CTCGCTCCCG AGACGGACCT
TCCACCTCTC AACCAAGCCA GCCTAAGTCA TCCAACAAAT CAAAAAACAA GAAATCGACT
TCCTCCGCCA ATCCTTTACC GCCTACCTCG CAGACCAATG ATGTTGATCT TGATGCATTC
TCAGAGGATG ACATTGCAGC GATGGATCGC GAGTTACGTT CTGTCTTAAA GGGTGCTGGT
ATAGACCCTG ATGACTCGGA TGATGATATT GAAGAGGTCG GCGAGCTTGA CGTGAATCAA
AAGAGAGAGT ATGAGATGAT GAAGAATTTC TTGGAAAGTT TCAAGTCTCA AGGAGGGGAA
AGCGGTGTTG TGGGGAATCT CTTTGGGCGA CTGTCAGAGA AGCATTGAAG AATAGACAGA
CCATTGTTGG GATTTACTTG TCGGTAGTGC AGACTTTATT ACACCCCAAG TACTTATT
 
Protein sequence
MANPIPLPTV PEDTLHYVIH LPRLEEPLTA ALLVTEHVHS LLPERWLWNK DPWELKVATD 
DDHKLEGRMR VGDAVDDEWL VVWLLTQVSK KWPEFIISVR DTDGEFLLIE AANELPSWVS
PDNANNRLWL SNGHLHLLPL NIHSPAPPHQ LPDDLTFDSS IYLSETDALR AVQTGRYLAP
KEVENVVWER IKGYPEAMKT HLHQTKIYLP AGIAKALNRK PELVQKAVEA FYVRDPAQLR
AASRMTHFPP SPSILSQLTL TRAAYAQLQG QVFHPPRVFG PEWNVRDPPS LDATEGTYKH
LENERRWRDL GVKLATGFEI MYREGGRKSR SGATGESVDS AEDKGYASFL EGIRKAGWFG
NELEGSQKWK EREEKARKGY INAKSADIAS QRPSFAYLVD NAIASCSLSL DQLAVSEDAP
EDNDDWLQIS PDELDSMMLH ASGQAKQSDK QDEEQRKREN VELTEEDGKA LGDLAKKVQE
FVGGQGDLQG AQFVDELSDE DMDSGSDSDS EELQAHKDKL EAEKQARMDS LVPTLPASDW
GRKVHISEQS HQLPVAINQS LTSPKADGKK IDPLEFIPSK MRPPRFAKQE FDGVVSDSES
DSESDLPAEG TWGRKVAQMK WSEFPPVDLE DNQQARIEEI EEEDEDEQQR KAKLRLGEDV
DLEEEMQRRV WGDRGEDEDE DEDVAAGGEE GVDIDMEDET EEFLKFSREA LGINDEMWEG
ILGDRRARGA FVPQLSGKVK RKDELPAKLQ PRDSTSKKVQ FAEADFPSKS ASQSSATTQS
PDQSNTSLNS FETVMRAMDE ALARSRDGPS TSQPSQPKSS NKSKNKKSTS SANPLPPTSQ
TNDVDLDAFS EDDIAAMDRE LRSVLKGAGI DPDDSDDDIE EVGELDVNQK REYEMMKNFL
ESFKSQGGES GVVGNLFGRL SEKH