Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00050 |
Symbol | |
ID | 3255846 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 8921 |
End bp | 15487 |
Gene Length | 6567 bp |
Protein Length | 1955 aa |
Translation table | |
GC content | 52% |
IMG OID | 638254658 |
Product | conserved hypothetical protein |
Protein accession | XP_568751 |
Protein GI | 58262682 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0248021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACC TCACGCCATT GCCTACGACG TTTGATCCGG ATAACGATGA TTTTGATGTG GAGGATCGCA GAGGTTTAGG GCATGGAGGT GGAGAGGTGG ATGAGGATGG GGAGGGTGAT CTTTTGATTG TGCATGGAGA GGAAGAGGAG GAGGAGCAGG AGGAGGGGAA TGAAGGGGAA GGGGAAGGGG AGGGAGAGGA CCAGACGAAC TTGATGGAAT ATATTGCGTC GTTGCAACAG ATTGGCGAGC ACGATAATGA CGGTAACCAG GATCAAGATC AAGATCAAGA TCACGATACG GAGACCGGCG ATAATACCCA TGAGCACGGG CACGACGCGG AAGTCGAACT TGGCCAAGAG ATGGAGGAGG AGTCGACAGG ACAGGTGGTG GATACAGAGC TTGGTGAAGC GGATCGAAAT CGTGATTCGC CTCCAGATCC GAATCCGGAT CCGAATCCGG ATCCCATCAC AGTACCGAGC TACGCCCGAC TCCAAAGTGA ACAGCAAGGA CAAGAGTCAA ATGTGATTAG CGGCCCGATG GAGGTGGAGA TGGAAGAGGA AGAGCTGGGA GAGAGCATCG AAGTTGATCC AGGGCAAGAA CCCGGAGAGC GTGAGTCGGT TCTTTTTTTT CTCCCATCGA TTCCAGTTGG TACACTAACG ATCACATAGT TCATGAACCC GAACCCGAAC CCGAACCCGA ACCTGAAATC CAACGTGAAG CATCGTCCCC CCTTTCAGAC CTCCCGGATT TACCTGACGA ACTCGAATTC TCCCCTCCCC CTTCTCCCGG ACCTTTTGCG CCGAGTCGGA ATGGTCAACG CCATCGCCAA CCAAGTAGTA ACAGCAATAA TTCGAACATG GGCACGGGCA TGGATATGGA GGTTGATCGA GATTCCTCGC TCACACCGCT ATACGACGAT TATGACCACG AGGGAGGAGG AAAGGATGGT GTTGGTTTAC CACGTCCACG TTTCCGCGCG GAATCCTCAG CTGCGGGCGC GAGACGATCC GCCTCCATGT CACATGCGAA TGCGACTAAA CGGGGTCGAA ATACTACCCG AAACAGGGAT AGCAGGGGCA GGGAAAAGCA AGGAAGAAGA AGCAGTTTAT CATTTTCATC TTCCCCGCCG CTGGAAAGTA TCGAGGTCGC GCCTGATATC GCGCCGGGTA CGGCGGGTAC GCTCGGGGCT GGGCGGAAGA AGAGGCAGAG AGAACGGGCA AGCAAGGGAA AATTACCTTT ACGCCATGGG CATGAGCGTG GAGATGGGCT TTCTGACGGG GAAGGGGATA ATGAATTTGA GCTTGTTCCA AAGTCTGCCA AACGGTCAAA ATCCGCCCAT CCGTTCCCCG CATCCCACGC GGCTTCTCAC GCCGCCTCCG CCTCCCATCC GCCACACCAC CCTTCCCGCA AACCCCTCGA CCCAATCTCC ACCGGCCTCA AACTCCCCGG GCCCCGTGCC CTCCCGCCCT CCCGCCCCCG CGCGCTCAAC GATCAATCCA TCAAAAAGTT TTTGGTGGGC AAAACGGAAG AAGTGCAGTT GGGCATCTGT CTGAGGGAGA GGTATAAGAA GTGGGGAAAG TGTACGCAGT GTGTGAGTAA GCTAGGGGGA GATTGTTGTA GGTTTAGAGA TTATAGGGTT TTTCCGTGAG TATCTTACCC CGTGTCCTCC CTCCCTTCTC CGTTAACAGC TAACCCGCCG GCTCATGCGA ACAGGATCAA TCCCAAAACA ACAGAAATCA CAGGCCCAGG GTATTTTAAA TCTACCACCT GGCCCGAACC GCTTACCCCA CTCCCAACAA AGTTTACGTC CCCGCTCGAT GGGGACAAGG TGGAGGAGAT ACAGAGGACG GTGGCGGGGG TGTTGTTGCC GCTGATAACG GGCGAAGCGA GGCATTGTTT TTGGCCGTTT CATCTGCAGG GGCGGGGTAA ACGAGATGGA GAGGGGGAAG AAGTTTCCGA TCCCGCCCAT TCGACACACA GGGCGAGGAA TCCCATCACC CTCCACCGCG GTGTCGACAC CGCCCTCCAT CGCTCCGTCT GCGATTTCTG CTCCTCCACG ATTTTTGGCG GGTACTATTT CTGTAAGAAG TGTGGGAGGG ATTACTGTTT GCAGTGCGAG CGGTATTTCC CGGAGAGTAT GGAGGCCGTA GGGGAGAGTC CGTGGGAGCT GACGGATGCG GCGAGGCCGA GGTTGTTAAA GTGTATAAAG CAGCCGTGGG AAGAGATGAT GAGCGGAGGC GGAGGAGAAA TGGGAGGTGA TTCGACTGGG AACGGCAATG GCAACGGCAG CACGAGAACG AACACCACCA ACGCAAACAC CGACGGGGGC AGCAACACGA ACAAAAATGG AAGGAAAAGA GAAAAGGCGA TTGCTTGGCA TGTGCGGAGT GATTTGCAGC CCGTTTCGAG ATTTACGAAA GAGGAAATTG AGGGACATTG GCTGAGTCTT GCGGAATTTG TCATTGGGCA TGGGTATGGT CCGGACGCTA GTTCTGGCTC TGGCTCTGGG TTTGACTTTC TCTTTGGTGC TGATGATGGG GATGAACCCG AAGGAGAGAA GAAAAAGGAG GAATGGAGGC TCCAGTGGGA ACGGATGTTA AGATTGATGG GATTGCAAAT GGATGAAGAG GTGAAGAGCG TTTTGGAACA ATGGAGCAAG AATGCTCGTG TTGTCAGGGA GAAGGAAAGT GGGTCTGTGA CGCACTTGGG TCCCGGCGCC AACGTCGAAT CTAATCTCGA TACCGATACG AATTCTCACC CCGTCAATCA GCCCGAAAGC AGTTCGAATA ACCCCATCGA CTCCCAAGCT CCCATCGATA AAACTTCAAC ATCCGTTCGC CGATCAGCCG CCATGACCAC AAAGAACGAG AACGACAACA AGTTGGAACC TGAAATGGAC GAGGTTGAAG AGTTGGAGAC GGGGGAACCG AAGAGTGCGA GCGAAATGTT GAAGGGTATG ATGGATAAGG AGAATGGAGA GGAGACATCG GCGGCTCACA CAGGTCCTGC GGCCTCTCTC AATTCTGTCG CTTCTGCTCC TGCGGCCGTA CGGTCATCGA CGCAAGTCAC CAAAGTCGAA ATCCCCGTCG CCGCAAATGC TTCTCCCACC GTCTCTTCTA GCAGCTCTCC TCAAGCCAAT CCTCCTTTCG AGAAATCTGG CCTACCATCT GCATCCCCAC TAAGATCCGT TCCCACCTCC GGCAACTTTA GGACCGGTTC GAATCCCATG CCAGATCAGC AGCAAGACCT TGAGAAGATT GCATTCAGCT ACACCAAACA TACCCACCCC AATCCACCTT CCGACCCGGC CAGTCTTTCG GACTATTCTC TACCGTTCAT CTACCTCCCT TCACCAGAAG GTTTGGATAA CAAGGCGTTC GATGAGCTGT GGAGTAAAGG GGAGCCGATC GTTGTAGGTG GTGTGAATGT CTATGTTGGT GGTGGTGGTG GTAGACGACG ACGAGAAGAA GGGGAGAAGA TGGGGAAGGA AGGGGAGGAA TGGGGACCCG AGAAGTTTAT AGAAAGGTTC GGGGAAGAAC AGTGTTCGGT GGTGGATTGT CAGAGTGACA CGCCGTTGGT TTCCACTGTT GGGGCCTTCT TCGCTGCATT TGGGGAAAGT GTAGGCAAGC CATGGGAGAG GGAAGGGGAG GATGGAAAGA GGAAGGAAAA GAAACGGCAG GGGATTTTGA AACTGAAGGT GAGTTTTTTT TTTTTTTCGA TTGGTTGTCG AATTGGTGAA GAATAACGCG CTAAAAAGAG ATATCTTTCA TTAGGACTGG CCTCCAGGTG ATGAGTTTGT GGATACACAC CCAGAGTTGT ATCATGACTT TTGTGCGGCT TTGCCTGTAC CGGACTATAC AAGGAGAGAT GGAGTGTTGA ACCTATATTC ACATGTAAGT CTTGATGGCT TGTGATAATC GCGTCGCTGA TAGGAAAACG GTTCACATCT TCTCGCCTTA ATTCGACCTT ATCCCTTCGG CCCTCCACTT GCTATCGCGC TCACGCCTTG TCCCCTCTAC AGTTCCCTCC GGGTCCGACG AGGCCTGATA TCGGACCAAA GATGTATGCT GCCTTTGCTG CACTTGAGAC GCCCGGCGGA TTTGGCTCAA CGAGATTACA TATGGACGTC GCGGATGCTA TTAACATCAT GCTACATGCT TCCCCCATTC CTGACGACTC GTCCTCATTA GAAGCTCCCA TATCCTCTGC AACTTGTTCG ACATCACCTT CTTCCTCGCC AGAAATCACT TTAGGAACCG ACTCAAAGAT ACCATCTAGA CCTGGCTGCG CTGTATGGGA CATCTACCCT GCGCAAGATG CCGACAAGAT CAGAGAGTTT CTCAAAGAAA AATTTGACAA GACTCATAAC TTTGTGGACC CCATTCACTC GCAAATGTTT TATCTCGATG CGAAATCGAG AAAAGAGCTG TGGGAACGTA AAAGAGTTGT CAGCTGGAGA GTCTATCAAT ATCCTGTGAG TGTTATTGCT TTTGTGATGC ACGATATGGT TTCTGATAAC GTGTATAGGG CCAAGCAGTT TTTATTCCTG CAGGATGTGC GCATCAAGTT TGTAATCTTT CCGATTGTAT TAAGATGGCT CTCGACTTTG TTAGCCCTCG TAAGTCTCCT ACACTATCGC TACCACTGTT GCATGCCAAA CTCATGTACA ACTATTTGTT TCATAGACAA TGTTCCAAGG TGTCAGCAAC TTACCAAGGA TTTCCGCAGA GAGAATTATT TGAAAGCATG GAAAGAGGAT GTACTGCAAT TGTATAACGT CCTCTGGTAC GTTTTTTTCT TCTTGTTTTT ATCTCCGTTC CTATCAATTG TGGCTGACAT TATTATCGCA GGTACGCTTG GCTCTCAGCC CGTGAAACTA TCGCCCGGAG AGAAAAGGAA GTGGCCGCTG CAGAAGCTCG ACAGAAGCAG CTTCAATCTT TCCAATTGGA CGACAGTAAC TCTCCACTAT TCCAATCGAA GCATCCTACA TCTTCCCCCC TCTGCTCCGC TACGCCTACT ATCACATCGA CTCTTGATCT AGATGCTTCG TCAAGAATGG GGTACGACCG TTTGAACATG GGTATAGGAC TTGGCGAGCG AGGTATGGGT ATGGGCCTGG GAAGGGTTGG TATGGGCAGT TGGGGAAACG TTAGTTCAGT GAGGGACGAG CCGCCGAGAG GTGGACCTTT GCTGTTAATG TTGAATGATG AATCTGAGAA GGTCTCAAGA GGTCAACTTA CGGAAGGTGA GAAGTCAAGT GAACCTGTTA GGGGAAAGGG GGAGGATCAC GAGAAGTCAG ACCAAGGCAA ACAATTGTCC AGCGATCCGG TCTCATTTGC CAGTTCTCTC TCCAACTCTT ATTCACTCCA ACTCGCCAAG TCTCTCTTCA CTCTCTCCCT CCAACGCGAC GAAACCAATC CTTCGGAATT ATTTGTGCCT TCTGCCGCCG CCATGAACCA CCGACCCCGT GGTCCGTCCT TACCCGATTC ACACTCTTCC ATTGCCCATC CGCGAACTTC ATCTCGTGCG CCCTCAACGC ATACCATGGA CAATCGATCC TCGCAAACCT CCGCATCCGT CCGCACAAAG GATATCAAGG AGAAAGAGAA TAAATTTACG CCTCCAAAGA AGCTCAAGAA CCCCGCCCGT CCGCTTCGTA CGTCAACGCT TATCAAGCTT GGAGCAAAAC GAATGGATGA GATCCTTGAA CTTGCAAGAG GGGATCTGGG AGAGGGTGAG AGTTTGAGCA ATGCACTTGG AGGCCTTGGG AACTTTGCGG AACTTGGTAC TATGGAGTCA CATGGTGGGA TGCAAGGTTG GGCAGGGTTG AGGGCAATGT CGGAGAGCGT GAGAAGTCGA GAAGATGGGG ATAGAGAAGG AAGTCCATTG GTATCCATCA ACGATGTCCA CAATTCCAAC TCTAGCCCCA ACGCCGAGAC CGATACCTCC ACGCCACCTG ACGATGCCAT CGACCGTACG GACGCTGAAG CCGCTGCTGA TATCGCCGCC GAGATACTCT CCATGGATAT TGACCATCAA GAAGACGAGA CAAACATTGA AGATGATGGG GATCTTGAAG GCTTGAGGCG GATTGTCGAG CGAGGAATAT TGGAGCAGGA ACAAGGGCAA GAGCAAGGGC AGGAGCAAGA ACGGGAACAA GAGCGGGAGC AGGAGGATGA CGAGAGGGAT AACGAGCTTG GTGAGGAAGG GAGGGAGGCG ATAGCGTCTA TTCAGAGGCT ACAGGCCGTA GAGGAGGCAG AGGAGGAAGA GAGGGAAAGA GAGAATCTTG CGAAGGAGGA AGAACGTGAA AGGGAGATGG GAAGACCCGA GGGCGTTGTT ATCAATGATG ATAAAGAGGA AGGCGAAGGG GAAGAGGAGG AAGATGAACA GGCAGATAGA GAAAAAGCGA GACAACGGGA AGAGGAAGAA GAAGAAGAGA TTGAAGGTGT AGATGAAGAT GCCGAAGGCG AGGAATTTGA AGAGGTTTCA AAGATGAGTG TAGATTTTGG CATGTTTATA TAGTTGGGCA TAATTAACTG CTGTAGTATA ATCTGTGTAT ATATAAC
|
Protein sequence | MDDLTPLPTT FDPDNDDFDV EDRRGLGHGG GEVDEDGEGD LLIVHGEEEE EEQEEGNEGE GEGEGEDQTN LMEYIASLQQ IGEHDNDGNQ DQDQDQDHDT ETGDNTHEHG HDAEVELGQE MEEESTGQVV DTELGEADRN RDSPPDPNPD PNPDPITVPS YARLQSEQQG QESNVISGPM EVEMEEEELG ESIEVDPGQE PGELHEPEPE PEPEPEIQRE ASSPLSDLPD LPDELEFSPP PSPGPFAPSR NGQRHRQPSS NSNNSNMGTG MDMEVDRDSS LTPLYDDYDH EGGGKDGVGL PRPRFRAESS AAGARRSASM SHANATKRGR NTTRNRDSRG REKQGRRSSL SFSSSPPLES IEVAPDIAPG TAGTLGAGRK KRQRERASKG KLPLRHGHER GDGLSDGEGD NEFELVPKSA KRSKSAHPFP ASHAASHAAS ASHPPHHPSR KPLDPISTGL KLPGPRALPP SRPRALNDQS IKKFLVGKTE EVQLGICLRE RYKKWGKSNP PAHANRINPK TTEITGPGYF KSTTWPEPLT PLPTKFTSPL DGDKVEEIQR TVAGVLLPLI TGEARHCFWP FHLQGRGKRD GEGEEVSDPA HSTHRARNPI TLHRGVDTAL HRSVCDFCSS TIFGGYYFCK KCGRDYCLQC ERYFPESMEA VGESPWELTD AARPRLLKCI KQPWEEMMSG GGGEMGGDST GNGNGNGSTR TNTTNANTDG GSNTNKNGRK REKAIAWHVR SDLQPVSRFT KEEIEGHWLS LAEFVIGHGY GPDASSGSGS GFDFLFGADD GDEPEGEKKK EEWRLQWERM LRLMGLQMDE EVKSVLEQWS KNARVVREKE SGSVTHLGPG ANVESNLDTD TNSHPVNQPE SSSNNPIDSQ APIDKTSTSV RRSAAMTTKN ENDNKLEPEM DEVEELETGE PKSASEMLKG MMDKENGEET SAAHTGPAAS LNSVASAPAA VRSSTQVTKV EIPVAANASP TVSSSSSPQA NPPFEKSGLP SASPLRSVPT SGNFRTGSNP MPDQQQDLEK IAFSYTKHTH PNPPSDPASL SDYSLPFIYL PSPEGLDNKA FDELWSKGEP IVVGGVNVYV GGGGGRRRRE EGEKMGKEGE EWGPEKFIER FGEEQCSVVD CQSDTPLVST VGAFFAAFGE SVGKPWEREG EDGKRKEKKR QGILKLKDWP PGDEFVDTHP ELYHDFCAAL PVPDYTRRDG VLNLYSHFPP GPTRPDIGPK MYAAFAALET PGGFGSTRLH MDVADAINIM LHASPIPDDS PGCAVWDIYP AQDADKIREF LKEKFDKTHN FVDPIHSQMF YLDAKSRKEL WERKRVVSWR VYQYPGQAVF IPAGCAHQVC NLSDCIKMAL DFVSPHNVPR CQQLTKDFRR ENYLKAWKED VLQLYNVLWY AWLSARETIA RREKEVAAAE ARQKQLQSFQ LDDSNSPLFQ SKHPTSSPLC SATPTITSTL DLDASSRMGY DRLNMGIGLG ERGMGMGLGR VGMGSWGNVS SVRDEPPRGG PLLLMLNDES EKVSRGQLTE GEKSSEPVRG KGEDHEKSDQ GKQLSSDPVS FASSLSNSYS LQLAKSLFTL SLQRDETNPS ELFVPSAAAM NHRPRGPSLP DSHSSIAHPR TSSRAPSTHT MDNRSSQTSA SVRTKDIKEK ENKFTPPKKL KNPARPLRTS TLIKLGAKRM DEILELARGD LGEGESLSNA LGGLGNFAEL GTMESHGGMQ GWAGLRAMSE SVRSREDGDR EGSPLVSIND VHNSNSSPNA ETDTSTPPDD AIDRTDAEAA ADIAAEILSM DIDHQEDETN IEDDGDLEGL RRIVERGILE QEQGQEQGQE QEREQEREQE DDERDNELGE EGREAIASIQ RLQAVEEAEE EERERENLAK EEEREREMGR PEGVVINDDK EEGEGEEEED EQADREKARQ REEEEEEEIE GVDEDAEGEE FEEVSKMSVD FGMFI
|
| |