Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA08250 |
Symbol | |
ID | 3253845 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 2265720 |
End bp | 2269450 |
Gene Length | 3731 bp |
Protein Length | 1018 aa |
Translation table | |
GC content | 52% |
IMG OID | 638253147 |
Product | hypothetical protein |
Protein accession | XP_567067 |
Protein GI | 58259309 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00319789 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAAT TAGGACAGAA AAAGCAACCC CCAACTCCGA GCAAGACGCC CAAGAAGGCT GCGCAGAATG CCGAAAAAGG TGCAAGTGGT GCCCTCGGCA GCGTGAAAGA TACCGCCGGC GGTGCGACAA AGGCTGCAGG AGACACTGCT AGCGGAGCCA CAAAACAGGC TCCTAGTCCA GCCAAAGACG TTGGCAAGAA GGTCGGAGAC ACTGCTGGCG GAGCCACAAA ACAGGCTCCT AGCCCCGCCA AGGACGTTGG CAAGAAGGTC GGAGACACTG CTGGCGGAGC CACAAAACAG GCTCCTAGTC CCGCCAAGGA CGTTGGCAAG AAGGCTGGAG ACACTGCCAA AGACGTCGGT GGTAAGACCT CACAAGCCCA GAATCCGACG ACGCCAAAGA AAACGCCGCA AACCAAGCAG GCCGTCGATA GTCCAACAAA GAAGGTCGGC AAGGGCGCTG ATACAATCCA AAAGCAAGGT AAATCAGTTT TCCCAATGTC GTTGCCTCGG TGCCAACTTT TCTTACGTGA TCCATTCTAC CAGGCAAGGA CGTAGCAGAG CAAGCGCAGA AGCGAGATGT TAAACCTGCA ACCCAAGGCA AGGACGCCGC AAAGACAGCG CAGAAGCAAG GTGTTGAACC TGCAACCCAA GGCAAGGACG TCGCAGAGAC CGCGAAGAAG CAAGATGTTG AAGGCAAAGT CCCAGAAGAG GCAGGAGCCC CCAGGACCCC TCAGCAAGGT GAGTAATGTT TCCAGTCACA TTTGGTAAAC ATGGCTTACT CGGGCTTTCG GTACAGAATC TGACGACGAA GACGAAGAAG AAAACCAAGA CGACAACGAA GACGAACAGG AAGAAGACAA TGACGAACCA GCAGCAGAAG AGCCCGAGGA GCAGGATCAA GACCAGGGCG GTTCTGAGGC AGAGTCAGGC GATGAGGACG TAGCTCAAGC TGAGACTCAG GATGAACCTG CTAGTGATGA AGGCGAAGAG CAAGACGAAG AGCAAGTCGA AGAGGAAGAC GAAGAGGAAG ACGAAGAGGA AGACGAAGAG CAAGGCGAAG AGCAAGACGC TGAGCAAGCC GCTGAGCAAG ACGAAGAGCA AGGCGAAAAG GGGGAGGTTT CCTCGGGCGA CAACCAACCT GTACCCGAGG ATCAGGAAGG CGGCGCCGAT CAAGCTCTAG ACGAAGCCAA TGACGTCAAA CCCGAGTTCA CAAGCCCTCT GACCGTTGAC CAAGCTGGCA AGATCAAAGA CGCCGATGAC CGGACTATCG CAAATATGTC TGAGGAGGAC GCCCAAAACC TCGAAGGCTC AGAGATCGCG GACATTGATG CTTCTGGCAA CCTCATGAAC AAGGATGGCG ACGTGATTGG TAACGCACAG CTTGCGTCTG AGCTTGACCA GGCCGAAGAG CAGATAGACT TTAGCGTGCT CAAGGGTCTC AGTCCCAACA AAGCCGGTAA TGTAAGTTCT GCACATCGGA TGTTGCCTCA AACCTTCCCC AGTTGCTGAC TATTAACTAC AAACAGTTGG TTGCCAAGGA TGGCGTCACC GTCGGCAAAG TCGTCGACGG CAACCTCAAA AAGCTCCAGG GTCGACGATG TACCGCAAAG GGCGAATTCT TTGACGATTC AGGCAAAAAG ATTGGACAAG CCGAGCCTAT TCCAGAAGAC GAGAGGTCGA ATCCAGAAGG CGCACCTTTC GAAGACTTTG AAGGTGCTAC AGTCCAGAAA AATGGCGATG TAGTGCACAA TGGAGAGAAG GTTGGCGAGA TCGTCGAAGG AGACGCAAAG AAGATTGCAG GAAAGACAGT CGATGCCGAC GGCGTGAGTG CTCATCCCAA CGATTACCGT CGAAAACAAT TTTTTATTAT TTTGGCAAGC TAACGTGTAG AATCCTATAG GACGTTGTCG ACAAGTCTGG CACAATCGTC GGCAAAGCAC AGCGATGGTA CGAGCCAGAA GAGCAGGAGC CAGAGAAGAC CGATCTCTCT CTCCTCGCCG GCAATCGCGT CAACAAGAGT GGCTACGTTG TCGATAATTC TGGCAAGCTA CTCGGGCGAG TGGTCGAGGG CGATCCTGCG AAGCTAGCTG GCAAGATGTG CGACAAAGAG GGCCAGATTT GGAATGAAGG TGGTAGCGTC GTAGGACGCG CGGAGCTGTT GCCCGAGTCT GAGCGCGAAG GACAGAAAGA AGGTCCCTTC TCCGATTTCC AGCCTGCAAC CGTCCGCAAA GACGGCAAGG TTATTGACAA TTCGGGAACG GTCGTTGGAC GTCTGACTGA GGTGAGAAAC CCGCTCCGCA TATGTGCGCG TTTCCGAAAT CATGCTAACT GATGTTTAAT TTGTTTCCAG GGCGATGCCA AGAAGCTTCA CGGCAAGTCA GTCGATGAGC AAGGCGACGT TGTCGACAAG AACGGCAATA AGATTGGTGC TGCCGAACGA TGGGAAGAAG AGGAGAAAGA AGAGGAGAAG CATCCTGCTG CCGGTCTCAA GGTTAACAAC GAAGGGGCTG TCATCGATAG CAACGGAAAC ACCGTGGCCA AGCTTACAGA GGGCGAGATC GCACGATGCC GAGGCAAAGA AGTCGACAAC GACGGAGATG TCTTAGACGC CAAAGGAAAG AGGTGAGCAA GGATACTGAC TCGATTGAAG ATTTCGAGCT GTGAACTGAT GATGAAATGC ACCACAGTAT CGGCCACGTA ACGCTCATGC AGGATCTGCC TAAGAGAGAC GAAGCTGCCG AGCAGAAGAA AGCAGAGGAA GAGGAAGCAC GAAAGGTAGC CGAACAGAAG GAAGCAAAGG AAGAGGAAGC ACGAAAGGAA GCCGAACAGA AGAAAGCAGA GGAAGACGCA GAACGTAAAA AGATAGAGGA CGAGGAGCTC GAGAGGCTCG AAGGCGACAG CAAGCTGGCT AAACGCATGG CTTACGAAGT ACGTACTTGA GCAGAAATTG CAATCACGCG GATGTTGAGG CTGATCTCCT TTTTCTGCAC TACTAGGTTG ACAGTACACT TGATCGTGTC AAGCCTATAC TCGAAGAAAT GACGGACTTG ATCGAGGATG CGCACTCCAA GCCGAAAGAG GAAGTCAACC AAGAAGAACT TGTGAAGAAA GTGAAGCCCC TTATTCAAAA GAGCTCGGAC ATCCTCAACC AGTGCAACTC CAACATTCGC GAGATGGATC CCGACGGCCG AATGCAGCAA CAGTCAAAGG AGAATGCTGC CACTGGTGAA GCTACTCCAG AAGAGCGACA CCTCGCTGAA GGCCTCAAGA AGCTTTCGAC AGAAGTCACA GATGCTATCA GGAAGGGCAA GAAGGGTATC GAAGGTATGC CCGAAGCTAA GAAGGGGATC AGTCCACTTT GGCACCTGCT CGAGCGTAAG CAAACATGCT CTTTTACTTC CTTCTGTGCG TTGGCTGATT GTGACTGCTT TGTTGAAATA TAGAACCATT GGTGCAAATT CTTGCAGCAG TCGGGTTACT CTTGTCCGGT GTTCTGGGCC TTGTGAACCA GCTCTTGGGC GGGCTTGGTC TGAATGGCCT GCTCGATAGT CTCCTTGGTG GATTGGGTCT TGACAAGCTC CTAGGCGGTC TTGGAATTCC TAAACTTGGT GGAAAATAAT GTGACTGCGA AACAAGGACG TGGACCGACA TTGGAAAAAT CACAACTCCT GACGTACAAA TGTAGCTCTA ATAACCACAC ACACGAGTAG CAATTATTTA AGTACTGTAG C
|
Protein sequence | MSQLGQKKQP PTPSKTPKKA AQNAEKGASG ALGSVKDTAG GATKAAGDTA SGATKQAPSP AKDVGKKVGD TAGGATKQAP SPAKDVGKKV GDTAGGATKQ APSPAKDVGK KAGDTAKDVG GKTSQAQNPT TPKKTPQTKQ AVDSPTKKVG KGADTIQKQG KDVAEQAQKR DVKPATQGKD AAKTAQKQGV EPATQGKDVA ETAKKQDVEG KVPEEAGAPR TPQQESDDED EEENQDDNED EQEEDNDEPA AEEPEEQDQD QGGSEAESGD EDVAQAETQD EPASDEGEEQ DEEQVEEEDE EEDEEEDEEQ GEEQDAEQAA EQDEEQGEKG EVSSGDNQPV PEDQEGGADQ ALDEANDVKP EFTSPLTVDQ AGKIKDADDR TIANMSEEDA QNLEGSEIAD IDASGNLMNK DGDVIGNAQL ASELDQAEEQ IDFSVLKGLS PNKAGNLVAK DGVTVGKVVD GNLKKLQGRR CTAKGEFFDD SGKKIGQAEP IPEDERSNPE GAPFEDFEGA TVQKNGDVVH NGEKVGEIVE GDAKKIAGKT VDADGDVVDK SGTIVGKAQR WYEPEEQEPE KTDLSLLAGN RVNKSGYVVD NSGKLLGRVV EGDPAKLAGK MCDKEGQIWN EGGSVVGRAE LLPESEREGQ KEGPFSDFQP ATVRKDGKVI DNSGTVVGRL TEGDAKKLHG KSVDEQGDVV DKNGNKIGAA ERWEEEEKEE EKHPAAGLKV NNEGAVIDSN GNTVAKLTEG EIARCRGKEV DNDGDVLDAK GKSIGHVTLM QDLPKRDEAA EQKKAEEEEA RKVAEQKEAK EEEARKEAEQ KKAEEDAERK KIEDEELERL EGDSKLAKRM AYEPILEEMT DLIEDAHSKP KEEVNQEELV KKVKPLIQKS SDILNQCNSN IREMDPDGRM QQQSKENAAT GEATPEERHL AEGLKKLSTE VTDAIRKGKK GIEGMPEAKK GISPLWHLLE QPLVQILAAV GLLLSGVLGL VNQLLGGLGL NGLLDSLLGG LGLDKLLGGL GIPKLGGK
|
| |