Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB02550 |
Symbol | |
ID | 3255949 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 741698 |
End bp | 744829 |
Gene Length | 3132 bp |
Protein Length | 816 aa |
Translation table | |
GC content | 47% |
IMG OID | 638254905 |
Product | hypothetical protein |
Protein accession | XP_569178 |
Protein GI | 58263801 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCACTCACCC TTTGCCTCCA AGTTAGCTTC TGGAATCATA TATTAAGGCC TGGAAATAAT GGCCTTCTCG TCGATTCCAT CCCAAATCCG CACTCTCACA CAGCCGCCGC CTCCCAACGC ATTGGCATCT CAGATCCGAG CTCATGTCGA CGCACATTTC TCTGATGTAG CCGCTCTTAT AAAGCCTACA TATCGAGACC AAACTCAGCC TAGTAGTGGA TTACGGAAGA AGAGAAGGGA AGGCCTCGAT GATGAAATAT CTCACTGGGA GGGAAAAGAC ACCAAGGCTG CAAAGGAGGT ATGTATCGGA ATTTGAAGAC AACTCGTACT CATATCGGCC ATTAGCTTCA AGAGACAAGC AAGGCGCTTC CATCTCTCCT TTCAAATACT CAGGACTCTC TACAGCAACT TCTTCAATCC GCCCAAGAAT TATCTCTACA AAGATACAAC CTGGCCGACA AGCTCTCAGA GCTTATTGGT GACATTTCAG GGTCGGTAGA GGGACAAGGA ACTCTGGATG CCGAAGCAAA GATTAGAAAC AAGCAGAGAA CGATCTTAAA TGAACTCGAA GGGTTGCAGA ATGACCTAGG AAACCTCCAA GCTGGTCTCG CGTGGACGAA CATGTTGGAA GAAGCCGTCG CGTTGAGGTA GGGCAGATCT TTACTTGCCA AAGATCCGAG CTAACTCATC TGCCAGTGAC TCCACTTTGA ACCCTCAAAA TCACAAGCCA TCACCCCTGG TAGCCTTACC CCACTACAGG CGTCTCCATG CCCTAGTGGA AGGGATGACC AAATCTTTGC CGAAAGAGAT GGGCCTTTTG CATGTGGTGA TGGATATTAA AGAGCAGACG TGGCAAGGGT TGAAGGATAT CATGTCTGAG TAAGTGATAG CTTCTGCTTG AGCGCTCCGA TTGCTAAACA AGCAATGAAT AGGAACCTGT TAATCGCAAG CGAAGCATTA GGTTGGCCGA AACAGGTTGT TTATGAGAAT GTTCCTTTGG AAGCAAGACG TTTGTTTGAA CGAGCGTTTC AAGATCTTTT GTATCTGCAA GCGGAGTGTG TTGCTTTTCT TATAAAGTAA TACCTTACGT AGATACTGAT ATGCGGCACA GAAAAGAATC TCTTGAAGAA ACTGGCGCTT CCAGGCACCC ACAATGGTCG CTTGGAACGG GTTTGTATCC CGTACAGGCC TTGGTGCACC CGATTGAACT GAGATTTAGA TATCATTTCA TGGGAACCAA AGGTACCAAT AGGATTGACA AGGTAATTCC GCCACATTAT TCCTTTCTCA ATAAAACGCT CACTGAGATA CAGCCAGAAT GGGCTTTTGC CAATATCCTT GACCAGACGT ACATCCACCA GACGTTCCTT GCCACTTACA TTCAAACACT CACGTCTCAG GCCGGCTACA CTTCAGTATC TGTCAAGTCC GAGTTTACCC TTCTCCTCCT CCCTATCCTC CTTTCTCTTT TACGGGCTCG TATACCTCAC CTTTTGGATC ACCCGGCCCT GTTAGCGCAT ACTGTATATC AGACTGTTGT GTTTGATGAA GCTGTTAGGG GCGGAGGGTT TGATTTAAAA GCTACGAGCT TGTATGAGGG CAGAGATGCA CCAGCTTGGG AAGGTCTCGT CGGCGTTGTC CTTAGGGAAG ATGATTGGTT TGAACGATGG TTAACAGGGG AGAAGAAGTG TAAGTTATCC TTTTTGGGGA AAACAGCATA CTAACCCGGA GTAGTTGCAA ATGCCCGCTT GCAGGATATA ATCTCTGCTA ACGATGCCTG GGTCATCAGT GAAGAGTTGC CTGAAGAAGA CGAGGGTCTA TCTAACATGC GTCCAACTAT TAGCTCTCGT CAGGTGAAAG GACTCGTAGA GCAAATAATT GGTAGGTGAT ATTCAGCATC GTCAGGGGCG ACACGCGCTA ATTGAGAACA GATCGTTATG CCCCACTTCC TGAATTGGAG TACAAGCTTC CTTTTCTCCT TACCGTCCAG TTCCCCATCC TTGCAACATA TCAAACCCGC ATTTCAGGCT CGCTAGATGC TTTTGAGACT CTCTCATCTG CTTTCGTCAG AGCGGTCCCA GGAGCCCTAT CCGGAAACAC AAGATCGGGG ATCAACTTTG ATCAGAGGGC TCTGACTAGC GGGAAGATTG GTGTTGAAAG GCTGGTGAAG GCGTTGTTAA GCTCAGACTG GGTTGGAGAA GCAATGCGCA AGTGGGCAGA TGGGATAGTA AGTTACTCTA CCTCGAAAAT GTATGATAAT AACACCTGGT AGTTCTTTGT GGAGCTATCC AATGACCTGC ACAACTCTAC AGCCCTCAAA TGGAAAATTC AATCAGACCC CCTCGTCCCG CAATCCATCA AAGCACCCAC GGCGGCTGAC ACGTCATACC AGACTGCATC AATATTTGAC GTGTTGATTG GACAATATGA GCAGTTATCT AGAAGAGCAG AGGATATGAT TGTTAAGCTT GTAACAGTAG AGGTGGAAAA TGAATTGAAG CAGCATTTAA CGAGGTTTGT TGTTCTTGTA TTCATATCCA GGGAAAGAAA GTTGACAGGT AACAGGCGAT GGGATAACCC TCCATCAGCA GAACCCATCA ACCCCTCAGC TCATTTCGTC TCTGCTCTCA CAACATACAC ATCACACATC TCTACCCTTC TATCACTTCT TCCTTCTCTC ACCGCCGCTC GTCTATACAG ACGTATCGTA GACGAACTTT CTAGGCACAT TTTGCAACGC GGGGTGTACT CTGGATGGTC AAAGTTCAGC GAAAAAGGTG GACAAGACTT CCGGGAAGAG ATTAACGAAT GGAAGGAAGT CACTGCTCAA GTTTTCAGAA GTAACCGATG GAACAAGGAT GTATGGGCGA TCCCTTACGA TGCCCCATGG AACAAACTCG TTCATGTTAG CAAACTTCTG TCTTTGCCAA CAGCACCTTT GCCGCAATCG GAGAACGACC AAGAACCAAC ATTTTCTCAA GCGATGGCCG TCGCCTGGAC AGACAGTTCC AATTTGGTCG AGTTTGAGGA AAGGCTGGGG GTGGAAATAG GAAAAGAAGA GATGCAAAGG ATTTTGAGAA GAAGGATGGA ATGCTGGCGT TAACACTTTT AATGTGTAAT AGATATAATT TC
|
Protein sequence | MAFSSIPSQI RTLTQPPPPN ALASQIRAHV DAHFSDVAAL IKPTYRDQTQ PSSGLRKKRR EGLDDEISHW EGKDTKAAKE LQETSKALPS LLSNTQDSLQ QLLQSAQELS LQRYNLADKL SELIGDISGS VEGQGTLDAE AKIRNKQRTI LNELEGLQND LGNLQAGLAW TNMLEEAVAL SDSTLNPQNH KPSPLVALPH YRRLHALVEG MTKSLPKEMG LLHVVMDIKE QTWQGLKDIM SENLLIASEA LGWPKQVVYE NVPLEARRLF ERAFQDLLYL QAEKESLEET GASRHPQWSL GTGLYPVQAL VHPIELRFRY HFMGTKGTNR IDKPEWAFAN ILDQTYIHQT FLATYIQTLT SQAGYTSVSV KSEFTLLLLP ILLSLLRARI PHLLDHPALL AHTVYQTVVF DEAVRGGGFD LKATSLYEGR DAPAWEGLVG VVLREDDWFE RWLTGEKKYR YAPLPELEYK LPFLLTVQFP ILATYQTRIS GSLDAFETLS SAFVRAVPGA LSGNTRSGIN FDQRALTSGK IGVERLVKAL LSSDWVGEAM RKWADGIFFV ELSNDLHNST ALKWKIQSDP LVPQSIKAPT AADTSYQTAS IFDVLIGQYE QLSRRAEDMI VKLVTVEVEN ELKQHLTRRW DNPPSAEPIN PSAHFVSALT TYTSHISTLL SLLPSLTAAR LYRRIVDELS RHILQRGVYS GWSKFSEKGG QDFREEINEW KEVTAQVFRS NRWNKDVWAI PYDAPWNKLV HVSKLLSLPT APLPQSENDQ EPTFSQAMAV AWTDSSNLVE FEERLGVEIG KEEMQRILRR RMECWR
|
| |