Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB03870 |
Symbol | |
ID | 3255923 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1142747 |
End bp | 1145604 |
Gene Length | 2858 bp |
Protein Length | 896 aa |
Translation table | |
GC content | 52% |
IMG OID | 638255033 |
Product | conserved hypothetical protein |
Protein accession | XP_569216 |
Protein GI | 58264120 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTCCGCCC ACAAGATGCG TCCACAGCAT CTCCTCCCGC TCCTCCTCCT CCTCCTCGCG CCTGCCATAC ATGCAGCGGT GCTCGCCATC GACTACGGCG CAGAGTTTAC AAAACTCTCC CTCATCAAGC CCGGCGTGCC CTTCGATGTC GTTTTAGACA AGGACAGTAA ACGAAAAATC GCGAGCGTAG TAGGATGGAA ACGAGACGAA CGAGTATTCG GCGCCGAAGC AAAATTGGCT GTAAGTAATT GACACGCTGA TAAAAACAAA TTGGCTAAGT AAAGCAGGCC ACGAGGTTCC CCGACACCCA CTACCCTTTT ATAAAACCTC TTCTTGGCAC CACTACTCCC AACACCTTTC CAGTATACCC CGTCAACCCT CATGTTACCA ATGACACATT ATACTTCCCT CACCCCTCGC CTCCCTCCTA CATCTCTCCC GAGCTTGTTT CTCCTGAAGA CGCTTGGACG CCTACTGCCC TCTTGGCACA GCAGCTCTCA TATTTCCGCC ACCTCGCAGA GTTGGTTCAG CCTGCTGGAT CAAACAAAGA GAGCATAAAC TCTGTTGTTG TCACCGTTCC TGCGTGGTGG GATCAGGCTC AGCGCCGTGC GTATCGAGAT GCGTTGGAAC TTCAGGGTAT GAATTGCTTG GCGATGATCT CAGAGGGTAC CGGTGTTGCA CTCAATTATG CCATGACAAG ATCTTTCCCC AACTACGACC CCGTAACTGG ACAAGGGGAA AAAGAGTATC ACATTGTTTA TGACTCTGGA GCTATGACCA CCACCGCTAC TGTCTTGGCA TTCTACCAAA CTAGCGAATA CGCGACACCG AAGTCAAAGA CAGCGATCAA CACAACTCAC ATTGAAGTGC TTGGAACTGG ATGGGAGCAC GTTGGTGGAG TGATGCTGGA CACTGTTATC CAGGATATTC TCCTCACTGG CTTTGTCAGT AAGACTGGGC GGGAGGAGGT CAGACAGGAC AAGAAAGCCT TGGCCAAGGT CGCAAAAGAA GCTACCAGAG TCAAGCAGAT TTTGAGTGCC AACCAGGAAG CCAATGTAGC TGTAAGTATT GGATCCACCA TTATCGAGTT TGGCTAATCT TTTAAATAGA TCGAGTCTCT CTTCGATGAC GTCGACTTCC GCTCAACTAT CTCTCGTGCC GACCTCGAAA AAATTGTGGG AGCTGTCGAC CAGTTGTACG GCAGCCCTGT CATTTCTGCT CTCGAGGCGG CGGGTTTACA ACTTGGAGAT ATCAACTCTG TCATACTCTT TGGTGGTAAT ACCCGAGTTC CTCTTGTACA GGCCTCCCTC AAATCTGTTC TTGGCGGTGC TGAGGACAAG ATTGCGCAGA ATGTTAACAC CGACGAAGCC GCCGTGCTTG GTGCCGCATA CTATGGAGCT GCGTTGAGCA AGCAGTTCAG GATCAAAAAC ATCGATATCA AGGAAAGGAG TGTCAGCGAG ATTGCCCTCA AAAATGGCAA CGCAATCTTC CCCGAGGGCA GCGTTCTAGG CGAGCGAAAG GCGATCACGC TTCCTGCCAA GGGAGATGTG ACTCTTGAGT TCACTGAGCG CATTTCTCAT CCCGACAGTG CCCACGCATC CTCTAGCGAG CCCCAGTCTA TTCTCTCCGT TGAAGTTCAC GACGTCGAAA AAGCCCTTGC AGATTTCACT GCCCCTGAGC CTGTCATCAA CATCACTATG CGTCTCGATC CCAAGGGTCA TGTATCGGCT GCCAACGCTG TCCTTGTCTC TAATGTTACC GATTCAAAAG ACGGTGGTGT CGCTGGCGCT ATCAAGGGCC TTTTTGGTAG CAAGGAGGAA GAAGCGAAGG AAACTGAAGA AGACGAGGGG CAGAAGGACG CTAAAGGCAA ATCTCTCAAA GTAGCCCTCA AGTTTCGCGA GAAGCACCTC GGCCTGAAGC CTTTGTCTGG CGAAGAGAAG CGTACCACCA ACGCCCGTCT TATTTCCATC TCTGCCTTTG AGGCCGCCAA GGCGTCCCGC GAAGAGGCTC GCAACTCACT CGAGTCGTAC CTCTACGCCC TTCAAAACTC CCTCAATATC GACGATGGAC CTACCGCCCT TACCGACTTT TCTACTCCTG CCGAGCAACA GGCTTTGAAA AAGCTTCTGG GCGAGACATT TGAGTGGTTA GGCGAGAATG ATGAAGTCGC AGAGGAGTCC AACTTGAGGA GAAAGTTGGC TGAACTTGAG GGGTTGGAGA GGCCCGTCGT TTTCAGGTAC AACGAGTACC GCGCGAGGGA TAAGGCCGTC GCCGACTTCC AGCAGGCCAT GCATCTTGCT CGTGCCTTCT TCATTGACGC TCAAACTAAC TACACGAAGG CTATGGAAGC TGCTGCCACC GCTACTCCCG AGGACCCAGT GGCGCCTCCC AAGCATACAG AAGAAGAGTT GAAGGGTGTC GAAGCGCTCT TAAAAGAGTA CACTCAGTTC ATCGATGAAA AGATGAAGGT GCAAGTAACT CTGGACGGAG ACAAGACTAA GGACCCGGTC ATCACAGTCA GAGAACTGCA GGAGAAGGGA AGGCGACTCC AAGCTACTGT AAGTACCTCA GCATCAATGT TTAAGATTGA CTAACATTTT TACAGGTCTT GACTCTCCAG AAGAAAAAGG CCCCTCGTAA GCCTCGACCG ACCACCTCTT CATCCTCTGC AACGAGCTCT ACTACTCTCG CCTCCCCCAC CGACGACGGT CCTTCCCCTG ACGTTACCGA GTCCCCATCT GACGTCTCTT CAACAACCCT GGCTTCACCG ACTGATCACG GGCCATCATC AGAGACGTCG GCCGCACCTA CTGAAGGTTC TGATGCACTT AGACATGAGG AGCTGTAGTG TAGCATAG
|
Protein sequence | MRPQHLLPLL LLLLAPAIHA AVLAIDYGAE FTKLSLIKPG VPFDVVLDKD SKRKIASVVG WKRDERVFGA EAKLAATRFP DTHYPFIKPL LGTTTPNTFP VYPVNPHVTN DTLYFPHPSP PSYISPELVS PEDAWTPTAL LAQQLSYFRH LAELVQPAGS NKESINSVVV TVPAWWDQAQ RRAYRDALEL QGMNCLAMIS EGTGVALNYA MTRSFPNYDP VTGQGEKEYH IVYDSGAMTT TATVLAFYQT SEYATPKSKT AINTTHIEVL GTGWEHVGGV MLDTVIQDIL LTGFVSKTGR EEVRQDKKAL AKVAKEATRV KQILSANQEA NVAIESLFDD VDFRSTISRA DLEKIVGAVD QLYGSPVISA LEAAGLQLGD INSVILFGGN TRVPLVQASL KSVLGGAEDK IAQNVNTDEA AVLGAAYYGA ALSKQFRIKN IDIKERSVSE IALKNGNAIF PEGSVLGERK AITLPAKGDV TLEFTERISH PDSAHASSSE PQSILSVEVH DVEKALADFT APEPVINITM RLDPKGHVSA ANAVLVSNVT DSKDGGVAGA IKGLFGSKEE EAKETEEDEG QKDAKGKSLK VALKFREKHL GLKPLSGEEK RTTNARLISI SAFEAAKASR EEARNSLESY LYALQNSLNI DDGPTALTDF STPAEQQALK KLLGETFEWL GENDEVAEES NLRRKLAELE GLERPVVFRY NEYRARDKAV ADFQQAMHLA RAFFIDAQTN YTKAMEAAAT ATPEDPVAPP KHTEEELKGV EALLKEYTQF IDEKMKVQVT LDGDKTKDPV ITVRELQEKG RRLQATVLTL QKKKAPRKPR PTTSSSSATS STTLASPTDD GPSPDVTESP SDVSSTTLAS PTDHGPSSET SAAPTEGSDA LRHEEL
|
| |