Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00140 |
Symbol | |
ID | 3254466 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 40603 |
End bp | 44073 |
Gene Length | 3471 bp |
Protein Length | 883 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253508 |
Product | hypothetical protein |
Protein accession | XP_567586 |
Protein GI | 58260352 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.518899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTATTAAAAG GAAAATAATC ACCTGTCTTA TAATAGTAAG TTGAGCACTT GCACTCGCTC GCACCATTTG GTTCCCAGTT CATGATCGAG CTCCCACTTT CCAACCTATG AAAAAAGAAA AAACCACAGC TGATACCAAC CAATCCAGAG AAACAAAATA GCAATCGGGT TTCGTCTCCC CACGAATCAC CACAGACAAA AAAGAAGGAA AGAAAAAGGA TGTTCGAGTC CCACCTAAAC TTAAACCCAC ACGGCGCGAT TGGCCAGCCT TCACCTTCTA TAGGTGTAAT CGGCGAACAA CCTCAAGCTA TTGATAATCT CTCGCCCGCG CCCGCGCGAC CGGCTGCACA CCCTCGGCAG GTGGCAGCGG CCAGTCTATC TCAAGGACAA TTAGGAGGTC GACCTAGGAC GCCGATCAAT CGTGATTGGG ATAGGGATAC GGATACGGAT AGGGATAGGG ATGTGGTGAT GGGTGAAAGG AGAGAGCAAG CGGATGTAGA GTCGGATTTG ACGAATTTGT TGGTGCAAAC GCGATTACAT CGGACGGGGC TCTTTTCCGA TCGGGAGGTA CACTCTCAGC CTCATTCCCA TTCCCAGTCC CAATCGCAAT CGCAACCACA ATCAGAAGCA CCCTCCCAAC GCCAACGTCA ATACCAATCC CAATCCCGAT CCCAACCCCA ATCACAATCA CCATCACAAG GAAAGGATGA TGTAACAGGA GGCATTGACG ATTTTCGCAA ACGTTGGGAA AGAGAGAGCC GGGAAGGTGG AAAGAGGTTT GATGAGGAAA CTTGGGATCG GAGTCTCTTG GGCATGGGCA TGGGTATGGG CACGGTGCAG AGGCAGGGAA GAGGGTCGCC TGCTCCTGGA GGCTTTGGTA TAGGTCTGGG TTCGGGTTTG GGTTTAGGTG GAAAAGCGAT GGGTATAGGA GGGACTGTGG GCGGGAGCAC GCAGTCCAAG ATGACTACAG GGACGTTTGC TGGTTATCAA CCTTTTATGG GCAAGATCTC GCCCGTCTCT TCTGCTCAGT ACCGACCTTC TCTCTCTCCC AGTAATCATC ATCTCCCATC TTCGACATAT GGGAATCAAG CACCGTCCCA ACCTCAATCG TATACCCAAC CTCAGACTCC AAGCCATACA GCGGCTCCCA ATACTTTGGT GGCCGGGGCC GGGACCGGCG GGCAAAACGA CACGACTGCA ACCCAGATAA CTCGCCACCT CGAATCCCTA ACCATCCTCC TTAACCCATT GTTGGCCCAG GCCGATGAAG TAGAAAGGCT CCGTAAGGAA GTCGAGATGT GGAAATCCGA GTGGGCACGG GCGGAGAGGG AGAGGAAGAG GTTGGAGGAA AAGGTGGGTG GGATGGGCAT GGAGTTGGAG AAGGCAGTTG GGAAGAGTAG GGATGTGAGT TTTATTTTTA TTTTTTATTT TTCTCCTTAC CGCTCCGCCT CTTATTCTTA CCTTCCCCTC ACCTTTCCTT TTTCACCGCG CCATCTCCAA GCGTGTACAA TAAAATAGGA GTGAACTGAT GCGTGTCGAC TTTAGATTGC TGGACCATCG TTCACTGCGG TATTGATCGA CGGAAACGGT CTTATAGTGA GTGCAAACCC TCCCAATCGA AGCTTTACGC TTGTCTTCTT TACAAGATGC TAAACGTCAA GATTCTTAGT TCCAAGATCC ATACCTTCAA GCAGGGTTCA AAGGCGGTCA GCTCGCAGCC CATCATCTCC TCTCTTCCAT CCCCAACCTC GCACCTGGTT CACCCTCCTC GAAGACCACT CATACCGGCA TTATTGCGAA AGAAGTCACT CTAGGCCTTG ATAGTCTTCC TGTTAATAAT AATAAGGGGG GAGATGACGA TGATTATGGT GGGAAGAAGA CGGAAAAAGG AGGGAGAGAG ATGGGGAGTG TGGTGGTCCA GATTTTCGTT AATAAACAAG GTCTCGGTGG GGCTCTTATC AAGGTAAGCA CGCCGAATAT GGGATTGCTA TTCTGCGCAG TTCTGTACGC TGACAAGCAG GTATTTAGTC TGGTATTGTA CCCTCGTGGA ATGTATACGA CCAGTTCTGG CAGGGTTTAT CTTCCTCACA CGAACTTTTC ACAGGTGTGT TGTTCACTTT TCCCTTTCAT GTTACTTTGT TCAAACCGTA CTTTTACTGA AGTGAATTTT TTGGGAGATA GTATGTGACG TAGGGCAAGG TAAAGAAGCG TCCGATGCCA AGATCAGGGA ATACCTCAAC TTATACGCAA GTAATGCGCA ATGCCGGTCT ATCATCCTTG GTGCTTCGCA TGATAATGGG TACGCCAACG TACTTTCCTC GTACGTCCAG ATTCAAATTT ACCGCCGTTG AGGCGCTAAC GCGTCCATGC CAATCGCCAT TCATATCTCA GATTGCAAAC AGGATCACGT CTCTCTAATG TCGTCCTTCT CAAGGGTTAC GCTACGCTCG CGCCGCAGCT CAAGACATAC TCTAGCCGTG TCGTCTCTAT CCCTGATTTA TTCAGGCTCG AAAAAGTACC ACCCCCTCTA CCGTCTTTTA CTTCTTCTAC TACTGCCGAC GTTGTCTCGT CTATCCAAGC GGGAGGTTCG CCCGACCTCC TTTCCGCGGT CACGGGTCTC GCGGGCGTAT CCTTCTCCTC CATTGTTGCT AGTAGCCCAA AGGACAAAGA AGCCGATTTC GCGGGACCTT ATGGCGCCAA CGTCCGTGAT AACACCAACA CTGCAACATC AGCACGCAAT ATCAGCTCGG CGGATATAGG TAAAGCGAGG ATAAGCACAC CGAAAAGGGA CGAGTATGAA GAGAGTGAAG AGGAGATTGA GGAATTCGAG TACGAATGGG GGAGTGGTGC TCAGTTTAAG GGAAGGTCGG ATCTGGCCTC AGGGTTCGCT CCAGGTAGCG CGAAGAAGAA GAAAATCCCA GCTCCTTTTT CCCGCGAGGC AAATTTTTCT CGTGTAGGAG GAAAAGATAG GGAGATGGAT GATGAATGGA CAGAAATGGC GCCTAAGAAG AAGGTTAAAG GCAAAAGGAA GGAGGCGGCA GAGTATGTGC GGACTTTGAA ACCTCGACCT TGCCATACGT GAGTCATACA TTCCTTTCTT ACATTCTTTG ATAATACGAT AAGGGCTGAT GAGGTACGGG CGTAAGATTT TATTTGGGCC CGCGGGGGTG TAAGAATGGG GACGACTGTC AATATGGCCA TGAGTATAAG CTCAATGCCG CCCAGCTCGA CGAACTCGCC CGCCTGGCAA AGTGCATCAT GTGCCCATAC GTCAAGGATG GACGATGTCG ATACTCGGAT GATGATTGCG TCTATGGACA TCAATGTCCC AACCCTGATA AATGTGTCTT GTACGTTTCA CTGTTCTGTC TCCCAGGCGG GAGTAAGGGG CCGTTCACTG ACAGCTGGAT ATTTAGCGGC GAAACTTGTA GATTTTACGA GTTGCCCAAC GGACATGGCG AATTGAATTA A
|
Protein sequence | MFESHLNLNP HGAIGQPSPS IGVIGEQPQA IDNLSPAPAR PAAHPRQVAA ASLSQGQLGG RPRTPINRDW DRDTDTDRDR DVVMGERREQ ADVESDLTNL LVQTRLHRTG LFSDREVHSQ PHSHSQSQSQ SQPQSEAPSQ RQRQYQSQSR SQPQSQSPSQ GKDDVTGGID DFRKRWERES REGGKRFDEE TWDRSLLGMG MGMGTVQRQG RGSPAPGGFG IGLGSGLGLG GKAMGIGGTV GGSTQSKMTT GTFAGYQPFM GKISPVSSAQ YRPSLSPSNH HLPSSTYGNQ APSQPQSYTQ PQTPSHTAAP NTLVAGAGTG GQNDTTATQI TRHLESLTIL LNPLLAQADE VERLRKEVEM WKSEWARAER ERKRLEEKVG GMGMELEKAV GKSRDIAGPS FTAVLIDGNG LIFQDPYLQA GFKGGQLAAH HLLSSIPNLA PGSPSSKTTH TGIIAKEVTL GLDSLPVNNN KGGDDDDYGG KKTEKGGREM GSVVVQIFVN KQGLGGALIK VSTPNMGLLF CAVLYADKQV FSLVLYPRGM YTTSSGRVYL PHTNFSQVKK RPMPRSGNTS TYTQVMRNAG LSSLVLRMIM GSRLSNVVLL KGYATLAPQL KTYSSRVVSI PDLFRLEKVP PPLPSFTSST TADVVSSIQA GGSPDLLSAV TGLAGVSFSS IVASSPKDKE ADFAGPYGAN VRDNTNTATS ARNISSADIG KARISTPKRD EYEESEEEIE EFEYEWGSGA QFKGRSDLAS GFAPGSAKKK KIPAPFSREA NFSRVGGKDR EMDDEWTEMA PKKKVKGKRK EAAEYVRTLK PRPCHTADEL NAAQLDELAR LAKCIMCPYV KDGRCRYSDD DCVYGHQCPN PDKCVFGETC RFYELPNGHG ELN
|
| |