Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC00300 |
Symbol | |
ID | 3256822 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 88126 |
End bp | 92215 |
Gene Length | 4090 bp |
Protein Length | 952 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255249 |
Product | hypothetical protein |
Protein accession | XP_569407 |
Protein GI | 58264502 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.182785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAACAGCCA GGTGATTGAT TACTCCCATC ACAAATATGA GAAGGAGATA GTGGTGCTTG AGACTCGCAT TCTCAAAGTC AAACAAAGGG ATTCCTACAA GCCGTACAAT GGACCCATAT TCCGTCGACT ACACTCGGCA CTCCCAGTAC CAACAGTACA ACCCGGTTCG GATAGCCCCA GGTCTGGAGG CTAGTTCTAG CAGGACAGCA TCGACCAGTG ACATCTATGG CCGGGGAAAC AGCGCATATT CCTCCTCGTG GACTGGGGCG AGTCCGGTAG ACGAAAGACC TGCCCGAAAG CTTGACAGGC CATGGGAGCA TGAAAACGAC ATAAAAGGGT TGAGCCAACC TCAAGCGTCG GCACAACAGC CCCCTCAGCA GCAACAGTCA TTGCAGCCGA GCGTATTTCC TAGCTCCGGC CCTACGGCCA AGAGAGGAAG TAGAGCTTGC ATGGCTTGTA AGTGAGCCAG ACCCTCTGAA TAACATGCTC TTATCTACGG ATGTAGGTCG ACGGGGCAAA AATAGATGTG AATGGGATCC CGCTGGCAAA GAGACATCAT GTCGACGATG TCTCATGAAC GGCACACAAT GTGTATTCGA GAAGCCGGCA GAAAAAAACG GAACGCGAGC CAGGACGAAT AGCAGTGCGG TGGCCGATCA TCCAAACGAA GCTGAAAAGT AAGTTAATTC CAAAAGCAGC CACATGCATC AACTCACCGC GAGTAGTCGG ATAACTAGTC TGGAGAAGAT GGTGGAGACC CTTTCAAGCG GACAAACGCA AATGCAAACC ACGGTATATA TGGCTTGACC ATCCGTATGG TGACACATAC TAATTAATCC CGTAGCTTCA ACAAATCCTC ACATTGCTTC CTCAAACTCA ATCTCTATTA ACGCCCACAG GACATCCGTC ATTCCTTTCG CACAATACCC CTTCTACTGG TGTACCACCT TCGACCATAT TCGCCTCAGT ATCTCCGCCT CAGACCTTCC ATGATGGTGC TGGAATGTCT AGTGTTAGGG ATCACGCACA GTCGGATAGA CCTCGTTCAC CTCAAAATAC TGCTAGCGGT CGAGGTATAT CGAACTCGAC GATGAACAGG GTCAAGTCTG TGAAAGAGTT TCCTCCTTTG CCAGGATTTG CACCTCCAGT ATGGTCATCC CCTCCCATAA TGTACTGAAA TATCGCTAAT GACATGCAGA ACCATCGTTT CGGTACTTAC GGGATCATCC CTCTACCTTC AGCTCCGCCT TCACCATCAC GCTCCCGTCA CTCTTCCCGG TCATCTTCTT CCAGCCGATC CCGCGACGAG TACTCTAGCG CTTTACCGCA TGAAACCCTC ACAGCACCTA TCCAGGCGTT ACAGGCGCTT GCGAATGCCG CAGATCAAGC AGCTGCTCTG GTTGATAAAG ATGATGAAGG TGAAAAAGGG AAGTCGACGA GTCAGGAAGT CCGGTCGGAT GGAAGTGATG AAGATTCTCG GAGAGGGGAA GGTGAAGCCA AAGGAGGAGC TGAAAGGCAG TGGAAGTTGA ACGGTGAGGA TGATGAAGAT AGAGGAAGAA GTAGGAAAAG GAAGAGAGTG ACTATTGATG GACAGAATAT TCATTTGAGA GTTCGAAAGA AGACTAGACC GGATCCAACT CCAAGGAACC CGTTCCCAGA TGTGGTCACC AAAGGTTTAG TTTCAGAGAA AGAGGCCAGG GAGCTTTGGG ATATGTGAGT ACCTTTTCTA CGCACGGGGA AAGCCGCTGA ATGAGTCATT GCCAGATTCT TCTCTGGCTG CCATTACTTT GTACCACTGT GGGACAAGTC TTACGACACA TTTGAGACTT ATATCGAGCG CACACCATTC TCCACCAACG GTCTTCTCGC CGTAGCGGCC AAAATCAGAG CAGGTAATGG GCCAGTGGAT GAGACATTTC ATCGATGTTT AGAGGAAGCG CAGGGTATCG CGAGAAGCAC ATTGTTCGGA CCGATAGTGA GAAAGGAGGC GGTGATGGCG ATGTTGATAT TGAGTCTTTG GAGTCAAAAT GGATGGTTGC CTTGCGGGTC AGTCCTGAGC TGTTACAGTC AGGCGTACTT GGTGAACTGA CTTGATTGAT TTAGACACGC TTTAAGAATG GGGTTGGATA TTAACATCCA TAGGGCTTTG GACAAGTTGG CAAATAAGGA GGACGGGAGA ACAGAAGCTG AAGAACGGGA TTTAGGTGAG TTTCCTCTAA CACTACTACT TGATCTTGAT CTAACGTGGA GACAGTCGTG TCGGCGAGAA TCTGGTTAAA TTGCTACATG CATGAGCACT TGTCAGTCTG TTCTCGAATG GTTTTACCGT AAACCTAACT GACAGCCTTA CAGAGTTAGC CTTGGGACAG GTAAACCAAT ATTATTACGC GATGATTCCT CGGTAAGGAA GGCCAGGGAG CTTTTGTAAG TCGTAGCTGT ATTTGAGACT TGGCTAGGAT ACTGATTGAT TTATAGGGAT CACCCAATGG CATCCGATAC GGACGTGCGA TTAATCGCTG GCGTAGAATT GGTCAACATC AGAAGTATGC CACTCTTCTC TGTCTGGTAT GACCAACACT TACATGCACG ACCTCTTAGT CACTATTCTC GAGCATCTCA CGCCTCTTCA TGGTAGTACA GACGCACCTA CCATATCATT TGTCAAGAGC AAACTTGCAG AACTTCAAGC GTGGCATCAA GAGTGGTACT GTATTCACAA ACGCCGATAT GACGATGAGA GTGTGGTGGT AAAATTGCTG GAGACGGAGA GGGTTTATGC GGAGCTGTGG ACAGTCTGCA TGGCTCTCCG AGGTTGTTCT TGGGATAAGG TAAGTGAAAT AGGGTGTTGC ATTAGTGAAA AAAAGCTGAT TAAACCCCGG TGTAGTTATC GCATGAACAA AAGGAACTAG CTTTTCATGC CAAAGACTGC GCGTGGAGGT GTCTTGAAAT TTTCTTGAGA TCCGATAATT TTAGGAAGCA TTTAAAATAT GCCACCCACG ACCAGCTTGT GTCAGTTGCA TTTGCAGCTG TCTTCCTACT AAAAGTGTAA GCCTTCATTA CTTAAATGAC TTGGTTCGTC AGATGTTGAT AAATGTCATA GTGCGATCCT CTACCCAACA TCTCTCTCCA TCCCTCTCCT CGTCAGCCAA GTATCTGAAC TTGCACACTG CTTATCGGCC GAATGCTTCG CTGAGCGATA CGCGCTCACC CTCCGTCTCA TGCTTTCCAA CTTCCGCCGC ACAACCGGTG CCATGTCCAC CATACCCGGA ACCCCGCGTA CGGCAGCTTC AGCAGCCGCA GGAGCTGTCC ACGGACTAAG CGGACTGTCT ATGACTGCTC TGAACGTGAA CGGCCAACTG AACGATGTGG AAGGAAGTTT CCAAAACTTT TTGAGTCTAC CTCAGATGGA AAATCATAAC ATGGAAGTAG GTGGAGAAGG AGAGAATGCG AGTAATGGTA TGGGGGCTAC AGGTTTTACT GGGACGGAAG GAGATGGGGG CCTATGGGGT ATGATGGGTT TGGAAGGGTT TGATTGGCCG ACGGAGCTTA GTCCGAGCTC TTTGCCAGTG TGGCTCCAGG ATGGGGTGGG TGTATTTATC TTCGTGATGG TTTGGTTAGT GCTAACTACG AAGCGTAGAA TGCCACAGAT CTTGGCTTAC CAGTCGATGG GTCAGACTCC CTCTTCTTAC CTGTCGAGTA CGTCCTATAT ATTCGGCTTC ACTATAGACA TCTAGTATAT TCGCTAACCA TTGTCGCCTT TTGCAGACTG GCAAACATGT TCATGCCCTC CAACTCTCAG ACCAGCGGAT TGTACCAGTT TACACTCCCA GATTCTGGCG ATGTGGGTAC AGAGGCATGG TGATCAGGAA TTGTATATCT GTTGGTAGTG TAGGAAGAGT AGCCTTAAAA GGGTGCTATC AGACTGTCGT GTGTTGTAGA TGTACCGTGA TCTTGTAACT TAGGGTCTGC GTTATGCCAG GAAGCAGAAG AGAGAGGCCC ATCTTCGGGA AGAGATAAAG ACGAAAGAGT GTTCACCTTT CCACAGCGAG AGGTAGAGTT TTGCCTTTCA TTATTCCTCT AGGATCTAAT
|
Protein sequence | MDPYSVDYTR HSQYQQYNPV RIAPGLEASS SRTASTSDIY GRGNSAYSSS WTGASPVDER PARKLDRPWE HENDIKGLSQ PQASAQQPPQ QQQSLQPSVF PSSGPTAKRG SRACMACRRG KNRCEWDPAG KETSCRRCLM NGTQCVFEKP AEKNGTRART NSSAVADHPN EAENRITSLE KMVETLSSGQ TQMQTTLQQI LTLLPQTQSL LTPTGHPSFL SHNTPSTGVP PSTIFASVSP PQTFHDGAGM SSVRDHAQSD RPRSPQNTAS GRGISNSTMN RVKSVKEFPP LPGFAPPNHR FGTYGIIPLP SAPPSPSRSR HSSRSSSSSR SRDEYSSALP HETLTAPIQA LQALANAADQ AAALVDKDDE GEKGKSTSQE VRSDGSDEDS RRGEGEAKGG AERQWKLNGE DDEDRGRSRK RKRVTIDGQN IHLRVRKKTR PDPTPRNPFP DVVTKGLVSE KEARELWDIF FSGCHYFVPL WDKSYDTFET YIERTPFSTN GLLAVAAKIR AGNGPVDETF HRCLEEAQGI ARSTLFGPIV RKEAVMAMLI LSLWSQNGWL PCGVSLGTGK PILLRDDSSV RKARELLDHP MASDTDVRLI AGVELVNIRI TILEHLTPLH GSTDAPTISF VKSKLAELQA WHQEWYCIHK RRYDDESVVV KLLETERVYA ELWTVCMALR GCSWDKLSHE QKELAFHAKD CAWRCLEIFL RSDNFRKHLK YATHDQLVSV AFAAVFLLKV AILYPTSLSI PLLVSQVSEL AHCLSAECFA ERYALTLRLM LSNFRRTTGA MSTIPGTPRT AASAAAGAVH GLSGLSMTAL NVNGQLNDVE GSFQNFLSLP QMENHNMEVG GEGENASNGM GATGFTGTEG DGGLWGMMGL EGFDWPTELS PSSLPVWLQD GNATDLGLPV DGSDSLFLPV ELANMFMPSN SQTSGLYQFT LPDSGDVGTE AW
|
| |