Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB04070 |
Symbol | |
ID | 3255609 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 1196865 |
End bp | 1201195 |
Gene Length | 4331 bp |
Protein Length | 1281 aa |
Translation table | |
GC content | 53% |
IMG OID | 638255051 |
Product | conserved hypothetical protein |
Protein accession | XP_569231 |
Protein GI | 58264150 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.217491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAAACAGCG CGCCGAGAAG AGAAGAAGAG TATAGACATA GCAGACACAG GGACACGAGA GCAGGCCAGC GGCAGTCGTC CCACACGCCA TACACATAGC AGACCGCTCA GGCAGCATGT CTCGCGCCCC CTCGCCTTCC ATCCCAGCCA GCCAGTCCGT CTTGCCACTG TGCGAGTCCA CGTCAACCAC GCCCACGCCC ACGCCCGCTC GTCCCGGCTT GCGCATCGAC GTCGATTCCC TGCCTTCCTT CCCTGTGAGT GCCAGTGACA AGCACGATGT CGCCATGGCA GTAAGAAACC TGCGAAGCGC ATCGCCGCCT CCGCCTCCGC CGAAATCCGC GAGCCGTCTA CGCGCAAATA CCTCTCCGCC AGATTCTGGA GGCGTGGCGC TTGCACCTGC GCCCGGTATA GACGGGTCGG GTCAAGCTGG TGTCGGAAGC TCAGCGTCGG CCTCGACCTT GGTGTTTAAC GAGGAAGAGA GGACTCTGAG GATGATCAAG CAGGCGCTAG AGAAATCAGA CGATGACGGA GATACATTAG ATCTGAGTAG AAAGGATATA GATCGGATAG GAGAAGAGGC GGTTGAAATG TTCAGAAGCG GCGTTGGCAA AGGTCAAAAG GGTGTATGGC GGTGTGTCCG GTCTTCATGA ATAGAAACGT GCGCTGACTT ACGACCTGTA GGCTGGCTCT GTCTTACAAC TTTCTGAGGG ACGACTCTAT GGTAGACTCA TTCTCCAAGC TGAGCAGATT ACGGTACCTC AATCTCAAGG GAAACAAGTT TACTCGGTTT CCTCATGCAG TATGTTTTCC TCCATGTCTA TCATATGTAT ACTGACCGTC TGCAGATCAC GCAAATGCCC GCTTTGGAAA TCCTCGATTT CTCAAAAAAC GAACTGTCCT CGCTTCCCGA ACAGCCCGGA CACTTGGTCA AGCTCAAAGT CCTTTCGTTG ACAAGTAACA AGATCCACAC ACTCCCTAAT TATCTTGTCG AGTTCGGGGT GTTGAAGGTG TTCAAAGTGG ACCAGAACCC TGTAGACTGG CCTGTATGTC ATCTCATCCT GTTTGAAAAC TGTTATATAC CCCCACTGAC ATGCGCAACT AGCCTGCCCA TGTGCTTGGT CCCTTGATTG AATCTGTCCC AACCAAGTCC AAGACGAGCA GTACCCCTGG AGGCAAGAAA AAGGCAAAGG AGGAGGATCT GCGGCCTTGG ATTGAGAGTA TGAAGGTCTG GATGGCCCAG CAAACACCTG GTTCCGAGAA CGCGCCTCGC AGAGGCGAAG AGGAAGCGTA CCTGGCTTCA GAGTGAGTTG AAAATCAATT TTCGAAGCAG GTTAGCATAC TAAACATAAC TGCAGGGAGG AACCGTCGAG CGCAACGTCC ATTGAAAATC CCGGAGCGAC AGAATGGCCT GAAGTATCAG AAAGCTCTTC CACGGTCGCT CCTTCTTCTC AAGCAACCAT CCGTCGCACA CAGCCACTTT ACGGAGATAC CCCTGATTTA GTGTTCGCCC AGCGAAACAT CTCGACAACT TTTTCTGAAG ACTCTCTGTT ACGCTCGTCC TCTCCCAGTC ATTTCTCTCC TGCAAACCAC AGTCGCGACG CATCCGTTTC TTCCTTCACC TCCCCTCCCT CTGCATCCAC AGATGCGTCC TTCCACTCGC ACTCGCGTAA TCCCTCTACC AACTTTCCTC ATCTCGCTTC CCAACCCGTC CAGGGACATA CTCGCGGCGC GAGCTATACT CCTGCACAAC GCAACTCTGG GCAGTTGACA GCAAAGAAAT CATTACCAGA CTTGCGTCAA AGCCACGCAA AGATTATCCG CGAACGCAGA GGCGATGCAG ACGAAGAAGT GGTCACCATG GTGGCGGCGC CAGATATCGA TCCACGGGAA ATTGGGGTAC GATCACCTCC ACGTTTGGCT AGTGCTTTAC GACCGAAACG AGCTAGTAGG GTGATGGGAC GGAAGACATC TGTGGATTTG CTGCCTCGCA AAATGGGAGA AATGACGAGC CACGACATGG CAGATCGGAA CGTTTCTCAT GACAGTATCC TCGACGAATC TCGCAACTCT TATTTTCGCC GTCTTTCTAC CCTGCCGGCG TCCTCCATAT CTAAAGCTAT CCCGCCTTTC CTTCTCAAGT TCATCGACTC CATCCGAGGT ATACTGTTTG CGCTTTCACA ACTGCACAGT TCGCTGAGGC AATATCTCGT TTTTGCTGTC AACGAACGGG TTTCAAGCGT GTTTGGACGG GTGATGGAAC CTGCTGGGAT GTACATAAAC AAGCTAATCA ATGCACTCGA TCGATTCGAC TCCATGTCCC GCCGGGGTAC TCCCCCAACG CAAGCTATAC AAAACCTCTT TGACGCTACG AAAGAAAGCG TTGCCGTCTT TGGTAAAGTG GCGGCTGTGC TCAAGATGCA GGTTCCCGCT ATGCGAGGGA ATGACGTGAG ATATACTCGA ACGTTGATGG TGCATGTCTA TGGGGCTATG GCAGAGGTGG CGAGCTCGTG GAAGGCCATG ACGGGGCTGT TACCCGAGGT AAAGATGTTA TTGCTCATTG ATGCGCCTGG TGTGGTAGGC GGAATGGGCG GACAGAAGAT TGTGCCAAGC GGGTCATTCA CTGGACGTAC CCCCATTTCA CCCATCATTG AAAGACGGGA ATCCCATTCA CCTCAATCGT CAACTCTGGA ATCGTCTCCC CTCAACCTGG AGCGTCAACA ACAACAACAA CAAGCCGAAA TTGATGGTTC ACCTACGGCT CATGCGGAGA GGCGGGCAGG CGTGCGAGGT CATGGTGTTG GAACAGAAAG GAGACACGCG GGATCGTATA GCAGTTTAGA TGTGGAAAGA GGTATGATGA TGGGCTCGCC TTTGGCAAAA GGCATGTCTA TTGGTGTTAG GGAAGACGAG GGACATAGGA GAGGGGAATC GGGGACGATT CATTTACCGA CGCCCGACGA GGAAGATGAA GAGGATGAGG AAGAGTACAG CAGTATGAGG CACATGTTGG GCCCTGTGTC GCCGCAGTCT GTGTCTGGTC GGACTTCTCA ACAACGTCAT AGACCGACTT CATCTTCTGG CTCCAGTCAT GCCCTTTGTC TCCCTCCTAG CTTGCCTCAT CACCCTTCTC GGCAGCTGTC GGTCGATGTC CGCGCACCAG CATCGGCGGC TGCAACTCTC TTTGACGAAG ATTTGTTGGA TGTGATCGAG ACGGCCACAG AAGCGGCGTT TGCATGTTGG CTCAAGTTGT CTGAAGAAAT TGGAGCGTTG TCACCTCAAA TCAGCAGGTC GACACATAGG AACAACTCTT CAGTCAGCAG CGTGTCAAGT CAGAGCCAGG GTCATCTCGA TTCGGCCCGT ATGGCTCTAC CGTTCTTTAC GCCTCTTGGT GCCCACAGTC GTCGTCCATC CACCATCTCG CCTAAATACC ATGCTGAACT CGTACATAAC CTATCTGTCG CCGAACAAAT TACTGCCGCT TTGCGCGAAT CTCTCCTACG TCTTCGTGCG GACCCCCTTG CTTATAGCCA TACTACGCTT CCAGACGATG CCCAATCCTT CATAAAGATT GTCGTCAAGG TGTCTGAGCT TGTCAAAGCT ATGTCGGGGA CACATCCGTT CCCAATAAGT GTCCGACAGG CGGTGGGCAA GTTGACGCAG GCGACGAGAG AATGTGCAAT CTTAATCCAA GTTAGCTCAT TGCGGCCTGG ACAAGGGACT CCAGCGCCGA CAGCACCCGT GTCTTCGGGG AGGTCCGTGA TGCCAATCTC GCGCGTGGGC TCCGCTACAG GAATAGGCAC AGGTTATAGC TCTAGCCACC ACGGACCCGA ATCAAGCACC GACGATCTAT CTCACCTTCA GCCCCCTTCC TCAGCCGGTC CATCCCACGT ATCGACTCCG CTTTACACTC CTTCCTCATC CACCACGTCT TTCGAGCATT TCCATCACAT TTCTTCCTCG ACTGGTGTGA CCACAAATGC CAGTGGCTTG AGGGATCTAC ATTTGCCCAG TAGACTTGGT CATGCTCCCT TTGGTAGGAA TCGATCTGCG AATGGGTCTG CGCAAATGGC ATTGCCAAGT ATACAGATGC CGGTGCCGTA TGTGAATGGG GGCGCTGTGG GAAAAGCAAA TCAGCCCAGG AGTGCACAAG CTAGTCAGGT TGCTTTTTGA AACGTTTGTT TGAATTTGGA CTGAGCGCCT GGGCGCGCGA AAGTATGTGT GAAGGATAAA GAATCTTTTA TGATAATTAT TCATGTGGTT CACGAAATAT TATGAGAGCA ATTAATGGTA TTTGTACAGT AGTCTTTATT TCCATTTCAA ACGGTGTATG C
|
Protein sequence | MSRAPSPSIP ASQSVLPLCE STSTTPTPTP ARPGLRIDVD SLPSFPVSAS DKHDVAMAVR NLRSASPPPP PPKSASRLRA NTSPPDSGGV ALAPAPGIDG SGQAGVGSSA SASTLVFNEE ERTLRMIKQA LEKSDDDGDT LDLSRKDIDR IGEEAVEMFR SGVGKGQKGV WRLALSYNFL RDDSMVDSFS KLSRLRYLNL KGNKFTRFPH AITQMPALEI LDFSKNELSS LPEQPGHLVK LKVLSLTSNK IHTLPNYLVE FGVLKVFKVD QNPVDWPPAH VLGPLIESVP TKSKTSSTPG GKKKAKEEDL RPWIESMKVW MAQQTPGSEN APRRGEEEAY LASEEEPSSA TSIENPGATE WPEVSESSST VAPSSQATIR RTQPLYGDTP DLVFAQRNIS TTFSEDSLLR SSSPSHFSPA NHSRDASVSS FTSPPSASTD ASFHSHSRNP STNFPHLASQ PVQGHTRGAS YTPAQRNSGQ LTAKKSLPDL RQSHAKIIRE RRGDADEEVV TMVAAPDIDP REIGVRSPPR LASALRPKRA SRVMGRKTSV DLLPRKMGEM TSHDMADRNV SHDSILDESR NSYFRRLSTL PASSISKAIP PFLLKFIDSI RGILFALSQL HSSLRQYLVF AVNERVSSVF GRVMEPAGMY INKLINALDR FDSMSRRGTP PTQAIQNLFD ATKESVAVFG KVAAVLKMQV PAMRGNDVRY TRTLMVHVYG AMAEVASSWK AMTGLLPEVK MLLLIDAPGV VGGMGGQKIV PSGSFTGRTP ISPIIERRES HSPQSSTLES SPLNLERQQQ QQQAEIDGSP TAHAERRAGV RGHGVGTERR HAGSYSSLDV ERGMMMGSPL AKGMSIGVRE DEGHRRGESG TIHLPTPDEE DEEDEEEYSS MRHMLGPVSP QSVSGRTSQQ RHRPTSSSGS SHALCLPPSL PHHPSRQLSV DVRAPASAAA TLFDEDLLDV IETATEAAFA CWLKLSEEIG ALSPQISRST HRNNSSVSSV SSQSQGHLDS ARMALPFFTP LGAHSRRPST ISPKYHAELV HNLSVAEQIT AALRESLLRL RADPLAYSHT TLPDDAQSFI KIVVKVSELV KAMSGTHPFP ISVRQAVGKL TQATRECAIL IQVSSLRPGQ GTPAPTAPVS SGRSVMPISR VGSATGIGTG YSSSHHGPES STDDLSHLQP PSSAGPSHVS TPLYTPSSST TSFEHFHHIS SSTGVTTNAS GLRDLHLPSR LGHAPFGRNR SANGSAQMAL PSIQMPVPYV NGGAVGKANQ PRSAQASQVA F
|
| |