Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK02850 |
Symbol | |
ID | 3254673 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 839347 |
End bp | 841742 |
Gene Length | 2396 bp |
Protein Length | 501 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253776 |
Product | hypothetical protein |
Protein accession | XP_567880 |
Protein GI | 58260940 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.374748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTGGTCTT CACCAGATCA TATCGCAGTT GTGACCTTCA TCGATATTGT TGCTGTAGCA TAGAGCCTTT AGCATTGAGC CTTTGGTCGC CAACCCTTTG GCCAGACGCA GCGATTATCA CACTTCCCCA CCAGGTGAAC AGCCCTGATA GACTAATCGT CGCGTATTCA TTTGCTATCA TGCCTTTCGC CTTTCCCAAC GCCACCGATC AAACTACCTT ATAATCCGCT CACTGAAACG GCTGCCAACA GCTTTCCCAC CAAGCATCCC AACTTCAAGC AAATCCAAGA TGTTCGACCA TCAACAAATG ACAGAAGAAG AGATCGCCCT TGAGCGCAAG GTCGTCAGAA AAGTTGACGC GATTCTTTTG CCTATCATGC TCGTTAGCTA CGGCTTACAA TACTATGACA AAAGCGTATT AGGAACAGCC GCTGTGTATG GAATAATAAA GGATTTGGTA AGCTCCAGTA AAATCACTGT TGACAGCATG GCATTGACAT CTATGATGTT GGCAGGATCT CCAAACAACC GTCAACGGTG TGGTATCTAC CACTAGATAT AGTACAGCCA CAGCTGCATT TTATTATGGT TACATCGTTG GTGTGAGTTC ATATCCCTAT CAAGCTTAAT GGCATTGTTC AATCATTGAT TTCCATCCTA GGTCCTTCCT ATCGCCTTTC TCTTCACTCG TCTGCCTCTG GCAAAGGCTA CCGCCTTCTT CGTCATCATT TGGGGTCTGG TCTGCATTCT TACAGTCGTC TGCACAAACT ATCCAGGATT CACCGCGCAG CGAGTGCTAC TGGGTATCTG CGAATCGGCT GTCTCTCCTG CTTTTGTTGC TATTTGCGCG TTGTGGTGGA AGCCTCAAGA ACAGGCGAAG AGAATCGGGG TTTTCTACTC TGCGACAGGA GTGAGTGCTG TTTTTATTTT GCCTTACTTG ATCCGAGTTG ACCGTTCCTC TGCTCGTATC AGGTATTTTC AATGTTCTCC TCTCTAGTGA ACATTGGTTT GGGAAAAACT GGTGGTACTC ACCCTTGGAA ATCCATGTAT TATTTCGTCG GGTTCGTCGG TTATCCAGCT CCCATCCCTC TGGGAAAAGC TTGCTAACCT TATCCACAGA GCTCTTACAA TTTTCTGGGG CTTTGTCATT CTCCTGATTC TACCCGATCA CCCTCTTCGT CCTGGGCGAT GGTTCACCGC GGAAGAGCGT ACCGTCCTTG CTCGCCGCTT TGCTCAGAAT CAAGCCGGAG CTAGTCAACA GCCCATCAAG CCCTACCAGA TCCTTGAAGC CGTCTCCGAT ATCAAGACGT GGCTGTACCT CCTTATGGCC GCTTCTATCT ATATCTGCAA TGGTTCCGTC ACCGCCTTTG GCGCAAAGAT CATCACTGGA TTAGGCTACA CCAGTTTGCA GGCGACAGCG TTGTTGGTAC CTGGTGGAGC GATGACTGTG ATCACCATTG CCATTTTCAG TTATCTTGCC GACAAATATA CCAATATCAG AACTTTGGTG AGTTAATCAA CACCCATATG CTTGCATACT CTCTGACATT GTTTTATAGC TGTTGCCCAT TAGCTGTATC CCTGTCGTCG TTGGTGCAAT CGTCATGTGA GCTACTTTTA TTTATTCTTA TGGTCTCTGC GGAGCTGTCA ATGCTGATAT TGCGTCAAGC TGGACCGCCC CTTGGAAGCC AACAGCAGGC CCCCTTATTG GCTACTACTT GGTTGCCAGC TTCGGTGCCC CTTACGTCGT ATGTCCTTTT CTTTAACATA TCTTCCTTTT GCTTACCCAA ATCCAACCGT TTTAGCTTCT CCTCACTCTC GCTTCCTCTA ACACAGCTGG CGCGACCAAA AAAGCTGTCA CTACAGGCTT CATCTTCATT GGCTACAACG CCGGTAACAT TGCCTCTGCT TACCTTGTTT TCGCCAAGGA GGCTACCATC AAGTACCGCT CGACATGGAT CTCTGTGATT GTCGGGATGG CGTTTGCTTC GGCGGCGAGT TTGGTGTTGA GGTGGTTGTA TGTGAGAGAG AATAAGAAGA GGGATAAGGA AGAGTTGGCA GGGAAGCAGA GTGAGTTATC GACGGAACCG GATGAAGAGA AAGGTGCTCC AAAGGGATCA GTCGACTCGC TTGATGGTCT TGCCACACTG TTAGTATCAG ACAGGTCAGA CAAGAAGACA CCTGGTTTCC GTTATACTCT TTGATTGTGG TTGGACTTGC TTGATTCAGA ATCTATGATT TTATTTGTTT TTCCTTCAGT TTGGGTTCCT TTGTCAGACT ATGTATAATT TACGCGAACC CAGCGATGGT TTGTAGGGCC AGCAAACTGC AATTAACATG GTGGTGATAG CAAAGTAGGT ATGAACCCAA GGAGTTAAGC TACATGCATT GAATAA
|
Protein sequence | MFDHQQMTEE EIALERKVVR KVDAILLPIM LVSYGLQYYD KSVLGTAAVY GIIKDLDLQT TVNGVVSTTR YSTATAAFYY GYIVGVLPIA FLFTRLPLAK ATAFFVIIWG LVCILTVVCT NYPGFTAQRV LLGICESAVS PAFVAICALW WKPQEQAKRI GVFYSATGVF SMFSSLVNIG LGKTGGTHPW KSMYYFVGAL TIFWGFVILL ILPDHPLRPG RWFTAEERTV LARRFAQNQA GASQQPIKPY QILEAVSDIK TWLYLLMAAS IYICNGSVTA FGAKIITGLG YTSLQATALL VPGGAMTVIT IAIFSYLADK YTNIRTLLLP ISCIPVVVGA IVIWTAPWKP TAGPLIGYYL VASFGAPYVL LLTLASSNTA GATKKAVTTG FIFIGYNAGN IASAYLVFAK EATIKYRSTW ISVIVGMAFA SAASLVLRWL YVRENKKRDK EELAGKQSEL STEPDEEKGA PKGSVDSLDG LATLLVSDRS DKKTPGFRYT L
|
| |