Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05710 |
Symbol | |
ID | 3255917 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 1601208 |
End bp | 1604158 |
Gene Length | 2951 bp |
Protein Length | 553 aa |
Translation table | |
GC content | 47% |
IMG OID | 638255213 |
Product | sugar transporter, putative |
Protein accession | XP_569312 |
Protein GI | 58264312 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.19394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAACCCACTG GGATCCCCAG ATAGGCATCG GGTTGTTGTA GGGTTTTGCT TCAAATTTGC GGGGTTTTAC GCATTTACAC AGGAACATGA CAGGAACGAG CAGCTGGTCT TCCACCTTTT GGCCTGCTGA GGGTTCGAGC TGTACAAGAG GAACGACTCT GGCACAGGGA GGTGAGTTGC GAAATACCCT TGATTGGCGA GAGGCGAAAG AACAAACCAA GCGGCTTAAT TTTCCACCTG CCCTTCGGGA TGATCCGTCG GGTAGGAGGG CGATATTTAC TTTTGCTTTC AAACAATGAC ATGATGCTGC TCCAAACCCA AACTGGCCAT ATATACCGCG GCCTCCAAGA GTCAATACTT CCTTTGCTGG TCCTCTTCAC ATTTCCAAAC TTCTCCCTCG CATTCAAAGG CCTTGACTTT TCAATTCCAC ATTTTAATCA TGACCTCCGA CCAACAGCAC CCTCACTCAG GGTTCCACGA TATTCAAGCT ATCCCTCTCG AGATGAACGA GAAAGACAAT GAGCTCATTC ACACTGAACG TCTTCAAACC GCCGAAGAGA TTCGAAAGAC CATGTCCAAG GGCGAGGCGG TGCAGATGAG GTCCAAGTTT GCCGACTTGG GTACCATCCA AGCCCTCAAG GAATACAAGC TCTTAGGTTT CGTCGCAATG GCAGCGGCCT TCAGTGCTTC ACTTGACGGC TATCGTAAGT CTTAGACCTA AGGTTGAGCC GCAGTACCAC TAAGGTACAT CAGAAATCAA CTTGAACGGT GGTATCGTCT CCAACAAAGG TTTCATTCAA CAGATGGCAG CCCCAGGCAC TGAAGCTATC GATGGAAAGT ACGTCTCGGC ATGGGGTGGT ATCCAATCTG CCGGCCAAAC CTTAGGGCAA ATCGTACGTC TCCTATTGCA TCTATCAGTT TTCTGGATCT CGAATGAAAG CTGATTCAAT ATCAGTTTCT TCAATACGCT ACAGACGCTC TCGGTCGAAA ATACGCCCTC TATATCCTAT GGGTCTTCCT CGTAGCTTCC ATCTTCGCCG AAACCTTTGC TAGCCACTGG TCCCACTGGC TTGTCGCCAA ACTCTTCTCT GGTATGGGTG TCGGTATGTT GCAAGCAACG ATGCCGGTCT ATTTGAGTGA AGTAGCGCCT TCTCAGCTTC GAGGTTTCTT CATCAACGCC TACTCATTGT GAATTTTGTA TTTTTTTATT CCATTTACCC CATTTACCCT TGGTAAACGA AGGAAGAAGC TGACAAACTA GTTAGTTGGT TTTGCCTCGG TCAACTTGCA GCTTCTATCG CGCTCAATGA GCTCAATGAT ATGAAGCCTT ACGATTTCCG AACTGCCATT TACACTCAGG TATGTCCCTC TTCCATCAAT TCTGTCAAAG TACCTTCTTT CTCAAAGCTA ACCAAGTAAT TAACGTGTAG TGGCCAATGG TCGGTGTCAT GGGTATCGTC TTCCTCTTAC TTCCTGAATC TCCTTGGTGG CTTGTCAGTA AAGGCAAGCT TGAAAAGGGT TCCAAAATGT TGTCAAGATA TCAAGGTCAT CTCCAAGGCT ACTCCGTTGA AGAAGAAGTT GTCAGTATGA TTCCTAATCG AATCGTGTGA ATCGTATCTG ATACGAGTAT GAACATAGGC CATCATGACA GCTACACTGG AAGAAGCCAA GCTCATCGCC AAACGTCAAG GACAAGAAGG TCAATTGGCA GTATTCAAGG GAAGCAACTT ACTCCGTCTG TTCATTGCTT CTTGGCCGAA AATGATTCAG CAATTCGTTG GGTACGTTCA TACCAATTAC CTCTTTATGC CGAATCACTT AGATCGTTTA TCAATAGTCT CTCTGTCTTC AACACTTACG CTACCTACTT CTGTAAGCTT AATCATCGTC TCGCTCTCTT TCCCCTCTCC CTCCATGGCA ACCACCCTTC AATACGAAAC CGCTAACTCT CTCAAACAGT CCAACTTGCG GGTAACAGCA ACCCCTTCCT CGTTACCGTC ATCCTCTCCT GTGTCCAGCT CATTTCTATG CTCATCACCG TCTCCCTCAG TGACAATATC GGCCGTCGAC CGCTTACTGT CTACCCATAT GGAATCACTG TCCTCTCTGT TCTTTCGCTT GGTATTATCG GCTGTTTCGA CTACACCAGC AAATCTCTGG GTTCTCTCCT CGTATGTCCA GTCCTTCTCT CCATCTTTAT GTTGAAATGA AGAACCCTTC CCTAATCAAG CTTTATCATA GATCTTCTTT GCATGTCTCG CCACCTTCTC AACTACCGGC GCTTCCGCTA TCGGCTACGC CTATGCTGCT GAAATCCCTA CTCAACGACT TCGAGCCCAA ACTGCAGGCT GGGGTCTTGC GTTGTCCAAC ATGGTTGCCA TCATGTTCTC TTTCTGCACT CCGCTCATGC TTAATGGTAA GGCTAAATGG AATGTCAAGA CGGGATTCTT GTAAGTATTT CTCCATCTGC CCTCCTTTTG ATGAAGACCA TATCTCGAAG GGACCGTAAT CGGAAAGTCA AATTTGGTAC TGACAAAAAG GTATATAGCT TTGCCGGAAC AGGCTCGGTC GCTACTGTCG TTGGGTGGTT TATCCTTCCT GAAGTTGCCC GTAGGACACC TGCCGAGATC GACGAGCTGT AAGTCACCCC TTCTTCACCC TCTTGCATTT AAGCAATAGA AGCAATAGCT TGACTGACCT CCTGGGTTTC ATTTTGTTTC GTTCAGGTTT GAGAAGAAGG TCAATCCACG CAAGTTCAAA GGTTACGTTA CAGACGTCGA GATCTCTTAC CGAGAGGCAA ACGAGACTTC TGCTTAAGGT TTGCTCAGTT GTTAAGATTT GAGGGAATGT AAAGATTACT AGGTCTATGT CATTGTGTAT ATGCTTTAGG GGGTTGGGTA TAATCGAAAG TAGGGACCTT ATAGGGCTAC TATCATCATC TTTTGTTAAT C
|
Protein sequence | MTSDQQHPHS GFHDIQAIPL EMNEKDNELI HTERLQTAEE IRKTMSKGEA VQMRSKFADL GTIQALKEYK LLGFVAMAAA FSASLDGYQI NLNGGIVSNK GFIQQMAAPG TEAIDGKYVS AWGGIQSAGQ TLGQIFLQYA TDALGRKYAL YILWVFLVAS IFAETFASHW SHWLVAKLFS GMGVGMLQAT MPVYLSEVAP SQLRGFFINA YSFWFCLGQL AASIALNELN DMKPYDFRTA IYTQWPMVGV MGIVFLLLPE SPWWLVSKGK LEKGSKMLSR YQGHLQGYSV EEEVAIMTAT LEEAKLIAKR QGQEGQLAVF KGSNLLRLFI ASWPKMIQQF VGLSVFNTYA TYFFQLAGNS NPFLVTVILS CVQLISMLIT VSLSDNIGRR PLTVYPYGIT VLSVLSLGII GCFDYTSKSL GSLLIFFACL ATFSTTGASA IGYAYAAEIP TQRLRAQTAG WGLALSNMVA IMFSFCTPLM LNGKAKWNVK TGFFFAGTGS VATVVGWFIL PEVARRTPAE IDELFEKKVN PRKFKGYVTD VEISYREANE TSA
|
| |