Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC03220 |
Symbol | |
ID | 3256199 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1015066 |
End bp | 1018523 |
Gene Length | 3458 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 46% |
IMG OID | 638255545 |
Product | receptor, putative |
Protein accession | XP_569994 |
Protein GI | 58265676 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.630871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTGGGCCCT CGATCCCTCA TTCTCTCGTT CCCTCGCGTC GTCTGATAGC ATAGTCGGTC TACTCCTTGT ACAATTGTCG ATGGTCCGCC GTCTACATGT CCTTTCGCAC TTCAAATAAC CAGTAATAGA CCAACTGCAC CAGAAGACGC CTTCTCCTCC TCCCCATTGT GTATGCGTGC CACACTCGCA TAGTTATAAC AGTCCCAAGT CAAGCGCTGC TCAGATTAGT ACAAAGGGAC AGCACACTGC GTGTTCGACA CGTCTGAGAA GAGCACAACC ATGCCCGATC CCTCTATTCC GGTAGTCGGT CACAAGACTC AGCGTAAGCT GGTCGGACAT AACCTACTGT ACAGTGTTTC AGTGTTTCTT AGCATAGGAG TCTGGTTATT TGGGTAGGTG TTTTGTGAAT CCGGATGGTG TTGTGCTGAT ATGATAACCG GTTAAGATAT GACCAGGGGT AAGTTCACTT CAAGCGCTTA TCCTTAAGCT TCGCTGATCA TGGCCTTCAA ACTGTGAAAG AGTAATGTCC GGTAAGTTGG TATCACCATC ACATATCTCG ATAGCTAGTT GACTTTCTAT GTAAGGAATT ATTACCGGCC CATACTTTAA GTAGGTATGA AGTAAATACT GAGTATCTTT CATTAACATT TATGGCACGT GTCTACAGAG CTTATTGTGA GTAAAGCCCT AATCAAGCAA ACTAAGCTGA CGTTTTTAGT CAACCAACCA ACGTCAACGC AGATTGGCAA GTAAGTTAGT CATTCAGTGC CAGTAGCAAT GTCTGAACAT TCGATAGTAT GGTGGCCGTT TTGGAGATTG GTGCCTTCAG TAAGATTATT CCATGTCATG TCATTCTTTC AAGAACTCAC GGCAATCACG CAGTTACTTC TCTGGCTGCC GCTCATATTG CAGATAATTA TGGAAGGCGT ATGACCCTTC GCACAGGTGC AATAGTCTTC ACCATTGGAG GTGCTATACA GACTTTTTGC GTTGGATATA ATTCCATGGT ACTTGGAAGA ATTGTCAGCG GCTTTGGGGT AGGGATGCTG AGTATGGTCG TGCCAATCTA TCAGGTATGT GGTTTACAAT AGTAAAGGCG CGGAGCATAA GCCAATGTGT GTCACCCTCG CGCAGTCCGA AATATCTCCT GCAGACCATG TAAGAAACTC TTCAATATTT CCAGAAGATT TTTCTAATAA AACACGATGA TAGCGAGGCC TTTTGGGCTC TGTCGAATTC ACAGGTAATA TCATTGGCTA TGCCTCCTCT GTTGTACGAT GTGTCACCTG CCTATGTCAA GCACCATTCT CACTCAACGT TCATAGTGGA TCGACTATGC CTGTTCATTC TTCCAGTCTG ACTGGTCTTG GCGCCTCCCG CTTTCTGTTC AATGTATAGG CGGCTCTGTT CTCTTTATCG GCAGCTTCGT CACACCAGAG TCTCCCCGGT AAGCCTTCTA TATGTGCATC ATATGTAGGT GCAGAACTAA GAGCTGTTCA AAGGTATCTT GTCGATACAG ACCAAGAGGT GGAAGGTTTA GCAGTCATCG CTGATTTTCA AGGGAAAGCG CTGGACGATA TTTCAGTGCA AGCCGAGTAC AAAGAAATTC GAGATGCTGT TCTAGCCGAC GTGAGACAAT CCTCTTAACG TTATCCCCAT ACACTTGCTT ACTCTGTTTT GTTTTTTTTT ACTAGAGAGC TGTCGGAGAT AGAAGCTATA GGGCTTTATG GAGGAGATAC AAAGGACGAG TTCTGATTGC AATGAGCAGT CAATTGTTTG CTCAACTGGT GAGTCAATCT TTGCAAAAGT CAAAGAAACA TGAAAATAAA GCGGTCTGTA CTAATGATTA TTGGCAGAAT GGCATCAATG GTGAGCTTAG AAGAGCGCTG CAAACAATTA CTAACCATTT GGCCAGTCAT CTCATATTAT GCACGTGCGT CCCATCTCAT TCTGCCGATC ATTCTTGACA GCTGATGTGG ATTACAGCTC TTGTCTTTGA ACGTTAGTCT CGCTGAAGAC CTAATACTGC CGTGTTACTA ATGGTGATGA AGAGGCGGGG TGGATTGGGC GTGACGCTAT CCTTATGACA GGTATCAATG CCTTATTTTA TGTGGCAAGC TCACTTCCGC CGTAAGTTCA GGTCCATCCG AAATTGTGCA CATTCTCAAA TTATTACTAG ATGGTATCTC ATGGATCGAG CGGGTCGAAG GCCCATTTTG CTCTCGGGAG CAGTGGCCAT GGCGATTGCA CTGACGGCTA CAGGATGGTG GATATATATT GATCAAGCAA TAACACCCAA TGCTGGCTCG TCTTTTGTTC TGCCATGTCG GATGAAGCTG ATGGTATTTG TCGATAGTGG TCATTTGCGT AGTGATTTAT AATTCCGCAT TTGGCATGAG CTGGGGACCT GTCCCATGGT ATGTGTCATT GACATATGAC CGGTCGGAAA TGAAGTTAAT TAATTAATTG ATCCCAGGCT TTATCCTCCG GAAATCATGC CGTTGTCATT CCGAGCAAAG GGAGTATCCT TATCTACTGC TACAGTACGT CCAATTTTAT CCCGAATGCA CAACGATGGG CTAATTTGAG TTATCAGAAC TGGATCTCAG TGGGTCTGGA GCGCTTTAAT CCTGCTTTTT GCTAACGACT GTTGTATCTG CAGAATTGGT GGGTAGGGGT TTCAACACCG CTCTTTCAAG AACTTATCGG ATGGCGATTA TATCCGATGC ACGCATTCTT TTGTGCATTA TCATTCATCC TCGTGTACTT CCGTGAGTTG TCAGCCGAGA TTCGTAAACC ACCACTCATG CACATGGAAC TAGTCTATCC CGAAACCCGA GGCGTACCGC TTGAAGAAAT GGACAAATTG TTTGGGGATG AAAGTGATGA AGACGAGGTT GATTCGGACT TCGATGAAGT TGAGGAAGCC GAATCAGAAA TATCCTCTCT AGTCAGCAAT CCTCGACACC GACGCCGCTC GGCCAGCTCT TCATTGGGCC CATCTTTGCC GACCTCCCGA AAACCGTCAC CCATACCCTC TAGGGAGGCT TCATCTAGCC GAGGACTGTT TGGACGTATA ACTGACTCGG TGAATGGTCT GATTGGAAGC ACAAAACAGC AAAGCAGGAG CGTGGGGTAT ACTGCTGTCA ACGAGGAATA GGAACTCGCG AGTGAGCATC ATCCGGACTC ATTTCAAGTG ACACTTGACA CTAAATGACC AGGGTTCGAT AAACGGAATC CAGGGCGTCG TCATGACTTT ACGGGACAGT TCGAAGAGCT CTCTGAAAGT CACCACGAAG ATGACTGGGA AGTAGACGTA GGAGATATAG AAATGGGGAT AGGAGAAGGG TTGCTTGCCA GGCGTGTGGC GATTTCCAAT GTCGAGCCTC GAGAGTGACA ATATTTTGTA CGAGAAAGTG AAGGGTGTAG CATAGTCGAA TCGTATTTGA TGGTCTTC
|
Protein sequence | MPDPSIPVVG HKTQRKLVGH NLLYSVSVFL SIGVWLFGYD QGVMSGIITG PYFKAYFNQP TSTQIGNMVA VLEIGAFITS LAAAHIADNY GRRMTLRTGA IVFTIGGAIQ TFCVGYNSMV LGRIVSGFGV GMLSMVVPIY QSEISPADHR GLLGSVEFTG NIIGYASSVW IDYACSFFQS DWSWRLPLSV QCIGGSVLFI GSFVTPESPR YLVDTDQEVE GLAVIADFQG KALDDISVQA EYKEIRDAVL ADRAVGDRSY RALWRRYKGR VLIAMSSQLF AQLNGINVIS YYAPLVFEQA GWIGRDAILM TGINALFYVA SSLPPWYLMD RAGRRPILLS GAVAMAIALT ATGWWIYIDQ AITPNAVVIC VVIYNSAFGM SWGPVPWLYP PEIMPLSFRA KGVSLSTATN WISNWWVGVS TPLFQELIGW RLYPMHAFFC ALSFILVYFL YPETRGVPLE EMDKLFGDES DEDEVDSDFD EVEEAESEIS SLVSNPRHRR RSASSSLGPS LPTSRKPSPI PSREASSSRG LFGRITDSVN GLIGSTKQQS RSVGYTAVNE E
|
| |