Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI01350 |
Symbol | |
ID | 3259690 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 401769 |
End bp | 404689 |
Gene Length | 2921 bp |
Protein Length | 582 aa |
Translation table | |
GC content | 47% |
IMG OID | 638258618 |
Product | transporter, putative |
Protein accession | XP_572860 |
Protein GI | 58271408 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.671168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTGACCTT TCTACCTATC CTGTTGATAC GGGACTCCTC TTTTTAGTCC TTCCCCAGTC CGTTTGTCTT CTCAGCTCCT ATATTTTTTC GCTGGTCAAC AATTTGACAT TCACCCTTCC CCGGGACAAA GTACTGGAAA CAATGACTCT CGGCGAGAGA GAACGACTCC TGCAACCGGC TCCCGCCCCT CCTGGGACCA CGCTCTATAG CGAACCAAAC CATTCTGAAG ATATCGAGAC TACCGATGAA CATAAATTGA GCTACAACAG GGTAGGGTTG AACGCCCGTC GATTTTGGAT CTTGGTTGGT CATACTGCTT GTGTTCTTGT ATGAGAATAA TGGCTAGCGA AAAGCTTACA ATGATCATCA CGTTAGTGTG CTTCGATGTG GATAGCCTCT TTCCTGAACG CCTTCGACGG TACCGTAGGT GAGTGTGAAC ATTTTCCGCC ACATCCGTTG TTTTGCAAAG GTAGTTAGTT GGCTGAGTGA TATGATGCAG TCGCTACTTT GCTGGGACCC ATATCTTCAT CGTTCAAAGC GACCAACTTG GCATCATGGC TCGGTACATC ATAGTGAGTC GTATCTAAGC AAGAAGGGAC GGCGTATGCT TATATTCCAC GCCGCCTTCG TAGTATGCTC TCAGTCTGCT GCTTCACTCC CATTTACGGA CGATTGTGTA ACATTATTGG TCGTCAGGGT TCAATGCTGC TTGCGCTTGC AATCTTCAGT AGGCATTCAA TGACCCGCTA ATAGCTTTTT GAGGTATCAA GCTGATTGCA ACTGTCACTG CGTAGCAACT GGTAATCTTC TGTGCGCGTT TGCTCCTTCT ATGGAGGCTT TGATTGCTGC TCGTGCACTA GCCGGTATGG GTGGAGGTGG TCTCAGTATA AGTGGGTTTT CCGGGACGAA AATGGCTCCC CGAAGTGGAA TGAAGCTGAC CATTGGAATA GTTGGAAGTA CCATCATGAG CGACATCGTC CCTATGTGAG CATCGCGTTT TCGGTAACTC TATCCTGTGA GGGCCAATTC GTTAACTCCA TGTATGCAGC ACCCATCGAG GTATCTTCCA AGGTCTTGCC AATCTTGCCT TCGGTAGCGG AATGGGGTAA GCTGCTCAGT TGTCATGAAA TGCGATTCTC TGTGTGACAC TGTGATTTGT AGTCTCGGCG CTCCCATCGG CGCTCTCATC AACGATTGTC TCAATTGGCG ATGGGCTTTT TGGGTTCAGG TACGTTTTTT CAACATATTT CTCACGCAAA AATTCTAAGT TTACATTATC CGTCCTTAGA TTCCTGTTCT CCTCTTTGCC AGCTATCTTG TCCATTCCAA TGTTCGATAT GATGTCCCAT CACGCCCCAG CTCAGGTGCC GCTACACCTA ACCCTGCGGC TGTTAAGCAA ACCGCTATGC AGCTTTTCAA GCGGATCGAC TTTCTGGGAT GTTTCCTACT TGCCGGATGG GTAGGCGCCG CTCTGATCGC CATCTCGCTC AATATTAACT CTACTGCAAC AAATGCGTAC AACTGGTCTG ATCCGATCAT GATCGGCCTA TTCGCCACCA GTGCTGTCTT ATTCGTCCTC TTCCTATTTG TAGAACTCAA ATGGGCAGCC GAGCCCGTCA TGCCTTTTGA GCTACTGGTC AGTCGAACTC CGGTTGCGGT TGCTATCAAT AACTTTGTGT TGTCTGTGGC CAACTTTGCT ATTGTAAGTA CTTCCGCCTT CGTCCCCTGA CTATGTCAGT CCTCCATTCC TGCGCGGTAA TGATACCATT GACTAATAGT CTCTCTCGAA CAGCTATATA GTGTCCCTCT CTACTTTACA ACTGTACGAC AAATGTCCGC TTCCAACGCC GGCGCTCATC TTATTCCAAA CTCGTTCGTC GGCGTGATTG GCTCTCTCGG CGCTGGACTC ATTGTTCGAC GAACTCATAA ATATTACTGG CTCAACACTT TTTGTGCATG CTTTGGAGTG ATTGGTTGCT TCTTGATCTC CACTTGGAGA CTTGGTACAT CTGAGTGAGT TCTTTGGATC ATGGTCAACG TTTAACACGC GAAGAAGCTG ATCACAAGTG CCACATCTCA GGTGGATGCT CTGGACGAAC ATGTCATTCA CCAGTTTTGC CATGGGGGCT GTTACCACCT TGACCATCGT CGCTCTTATT GCAGATGTCG GGCCTGAGCA TGTCGCCATT GCTACCAGTT GTGAGCATTT ATGGCTACGA AAAGTATTTC CTGCCTTTGA TCTGACCTAC TTTCTCTCAG TGTCCTATGT GTTCCGTACC ATCGGCCAAG TCTTGGGTGT AGCCTTGTCT GGAGCTTTGA CTCAGGCAGT TCTGACCTGG GAACTGGAAA AGAGGATACG AGGTCCTAAT GCAGAAGAGG TGAGTTGCAC ACCAAAGAGA TCGAGCTTTT TTCTGACATC TAGGACAACG ATTGTTCCAC AGATCATTGC GTCGATCCGA GAATCGTCTG CTTCTATTCG CTATCTCCCA GAGCCTCTCA AGTCCATCGC GATTGCATCT TATCAGAAAG GTCTACACGC TGTCTTCATT TGTACCGTAG TCCTGAGTGT GATCACTCTC TTATCAGGCT TAGGAATCAG AGAACTTGAT ATGAAGCAGA TCATGTCCGG AGGAAAACAG GCAAAGCAGG TACAGAACGA GAGCGAAGAG GAGGAGGCTT AAGGTCGTCG AGAAATATAT CGAGAACATA TACCTGCTAA GGGAAGAGTA CTTTGTAAAG CGGAGGCGGC CTCGTCATTA TACGTGTGGT GTGGGAGTGT GACACAATCA ATTGTATCTT AAAATAGAAC ATCTAGCATA GAACATCTAG CTAATTGTTA TAGATTCATA TCTAGCCATC ATCTTGGACT TGTCATATGT GATGCTTACT CATTGGATGC TTTGTCTGTG ATGTTGCATA A
|
Protein sequence | MTLGERERLL QPAPAPPGTT LYSEPNHSED IETTDEHKLS YNRVGLNARR FWILCASMWI ASFLNAFDGT VVATLLGPIS SSFKATNLAS WLGTSYMLSV CCFTPIYGRL CNIIGRQGSM LLALAIFTTG NLLCAFAPSM EALIAARALA GMGGGGLSII GSTIMSDIVP ITHRGIFQGL ANLAFGSGMG LGAPIGALIN DCLNWRWAFW VQIPVLLFAS YLVHSNVRYD VPSRPSSGAA TPNPAAVKQT AMQLFKRIDF LGCFLLAGWV GAALIAISLN INSTATNAYN WSDPIMIGLF ATSAVLFVLF LFVELKWAAE PVMPFELLVS RTPVAVAINN FVLSVANFAI LYSVPLYFTT VRQMSASNAG AHLIPNSFVG VIGSLGAGLI VRRTHKYYWL NTFCACFGVI GCFLISTWRL GTSEWMLWTN MSFTSFAMGA VTTLTIVALI ADVGPEHVAI ATSLSYVFRT IGQVLGVALS GALTQAVLTW ELEKRIRGPN AEEIIASIRE SSASIRYLPE PLKSIAIASY QKGLHAVFIC TVVLSVITLL SGLGIRELDM KQIMSGGKQA KQVQNESEEE EA
|
| |