Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04910 |
Symbol | |
ID | 3254875 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 381027 |
End bp | 383532 |
Gene Length | 2506 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253963 |
Product | conserved hypothetical protein |
Protein accession | XP_568033 |
Protein GI | 58261246 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.413844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGATC GTCCACCTGA TCAAACTGGT CCATCGAAAA ACTCTGAGAA ACAGGCGGTG AAGGGTCTCA ATGCGATTGA CATTGAGCAT CTACCAGTCG ATAATGATCC AAGGAAGTGG AGTGATAGGA AAAAGTGGAC AGTATTGATG ATCATGACTT TTGCGATGGT GAGTACCTGT AAGCAAGTCC CATGGCGAAG AACATAGGGT ACTGACTGGG TGGTCGTCCT CTAGTTGGGC CCAATGATGG CTCCCAGTAT TTACAACCCG GTAATCAGCG ATGTTCGCGA AGACCTGCAC GCCACAGAAA GTGAACTCGG TTTATCGATC TCTCTTTACA TCCTGTGAGC TTCGATCAAT CTTTGCGGCC GATGCCCAGT ATCAAACCGA GAACGCTAAT GCACCTGTCA GATTTCAAGG GTGCACTCCC GTCTTGTGGG CAGCTATTGC GGAAGTAAGT ATCATCTATG GCGATCATTA TTAAGAATGC TTACTCACAT CTGGCCATAG CTCAATGGAC GCAAGGTATG TCTTGTATAG AAACCGGTGG AGCACTCGCT AACGTATCGT TGCAGACTGT CTATCTGGTG TCATACACAG TAAGCCTTAC ATCAAAACGT ATCACCAGTT GCCAGACTAA ATGATGATTG TAGCTCTATG TTATCGCTCT CGTGGTTGCT TCTCGGGCCA ATTCAATGCC CCTTCTGATT GTAATGCGTA TCTTACAGTC CACAGGCAGC GGACCAACAG TTTCTTTGGG AGCCGGCAGT CTGGCCGATA TGTACGAGAC GCATGAGCGA GGGGCAAAGG TGAGTTTCAT CCTTTAAATT AGCCATTTGC TCATTTACTG TATGTCTGCA GTTGGGCCTG TTCTACGGAG TGCCGATGAT AGCTCCTGCC GTAGCACCAC TCATCGGTGG AGCTCTAGGC CAAGTACGGT TTCATCTTTT GGCTATGTTG ATGTTTGCTA AAGTATAACA GTCGTTTGGC TGGCGAAGTG TTTTCTACTT CCTTGCTGTG TATGCTTTCA TCATGTTGTG CTGCTTCATG GTGTTTCCGG ACTCTTGGCG ACGTCAGGTG ATTTCTTTCT GCTCATTTGG AGTAAAAATA ATGTCAACTG ATTTCCACAA TGTTAGCGTT CGAGAGTTTA CCAAAGAGCG CTTACCAAAG CAATCGAAAG AGCAGAAAAT CGGGATGCGA AAGATTTAAA GCGGAAGGCC AAGCTCGCAA AACAGTCGCA AGTGTTAGAC ACCATACCAG CCACGCCAGA TGCCACTCCC GGCAACTCTC CTGGTAATAG CAGAAGGCCT AGTGCCGAGG CTGGAGGGGC CGCGGTGACG GATACTACAC TGGTTGATGT GACCTTGAGT AATGACCAGA AGCAAACGGT AGGCAGATCG GGCAGGATAA AGGTCAAAAT GATGAAATGG TTGCCGTTTG GGGCAAAGGA AGAAAAGATA CAGGCCGAGA ACGAAGTGGA ATTCAAACCC ACCTTCAGGG ATCTCAACCC TTTGCCATCG ATGGCTCTTA TTCTCAAACA GCCCACCAAT TTACTCATTC TTACCTCATC GGGTGAGCGC CCTTCTCTTT TCATAAGTAC TCAGTCTATT AACATGCCGG CAAATGTCAG CCTTGAGTTT CTCTGCTCAA TATACCATCG TCTACACAGC ATCAATTACC CTTGGTAAAG CTCCTTACAG TTATGGATCG CTCAAGATAG GTCTTGTGGT CTTGGCGTTT GGAATAGGCA ATATTCTAGC TAGTATGATT GGAGGGAAAT ATTCTGACAT GGTGTTGAAA AAACTGAAAA AGAAGAACGG AGGTGTGGGC AATCCCGAGG TACGCGAATA TGTTTATATT GCGGAAATGA CAAGCTGAAG CACTTGGACA GATGAGGCTG AGATCCACAG TGTTGGCGAT GCCAATCCTC GTCGCTAGTT TCCTTGCCTA TGCTTGGACT GCGGAGGAGA AAGTGCACAT CGCAGCCTTG GTGGTTTGCC TGTTCTTTGC TGGTTTCTCT CTCTTGCAAG TACAGTTTCT AGTGTAGCGC ACGACTCTTC TGACATGCGG CTTTGTTATT CAGGTGGATC TATAGCAGTA CTCTGGCGTA CGTGGTGGAC GCCAATCCTG TACGTGTTAC ACTTTTTTCA TCCCTGCGTC AAGTCAGTTC TGATGCTGAT GATTGTCTAG GGTGCTTCCA GCTCTGCAGT CTCTTGTAAT TCAATGTTTA GGGGTATCTG CGCTTGCGTT ATGTCACAAG TCGCTACTCC GATACAGAAT GGTATAGGAG ACGGGGGGTT ATATACCTTG TTTGCTGGGA TCTTGGCATT CGCCTGTGCT TGCAACTTAT TGCTTATGGG TGCGTTTCCG CCCTTGCTTT TTGATACATA AACAGAAAGC TGACACTTGT ATACCTAATA CAGTGAAAGG TGAGCAATGG AGGTCTCCCG AACATCGTTG GCCCTGGCAA AAGAAGCGAG AAGAAGGAAA CGATGAGAAG GAATGA
|
Protein sequence | MIDRPPDQTG PSKNSEKQAV KGLNAIDIEH LPVDNDPRKW SDRKKWTVLM IMTFAMLGPM MAPSIYNPVI SDVREDLHAT ESELGLSISL YILFQGCTPV LWAAIAETVY LVSYTLYVIA LVVASRANSM PLLIVMRILQ STGSGPTVSL GAGSLADMYE THERGAKLGL FYGVPMIAPA VAPLIGGALG QSFGWRSVFY FLAVYAFIML CCFMRSRVYQ RALTKAIERA ENRDAKDLKR KAKLAKQSQV LDTIPATPDA TPGNSPGNSR RPSAEAGGAA VTDTTLVDVT LSNDQKQTVG RSGRIKVKMM KWLPFGAKEE KIQAENEVEF KPTFRDLNPL PSMALILKQP TNLLILTSSA SITLGKAPYS YGSLKIGLVV LAFGIGNILA SMIGGKYSDM VLKKLKKKNG GVGNPEMRLR STVLAMPILV ASFLAYAWTA EEKGASSSAV SCNSMFRGIC ACVMSQVATP IQNGIGDGGL YTLFAGILAF ACACNLLLMV KGEQWRSPEH RWPWQKKREE GNDEKE
|
| |