Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB02100 |
Symbol | |
ID | 3255657 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 610092 |
End bp | 612854 |
Gene Length | 2763 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 45% |
IMG OID | 638254860 |
Product | expressed protein |
Protein accession | XP_569138 |
Protein GI | 58263456 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.320661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACACTACAA TGGTCACTGC AAACTGCAAC GCTGCATGCC GAGACGCTTA TCGAGCGTGG AGATGGAGAT CCCCGATCTC GACCATACGG CGGTTCACCG ATCTCGGCAA TCCGCGAAGC ACTCGGGCAT TGTCAGATGG GAATATGAGG ATAACATCAT ATCGCAGAAA TCCACCACAG CCTTAAAAGC GCTGAAGAAA GCAACATTAC AAACAGGAAG GAAGGATATA AAAGCCCATG TATGTAGTGT CTTTCAACGT TCCCACATAC ACCCCAATCG CGTAACATTA TGTCTGACGA TTCAACCTTG GCTGACATTC ATCCTATCGA CGCTAGCAAG CTAGAGAAAG CTGAATCTGA ACACAATCAG TTACACCAAA ATGCGAGCCA TCATCACTCC GAGAACACGC TTGCCAACCT GAGCCAGGCT AGGAAGAACT TCTTAGTGCT TATCTTCTCC ATAGCAACGT TTGTCGACAT CTGCAAGTGA GTTCTAATAT CAGATGAATA TGTCAGCACT GACATGAGCT CGTCCTATTC AATCAGTGTT TCTGGAGTGG CCGTAGCGGT TGCCCAAATT TCAACTGACA TCAAACTCGA CTACTCTCAA ATCGTTTGGA TCGTCACATC TTATTCCCTG TGTTTTGCTG CTCTTTTGCT CTTTGCCGGG CGACTGGCAG ATTTGTTCCC AGCCCAAATA GTGTTCGAGG GAGGTTTTAT TATGCTAGGA ATATTGAGTT TAGTCACCTC TTTTGTGACT TCTAATAAGT GGGTCCATGC TTTGTCATAA CCTTATGCGA GGCAATGTAC TGAACATTTT TTTAGGTATG GGTTTTTGAT TTTACGTGGC CTTGGAGGTA TTGCCGGTGC CATGAGTGAG TTATTTCCCC AGGATAACTA GACTGATGCT CACTTCTACA AAAGCAATCC CTTCAGGCTA GTGAGCATAT GTGGCGACCT TTAACGGCTG GGACTGACAT TAATGGCATA GTCACCTCAC GGTCCATCTC TTCCCTGAAC CTGCTGAGCA ACAAGCCAAA TTAGCCCTTT TAGGATTAGC AGGTGCTATT GGAAATGTAC TCGGATTGTA AGTATCAAAT GATTGGCAAG AAGAGGTGCT CACCCAATGC AGGGTTCTAG CAGGTGTGTG TATGTTAGCT AGTTACAAAT GGTTCTTTAG GGTCATTGCC ATCATCTGTA TGTTTTCAAT CCCGATATTT CCAGGAGAAT TGCTAAGATG ATTGCGAAGG TATTGTCTTC ACTATCATTT GCGTCTTGGT TTTGCCTTTC ACAGGGTCAA CGTACAGCCC TGACCCTAAT ATGCCTCGTT GGAAGAGGCT TGACTTTATG GGTGTCGGAC TTATGATGAC CTCTCTTATC TGCTTTATTC TTGCCTTGAC TCAAGGCCCA ATTGATGGCT GGGGTTCCGC CTCATTCATT GCTCCATTCA TCCTGAGTTT CCCTCTTGCA ATCGGCTTCT TTTTCTGGGG TGCGTAAATC CCTCGCTATA CTCGGCATCT CTTAAATGGG AAGTTTTGCT GATCATGAGC AATCGCAGAA TCTAAGATTC CAGCCAAGAG CGCCGTATTA CCCAGTTCAG TCTGGAAGAT CACCAATATT GTGATCTCCA GCTTGGCGAT AGGTATCCCT TGTACGTGGC CTTCCTACAG ACCGCGGAAC CTCTCTGACA TTTTGCCTGA AAGTTCCGTT CTGGGCGACT TCTCAGCTTC TGTACTCTAC TTACTTCCAA GAAGTATTTG GCTGGACCCC AAGTGAGTTT TATGACTCAT AATTTTTTCG AAACCGATTG CTGAACCCCA TTATGTCAGT CAAAGTCGCG GCGGCAATGG TACCCCAGGG AGTTACTGCA TTGATAATTG GCGCTTCAGC GCAGGTCATC CCCCAAATCA TCACAAAGCC GCGAATCACG CTTCCCATCG GTGGAGCTCG TGAGTATTAG AGTTTCAAAG GAAAAGTCAT CTCCTAAACA AATGGTAGTG GTGATTATCG CCGAGATTCT GCAAGTGTTC TCTAACGGAG GACATGGTAC AGATTACTGG AGGTATTGTT TCCCTGCATT TGTGCTCGGC AGCGCAGGAG CGGTTATGAC TTTCTTTGCC TCAGCGTAAG CAGAAAAGCA CAGAATGCGG GAATATATAA ATTGACCATC TTTTAATAGT ATCAATCTCA TCTCCTACTG TCCTCCAGAA ATGGCTGGTG TTGCAGGTGC TTGGACCCAA GTGATCGTGA GTTTCTATCA GTTTTATTTG TGTCATAAAC CAGGCTGATA AACAATATCT CCAGTCTCAA ATCGCGGGTG CTATTACACT CGCAGTTCAG GCTTCTTTCG AAGGCGACGG TGTTGCTGAC TGGAACAAGG CTGGCCGCCG ATCCTTCTAT TTCCAAATTG CTTGGACAGC TATATTGTTA CTCCAGTTTT TAATTTTCTA CAAGACGCCA GGAACTCCCG ACGAAGAACA CGAGGCCGCT AGGAAGAGAA TCAAGGAGAG TGGGAAGGAT GCTGGTGTGT GATTGTGAAC AATGCTTTAG AGTTGAGTCA AGCAAAACTG GGAGAAGCAC CTCTCTGTAC AGCACATTGT CGATATTAAG TATAACTGTG TATAGATAAG AAACAAATTG AATAGTCAAC AGTAAAGTGA ATATTTATAT ATAGATGCAG AAGGATACCG ACCACGAATT TACTGTATAA TATAATATAA TATAATCATA GACATACTTT ACAACCTTAC ACAGCTTATG AAC
|
Protein sequence | MSDDSTLADI HPIDASKLEK AESEHNQLHQ NASHHHSENT LANLSQARKN FLVLIFSIAT FVDICNVSGV AVAVAQISTD IKLDYSQIVW IVTSYSLCFA ALLLFAGRLA DLFPAQIVFE GGFIMLGILS LVTSFVTSNK YGFLILRGLG GIAGAMTIPS GYHLTVHLFP EPAEQQAKLA LLGLAGAIGN VLGLVLAGVC MLASYKWFFR VIAIICIVFT IICVLVLPFT GSTYSPDPNM PRWKRLDFMG VGLMMTSLIC FILALTQGPI DGWGSASFIA PFILSFPLAI GFFFWESKIP AKSAVLPSSV WKITNIVISS LAIGIPFPFW ATSQLLYSTY FQEVFGWTPI KVAAAMVPQG VTALIIGASA QVIPQIITKP RITLPIGGAL VIIAEILQVF SNGGHGTDYW RYCFPAFVLG SAGAVMTFFA SAINLISYCP PEMAGVAGAW TQVISQIAGA ITLAVQASFE GDGVADWNKA GRRSFYFQIA WTAILLLQFL IFYKTPGTPD EEHEAARKRI KESGKDAGV
|
| |