Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA01120 |
Symbol | |
ID | 3253685 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 299868 |
End bp | 302803 |
Gene Length | 2936 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 49% |
IMG OID | 638252443 |
Product | conserved hypothetical protein |
Protein accession | XP_567087 |
Protein GI | 58259349 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0180752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACCAACAG TGGCCTCGAA AGTAAAGGCA AACGTCGTCT TCCTGCTTTT CGACCACATC TCACACTCAT TTACCAACAC CACGCTCATT CAAACGCATC CCAGCCTCTC ACCGACCCCG ACCAACCATC CTTTCCTTTT TTCTCCACCG GCCTATCATC CGATCAGTGT CAGCCAAGCT ACACGACTTT CTTCACCAGC TCCCTAGGTT GAGAGCAAAG TTGCTACGAC GATTTCCCTT CCCTTCTCCC TTTTCAATCC ACTCTTTCAC TCGACATCCT CGCCACACAC ATTCGCCATT CCGTTGACGC ATCCGTGCAT TCATCTGTGC ACTGTCCATC CTTCCCTTGT AGTCACCGCG AGCTTCTTCC TTACACATTC ATCACCATCA CTCGATTCTA ATAGTCACAT GGTCGATCAG TAGACCTATT TCCGGCCACA CATATCCATA TCCTATAATC AGCCCGTATC GACCCTTGTC GCTCATTATA CTATCTCCTC CTCTCCACCG ACTGTCTTTA ATCAGCCCTT CCCGACACCT CTTGCTCCAA CTGCAATAAT GAGATCACAC TTGTCATTGT CATTATTGTT TTCTTCCCTC GTGGCGATCA GCGTCGTAAG TGCAGCTTCT GCCGACGAGT GGAAAGGCAA ATCCATTTAC CAGTGAGCAT TTAATCCCCT TATTTGATAT GAACGTTTTT GCTCATTAGC ACGGGCACTG CAGGCTTTTC ACCGATCGAT TCGCTCCAGT CTCCGACACT GCGCCCGCTC GGTCCTCCCC GATACCGGAT GAATGTGATC CCATCGACCA AACGTGAGTG GGGGAATCAA GCGTAGTATT GAGAATGCAT GATTGATTAA CGCATTAATA GATGGTGTGG CGGAACATGG CTGTCTATCA TTGACAAGCT TGATTATATT TCTGACATGG GTTTTGACGC TATTTGGATT TCTCCCGTTA GTCGTAAGCC GCATTTCGTT TCTGTACCTG GTCGCGTGAC GCACATGTGC TGATTTTGCC TTTAGAAAAC ATTGACCGCG ATACCCCCTA CCACTATGCG TATCACGGTT ACTGGGTCAA TGACCCTCGT GCTCTCAACC CTCGTTTCGG CACCGCCGAT GACCTCAAGG CGCTCAGCAA AGCTCTTCAC GACAGGGGAA TGTACTTGAT GGTCGACATT GTCGTCAACA ACATCCCTGG AACCACTGTC AACGATTCTT TCAGTACCTC TGATCTTGTC GCTGACGGTT CTATTTGGAC CGATCCCTCA GAATTCCACC CTCAATGTTG GATCGATTAC AGCAATCAGA CATCAGTAGA AAACTGTTGG TTGGGTGACG ACAAGTTGCC TTTGATGGAC GTTAACACTG AGAACGAGGC TGTCGTCTCA ACATTGCAAG CTTGGATTTC CAACTTGACT GCTGAGTACG AGATTGACGG TTTACGTATC GATGCTGCCA AGCACGTCCC CGGAGAGTTC TGGACAGGAT TCTGCGGTGC TGCCGGTGTT TTCTGCATGG GCGAGGTCTA CACAGACGAC ATTAAGTGAG TGATTATTGC CTATACGGGG GAGTACTGAC TGATCTTGCA CAGTTTTGCC GCCAAGTTCC AAACCCAAAA TTGGATGGAC TCCGTCCTTG GCTACCCTCT CTACTACGGT ATTGTGGATG GATTCGGTAC TCCAAACGGC AACATGTCCA GATTCGTCGA CATTGCTACT CAGGTTTTGG GCACGTTCCC TACCCCCGGT CTCATCGGTA ACTTTATCGA AAACCATGAT CTTCCTCGAT GGCGAAACAC TACCGCCGAC TCTCAGCTGG CTTACAACGC GATGACTGTT CAGTTCATTT TTGAAGGTTT GCCAGTAGTG TATTACGGCC AAGAGCAAGA CTTTGCTAGC GGTGCCGGCG ATCCTTACAA CCGACAAGCG CTTTGGACTT CCGAGTATGC CAACACGACT AGTTACAACC ACATCAAGAG GCTGAATGAG ATCCGACACG CTGTGATCTC TAACAACACC TTGTTTGACG GAAAGAACTT TTTGGACTCT CAGACCAAGA TCGTGGCTTC GACCGACTAT GATGTAGCGT TCAGGAAGGG ACCTTTGCTT GCTGTCTTGA CCAACGTGAG TGAGATGTTC CTGGCCAGGA AGCGGTTCAA CTGACAGTTG TTTAGCGAGG AAGCCCCAGT CAAAACGTCG GGTTTGGCGT GCCCACTAGC GGCTGGCCTT CCCAGTCCAG TGTCGTTGAG TAGGTCCAAC AGTAGTTGTA AAAGTTGCAG AGAAGCTGAC TCTACATAGC CTTCTTTCTT GCAAGCAGTT CACTGTTGGA TCTGGTGGTG CCATGCTCGT CTCTTACTCT GCTTCTGGCT ACGGAGGTAT GCCTTATGTA AGTGCCTTTG TTCCAGGTCA TCCCTGATAT TATCCTAACA CCCGCTTCAG GTCTTTGCCG CACAGAGCGA TGCTTCGGCA ATGGGAATTT GCGGTGATGC TGGCATGTCA ACCTATGTGT CCCCCAACAT CACCTCGGCT GCTTTCCCCG CGTTGGCACC CGCAACAGGT CTCGGATCAG CTCTCAGCTT GCCAGCAGCC GTTGCTGGTG CACTGGGACT GATGTTCATA CTATGATACC CTCTTCGCTT TCGACAGATA GCAATCCGAT TAAAACGAGG CCCTGTTGCC CATATACCCT TATTCTTATC GCCATCATCC TCGTGTATTC CACTAAGCAC CGTCAATCCT ATCATTGTCA TTGGGAGAAC AAACATCACC TACGCATTAT AATGGCCTTT ATGATTAACA TTTTCTATTT CCTGGCCTTA TAAAGGCAAT CCAGGTGCAG GTGCCGGTGA TGGGAGGTCC TGTCCATAAG CTCAGCAGAC AAACATCCTC GTTTCCGGTC ATTGAATAGC AATATGAGCG TGTTGTAATA CTATTA
|
Protein sequence | MRSHLSLSLL FSSLVAISVV SAASADEWKG KSIYQLFTDR FAPVSDTAPA RSSPIPDECD PIDQTWCGGT WLSIIDKLDY ISDMGFDAIW ISPVSQNIDR DTPYHYAYHG YWVNDPRALN PRFGTADDLK ALSKALHDRG MYLMVDIVVN NIPGTTVNDS FSTSDLVADG SIWTDPSEFH PQCWIDYSNQ TSVENCWLGD DKLPLMDVNT ENEAVVSTLQ AWISNLTAEY EIDGLRIDAA KHVPGEFWTG FCGAAGVFCM GEVYTDDINF AAKFQTQNWM DSVLGYPLYY GIVDGFGTPN GNMSRFVDIA TQVLGTFPTP GLIGNFIENH DLPRWRNTTA DSQLAYNAMT VQFIFEGLPV VYYGQEQDFA SGAGDPYNRQ ALWTSEYANT TSYNHIKRLN EIRHAVISNN TLFDGKNFLD SQTKIVASTD YDVAFRKGPL LAVLTNRGSP SQNVGFGVPT SGWPSQSSVV DLLSCKQFTV GSGGAMLVSY SASGYGGMPY VFAAQSDASA MGICGDAGMS TYVSPNITSA AFPALAPATG LGSALSLPAA VAGALGLMFI L
|
| |