Gene CNA01120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01120 
Symbol 
ID3253685 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp299868 
End bp302803 
Gene Length2936 bp 
Protein Length561 aa 
Translation table 
GC content49% 
IMG OID638252443 
Productconserved hypothetical protein 
Protein accessionXP_567087 
Protein GI58259349 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0180752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACCAACAG TGGCCTCGAA AGTAAAGGCA AACGTCGTCT TCCTGCTTTT CGACCACATC 
TCACACTCAT TTACCAACAC CACGCTCATT CAAACGCATC CCAGCCTCTC ACCGACCCCG
ACCAACCATC CTTTCCTTTT TTCTCCACCG GCCTATCATC CGATCAGTGT CAGCCAAGCT
ACACGACTTT CTTCACCAGC TCCCTAGGTT GAGAGCAAAG TTGCTACGAC GATTTCCCTT
CCCTTCTCCC TTTTCAATCC ACTCTTTCAC TCGACATCCT CGCCACACAC ATTCGCCATT
CCGTTGACGC ATCCGTGCAT TCATCTGTGC ACTGTCCATC CTTCCCTTGT AGTCACCGCG
AGCTTCTTCC TTACACATTC ATCACCATCA CTCGATTCTA ATAGTCACAT GGTCGATCAG
TAGACCTATT TCCGGCCACA CATATCCATA TCCTATAATC AGCCCGTATC GACCCTTGTC
GCTCATTATA CTATCTCCTC CTCTCCACCG ACTGTCTTTA ATCAGCCCTT CCCGACACCT
CTTGCTCCAA CTGCAATAAT GAGATCACAC TTGTCATTGT CATTATTGTT TTCTTCCCTC
GTGGCGATCA GCGTCGTAAG TGCAGCTTCT GCCGACGAGT GGAAAGGCAA ATCCATTTAC
CAGTGAGCAT TTAATCCCCT TATTTGATAT GAACGTTTTT GCTCATTAGC ACGGGCACTG
CAGGCTTTTC ACCGATCGAT TCGCTCCAGT CTCCGACACT GCGCCCGCTC GGTCCTCCCC
GATACCGGAT GAATGTGATC CCATCGACCA AACGTGAGTG GGGGAATCAA GCGTAGTATT
GAGAATGCAT GATTGATTAA CGCATTAATA GATGGTGTGG CGGAACATGG CTGTCTATCA
TTGACAAGCT TGATTATATT TCTGACATGG GTTTTGACGC TATTTGGATT TCTCCCGTTA
GTCGTAAGCC GCATTTCGTT TCTGTACCTG GTCGCGTGAC GCACATGTGC TGATTTTGCC
TTTAGAAAAC ATTGACCGCG ATACCCCCTA CCACTATGCG TATCACGGTT ACTGGGTCAA
TGACCCTCGT GCTCTCAACC CTCGTTTCGG CACCGCCGAT GACCTCAAGG CGCTCAGCAA
AGCTCTTCAC GACAGGGGAA TGTACTTGAT GGTCGACATT GTCGTCAACA ACATCCCTGG
AACCACTGTC AACGATTCTT TCAGTACCTC TGATCTTGTC GCTGACGGTT CTATTTGGAC
CGATCCCTCA GAATTCCACC CTCAATGTTG GATCGATTAC AGCAATCAGA CATCAGTAGA
AAACTGTTGG TTGGGTGACG ACAAGTTGCC TTTGATGGAC GTTAACACTG AGAACGAGGC
TGTCGTCTCA ACATTGCAAG CTTGGATTTC CAACTTGACT GCTGAGTACG AGATTGACGG
TTTACGTATC GATGCTGCCA AGCACGTCCC CGGAGAGTTC TGGACAGGAT TCTGCGGTGC
TGCCGGTGTT TTCTGCATGG GCGAGGTCTA CACAGACGAC ATTAAGTGAG TGATTATTGC
CTATACGGGG GAGTACTGAC TGATCTTGCA CAGTTTTGCC GCCAAGTTCC AAACCCAAAA
TTGGATGGAC TCCGTCCTTG GCTACCCTCT CTACTACGGT ATTGTGGATG GATTCGGTAC
TCCAAACGGC AACATGTCCA GATTCGTCGA CATTGCTACT CAGGTTTTGG GCACGTTCCC
TACCCCCGGT CTCATCGGTA ACTTTATCGA AAACCATGAT CTTCCTCGAT GGCGAAACAC
TACCGCCGAC TCTCAGCTGG CTTACAACGC GATGACTGTT CAGTTCATTT TTGAAGGTTT
GCCAGTAGTG TATTACGGCC AAGAGCAAGA CTTTGCTAGC GGTGCCGGCG ATCCTTACAA
CCGACAAGCG CTTTGGACTT CCGAGTATGC CAACACGACT AGTTACAACC ACATCAAGAG
GCTGAATGAG ATCCGACACG CTGTGATCTC TAACAACACC TTGTTTGACG GAAAGAACTT
TTTGGACTCT CAGACCAAGA TCGTGGCTTC GACCGACTAT GATGTAGCGT TCAGGAAGGG
ACCTTTGCTT GCTGTCTTGA CCAACGTGAG TGAGATGTTC CTGGCCAGGA AGCGGTTCAA
CTGACAGTTG TTTAGCGAGG AAGCCCCAGT CAAAACGTCG GGTTTGGCGT GCCCACTAGC
GGCTGGCCTT CCCAGTCCAG TGTCGTTGAG TAGGTCCAAC AGTAGTTGTA AAAGTTGCAG
AGAAGCTGAC TCTACATAGC CTTCTTTCTT GCAAGCAGTT CACTGTTGGA TCTGGTGGTG
CCATGCTCGT CTCTTACTCT GCTTCTGGCT ACGGAGGTAT GCCTTATGTA AGTGCCTTTG
TTCCAGGTCA TCCCTGATAT TATCCTAACA CCCGCTTCAG GTCTTTGCCG CACAGAGCGA
TGCTTCGGCA ATGGGAATTT GCGGTGATGC TGGCATGTCA ACCTATGTGT CCCCCAACAT
CACCTCGGCT GCTTTCCCCG CGTTGGCACC CGCAACAGGT CTCGGATCAG CTCTCAGCTT
GCCAGCAGCC GTTGCTGGTG CACTGGGACT GATGTTCATA CTATGATACC CTCTTCGCTT
TCGACAGATA GCAATCCGAT TAAAACGAGG CCCTGTTGCC CATATACCCT TATTCTTATC
GCCATCATCC TCGTGTATTC CACTAAGCAC CGTCAATCCT ATCATTGTCA TTGGGAGAAC
AAACATCACC TACGCATTAT AATGGCCTTT ATGATTAACA TTTTCTATTT CCTGGCCTTA
TAAAGGCAAT CCAGGTGCAG GTGCCGGTGA TGGGAGGTCC TGTCCATAAG CTCAGCAGAC
AAACATCCTC GTTTCCGGTC ATTGAATAGC AATATGAGCG TGTTGTAATA CTATTA
 
Protein sequence
MRSHLSLSLL FSSLVAISVV SAASADEWKG KSIYQLFTDR FAPVSDTAPA RSSPIPDECD 
PIDQTWCGGT WLSIIDKLDY ISDMGFDAIW ISPVSQNIDR DTPYHYAYHG YWVNDPRALN
PRFGTADDLK ALSKALHDRG MYLMVDIVVN NIPGTTVNDS FSTSDLVADG SIWTDPSEFH
PQCWIDYSNQ TSVENCWLGD DKLPLMDVNT ENEAVVSTLQ AWISNLTAEY EIDGLRIDAA
KHVPGEFWTG FCGAAGVFCM GEVYTDDINF AAKFQTQNWM DSVLGYPLYY GIVDGFGTPN
GNMSRFVDIA TQVLGTFPTP GLIGNFIENH DLPRWRNTTA DSQLAYNAMT VQFIFEGLPV
VYYGQEQDFA SGAGDPYNRQ ALWTSEYANT TSYNHIKRLN EIRHAVISNN TLFDGKNFLD
SQTKIVASTD YDVAFRKGPL LAVLTNRGSP SQNVGFGVPT SGWPSQSSVV DLLSCKQFTV
GSGGAMLVSY SASGYGGMPY VFAAQSDASA MGICGDAGMS TYVSPNITSA AFPALAPATG
LGSALSLPAA VAGALGLMFI L