Gene CNB04620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04620 
Symbol 
ID3256030 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1322086 
End bp1323335 
Gene Length1250 bp 
Protein Length309 aa 
Translation table 
GC content47% 
IMG OID638255105 
Producthypothetical protein 
Protein accessionXP_568952 
Protein GI58263084 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.599307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCA ACGCAGCCAA GAAAATTGTT GTGTTCACTG CCACCGGCAG TCAGGGTAGT 
TCTGTAGCTC GATACCTCTC CGAGGCCGGT TACAAGATTG TTGCTCTTAC TAGGGATACT
GAGAGTAAGA GCGCCAAAGG TGAGACAGAT CCTTTCATGT CTTCTGAACA AGATCCAAGC
TGACCTATGA TGGTGATTTT TGTTAGCTCT CAAGGCTAAA GGCTATGAAG TTGCGAGAGC
TGACAACACC GATCCTGAGT CCTACAAGCC TGCCCTCCAA GGAGCTTATG GTGCTTTTGT
CAACACTGAT TGTGAGTCTT CAGTTATTCC CTGTTTATGA GATACTCATA ACACTACTTC
TAGTCTGGTC GATTTTTCCC ACTAAGAACT TTGACCCCGA ACTCACCCAG GCGGAAGAGT
TTAAACAAGG CACAGCCGCC TTGCAGGCTT GCAAAGAGGC GGGATTGAAA CAGATCGTCT
ACTCAACTTT GGATGATGGA ACGGGATGCG TGCACTGGCA GTCCAAAGCA GAAGGTAAGA
TTGATCACAA TTATTATAAG GATACATGCT TATAGTCAAT TCCTAATTAG TCTCCAAATG
GGCAAAGAAC AACGACATCC CCATTACCAA CCTTGTACTC ACGTTCTACT ACGAGAACAT
CGTCAAAATG AACGCATGTG CCGGTGATGA CCAGGGCCCC AATACCTTTA CCCTTAACTT
GCCTCTCCCA GAGGATTCCT TAGTCCCTGG GTTCCCCGTT GCTCAGACTG GATTGTGGGT
CAAGACAGCG TTCGATGACC CTAAGAACTG GATCGGTTAG TGCTTTCAGT TTGGGATGAG
TTCCTGCATG GCTTACGTGG CCCTAGGCAA AGACATATAT GCCTGCACTG ATATTATAAC
AGTCAAGGAG ATGGCGGATC AGCTCTCTGC TGTCAGCGGG AAGACTGTTA AGACCAACGG
GTTGCCAGTT GAAGTCTTTA AGAGTAAGGA TTTTCAAAAG AAGGTCGGTC AAGAGCTTTG
GGACAACATG GACCTTTTCT ATCGAAGGTG AGCTCCTCAT ACTCTTATCA TAAGCCAAGA
TGCTGACAGT CTATGCCTAT AGATTCCTCC AAAGAGATGT CCAGGAGAGT GTGCGTCTGG
CACCTGGTGC TTGGAGTTTT GAGGCTTGGG CAAAGCAGAA TGATCAGCTC AAGAAGGCTC
TAGGCTTTTA AATTAGTATT GAGGCATGAA GATGAAAGCC AGCTAGTCAA
 
Protein sequence
MSSNAAKKIV VFTATGSQGS SVARYLSEAG YKIVALTRDT ESKSAKALKA KGYEVARADN 
TDPESYKPAL QGAYGAFVNT DFWSIFPTKN FDPELTQAEE FKQGTAALQA CKEAGLKQIV
YSTLDDGTGC VHWQSKAEVS KWAKNNDIPI TNLVLTFYYE NIVKMNACAG DDQGPNTFTL
NLPLPEDSLV PGFPVAQTGL WVKTAFDDPK NWIGKDIYAC TDIITVKEMA DQLSAVSGKT
VKTNGLPVEV FKSKDFQKKV GQELWDNMDL FYRRFLQRDV QESVRLAPGA WSFEAWAKQN
DQLKKALGF