Gene CNM00450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM00450 
Symbol 
ID3255256 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp108534 
End bp111417 
Gene Length2884 bp 
Protein Length436 aa 
Translation table 
GC content48% 
IMG OID638254204 
Productexpressed protein 
Protein accessionXP_568456 
Protein GI58262092 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.98204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTTGGGTTA TTTACCAGTT CCTTGTCTCC AGGGCTTCTA TAGATGGGTA TCGGATGCCT 
TCTACACTTC ACAACTTGAC AACCCAACGC AAAAACTCCT TCCTCTTCTT TTTGTTGAAA
TTCGAAACAC TGCCACTCAC GTATCCACAT TCCCAACGGC GGTCGTAACA AACAGCGATT
CCGCCCTCCG GCTGGGCAGC TTCCGGTTTA TTTCCGACAA ACATTTCGGA CATGTCCTGT
CCGTCACTCG TTTGGCAGAA GTTCGCAGGT TTTTAATAAC TTCCTTCCCC CGAAAGGTGA
TCTTCGTAAA CGGTTCTAAA GGCCGATATT CGCCTTGTTT CGGCTTTGCA ACTCAAATTC
CCCCATGTGC TGGTCACTGC TCTCGGTCAA ATTTCACCAC CTCTTTTGAC TCCCCCCTTC
CAAGCATTAT CTTTTGACGA TTCGTTAGCC GCGCTCCACC AAAAGTTTTG CTTTGGAAGC
CTCAATACGG CTCGAAGGAG GTCATCAACC GGCGCAAAGG AAGAGGATCA GCGATCAGTC
GCTTGTCGAC AACATTATTA AACGTTACGA TTGTCATATC AAGTCCCGGA CTCACATATA
GATTGTCTCT TTGAGTCCGA ACGACAGACT TCCTCACCGT CGCTGGTGGC CAATTCTTTT
CGCGCGGTAA CAAAGAAAGT ACTGTGAACG TCCGCTATCT AGCGACCTTG CGACGACGAC
CAAAGTCAAT CTTTCGAAAA CCACAACGCC AAAAGATTCC AGGAGTGATC ATCATTAATT
CCGTACGTCT GGCTCTCGTC ATCTTCTTCC CATTTTATCA TCAGCTGATT CCAGCTTTAA
AGCAGTTATT ACTGGTGATT CAAGTGGTAT TCTAAATTTT TTATAATTAT CTGCCCTCCT
ATTCGTGGCA GAAGGACGAG TGAAAAGAGT GGCAGCCAAC TCGCCGACCT TTCCCCCACT
TCCCCCGCCG GCTCGGGACT TTTGTTTACT TCATCAGATA ACACCTCGGT TCAGCGCTAT
TTAACCAACG ACGTATATAT TGTTGGTCCT ATCCACTGTG CGGCTAGTTT CTCTTGTTCA
TTACAGCTAT AAAGGACTTA AGTCGTCGAC CCTGTGTATA GCCGGAAAAT GAGCCCCTAC
ATCCATCCAT CCGAAGCAAG GGCCCCCGGG GAGTCTGTCA GATGTACCAC ACGGCTACCC
CCCATTCTGG CGTCCATCGG TTAGTTATGA GATCAAAACA TCTCCTTCCT TAGTTTCACT
CGACTGATCA GTCCTCCTAG GAACCGTCCT GATTGATGCA TTCGACAGTT TGCCTCGACC
TATTGTGGAC GATAGCCAGG CTTCTTCTCG GCCAGCGTCC CCAGTCATCG ACGAGCGGGC
GCCTCAACCA CAGGGGCATC TTCAACACTT AGTTATTCCC GCCCGTCGCC TTCGGCTCTC
CCCTCCCCTG TCGTCCCCGG CAACTTCCGG AACCTCGGGC CCTTCGACTC CCCAGATGAC
CCTCGATGAT ATCCCTGTTC CGAATGCGGA AGAAGTTTAT GAAATGCTAG GCGGTGGTGC
TCTGTATGCA ATCGTCGGGG CGAGATTCTG GCTTCCTCCT TGCCAATTAC GAACTCTTGT
CGATCGAGCC CCGGCTGAAA ATGATGACTG CCCAAAAGAT GTGGAGCAAA AGCTGGCGAA
ATTGGGTAAT GAAATATGGG TTTGGAATAG AGGTGAGGGA ACGAGAATGA CAAGAGCAAG
AATACGGTAT GAGGGCGATG TCCGATAGTA CGTGACAGTC ATGAATCTGA TAGTTATAGC
TAATGGATAT CATCAGTTTC CAACCCGTTG TGAAGGCTCC ATATCGGACA ATACAGGAGC
TTTCTACTTC ACCTCTTCTC TATGCTGAAT ATCTTCACAT TTCCCCTCCA TACTCTCCAG
AAAATGTGGC TGTTATCGTC TCTGATCTGA AAGCTTTGCC GAAAGATAGC TGGCGACCTA
AGATTGTCTT TGAGCCTACC CCCCCTTCAT GCCATCCTGG CCAGAAGGAC TGGCTCGAAC
ACATTCTTCC CGATATCGAA GTACTCTCGT AAGTCGTTGC TTGATGAAGG AAACCTATTA
TGCTGATTGG CCCTTAGCCC CAATCACGAA GAGCTCTTTT CTTTCTACTC TATCCCTACC
ATGGCGACCT CTTCTATCTC GCTGCGTCCA ACAGTTGAAC GCCTGGTGAC CCATATTCTG
CACGATGTCG GCATTGGCGC GAATGGACAA GGTATAGTGG TCGTCAGGTG TGGTCGGCTC
GGAGCATGTG TAGGCACCAA GAAAGGCGGA TTAAAATGGT GTCCGGCTTA TTGGGAAGGT
GATGATGTGA AGAATGTAAA AGATGTGACT GGAGGTGGGT TGGATTGGAA ACCATGTATC
GTCAATATTG ATGTTTTCAT TGTCGTAGCT GGCAACTCTT TCCTGGGAGG TTATGTAGCA
GGCCTTTCCC TAACTAATGA CCCTTATGAA GGTAAGATAC TTTCCAACAC TATCAGGAGT
CCTACTGATT CCGTCCATAG CTCTTTTATA CGCCACCATT TCATCCTCCT TCGTTGTAGA
GCAGTTCGGA CTGCCACGTC TAATGGATTG CACCGATCCT CTGACGGGCG AAGAAATTTG
GAATGCCGAC ACACCCTCTC GTCGATTGAA GGAACTGAAA CGACGCTTGG GTCTACTATA
ATGTTTTCCC TACATATACA CATCATGCGA GATCGGTTCA GGACATTTCT CTGCCACTTT
TTTAGGATAC CGTTCATTAG GTGTATACCG TGATAAGCCT CGTGGTATCA TTATAGTTTG
ATTAATAGAA TATGTCATAG CAATACAACT GTAACATCCA CAAAATATGC ATGTCAACGT
ATGT
 
Protein sequence
MSPYIHPSEA RAPGESVRCT TRLPPILASI GTVLIDAFDS LPRPIVDDSQ ASSRPASPVI 
DERAPQPQGH LQHLVIPARR LRLSPPLSSP ATSGTSGPST PQMTLDDIPV PNAEEVYEML
GGGALYAIVG ARFWLPPCQL RTLVDRAPAE NDDCPKDVEQ KLAKLGNEIW VWNRGEGTRM
TRARIRYEGD VRYFQPVVKA PYRTIQELST SPLLYAEYLH ISPPYSPENV AVIVSDLKAL
PKDSWRPKIV FEPTPPSCHP GQKDWLEHIL PDIEVLSPNH EELFSFYSIP TMATSSISLR
PTVERLVTHI LHDVGIGANG QGIVVVRCGR LGACVGTKKG GLKWCPAYWE GDDVKNVKDV
TGAGNSFLGG YVAGLSLTND PYEALLYATI SSSFVVEQFG LPRLMDCTDP LTGEEIWNAD
TPSRRLKELK RRLGLL