Gene CNM02030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM02030 
Symbol 
ID3255274 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp628975 
End bp631339 
Gene Length2365 bp 
Protein Length527 aa 
Translation table 
GC content47% 
IMG OID638254357 
Productconserved hypothetical protein 
Protein accessionXP_568285 
Protein GI58261750 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAAGTAACC ATAATCCAGA ATATCATTAG GACCCAACTT CAACTTAAAG CCGCCCTTTA 
AATTTTCCGC CTAACCCATT TTGCCCTATT GCCACCCCAG CAAGATGGTT CGACACGCAA
CTTTTGACAC AGAACCGGTG GAGATTGAAA AACCGGCCGA GGTGCAGACC GAGGATGTTA
ATGTTGAGAA GCAGTCTTCC TTGCATCCCA TCGGCACGCG TACGACTGAA GGCGGTGACA
CCGAGAAGGG AAGTACAAGA TATGCCGAAA GCGAATCTGG TTTGCTCTAC AAGAAGGAGC
AAAAGAAGGC CGAAAGGAAG TTATTGATGA AGTTGGGTAA GTGTTTTCGC TCTTCTGGAT
CGAAATGAGA GGCTCATATG ATATATGTAT ATATATTACT TAATGCGCAG ATGTTGCCAT
TTTGCCTTTT GCGGTTTTGT TGTACCTGAG TGCCTACCTT GATCGAGGGA ACTTGTGAGT
TGTTACACTG AAGTGGGCAT GTTGCAGGGC TAACTTTGGC GGAAAGAGCA AACGCGAGAC
TGCAAGGTCT GCAGGACGAA GTCCTGGATG GAAAAGACAA GAACTACTCG ATTGCACTCT
GCTGTTTCTT CGTGACTGTA CGTTCTCGGT TCATTTCTCG ATGCGTATTT ATGTTAATGT
CGTTTCTACA GTACATTGTG TTCTCAGTGC GTAAACATAT GACCGTGCTG CTCGCGTCTC
TCTTCTAATG TCCTTCAAAA GGTCCCCGGT ACTCTCATGG CCAAGCAATT CCTTCCTTCT
AGATCTATCG CCTGCGGTGC CATGATCTGG TCCATCGCCG CAACCTGCCA AGCAGCCGCT
TTCAACAAGG CCGGACTTTA CGTTTGTCGC CTTTTTGTTG GTATCGGTGA ATCCATGTTC
GGTCAAGCGA TGGCTCTTCA CTTCTCCTAC TGGTACACCA AGACCGACCT CGCCAAGCGC
GTCGGTCTCT TCATCTCGGC TGGTGCAGTC TCTGGTGCTT TCGGTGGTTT GATCTCGTTT
GGGGTGTCTA ACATCAAGAA CAGTCCTATC GAGCAATGGA GGATCTTGTT CTTGATTGAG
GGTTGCCCCT CCATTCTTCT TGCCATTTGT GTATTTTTCT TCATGCCCAG TAAGCCCGAG
AAGAGCAAAT ATCTTAATGA AGAAGAGAGA ACTCTTTGCT TGACTAGGTT GAATCAGGAG
AACAACGTTG AGAAGGATTT GGGTATCGAT TGGGGGGGTG TCAAGAGGTG TCTCACCGAC
TGGAAGACCT ATGTCATTAG CATTGCGTAA GTTAAATTTT ACGATATTAT AGTATATGAC
TCACCTTGTT TTAGTTATTC TTGTATGAAC CTCACCCTCG GATCTGTCAG CGGTTTCTTG
CCTACTATCA TCAAGGGCTT TGGTAACTTT TTCATACTCG TCGCTACAGC TTATAAGTAC
TCACATATTG TCTCTAGGCT ACTCCAACGC TCGTGCCCAA TTATTTACCG TTCCTCCCTA
TGCCGTTGCT CTCGTCTTCA TGCTCATCCT CACTTCCTTT TCCGATTACC GTCAAACCCG
TGGTCTCCCC GCCGCTTCCG TTTTCTGCCT CGGTATCATC GGCTGGGCCA TCCTTCTCGC
TGTCCCTGCT GACGAACACT ACTCTGCTCG ATATTTTGGG TGTATCTGTG TTGTCACAGC
GGGTTACACC AATATCCCGT TGATAATGAG TTGGCAGAGT GGTTGTACTG CGAATCAGAG
TCAAAGGGCG ACAAGTTTGG GTATGCTTAA CACTTTGGGA CAGTGCTTGT CTTTGGCTGC
TGCGTTTTTG TAGGTCATAA TCCTTACATA TGCTTTCAAG GCATTTACTA ACAGTCCTGG
CAGGTTCCCT TCTGCGGAAG GTCCTCAGTA TACCAAAGGT GCCTCTATTA ACTTGGCCTT
CCAAGGTCTC GGACTTATCC TTACATTGTT CATGACCTCG TACTACCGAT GGGAGAACCG
ACGACGAGAC ATGAAGGAAG GTGGACAACC TGCCGTAGGT GTTCCTATCG ACGTCAAGGA
AGGTTACGAT AGGGCTGTCG GTCAGTATTT TTCTCCGCCA AGCCTAATAT ATCGGCGCGT
TCTGAGGCTG ACATTCTGGT GACAGGGTTC CGATACGTCC CATGATCTCG TCCTTTTTCT
TTTTAGGTCA TTAGCAGTGG TGTATATCTG TATTTCTATA GTATTTACGA AGAGTACGTA
CAGAATTAAA AGAAGATTAA GTGCAGAAAG TCAAAAGTCC CAGGCGTGTG TTAGTACGTA
TCTTCGTGGC GCGGAAACAG CAATACTGAT ACGTGGAACA AAGCCGAACA TTCCCTTTGC
CATTTTCATA GAGATCTTAT GGAGC
 
Protein sequence
MVRHATFDTE PVEIEKPAEV QTEDVNVEKQ SSLHPIGTRT TEGGDTEKGS TRYAESESGL 
LYKKEQKKAE RKLLMKLDVA ILPFAVLLYL SAYLDRGNLA NARLQGLQDE VLDGKDKNYS
IALCCFFVTY IVFSVPGTLM AKQFLPSRSI ACGAMIWSIA ATCQAAAFNK AGLYVCRLFV
GIGESMFGQA MALHFSYWYT KTDLAKRVGL FISAGAVSGA FGGLISFGVS NIKNSPIEQW
RILFLIEGCP SILLAICVFF FMPSKPEKSK YLNEEERTLC LTRLNQENNV EKDLGIDWGG
VKRCLTDWKT YVISIAYSCM NLTLGSVSGF LPTIIKGFGY SNARAQLFTV PPYAVALVFM
LILTSFSDYR QTRGLPAASV FCLGIIGWAI LLAVPADEHY SARYFGCICV VTAGYTNIPL
IMSWQSGCTA NQSQRATSLG MLNTLGQCLS LAAAFLFPSA EGPQYTKGAS INLAFQGLGL
ILTLFMTSYY RWENRRRDMK EGGQPAVGVP IDVKEGYDRA VGFRYVP