Gene CNH01690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH01690 
Symbol 
ID3259214 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp655690 
End bp658662 
Gene Length2973 bp 
Protein Length674 aa 
Translation table 
GC content51% 
IMG OID638258319 
Productconserved hypothetical protein 
Protein accessionXP_572346 
Protein GI58270380 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.887535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGACA CACCGCCACA CAAACACCAC TACACCTCCC TCCGCTCCCT CATGCACAGC 
CCAGATCCCG GCAACCCCGC CAAGCGTGAC TCCCTCACCC CGCGCCAGCC ACGCCCAAGT
CGCAACAGCA GCAGCTCACG CAGCCGCGAA CGCAGGCTGG ATCCTCCCAG AAAGGCAGAC
AAGGACAAGG GCCGCTTATC GACATGGAAA CTCATGTGCC TCACCGTGTC CATGGGCGGG
AGCCAGATCG CGTGGACAGT GTACGTGCTC AACGATCACC TGCAGCTGCC TGCTGACCTG
CCACCAGGGA ACTGGGATAC GGAACGCCCT ATCTCCTCTC GCTCGGCCTC TCGGAACAGC
TCACTTCTCT CGTCTGGCTC GCCGGGCCTA TATCCGGCCT CATCGCCCAG CCGCTCATCG
GTGCCATTTC AGATTCCTCG CATTCTCGCT ACCGGAGGCG GTACTGGATT GTAACATCGA
CCATGCTGCT CGTATTTAGC GGTCTCGGTC TGGCTTTCAC AGAACCTATT GCCAAAGCTT
TGGTCGATTT GATAGGCGGC GGCCAAGGCG ACTGGGATCC AAAGACTATC AGATTGGTAT
GTCATTCAGT TTTGCCGCCT TGTAAACGAT CAAAACGGGA TTGTGGCTTA CACATGCAAT
TAGGTGAAGA ACACGGCTAT CGGTATAGCC GTCTTCTCAT TCTACTGCCT TGACTTCGCC
CTTAATGCGT GAGTAATCCA TGCGCCTTAT GCATCAGTCT GATATCCCGG CGCAGTCTCC
AAGCTTCCCT CCGCAATCTT GTCCTCGACA TCACGCCAGG TGAACAACTT GCCACCGCCA
ATGCTTGGCA TGGACGCTTC AACCATGTGG GTAACATTGT GGGGTTCACC ATGGGTTCGT
CCGTCTCCTT GATCCCAGAC CGGCTTCTTT GGTCTGAATA CTCATCTGGA TACTCTGCAG
GTTTCTTAAA TCTCAGTCAT GTACCGATTA TCCGTTTGGT CGGAGGCGGT CAGTTTCGTA
AAGGTACGTC TGGCACCCCA CACACCTGCG TTTAATAGCT GACATTATAT CCAGTATGTA
TCGTGGCGTT GGTACTGTTG GTCATGACCG TGTGGATCAC GTGTTGGACG CAAGAAGAAA
AGGAAACGGA TAGTATTTTT GGCGAAAGGC GCTCGTACGT TGTTATATGC TCGCGACTCT
GTCATACGCT GACATCAAAC ACCCCTATAG GAAAATACGA GATGTAGTGG GTACAATCTA
CGAAGCGGTA CTCCATCTTC CAAAACCCAT TCGCCGAGTT TGCATCGTAA GTCTTTCCCC
CATACCTTCT TTCCAAAGAC CACTCAAACC TAAAATTGCA TCGGGCGCGG AAAGGTACAA
ATCGCCGCAT TCATGGGATG GTTCCCGTAT CTTTTCTACT CTACCACTTA CGTCGCCGAA
GTCATGGCCA AAGAATTACA TCATAAACCT GATATCGATC GAGCCACCCG AGCTGGTAGC
CTGGCCCTTT TGATCTATTC TTTCGGTAAG TGCTGCGTCT TTCTATATTA CTATACAAAT
TTGCTCAACT GCCAACACCA TTTGATCAAA AAAAAAAAAC AGTCGCCATC ATCGCTGGGA
CACTTCTCCC TTACCTCGCC GCGCGAGATC GCCGACTGCT CAAACCCACT TCGGAAAAAC
TGCGAGATGG CGAGATTGAG ATTGAAAACG AAGATGAAGA AGATGAAGAG CATGTGGAGA
TGGAGAGGAT CAGAGAGATG GTACAGCAGT GGAAAGCCGA GGCTGCTAGA GAGGGAAGAC
CTCTGAAATT GCCTACTAGT AAGTTGCAGC TTCCGTTAAA ACACACACAT ACGGCGTGTG
CTGATATATC ATCCGGTCAT TTCCAGTGCC GTTCATGTTG AGAAATATTT GGACGGCGGG
CTTGGTCATT TTCGGGTGCT TGATGATGTC CACGTTCTTC ATCACAAAGG TCTGGCAAGC
AACAGTGATG ATCGCTTTGG TAGGCATCTG TTGGGCTATT GCTTGTTGGG TGTGAGTGGT
TTTATTTTGG TTGAAAGGCA AAAAAAGGTT TCAAGAACTG ATGGAGAAAT TGGCGTCTTA
TTAGACCGTT TGCGTAAGTG TTACGATTTC ATTTTCTTCC TCAGTGTATA TTTACTCATG
AACAAATGAT CGATGGATAT CGCAGAATCA TCATGGAGGT CAGTTTTGGA TTTCCACTTA
TTAAACCGCA CAACCCCCCA GACCCAAACC GATTGACTGA CAAACACCCC CAGTTCCTCA
AAGAGCTCGA CGACAAGCCT CCTCCCCGAA TATCAGACGG TCGCCCCCGT CCCACCCACG
CGCGCACGGC TTCCACCCCC CTCGGCTGGC GATCACATCC CACCAGCCCT GCAGGCCGCG
CCTCCCCCGA CGAACGTACA CCACTTGCCA GAAGCTACTC GACCGCCGAT CTTGATGGCG
CTAATGAAAT GGAATATACC GGCCAGGGAC CAGTAGCTGG CGGGACTATT ATGGGTATCC
ATAACCTCGC CATCGTTTTC CCTCAATTTA TCGCAAGTCT AATTTTACCT CCTTTCTTGC
GTTATCAGCT AATAGTGGTT GTCGTGCGAT CATTTTTTTT TTAGATTGCA GTCGTAGCCT
CTATCATTTT CAAGCTAGCC GACACTCAGC CCGACATCCA GCCCACCTCG CCCGAAATCG
GTGGACCGCA CGGTCAAGAT AAGAATGGAG TCGCCTGGGT CCTCCGGTTC GGCGGGTTGA
TGGCGTTTGT GGGGGCTTTG GTATCGAGGA AAGTACCGCC TACCAAGACT GAAAAGGCGA
TGAGGAGGAG ATTGGCGGAT ATGAGGGAGG AAAGTGCAGA GTGAGAGATG TAGGCGGCGA
GCGAGAGAAA TGTCAGAGTG GAAAGGTCGG ATTGGGACGT GTTATGTGTA TATATGCGGA
GGAGAGTTGA CCGAGTTTAG TCTTGTTATT TAC
 
Protein sequence
MPDTPPHKHH YTSLRSLMHS PDPGNPAKRD SLTPRQPRPS RNSSSSRSRE RRLDPPRKAD 
KDKGRLSTWK LMCLTVSMGG SQIAWTVELG YGTPYLLSLG LSEQLTSLVW LAGPISGLIA
QPLIGAISDS SHSRYRRRYW IVTSTMLLVF SGLGLAFTEP IAKALVDLIG GGQGDWDPKT
IRLVKNTAIG IAVFSFYCLD FALNALQASL RNLVLDITPG EQLATANAWH GRFNHVGNIV
GFTMGFLNLS HVPIIRLVGG GQFRKVCIVA LVLLVMTVWI TCWTQEEKET DSIFGERRSK
IRDVVGTIYE AVLHLPKPIR RVCIVQIAAF MGWFPYLFYS TTYVAEVMAK ELHHKPDIDR
ATRAGSLALL IYSFVAIIAG TLLPYLAARD RRLLKPTSEK LRDGEIEIEN EDEEDEEHVE
MERIREMVQQ WKAEAAREGR PLKLPTMPFM LRNIWTAGLV IFGCLMMSTF FITKVWQATV
MIALVGICWA IACWVPFAII MEFLKELDDK PPPRISDGRP RPTHARTAST PLGWRSHPTS
PAGRASPDER TPLARSYSTA DLDGANEMEY TGQGPVAGGT IMGIHNLAIV FPQFIIAVVA
SIIFKLADTQ PDIQPTSPEI GGPHGQDKNG VAWVLRFGGL MAFVGALVSR KVPPTKTEKA
MRRRLADMRE ESAE