Gene CNF01620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01620 
Symbol 
ID3258477 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp478721 
End bp480649 
Gene Length1929 bp 
Protein Length397 aa 
Translation table 
GC content52% 
IMG OID638257287 
Productnucleotide-sugar transporter, putative 
Protein accessionXP_571496 
Protein GI58268680 
COG category[G] Carbohydrate transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5070] Nucleotide-sugar transporter 
TIGRFAM ID[TIGR00803] UDP-galactose transporter 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGGAATAAC CATCCCGGCA CCATGTCCAA ACCTTTCGTG CCCACACCCA ACATCTCTCG 
CCCAGCCACT CCCTCCTCCC TCGACTACGG CAAGGACGAG GCCTCTTCCA CCCTCCTGAG
GGACATGGGC GAGCGGGGAG ACAGAGAGAG GAAAGACAGG GAAGAGAGGG ACAAGAAGGA
GGCTATGCCG TCGGGACAGG ATCAGGGTGA GTCTGGCCTC TGACTGACTG CGCATGTATG
GAGGGAGAGA CCAAGCTGGA CACGGCGACT GGACTAGATG TATTGGCGGA TGACATGGGA
ACGCGCCCTG CCATGCGCTA TATCCCGTCT CGCTGCCACG TATGGCGTGG GCTGAGGATA
GACAAGTGCT GACCACGCGT CCTAGTCCTC CCCATCCTCT CGTACTGTGC TGCCAGTATC
ATGATGACCG TCGTCAACAA GGTGAGCCGT TCTGTCTGTT CCGGGTGCTG CGGACTGCGC
GACGCGTTTC AACGCTGATC ATTACATCCG CAGTACGTCG TGTCCGGCGC GAACTTCACC
ATGACCTTTT TGCTGTTGGC TATCCAATCG AGTGTCTGTG TTTTGGCCGT CACTACCGTG
AAGAAGCTGG GTTTCATCTC TTGTTAGTCG ATCGGCATGC TACATGTAAA CAAGTCGGGC
TGATAGGCTG GGTTCTATCT AGTCCGTGAC TTTGACAAGA ATGACGCCAA GGCCTGGTGG
CCCATCTCTA CATTGTTGGT GGCTGTCATC TACACTGGTT CAAAGGCTTT GGTAAGTTTA
GTGGGGTTTT GAGTTGCAAT CGATGCTAAC AAAGATGTAG CAATTCTTGT CTATCCCCGT
CTACACGTGA GCCGGCCGAC CATCTTGAAA ATGTGGACTG GACTGATCTC GTTTGTAGTA
TCTTCAAGAA CTTGACCATT ATCCTCATTG TCAGTATCGA CACCATGGTT CAAGCCTTAA
AAGCTAATCT CTTTTATAGG CCTACGGAGA AGTGTTTATG TTCAACGGTG CCGTCAGTGG
TCTCACACTC TGTTCATTTG CTCTCATGGT GAGTACATTG AGTAAGGTCT AGGCATGTCG
CTGATAATAT CCCAGGTTGG CTCTTCCATC ATCGCCGCCT GGTCCGATAT CACTTCTGTG
TGGAACAAGG AGCCTGAGCT TGACCCTATT ACCGGTCTCG AGATTACTGT TGGCCCCGTA
TCTACGATTG GTGGCCTTAA TGCTGGTTAC ATTTGGATGG CGCTCAACTG TTTCGTCTCT
GCTGCCTACG TACGTACACT GTTTTGGAAT GATGGGCATA GCTGACAATA TATTTGACAA
TCCAACAGGT TTTGTTCATG CGAAAGCGAA TCAAGGTCAC TGGCTTCAAG GACTGGGACT
CTATGTATTA CAACAACCTT CTCTCCATCC CCATCCTTGT CGTCTTCTCT CTTGTCATCG
AAGACTGGGG TTCTGAATCT CTTGCCCTCA ACTTCCCTGC TTCCAACCGT GTGCTCCTTC
TCTCCGCCAT GGCCTTTTCC GGCGCCGCTG CCGTCTTCAT TTCATACTCT ACCGCCTGGT
GTGTTCGTAT CACTGGTTCC ACAACATACA GTATGGTCGG AGCTTTGAAC AAGTTGCCTG
TCGCCGCGAG CGGTATCTTG TTCTTTGGTG ACCCCGCCAA CTTTGGTAAC ATCTCGGCCA
TCGCTGTTGG TGGTGTCGCT GGTGTGGTGT ACGCTGTGGC CAAGACTAAC CAGGCAAAGG
TAGAGAAGGC TAGGCAAGCA AGGGCCGCGG GTGGTAGGCC ATGAGGTGCT TTTGAAAAAC
AGGGGACCAG AAGTACGTTT GGTGATGATT CACGGTGTAT ACACATTGTG CAAAACGGGA
TTATTTAGAG AGAGCTCTAC AGAGAGCGCT TGTTGCTTTG ACCATGCATA GATAACAATC
CGTTTGATA
 
Protein sequence
MSKPFVPTPN ISRPATPSSL DYGKDEASST LLRDMGERGD RERKDREERD KKEAMPSGQD 
QVLPILSYCA ASIMMTVVNK YVVSGANFTM TFLLLAIQSS VCVLAVTTVK KLGFISFRDF
DKNDAKAWWP ISTLLVAVIY TGSKALQFLS IPVYTIFKNL TIILIAYGEV FMFNGAVSGL
TLCSFALMVG SSIIAAWSDI TSVWNKEPEL DPITGLEITV GPVSTIGGLN AGYIWMALNC
FVSAAYVLFM RKRIKVTGFK DWDSMYYNNL LSIPILVVFS LVIEDWGSES LALNFPASNR
VLLLSAMAFS GAAAVFISYS TAWCVRITGS TTYSMVGALN KLPVAASGIL FFGDPANFGN
ISAIAVGGVA GVVYAVAKTN QAKVEKARQA RAAGGRP