Gene CNC06820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC06820 
Symbol 
ID3256557 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1994159 
End bp1996565 
Gene Length2407 bp 
Protein Length604 aa 
Translation table 
GC content50% 
IMG OID638255902 
ProductArylsulfatase precursor, putative 
Protein accessionXP_569926 
Protein GI58265540 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTACATAAG GACCTGGCCA TATTTTCCTA TTCAGACAAC CCCGAGGGAC GACTCTCAGA 
ATGCTAGGAC ACAAAGCGCT CCTCCTTGCT AGTGTAGGCT CGCTTCTGCT CTCTGCGGAC
GCTCTCGCTA TCAACAAGAA GCCCAACATC ATCGTAATCC GTGGGTCTCA ATTGCGTGCC
TGGCGGTGAT TTCGCTAACG ATTACCAAAA AAAGTTACGG ACGACCAAGA CGTCAGTACG
CTTGCAAAGC GTGAATACCT TCCTCGCATC CACGAGCATC TCGTCGACGA GGGTGTCCTC
TACGACAACT TCTTTGCGCC CGTATCCATA TGCTGTCCCA GTCGAGTGTC TTTGCTCCGG
GCTCAGTACG CTCACAACCA CAACGTCACT TTTGTCAGTG CTCCTTGGGG TGGTTGGGAT
GTCTTCAACA AGCTCGGTTA TGTTGGGCAT ACCTTGCCTG ACTTTGTTCA GGCTGCTGGG
TACAACACTT ACTACACCGG CAAGTTCATG TAAGCCTTGC ATGCTAACTC CCGAACCAGT
AACTAATCCG CTGCATAGGA ACGACCATAC CGACGCCAAT TGCGAGTCCT TGCCAGTTTC
TGGGTTCAAC TCTTCCGACA TTCTTGTCGG TGAGTGACTC CCTCATCCAG ACCTTGGGTA
AAATAGCTTA CAAGTCGCAG ACCCTTACAC CTATGACTAC TGGACTCCCG GTTTCTCTCG
AGACAGTATG TACAATCACG CCTGGACGAA GAAAAAAGAA AACCAATGGC TAATAATTAC
CGCAGACGGT CCTGTCAAGG TTCACGCCGG CGAATACTCT ACGGACCTCG TCCACGAAAA
GGCAGTCAAC TACCTTGAAG ATGCTCTTCA GGAAGACCGA CCTTTCTACC TTACCGTTGC
TCCCGTCGCT TGTCACTCCT GTACGCAATC ACATGACTAC CAAGTACTCC AAGCTAATAC
CACTTTCAGG GCTCGACTAT AAGCAGCAGG GTGACATAAA AAAATTCGTT ACCGACATTC
CTGCAAGTCA CCCCCGACAT GCCAGGCTTT TCCCTGTCGA GCAGATTGAA AGGACTGAGA
ATTGGAACCC TGATGAGCCT TCTGGTGTTT CCTGGGTAAA GGAGCTCCCC AAGCTTGTAA
ATATTTGCCC TCGGCGATGT TGAGAACAAA ATGTTGACGG TGCCTAAATA GAACTCTACT
GAAGAGGCTT ATCTTGATGA GTTCTTCCGT GGAAGACTTA GGGCGTTGCA AGCTGTCGAC
GAGCTAGTGG AGGATATCGT TGACAGGCTT GAAAAGGCCG GCGAGCTCGA TAACACATAT
ATTTTCTACA CCGGTACGTG AACCCCAAAA TTACTAGAAA TATATCTGAT CATATACTTC
ACAGCGGACA ACGGTTATGC TCTTGGCTCC CATCGCCGTC AGGTAAGTAC TCCCCGCTCC
AGTCACAAGT TCGTGCCTGA TCTTCTACTA TCAGCCCGGT AAGACCCTCG GTTTCGAAGA
AGACATCCAT GTCCCCTTTA TCGCCCGAGG TCCTGGTATC AAAAAAGGTT TCCGCGACAG
TCTCTCATCT TACGGCATGG TCGACCTTTC TCGGACCATC CTCGACATTG CCGGCGCCAA
CCCGGATTAC ACGGACGACG GACGAAAGAT CAACCTCCAC CAACATGGGG AAAAGAACCT
TGAGCACCAG ATCGCCCGAC ATGCTATCAG CGAATACTGG GTTTTGGGTG CGGATGAGGG
CGTGTTTGGC GGGCACACTC GACTGAACAA CAGTAAGTAG TCGGAAGAAC TTCAAATGAT
CGTCCTTACA CAAAAAGAGT TTCAAACTAA TAATATGAAG CCTACCGAAC TCTCCGTATC
CACGATGAGC ACGACGGCAA GTCTCACTCT CACTCGTATT CCGTATGGTG TACCGGCGAG
CGAGAACTCT ACGACCTCGA AAAGGATCCT AAGCAGATTA ACAACCTCCT TTCTCCTCTT
AACGAACTCG GCGCTTTCGC TCCTTTCAAC TCTACCGCTT CCAATGGCGA ACCTGTCCTC
GTCCGCCATC TCCAACATCT CCTTAACCGA CTTGACGCCG TCCTGCTCGT TCTCAAACGA
TGTACCGGCG AAGCATGCCA CAACCCTTAT CGCGAACTCT TCCCTTCATC CCAAGCTACC
GGCGGGGAAA TCTTCAAGTT CTCTCAGATT CTTGAAAGCC GATTCGATGA CTTTTTCAAA
GACCTGCCCA AGGTTCAGTT TGATAAATGT GCATTAGGAT TCCAGGAGGA GCTGGAGAAG
CCTGATTGGA AGAAGGAGTG GGCGTATGGT TCGGATGCGC GTGATGCGTC TATTGGTGGG
GGGATTGTTT TCCAGGATTT TTGATGTAGA CGAAGATGGA ATAGAAGTAG TATAGTTGAA
GTAGTTT
 
Protein sequence
MLGHKALLLA SVGSLLLSAD ALAINKKPNI IVILTDDQDV STLAKREYLP RIHEHLVDEG 
VLYDNFFAPV SICCPSRVSL LRAQYAHNHN VTFVSAPWGG WDVFNKLGYV GHTLPDFVQA
AGYNTYYTGK FMNDHTDANC ESLPVSGFNS SDILVDPYTY DYWTPGFSRD NGPVKVHAGE
YSTDLVHEKA VNYLEDALQE DRPFYLTVAP VACHSWLDYK QQGDIKKFVT DIPASHPRHA
RLFPVEQIER TENWNPDEPS GVSWVKELPK LNSTEEAYLD EFFRGRLRAL QAVDELVEDI
VDRLEKAGEL DNTYIFYTAD NGYALGSHRR QPGKTLGFEE DIHVPFIARG PGIKKGFRDS
LSSYGMVDLS RTILDIAGAN PDYTDDGRKI NLHQHGEKNL EHQIARHAIS EYWVLGADEG
VFGGHTRLNN TYRTLRIHDE HDGKSHSHSY SVWCTGEREL YDLEKDPKQI NNLLSPLNEL
GAFAPFNSTA SNGEPVLVRH LQHLLNRLDA VLLVLKRCTG EACHNPYREL FPSSQATGGE
IFKFSQILES RFDDFFKDLP KVQFDKCALG FQEELEKPDW KKEWAYGSDA RDASIGGGIV
FQDF