Gene CNC00020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC00020 
Symbol 
ID3256284 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp4572 
End bp6132 
Gene Length1561 bp 
Protein Length383 aa 
Translation table 
GC content51% 
IMG OID638255222 
Productconserved hypothetical protein 
Protein accessionXP_569351 
Protein GI58264390 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATACATCAAT CCAAGTTAAA CGCGTACATG CACAATGGCA TCATTCACTC TCAGTTGGGG 
TATCATCTCC ACCGGCGGCA TCGCCACCAC CTTTGCTCAT GATCTTCTCG TCGACCCGGC
CTCCCGCAAC ACCAAGGACG TCAAACACAG GATTGCTGCT GTCGGCTCCC GCTCCCTCCA
GTCTGCTCAG TCATTCATTA ACAAGCTCAA GGAAAGCTCC GAGGGGAAAT CATGGGCTTG
GGGAGTGAAG AATGGGGTTC TCGACGGTGT CAAGGCGAGG GGTACTTACG AAGAAGTGTA
TAACGATCCT GTACGTAACC TTGGCTCATC TTAAGTGTTA CGATAAGGCG TATTGACGTA
CTTACTCTCA CAAATAGGAT GTCACAGCCG TCTACGTAGG CACCCCGCAT GTTTTTCACC
ACCGTAACGC AAAAGATGCT CTTTTGGCTG GTAAACATGT CCTCCTCGAA AAGCCTGCTT
GTCTCGAAGT TGAAGAGTTG GACGAGTTGA TCAAGATTGC AAAGGAGAAA AATCTTTTCT
TTATGGAGTG AGCAAAGTTG TATCCTTCTC CCTTTGTTTT CCCTCATCTT TTCCTTCGTA
AGCATGTGCG CTGATAAGCA GCGATGCAGA GCCGTGTGGA CCCGATTCCA GCCCATCGCC
TATGCTGTTG AAGAAGTGAT CAAGTCCGGT ATTCTCGGTA AACCCAGACG TTTTTCTGCC
GATTTCTCCA TGGACTGGAA CTTGGACGGT ATGCTCCCTT TCCCCGTCTA ACAATCCATG
AGCATACTGA CTTGACACTA TTTCAGCCTC ACCCGACTCT AGCCGAATGG TCAACCCTGC
TCTTGGCGGT GGTTCACTCC TTGACATGTC CGCCTATCCA TCTGTCTGGG CCATGCTCCT
CGTCCACCGC AACCCTTTCA ACACCGATAA GGACCCCAAG GTGCTCTTCA CCAATCAAAC
GATTTATCCT CGCTCTGGAG TGGACATCAA TTCGAGGTGG TTGGTAGAGT GGAAGGGGCT
TTGTCAAGGA ATGTTGATGA CCGATTTGGA TAATGCGGGA CAGAAGGATT CGACCGCGGT
TTTGCAATGT GAGGAAGGGG ATTTGGTTGT TGCCTGTATG TTCACTCTCT CCGACCCTCA
TTTCTCTCCC CTTCCCTCGC GCTAACTTAT CCTTCCCTCT CCCTCTTTCT TCCTATGTAG
ACCCCCCTTA CAAACCCGAG ACATTCTACA TCCACCCCCG CCCTTCCCGC TTCCCCGGTA
CTATCACCTC CTCAACCACT CATCACCACC CTCTCCACGA TGGGAACAAT GGGATGTCAT
ACGAAGCAGA TGAGGTTGCG AGGTGTATCA GGGATGGGAA GATTGAGAGT GAGAGGATGC
CATGGGAGGA GAGCAGGGTG GTGCAGGGGT GGGTTGATAA GGTGAGGAAG GAAGGGCCGA
CCGAGACTGC CAAGTTGGTG GGGACTGCTG GGCAATAGAG TAGGAAGCGG CAAGTGACAA
GTGACGAAGG TCCTGGTGAG ATAGATCAAA AGTAGAAGGC GAAAACGAAT ATGCAGACAT
C
 
Protein sequence
MASFTLSWGI ISTGGIATTF AHDLLVDPAS RNTKDVKHRI AAVGSRSLQS AQSFINKLKE 
SSEGKSWAWG VKNGVLDGVK ARGTYEEVYN DPDVTAVYVG TPHVFHHRNA KDALLAGKHV
LLEKPACLEV EELDELIKIA KEKNLFFMEA VWTRFQPIAY AVEEVIKSGI LGKPRRFSAD
FSMDWNLDAS PDSSRMVNPA LGGGSLLDMS AYPSVWAMLL VHRNPFNTDK DPKVLFTNQT
IYPRSGVDIN SRWLVEWKGL CQGMLMTDLD NAGQKDSTAV LQCEEGDLVV AYPPYKPETF
YIHPRPSRFP GTITSSTTHH HPLHDGNNGM SYEADEVARC IRDGKIESER MPWEESRVVQ
GWVDKVRKEG PTETAKLVGT AGQ