Gene CNC06850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC06850 
Symbol 
ID3256764 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp2002112 
End bp2003622 
Gene Length1511 bp 
Protein Length361 aa 
Translation table 
GC content49% 
IMG OID638255905 
Productconserved hypothetical protein 
Protein accessionXP_569928 
Protein GI58265544 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0423295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCCG CCTTTTCCAA CAAGCTCAAA TACGCTGTCC TCGGTGCTGG CCGTATGGGC 
CAGCGTCATG CTCTCAACGT TGCCTTCAGA TCTCCCCGTG CAGAACTGGT AGCAGTAGCC
GACCCCAAAC CTTCTATTCC TCAATGGATG AAGGACAACT TGCCTCCGAG TACCAAGTAC
TTCGAGAACT ACGAAGACTG TCTCGTGAAC AGTGGAGCAG ATGCTGTTCT AATTGCGAGT
GCGACAAGCT GGCACGCTCC CATGGCTATT GATGCTATGC ATGCTGGCAA GGTGAGTTCA
TTAATGTGAA CTGTGCTGTT CTATGCTGAC TTCTCATTAG CATGTCTTAC TGGAGAAGCC
TATTTCCATC GATCTCGAAA CCTCCAGGAG TGTCGTTGCG GAGGCTGAAA AATTCCCAGA
CTTGAAGGTC ATGATTGGCT TCAGTCGCCG ATGTAAGTCC GAACCGGTTC CCCTACCAAG
AGAAGCCCTG CTAAACATAT CATTTTGAAG TTGACGAGTC TTACCGACAG GCGAGGAAGA
TGATTGAAAA CGGACAACTA GGCAAGGCCC ACTTGATCAA GTCTGCTACC AACGATCAGT
ATGACCCGTC CGGATTTTTC GTCTCCTATG CAGCCGCTTC CGGTGGCATT TACATTGACT
GTGGTATCCA CGACATTGAT TGCGCCCGAT GGCTCCTTGA CGCCTCTCTT GGTATTCCCA
ACCCCAAAAA ACAAGTCCGC CGTGTATTTG CTGCGGGCCA CAACATCCGG CACCCCGAGC
TTGTCCAGGA CAACGATGTC GACAACGCAG TAGGGTTTGT GGAGTTTGAA AATGGCAAAA
TGCTGGTGTT ACACCTGAGC AGGACTTCTA TGCATGGTCA CGATTGCTTT GCTGAGGTTT
TCGGAACGGA CGGAAAGGTA ATCGTTAACG GAGTGAGTTG TCATAGGTTA ATCCGGATGC
GGTGGAGGCT GACGCGATGT TGCAGAACCC TCAGCTTAAC CGAGTGGAGA TTCGCGATGT
TCACGGTGTC CGTAGCGAGT CGACGTGAGT AGTTTTATCA GATATCTGGT GGCAGAGGAG
ATAATCCGCT TACATATAAT TAGCCCTACC TATTACGAGC GTTTCAAGGA TGCTTTTGTG
ACAGAGATCA ACGAGTTTAC TTCCGCCGTC CTCGATAACA AACGTACGTG CTTCACCTAG
TCTTGTCTGA TATCTATTAA CTCGATACTT CCCATAGCTC TCCCAGTTAA CGCCATCGAT
GCTCTTGAGG CAAGCAAGAT TGCGACCGCT TTGACACACT CCTTCAAAAC CAATACTCCG
GTCTTCTTCG ATGACGAGGG CGAGCCGATA TTGGCGTAAT TGTGAAAGAA GACGGCGTTC
GTTTGGGGTT ATAAAAAGTG TGGGGACCGT GCAATTATAC AATGCAATTA GGAAGATAAT
TTGAATAGAC TAAGAAATAG AGAAAAAGGT AGACGCAAGG AAAGAAAGAA ATGCAAAATA
GATAGCGAGT C
 
Protein sequence
MSAAFSNKLK YAVLGAGRMG QRHALNVAFR SPRAELVAVA DPKPSIPQWM KDNLPPSTKY 
FENYEDCLVN SGADAVLIAS ATSWHAPMAI DAMHAGKHVL LEKPISIDLE TSRSVVAEAE
KFPDLKVMIG FSRRFDESYR QARKMIENGQ LGKAHLIKSA TNDQYDPSGF FVSYAAASGG
IYIDCGIHDI DCARWLLDAS LGIPNPKKQV RRVFAAGHNI RHPELVQDND VDNAVGFVEF
ENGKMLVLHL SRTSMHGHDC FAEVFGTDGK VIVNGNPQLN RVEIRDVHGV RSESTPTYYE
RFKDAFVTEI NEFTSAVLDN KPLPVNAIDA LEASKIATAL THSFKTNTPV FFDDEGEPIL
A