Gene CNG02950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG02950 
Symbol 
ID3258931 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp812368 
End bp813820 
Gene Length1453 bp 
Protein Length351 aa 
Translation table 
GC content51% 
IMG OID638257918 
Productconserved hypothetical protein 
Protein accessionXP_571984 
Protein GI58269656 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGACTTGT ACTAGGCGTT GAGATACCGA TACACCACTT CACCAAAAGG AGTTAATTAA 
GGTAAGCGGG CCGTTTTGGG TCCGGTGCCT CTTCGGGCCG GAGCTACCAT GGAGGGGCCG
GCTGGTTGTT TTCGGGCATG GGGGATAGCA GAGACACTCA TCTCTATAAA ACCCCCAGCT
GCAATACGCA TCAGTAGCTT TCCCTCGGTA CATATCCCTA TTACGAGTAT ACACTACTGA
AATCAATAAT GTCATCGCTC AACGGTCATT CCAACGGCAG CAAAGGTGCC ACCCACAAGA
GAGTGCTCAA ACCAGGCGTC TGGGCCCCCA TCCCCACTTT TTTGGATGAC AAGGAGGAGC
TTGGTGAGTG GTAATACCTA AACTTTACAA ATTTTCCGAA ACTTACTCGA TAACTGGAAA
TGGTTTAGAT ATCTCCACCT TCAGAAAACA TGTTGTTGAT CTTGCTAAAA TCGGCATGCA
GCCTGTCATT TGCGGGTCGA TGGGTGAAGC TTTCCAACTC ACAGACGATG AACGAGTAAC
TCTCTTCAAG GAGACCCGGG CTGCTCTGGA TGAGGCTGGG TTGCTCGACA CTGTGGTGAT
CGCCGGAACG TAAGTTCAAT ATTAAACGCA ATCGACCAGG ATGAGCTAAT CATCAAGTAG
AGGAGCTAAT TCCACTCGAG CGACCATCAA TCTCTGTCAT TTGGCTGCCT CTTCTGGTGC
CGATGTTGCC ATCGTTATCC CGCCCGGTTA CTTCGCAGGA GCCATGACTC CCCTTGCCCT
CAAGACCTTC TTCCTTGAAG TCCAAGCCTC TTCCCCCATT CCCGTTATGG TGTACAACTA
TCCAGGTGCT GCTGGTGGCA TCGATCTCTC TTCCGACCTC ATCGAAGAGA TCGCCAAGAA
AGGCTCCAAC ATTTGCGGCG TCAAGCTCAC TTGCGGAGCC GTAGGAAAGC TGACAAGGAT
ATCTGCTGCT ACTGCCACTC CGGCCTTTGC AGACTATCCC AGGAAAAGTG ACGTCGCACC
AGAGTTCCTC ACTCTTGGTG GGTTTGCAGA CTTCCTCGCG CCCGCCGTGC TGGGTGGTAG
AGGCCATGGC GCTATCATGG GTCTGGGCAA CATCTATCCT CGTTCGCTTG CCAGATTGTT
TGAGCTCTCT TACAAGATTG CCACAGACGC CCAGCCTTCT GCCCAAGATC TGAAGAAGGT
TCTCGAGTTG CAAGATCTGG TTTCCGGTGC CGATGCATCC TTTGCAAGGG CAGGGATTGC
TGGAACCAAA TGGTACCTCA AGACTCACAG TGGTTATCCC TCTGCAAGGT TGAGGCACCC
CTTGTTGGAG TTTACGGATG AGCAAGGACG GGCATTGGAG AAGGAAGAGG CAGTTGTCAA
GTTGATGGAA GTCGAGAAGA GCTTGGCCAA TAGCCAATGA AAGTTACACT AGAGATAGCA
TTTATGTAAC AGG
 
Protein sequence
MSSLNGHSNG SKGATHKRVL KPGVWAPIPT FLDDKEELDI STFRKHVVDL AKIGMQPVIC 
GSMGEAFQLT DDERVTLFKE TRAALDEAGL LDTVVIAGTG ANSTRATINL CHLAASSGAD
VAIVIPPGYF AGAMTPLALK TFFLEVQASS PIPVMVYNYP GAAGGIDLSS DLIEEIAKKG
SNICGVKLTC GAVGKLTRIS AATATPAFAD YPRKSDVAPE FLTLGGFADF LAPAVLGGRG
HGAIMGLGNI YPRSLARLFE LSYKIATDAQ PSAQDLKKVL ELQDLVSGAD ASFARAGIAG
TKWYLKTHSG YPSARLRHPL LEFTDEQGRA LEKEEAVVKL MEVEKSLANS Q