Gene CNC04680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04680 
Symbol 
ID3256500 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1414663 
End bp1416532 
Gene Length1870 bp 
Protein Length458 aa 
Translation table 
GC content49% 
IMG OID638255687 
Productthreonine aldolase, putative 
Protein accessionXP_569726 
Protein GI58265140 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGACTCATCT TGCCACCACA TGATCCATTT CCTGCCCTAT GGTCCGCGGC GCTTGCTTTA 
AATACGCCAG CTTGCCAGTA AGGATTTCCG CATCTATCAT CTTTCCAACA AGAACATTAG
CGACGACCGC ACCTATCGCC ATGCCCGTAG CTTCCTCCAA GTCCTCAGCA GACGTGACCC
CCGACGTTGG TGGACAAGCA AACGTTGATC AGTTGCACAG GGTATCTCGT GACTTTCGTA
GTGAGTCTGA CACTGTCACA AAAGACGGCA AAGCATTGGG ACACCAACTG ATGGTCATTA
TTTGCAGGTG ATACTATTAC CATTCCTACA GATGCTCAGC TCCTGTGCTG TCTGAGAGCC
ACGAGAGGGG ACGATGTATA CGGTGAAGAT ACTTCGACAA CAGCGCTCGA AAAACGGATA
GCGAAACTGA CCGGTAAGGA GGCTGCCATG TTTGCAGTCA GTGGTACTAT GACGAATCGT
ACGTGTGCCG AAATAAAGGA AAGCGAGTTT CATACAGTAG CGTACTGACG ATGCCTTGGG
TGATAGAACT GGCCATTAGA ACACACATGA AGCAACCGCC GCACAGCGTC ATCACTGACT
GGCGAGCACA TGTCCACAAG ATGGAAGGTA CGTGCGATTT TCTTATACAT CGGATGCTTT
CTGATGGAGC TTCCCAGCCG GTGGAATTGC CATGTTTTCT CAGGCGACTA CCCATCAGCT
TGTACCGGAA AATGGTTTAC ACTTGACCAT GCAGGATATC GAGCCGGCTT TGCAGCTGGG
TACCAATATT CACATTGCTC CTACCAAGCT TATTTGCCTT GAGAATACTT TGTCCGGCAT
GATTTTCCCG CAAGAAGAGA TTGTAAAGAT TGGGGAAATG GCAAGAAAAC ATGACATTGG
TATGCATCTT GATGGCGCGA GGATCTGGAA CGTGGCTGCC GATGTTATCG CGAAGAGGGG
GCTAAATCCC AACAAAGAGG AGGATCTGCA GACTGTGTCA GTGACTTTAT TTTTTCTTCT
CCTGAATATA CTATCCTAAC TTGTCGCATA GTCTTACAGA ACTTATCGCT CCCTTTGACT
CGGCATCGCT CTGCCTCTCC AAGGGCCTAG GTGCACCGAT CGGTTCTGCG TTGGTCGGCT
CTAAAGAATT CATCGATCGC GCTAAGTGGT TCCGCAAGGC TTTTGGGGGA GGTATCCGGC
AAGCTGGTGG GATAGCTGCG TCTGCGGATT ACGCAATAAC CCATCACTTC CCAAGACTTA
TAAAGACACA TGAACTTGCG TCGCGACTGG AGCAGGGTTT GAGAGAGCTA GGCTGTGATA
TCCTGGCGCC AGTGGACACC AGCATGGTAT GCTACGCTTA TTGTCTGTTC ATCATTTCAT
CCCCATGCTA ATGTTGATGG TTTGCACAGG TATTTTTCCA ATCTAAATCC ATTGGACTAC
CCCTGGACGC TGTCATGGCC AGGCTGGCTG CTCTTCCTGA TCCCATTGTT ATTGGTGGTC
AACGTTGCGT CGTCCACCAT CAGATTAGCC CGCAAGCGAT TGAAGATTTT ATTGGCTGTA
TCGCCGAAAT GAAGAAGGAA AAAGAAGAAA AGGGGGAGTA CAAGGTTACT ACGCTAGGGC
AGGAGGAGAA AGACAAGTTG TCTAGATTTG TAAGTCCGGA GATCAAAAAC GAGACAAGCG
AAGCTAGATT GAGGAAGGAG GCTGCTCTGG GGTATTAATA TTCATGTGCC TTGTTGTTTG
TGCAGCTTGT ACTTTAAGAA CTGTCTGTAT AAATGTAACG GGGGCGTATA GGTGGATAGA
CGGCCAAAGA TGCAGTTTTA CGTATCATGT ATAGTGGTTT CGCATGCAGC GTGCAGCTCA
GCCCACGCAT
 
Protein sequence
MVRGACFKYA SLPVRISASI IFPTRTLATT APIAMPVASS KSSADVTPDV GGQANVDQLH 
RVSRDFRSDT ITIPTDAQLL CCLRATRGDD VYGEDTSTTA LEKRIAKLTG KEAAMFAVSG
TMTNQLAIRT HMKQPPHSVI TDWRAHVHKM EAGGIAMFSQ ATTHQLVPEN GLHLTMQDIE
PALQLGTNIH IAPTKLICLE NTLSGMIFPQ EEIVKIGEMA RKHDIGMHLD GARIWNVAAD
VIAKRGLNPN KEEDLQTVLT ELIAPFDSAS LCLSKGLGAP IGSALVGSKE FIDRAKWFRK
AFGGGIRQAG GIAASADYAI THHFPRLIKT HELASRLEQG LRELGCDILA PVDTSMVFFQ
SKSIGLPLDA VMARLAALPD PIVIGGQRCV VHHQISPQAI EDFIGCIAEM KKEKEEKGEY
KVTTLGQEEK DKLSRFVSPE IKNETSEARL RKEAALGY