Gene CNF03900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03900 
Symbol 
ID3258063 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1137599 
End bp1139695 
Gene Length2097 bp 
Protein Length506 aa 
Translation table 
GC content50% 
IMG OID638257509 
Productaldehyde dehydrogenase (alddh), putative 
Protein accessionXP_571348 
Protein GI58268384 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACATCCCCA TCTCCTCCCT AACCATATCT GTCTGTGCTA TTTATCACAT CCCCGCACGC 
AAGTTAATTA TCAAGATGGC TCCCACATTC ACACATGAAT TCAACCACGC CGGCTACAAG
GGCAAGGTCG AGGTCCCTAC CGGTATCTTT ATCAACGGCG AATGGGCCTC TTCCGTCGAC
AAGAATGCCA AGACCATTGA GTGAGCTTGC GTCTTTTGCC CATTGCAGTC CATAACATCT
AATGTCCTCC CAGCGTCTAC AACCCCACTA CAGGCGAGGT CCTTACCAAA ATTCCCGAGG
CTTTGGAGGC CGACGTTAAC AAGGCCGTCG AGGTCGCCCA CAATGCTTTC AACAACTCTT
GGGGTCTCAG CGTCCCCGGT TTCAAGCGAG GAGAGTACCT CATTAAGATT GCCGAGTTGA
TGGAGAGGGA CCTTGATATC CTTGCTTCTC TTGAAGCTCT CGACAACGGC AAAACTTTCG
GCGCTGCTAA GGGCTTCGAC GTTATCGAAT CTGCCAGAAC ATTTAGGTAC TACGGTGGAT
GGGCGGACAA GATCCACGGC AAGGTTATTG AGGTGAGACA GATTTTGATA CTTTACTGCA
GAGTCTCTGA CCAACTTTCT CCCAGACCTC TTCTTCTAAG CTCACATACA CCCTCCATGA
ACCTGTCGGT GTCTGCGGTC AAATTATCCC CTGGAACTTC CCTCTTCTCA TGTTCTCATG
GAAGATCGCT CCCGCTCTTG CTGCTGGTAA CACTGTCGTT ATCAAGCCTT CAGAGCTTAC
TCCTTTGACC GCCATGTACA TGACTAAGCT CTTCAATGAG GCTGGTCTCC CCAAAGGTGT
CATTAACGTT GTCGTTGGGT AAGTAGACTC CTCAGTTGAT TCGTTTCCCA TTCTGACTCC
TTGCTAGTTA CGGCCAGACC GTCGGTAATG CCCTTGCTGG TCACCCTGCC ATTGACAAGG
TCGCTTTCAC TGGTTCCACC GCCGTCGGCC GAAAGGTTAT GGAGGAGGCT TCCAAGTCCA
ACATCAAGAA GGTGACCCTT GAGCTCGGTG GAAAGAGCGC CAACATCGTC TTCGAGGACG
CCGACTTCGA GGAAGCTGTC AAATATTCCG CTCAGGGTAT CTTTTTCAAC CACGGTCAGA
CCTGTGTGAG TAGCTTTTGC TTTGAACAGT GAATTTCATG CTGACCAATC ATAGTGTGCC
GGTTCTCGAA TCTACGTCCA GAAGCCTATC TATGAAAAGT TCGTCAAGGC CTTCAAGGAG
CAAACCTCCA AGCTCAAGGT CGGAGACCCC TTCGACCCTA ACACCTATCA GGGTCCTCAG
GTTTCTCAGA TTCAGGCTGA GCGAATCATG AGCTATGTCG ACCACGGCAA ACAGGAAGGT
GCTACTGTCA TCACTGGTGG CAAGCGATGC GGTGACAAGG GTTACTTCAT TGAGCCTGTA
AGTCGCATTC CATCCATACT GTCGGTCGGC ATTTTATAAT GCGGGACGGT GCCCGGGGTA
TTGATCGGTT CCGGAATCCG GTCCAATGTT ACACCGGGAC CATGATCATA CCACTCCTTT
AAGCCGATTG CACTGTATGC TGACATTTGA TTCCTTCCGC TTCTATTAGA CTGTTTTCGG
CGACGTCACC GCCAACATGA AAATCGTGAA GGAGGAGATC TTTGGCCCCG TCGTCGTTGT
TTCTCCGTTT GAGACCGAGG AGGAGGCTCT TGAACATGCC AACGACTCTG TCTACGGTCT
TGCTTCCGCT GTCTTCACTT CCAACATCTC TCGAGCTACC CGAGTCGCCA GTAAGCTCAA
GGCTGGTACC GTTTGGATCA ACTGTTACAA CGAGCTTCAT CCTCAAGTGC CCTTCGGTGG
TTTCAAACAG TCTGGTCTCG GTCGAGAGTT GGGAGAGTAC GCTCTCGAGA ACTACACCGA
AATCAAGGCT GTCCAAATCA ACGTCGGTGC CAAGTGCGCT ATCCCTACTT AAGCGTTGAT
TTAAATCAAA CAAAAAAGGC GGCAGAAGTG GATTCTGAAG TACTTGGTGT AAAAGATCTT
TTCGGGTTTT AAAACTTTTG TATATATTAG TACTGGTCCA TGCAGCAGTA TTTGTCA
 
Protein sequence
MAPTFTHEFN HAGYKGKVEV PTGIFINGEW ASSVDKNAKT IDVYNPTTGE VLTKIPEALE 
ADVNKAVEVA HNAFNNSWGL SVPGFKRGEY LIKIAELMER DLDILASLEA LDNGKTFGAA
KGFDVIESAR TFRYYGGWAD KIHGKVIETS SSKLTYTLHE PVGVCGQIIP WNFPLLMFSW
KIAPALAAGN TVVIKPSELT PLTAMYMTKL FNEAGLPKGV INVVVGYGQT VGNALAGHPA
IDKVAFTGST AVGRKVMEEA SKSNIKKVTL ELGGKSANIV FEDADFEEAV KYSAQGIFFN
HGQTCCAGSR IYVQKPIYEK FVKAFKEQTS KLKVGDPFDP NTYQGPQVSQ IQAERIMSYV
DHGKQEGATV ITGGKRCGDK GYFIEPTVFG DVTANMKIVK EEIFGPVVVV SPFETEEEAL
EHANDSVYGL ASAVFTSNIS RATRVASKLK AGTVWINCYN ELHPQVPFGG FKQSGLGREL
GEYALENYTE IKAVQINVGA KCAIPT