Gene CNF01050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01050 
Symbol 
ID3258411 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp320522 
End bp322222 
Gene Length1701 bp 
Protein Length405 aa 
Translation table 
GC content47% 
IMG OID638257228 
Producthypothetical protein 
Protein accessionXP_571546 
Protein GI58268780 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCCC TTACGCAGGC CCTCAAACGT GCCCAACAGA CCTATGCCAA CCGTAACCGC 
AAGTCTTATG AGGCACATCA GGCTGCCTCT CGGCATCTTC CAGGTGGAAG CACTCGTTCG
TCAATTTTCA TTCACCCCTT CCCTCTGTGC ATTCAGAAGG GCCAAGGTGT GAAAATCACG
GATCTGGACG GACACGAATA TGTGGACTTG TGAGTAGTAA TATCACGTGG TAAAATACAA
TCGTCTCACT GATTTTTTCC TGCAGTGTGT CTGATTTCAC GAGCGGCATA TATGGAAAGA
GCCATCCTGT TCTTATAGAT GCTATCAAAG AGGCTCTCGA CAACGGACTT CAGGTCTGTA
TGCAATCAAT CAATCATTAT TGACATCACG TTAATGTGCT TGTAGCTCGG TGCTCACACC
CTTGCAGAAA CGACCCTCGC ATCTCATTTC ACATCCCGAT TTCCCTCCAT TGAACTCGTC
CGATTTGCCA ACTCTGGTAC AGAGGCTAAT ATTTTGGCGA TATCTACTGC TATCAAACAC
ACTGGTAGAA AGAAGGTGCT GGTTTTTGAA GGGGGGTATC ATGGGAGCGT CATGTCGCAT
TTTCATATGA AGGGAGGAGA TGGGGACGAT TTGAAGGTGC CCTTCGTAAG TTTTGAAGCC
TGCGGGGGCG AGGAACTTTT GTCTCACTTA TGCATAGGAT TTTGTGGTAT GCTCGTATAA
TGATCGCAAG GCAACCGACG CTCTCATATC TCAGCACCCA TCTGAAATAG GAGCTATCCT
TGTTGAACCC ATGCTTGGCG CTGGTGGATG TGTGAGCACA TTCTGCCTCT ATCCCCATCT
TGCTGAGATT TTTAGCCAAA AGCATTGATT AACTTAACTT ACAGATCCCC GGTGATCCTG
CATTCCTTCA ATTCCTCCGG GAGAAAGCTT CTTCGATCGG CGCCGTGCTC ATTTTCGATG
AAGTTCAAAC TGCTAGACTT TCGACTGGAG GGCGCCAAAA GATTCTGGGC ATCACACCCG
ACTTGACTAC CGTTGGCAAG TTCTTTGGTG GGGGTTTCGC CTTTGGAGCC TTTGGGGGGA
AGAAGGAAAT TATGGAGAAG TATGTGCCTC TTTCTCAAAG TTATCGCAGG ATAGGTCGAC
GAGTATGTCA CAATGCTGAT CAGCATATGG CAATAGATTT GACGCGAGGA AGGGAGGTGC
AATATCACAT GGAGGGACCT TCAACAACTC CCCGCTGACA ATGGTTGCCG GTGCCACAGC
GATGGAGAAG ATCTTGACTG AGGATGCGCT GAAGAACTTG AATAAGTTGG GTGATTGGAT
GAGAGAAGAG ATCAACAGGA TGTTCACAAG CGATGGATCG CCTTTCCTGG TAAGTCGAGT
GTCCTTCAGA AATATTGTTT CCTCTCGGCT CATGGGTTTC GTTTGTCAGA TGACTGGGCT
AGGTTCCATC AATCAATTCC ATTGCACTCT CTCATCAAAC CAAAATCAGG TTCTGGATCT
TCTTTTTTTC TACCTTCTTG AGAGGGGATT CTGGATAGCT CAGCGAGGAC TCGTCAGTCT
GAGTTTTGCC ATGACCAAGG AAGACGTGCA AAGGTTTATC GAGGCGGCAA TAGAAGCTAC
ACAAAAGGTG AAAGCAACGC TGAAGTAGAA AGTGCCGACG CGCAGTTCCA TATTTCTACA
TATCATACAT GACCAATGGT C
 
Protein sequence
MTSLTQALKR AQQTYANRNR KSYEAHQAAS RHLPGGSTRS SIFIHPFPLC IQKGQGVKIT 
DLDGHEYVDF VSDFTSGIYG KSHPVLIDAI KEALDNGLQL GAHTLAETTL ASHFTSRFPS
IELVRFANSG TEANILAIST AIKHTGRKKV LVFEGGYHGS VMSHFHMKGG DGDDLKVPFD
FVVCSYNDRK ATDALISQHP SEIGAILVEP MLGAGGCIPG DPAFLQFLRE KASSIGAVLI
FDEVQTARLS TGGRQKILGI TPDLTTVGKF FGGGFAFGAF GGKKEIMEKF DARKGGAISH
GGTFNNSPLT MVAGATAMEK ILTEDALKNL NKLGDWMREE INRMFTSDGS PFLVLDLLFF
YLLERGFWIA QRGLVSLSFA MTKEDVQRFI EAAIEATQKV KATLK