Gene CNL04980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04980 
Symbol 
ID3254958 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp397749 
End bp399725 
Gene Length1977 bp 
Protein Length558 aa 
Translation table 
GC content50% 
IMG OID638253970 
Producthomoserine O-acetyltransferase, putative 
Protein accessionXP_568037 
Protein GI58261254 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAATT TATCGTTGTT GTTCCATCGT CCTGCAAAAC CCAGAATCGG CTCGTCTCCA 
AGTCATTCTT CTTCGCTCTA CATCTCCAGG ATGTCGGATA ACGCTCCCAC ACCTCAAAAA
ATACGAGACA CAAATCCATA TGCCTCTCTC ATCTCTCAGC AAATCGCGAT CATCCCTTCG
TTCACCCTAG AGTCAGGTGT CACTCTTAAT AATGTTCCAG TGGCATACAA GACCTGGGGT
AAACTTAACG AAAAAGCCGA CAACTGTTTA GTCATCTGTC ATGCTTTGAC AGGTAGTGCT
GATGTCGAAG ATTGGTACGT CACTCCGTTC GCCTTGCGGA GTAGTGAGGG CTAATCGTCT
CCTTACAGGT GGGGACCGTT GCTTGGTCTC AACAAGGCCT TTGACCCGAC CAGATTTTTC
ATCTTCTGTG GAAACGTTAT AGGTTCACCC TACGGCACTA TTTCCAGTGT CACTACCAAC
CCCGAGACTG GCAAGCCTTT TGGTCCCGAG ATGCCCGGAA GTAGCGTCAA GGATGATGTT
CGGTATGCCA TGTCTACATA CTAGTTTACT GGGTGCTGAC AAATGACAGA TTGCATTACA
TAATTCTCAA ATCTCTTGGT GTGAAATCGG TGGCAGCCGT CGTTGGTGGA TCCATGGGTG
GTATGACTGT TCTTGAATAC CCACTCAATA CCCCTCCTGG GTTTGTCAGA GCCATTATCC
CCCTTGCGAC TTCAGCTCGT CATTCAGCTT GGTGTATTTC TTGGGGAGAA GCACAGCGTC
AATCTATCTA CTCCGATCCA GACTACAAAG ACGGTTACTA TTACGAAATT GAGGAGGAAG
GAGGCAAAGT TGACCTGGCT CGACAGCCAG CCAGGGGTCT GGCTGCGGCT AGAATGGCGG
CTTTGTTGAC TTACAGGAGT AGAGACAGCT TTGAAAGCCG ATTCGGCCGA CGTGCCGGCG
GCGGTAAATC GTCAGTGCCC AAGGGTGGTG TACGAATCAT GGGTGGTCAA GAGACGACCG
ACCCTAGCGT CCCCAGTGAG AGCGATCTCG CTGCCAAGTC CCCCAGCTGG AGAGCCTGGA
GGGAGCATAA CGACGGGCAC AGAAGCTCTG GCGCAAGACC GATATCTCGT AGCGGGAGCG
AAGGCCCTAA CCGTGGAGAG GGTGATGCGG CTCAGGCTGA GGTTGTAAAG ACTCAAGAAG
TGAAGGCCAA CGGGAATAAA ATTGGAACTG GCGGAGAAGC ACCGCCCAAA ATCTTTTCTG
CGCAAAGCTA TCTTCGCTAC CAGGGAGACA AGGTGAGACT TCCTTAACTG GAATATGCGA
GCATCGTTGA CGTTAGCGCA GTTTACTGGT CGATTTGATG CCAACTGTTA CATCCACATC
ACCCGTAAAC TCGACACCCA CGATCTGTCC GCTCCTTCCC GTGACACTTC TCTGTCCTCA
CTCTCTTCTG GTCTTCCCTC GTCCGCCGAC GCAACAGAAG AAGAGCTCAA TGCCCGTTTG
ATCCACGCTC TTTCTCTTGA ACCTCCCGCT TTGGTCATCG GCATTGAGTC CGATGGCTTG
TTCACCACTT CCGAACAACG CGAGCTTGCA GCTGGGATCC CCGATGCAGA GCTTGTTGTC
ATTCCTTCCC CTGACGGACA TGACGGTTTC TTATTGGAGT TTGAAGCCAT TAACGGATGG
GTTGAAGGAT GGCTGAAGAG AAAGATGCCC GAGTTCTACG AGAAACGAGT GATCGATCCC
GAAGATTATG TACAGGGAGA AGAAGGATTT GACATCAAAA AGGAAAGCGT ATTCGGCGAG
GCCGAGGCAG ATGTTACGAG GTGGTAATTT TTTTGGTCAG TGGCGCGTGG TTGGGATATT
GTGATATAGA ACTGGCTTCA AATCAATTGT ATAAAGGGAC TGCAAGAGTA CAGTACCCAA
CTACTGTACT ACTAAAAACA TACATATATC ATCTCTCCAA CAGAGGACAC ATCATGA
 
Protein sequence
MGNLSLLFHR PAKPRIGSSP SHSSSLYISR MSDNAPTPQK IRDTNPYASL ISQQIAIIPS 
FTLESGVTLN NVPVAYKTWG KLNEKADNCL VICHALTGSA DVEDWWGPLL GLNKAFDPTR
FFIFCGNVIG SPYGTISSVT TNPETGKPFG PEMPGSSVKD DVRLHYIILK SLGVKSVAAV
VGGSMGGMTV LEYPLNTPPG FVRAIIPLAT SARHSAWCIS WGEAQRQSIY SDPDYKDGYY
YEIEEEGGKV DLARQPARGL AAARMAALLT YRSRDSFESR FGRRAGGGKS SVPKGGVRIM
GGQETTDPSV PSESDLAAKS PSWRAWREHN DGHRSSGARP ISRSGSEGPN RGEGDAAQAE
VVKTQEVKAN GNKIGTGGEA PPKIFSAQSY LRYQGDKFTG RFDANCYIHI TRKLDTHDLS
APSRDTSLSS LSSGLPSSAD ATEEELNARL IHALSLEPPA LVIGIESDGL FTTSEQRELA
AGIPDAELVV IPSPDGHDGF LLEFEAINGW VEGWLKRKMP EFYEKRVIDP EDYVQGEEGF
DIKKESVFGE AEADVTRW