Gene CNC04430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04430 
Symbol 
ID3256636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1341937 
End bp1343634 
Gene Length1698 bp 
Protein Length565 aa 
Translation table 
GC content51% 
IMG OID638255664 
Productamino-acid N-acetyltransferase, putative 
Protein accessionXP_570013 
Protein GI58265714 
COG category[E] Amino acid transport and metabolism 
COG ID[COG5630] Acetylglutamate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.171523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACAG GGAGTTCCGG GAAGGCATTC ATCTTATCAA TCTTGCAAGC TTCTCCTTCA 
GCCAGAGACT CTCGTTCTTA TCTGTCTTCA TTTGCTCCTC CTCAGCCTGC CGACATTGCT
ACTGCAACCC CTGCCGCCAC TCCATCAGAC GGTGCTCAAC CTCCTGCCCA AAACCCTCTT
GTCAATGCCC TTCTCAATCC TATTCTTCGT CGGCCTGCCC TTGTCAAAAT TCAAGGTCCA
TTTACCGACG CGCAACTTGA ATCCATTTGC CGCGGCATGG CGCATCTTCA AAAATTAGGA
CTCGTTTCTG TCATCGTCGT TGACCGTGAT GACTTGCCGT CTACGGAATC TTCCGACCGT
TACGAAGCAC AGAGACAAAG GGCGATTGTC AGGCATGAAG TCGAAAGGGT TGTGCATTTC
CTCTCAAGGC ATAGGGCAGC CGCCAGACCA GTCTTTTCAA CTGTTGCAAG GATCGCAGAC
CCTGAGCTGG AGCCAGAGGA GGCACAAAAG GGTGTATTTG TTGAAGAGGA AGGACTTGAT
CACGTGCGGA GGGCCGTGGG TGAGGGCGAA ATTCCCGTAT TACTGCCCGT CGCACTCGAC
TCTGGCTGTC GTTCCCGGAG GATCCCAGCC AACAGAGTGC TTTTGGCTCT TGCTTCTGCA
ATGTCAACAC ACACTTCCAG CCCCGTGGAC CTTACTCCGA GGAGGTTACT GGTGATCAAT
CGTGAAGGCG GTATCCCTTC TTATGCTCGA CAAGGTCTGC CACACTTATA TATCAATCTC
GCGTCCGAGT TTTCCTATAT CAACCGTACA TTTCAACCCC AATGGAATGA TTCCCATCCT
ACTGCCTTGT CAAACCTCTT TCTCGCCAAT GGCTGCCTCG CCCACATGCC TCGTGAAGCG
TCTGCTTTGA TCGTCTCCCA TCGATCTCCC GCAGCCTTGA TTGCGAATTT AATCACCAAC
AAGCCCGCAC ACTCTGCTTC TTTGCCTCAT GCCCTGCTTG TCGAGTCTGA GGGTCGTATC
ACTCGTGATA CACCAACACT CATCCGTAAG GGCCTTCCAG TTCGCGTCTT GCGCAGCATG
GAAGAAGTCG ACCAAGACAA GCTCACACAT CTGCTTGAAA CCTCTTTCAA ACGCACACTT
GATCGCGAAG GGTTCTACAA CCGTTTAAAG AATGATCTTG ACTTTGTGAT TGTGATTGGC
GATTATGCCG GTGCTGCTGT TTGTACCCTT GAAGGCAAAC CCGTTTCTGA TTCATTCGCT
TACCCCCCAA ATCATCCCGA ACCTATATGC TACCTTGACA AATTTGCCGT TCATCCTTCA
CACCAAGGCG ATGGTACAGT TGATTTCTTG TGGGTCGCTC TTCGTGATGA GACGTACGGT
CTCGGTCAGT TGGATGCCTC AAACCCGTCT ATCGGTTCGT TGAGAGGTGT CGGCAGGGGT
AGAGATCTTG TCTGGAGGAG CAGAAGTGAT AATCCCGTCA ACAAATGGTA TTACGAGAGG
TCAAGTGGCT TCCTGAAGAC AAGGGACGAG AAGTGGAAGG TATTTTGGTG TGATGCGGAG
CAGAGGCTGG GAGAGATTTG GCGAGAGAGG GAATTTGGCG GAGGAAGATT GGTTAGAGTT
GTGGAAAAGG AGGAAAAGGG AAGGGTGAAA TGGTGGGAAG AGGTCATCGG AGCGATCCCA
TCAGCTTGGT CGGCGTAA
 
Protein sequence
MLTGSSGKAF ILSILQASPS ARDSRSYLSS FAPPQPADIA TATPAATPSD GAQPPAQNPL 
VNALLNPILR RPALVKIQGP FTDAQLESIC RGMAHLQKLG LVSVIVVDRD DLPSTESSDR
YEAQRQRAIV RHEVERVVHF LSRHRAAARP VFSTVARIAD PELEPEEAQK GVFVEEEGLD
HVRRAVGEGE IPVLLPVALD SGCRSRRIPA NRVLLALASA MSTHTSSPVD LTPRRLLVIN
REGGIPSYAR QGLPHLYINL ASEFSYINRT FQPQWNDSHP TALSNLFLAN GCLAHMPREA
SALIVSHRSP AALIANLITN KPAHSASLPH ALLVESEGRI TRDTPTLIRK GLPVRVLRSM
EEVDQDKLTH LLETSFKRTL DREGFYNRLK NDLDFVIVIG DYAGAAVCTL EGKPVSDSFA
YPPNHPEPIC YLDKFAVHPS HQGDGTVDFL WVALRDETYG LGQLDASNPS IGSLRGVGRG
RDLVWRSRSD NPVNKWYYER SSGFLKTRDE KWKVFWCDAE QRLGEIWRER EFGGGRLVRV
VEKEEKGRVK WWEEVIGAIP SAWSA