Gene CNA04590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA04590 
Symbol 
ID3253316 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1219554 
End bp1220786 
Gene Length1233 bp 
Protein Length291 aa 
Translation table 
GC content47% 
IMG OID638252779 
Productconserved hypothetical protein 
Protein accessionXP_566833 
Protein GI58258841 
COG category[R] General function prediction only 
COG ID[COG0824] Predicted thioesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTGC GATGCCGCTT CTCCTGCATT CGTAACTATC GAGCTGCGAC ATCGAACAGA 
GCATTCAAAC GCAGCATCAT GTCACACATT CGGGCTGTCC ATTCCAAGCC CACTCTGGCT
GTCACAGATG ATGACGCTGC TCGTCTCCAG GTACGATTAC TCAAAGTCTA TGAAGTTTGG
CTGATTTACA ACAACGCTTT AGGAGATATA CGCTAAATTT CAGGACCCAT CATCTCATTA
CTACATTGCC CCAGGCTCGT TCGGTCCAGA ACACGAGGAG GATCATAGGA CCTCCCATGT
GCCAGATCCC GCATATTCTC TTGATGCTAT AACGAATGCA GAGGCATCAT CACCTTTCCA
AGGCTCTCAG GAACAGAGAG AGCAAATGGA TGACAGACCT TGGCTGAACG AAGATAGTGG
AAGGAAAGAG CAGGCTCTCG AGTACTTCAA GCAGCAGAAC CATGATACCA CCGGGGTCTT
GACCTGGCCT GTGGCTTGGG GTGACCAAGA TAGCTTCCAG TGGGTGCAAA TATCTTCTTA
GCTTCAGAGA GGGTACTGAT CCTTTGGGCA GACATGTGAA CAATGTCCAG ATACTGCGAT
GGGTGGAGTC GGCTCGTATC AGGTACTGTG AGAGCTGGGC CGGAAAACTG GGAAAGAAAA
CTGTCTACGA TATGCTGGTG TGTCGCATTA GGAGTAGTGC ATGTGGAATG GCGCTTATGC
TGTACTACAC TAGCGCGCCA AGGGTACCGG CATCATCCTC AAAGAGGTGT CAATCAAGTA
TAAGGCGCCG ATTACCTATC CCGATACAGT GATCCCTGTT TTCCTTTTAG CCACATTGCT
TAATTCTGAC TCAATTCCAG CTCATGATTT CCAATTGTAT TCACTCAGTC AACGTGGAGC
GAGCCAGCTA TGGCCATCGA CACATCATTT GGTCTTTGAA GGATCAGCAT GTAAAGGCTG
TCAGCGATTC GTGAGTCAAG TTCATTGCTC GTATTATCAC ATACTCAAAT ATGGTCCAGG
TCAATTGTGA TTTATGATTA TGACAACCTG AGGAAAGGTG TTATGAGCGA TCAATTGAGA
GAGCTATTGC TGAGCGTAGC CGGGGATAGT AAATGGAGCG AAGAAGATAC AAAATAAATA
TCAAGATGTT AGAGACATAC AGTGGGTAGG AATATTTGAT CTTGTCAGCG CTGATCACGA
AAAACTCGAT AGCATGCATT GATCTTTTAT GAT
 
Protein sequence
MALRCRFSCI RNYRAATSNR AFKRSIMSHI RAVHSKPTLA VTDDDAARLQ EIYAKFQDPS 
SHYYIAPGSF GPEHEEDHRT SHVPDPAYSL DAITNAEASS PFQGSQEQRE QMDDRPWLNE
DSGRKEQALE YFKQQNHDTT GVLTWPVAWG DQDSFQHVNN VQILRWVESA RIRYCESWAG
KLGKKTVYDM LRAKGTGIIL KEVSIKYKAP ITYPDTLMIS NCIHSVNVER ASYGHRHIIW
SLKDQHVKAV SDSSIVIYDY DNLRKGVMSD QLRELLLSVA GDSKWSEEDT K