Gene CNM01740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01740 
Symbol 
ID3255175 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp524470 
End bp525901 
Gene Length1432 bp 
Protein Length255 aa 
Translation table 
GC content46% 
IMG OID638254328 
Producthypothetical protein 
Protein accessionXP_568312 
Protein GI58261804 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGTTA TCGATCGATG GGGTAAAGGT GTTTAGGTGT GAAACATGTG CTATGCACAC 
CATATCGATG TAACTAGACT ATGTTTTTCG TATTGACAAG GACAAGAATC TTAACCACGA
TTCCGAACAT TAGAAATATG GCCTCATCAA CGAATTCGAA ACCGGTTCGC TTGGTTCTCT
TTGACGTCTT TGGTGAGTCT AGTCCAATCA TAGAACGACG TAGGAGGCTC ATCCATACAA
ATGATGACTA GATACCCTCT GTACTCCAAG GCTACCCATC CACGAACAGT AAATCTCATT
ATTTTTGCTG CATACGCTTC TCGCGTCTGA CTTATGCGGC AGGTACCACG AAGAAGCTAT
CAGAGGTGGG CTCTCATCTG CAAGTATAAC CCCACAGAGC GTCCGTAATG CATTCAAACC
CGGTATATCT TCTTTTGGAC TAATTGAGAT CGACTAAACT GACTGTCGGT TCAGCCTTTA
AAACTGTCGA TGCCCAATAT CCTTTGTATG GTAAACATTC GACGCCTCCA TTGACTCCTG
AGGAATGGTG GACAAGAATC ATCTATGAGA CCCTTAGGGA AGCCGGAGCT TCCAAGCGAG
GTGAGTTATG ATCCTTTCGC TAATTTCCAT GTAGTAGCTA TGACTGACTC AGATGATTCT
GCTGTCAGAA TTGGATGGGA AGATTGATGC GATCGGACCT GCTTTGATGA GTCGTTTTGA
GAGTGATCTC GGGTATCGAA ACTTTCCAGA GACTATCGCT TGCCGTAAGC TCCCAGCTCG
TTGTTTGATT TGAATCCTTG GACTAATACT AGGCTAGTAA AAGAGCTTAA GGAGCTAGAA
ATCAAGACCT CGGTAGTATC CAATGCTGAT CCTCGTATTC GTGAGCATCG AACTCTTAGC
CCGGGCCTCT TCTCAAATTC ACCTATGATT CGTAGTCAAA ACCTTGGATT CACTTCAGAT
CTTACCCCTT CTTACCTGTT CTCCCACCCT ATCATGGGAT GTCGAAGCTG CCAAGCCATC
TGCTACCATC TACGAGAAAG CATGCGAGAT ATCTGATGAA AAAGTGGGAG AAGGTATCAT
CATGGTTGGC GACGAACTCA AAGCGTAAGC CCATATTCCA TCAGGTCAAT GGCGAATGAC
TATTGAAGCC TCATGCTGTT CTAGTGATTT CCATGGTGCC ACGTCGGCTG GGATCGAGGC
TCGTCTTATA CGGAGACCAG GAGAATGGAG TGATGGTGCT GTCAGAGATG CTAAAGAGGA
ATTGGGCGGG GTGAACGTCG TTTCTAGCTT GGAAGACATT GTTAAAGAGG TCAAGCAAAG
GAACGTAGGT GGCTGAGGTA CAGAGGTGTA TACTACGAGT CTGTGGTCTC AAAGGTCAGC
ATGACAGCTA CTCGCAATTG GACCTCCTCG GGATAATCGG CAGCACAAAT AG
 
Protein sequence
MSVIDRWGKE IWPHQRIRNR FAWFSLTSLV PRRSYQRWAL ICKYNPTEPF KTVDAQYPLY 
GKHSTPPLTP EEWWTRIIYE TLREAGASKR ELDGKIDAIG PALMSRFESD LGYRNFPETI
ACLKELKELE IKTSVVSNAD PRILKTLDSL QILPLLTCSP TLSWDVEAAK PSATIYEKAC
EISDEKVGEG IIMVGDELKA DFHGATSAGI EARLIRRPGE WSDGAVRDAK EELGGVNVVS
SLEDIVKEVK QRNHK