Gene CNG02670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG02670 
Symbol 
ID3258653 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp747495 
End bp749525 
Gene Length2031 bp 
Protein Length492 aa 
Translation table 
GC content50% 
IMG OID638257889 
Productconserved hypothetical protein 
Protein accessionXP_571930 
Protein GI58269548 
COG category[I] Lipid transport and metabolism 
COG ID[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID[TIGR01833] 3-hydroxy-3-methylglutaryl-CoA-synthase, eukaryotic clade 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.250048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCTCTATA GCACTTTACA CTCCTTACCC GCTCCCATAA TCCCCATTCC ACCGAAAATG 
TCCAACTTTG ATATCCCCGC TAGGCCCAGC AACGTCGGTA TCCTCGGCAT GGAGATGTAC
TTCCCCAAGA GGGTGAGTGT CGCCACTATC AACTTACTCC TCTTCTGCAG CTAATTTCGG
CGCATCTAGT GCATTTCTGA GGAACAGCTC GAGGAGTTTG ACGGCGTTGC CAAGGGAAAG
TACACTATTG GTCTGGGTAT GGGCCACATG GCTTTCACTG ACGACAAGTG AGTTTGTCAA
GTAATCGCGA TGGTCAAAAT AGCTGACAGT GCAATAGGGA GGACATCAAC TCTGTCGCCT
TGACCGGTAA GCACAAAGTA TAGGCATTGA TGAGACAATG CTAACTTTTT GCAGTCGTTT
CTTCTCTTCT TAAAAAATAC AATGTCGACC CCAGGTCTAT CGGTCGCTTG GACGTTGGTA
CCGAGACCCT TATCGACAAG TCCAAGTCTA CCAAGACTCT TCTCATGAAC CTCTTCGCCG
AGTCTGGTAA CACTGACATT GAGGGTATTG ATTCCAAGAA CGCGTGCTAC GGCTCTACCG
CCGCCCTTTT CAACGCTGTC AACTGGATTC AGTCTGAAAG CTGGGACGGA AGGAATGCTA
TTGTCATGTG TGGCGACATT GCCATTTACA AGGAGGGAAG TGCTAGGCCT GTGGGTGGCA
TGGGTGCTTG TGCCATGTTG ATCGGTCCCG ATGCGCCTTT GGTGGTTGAG CGTGAGTGCT
AGTCAAGCCC AGAGAGATAT TGTTTGCCAA CTCATTTGCA GCCGTCCACG GTACTTACAT
GGCCAACACC TGGGACTTTT ACAAGCCCGA CCTTTCCGCG GAATACGTAT GTGCAATTCT
CATGTCGTAC GATAGGTCAC TGACGCTGCT ATAGCCCACC GTTGACGGGC CCTTAACCAT
TGCGGCATAC CTCGGTGCCC TTGACAACGC CTACTCTACT TACGTCCAGA AGGCGGAGGC
TTCCCAGGCT CGTGCCGCCA AGAAGCTCTC TCTTGCTTCT GTGACCGCTG CCGTTTCCGA
AGTTGCTAAC GGTATCGTCG GAGCCGTCAA TGGCCACGCC AATGGCCATG CCGAGACCAA
GGAAGACGGT ATCGCCAAGT TTGACTATGT CTGCTTGCAC AGCCCTTACG GCAAGCTTGT
CCAGAAGGGT CACGCCCGTA TGTTCTACAA CGTAAGTCAA TACATTCCGT TCGAACCATA
TGGAAACTAT GAAACTTATC CTTTTGGCAG GACTACCTCC GAAACCCCTC TCATCCCGCT
TTCGCCAACG TCCCTGAGGA CGTCAAGTCC CTCGACAAGA CTAAGACCTA TACCGACAAG
GTCATTGAGA AGACTTTTAT TGGTATCGCT GGCGACCATT ATAAGTCTGC TGTTATCCCT
GGCAAGGACT GTGTCTCTCG ATGCGGTAAC ATGTACACTG CTTCTCTTTA CGGTGCCCTC
GCCTCTGTCG TCTCTTCCGC TCCTGAAGGT ATCGAGATTG GCAAGCGAAT CGGCATGTAC
GCCTTTGGTT CTGGTTGTGC CGCTTCTTTC TACGTTCTCA AGGTCAACGG TTCTACCAAG
GAAATTGCGG ACAAGTTGAA CTTGAAGGCG AGATTGGCTG CTATGGACGT CAGGCCTTGT
CAGGAATATG TTGATGCTCT CAAGGTAACT ACTCATGTTC ATTTCTTGTT GGGGACAATG
ATTGGCTGAC GATGTCACAG CTCCGAGAGG AGAACCACAA CGCTGTCAAG TACGCTCCTC
AAGGCTCTCT TGACAACATC TGGCCTGGTG CCTACTACCT CGAGGGTGTT GACGATCTCT
ACCGACGAAC TTACCTTCAA AAGCCTGAAT CTGCCCAAGT ATAGAGCGTA TTGTTTGTTA
TAGAGGGTTA TTCTGAAATG TGGCTAGACG GACAATCTGT TCGGTTGCTT TTGGGGACTT
TACATGTAGT TTATATACCG AGTGCATAAT GGATATCATT GGCATAGTTT G
 
Protein sequence
MSNFDIPARP SNVGILGMEM YFPKRCISEE QLEEFDGVAK GKYTIGLGMG HMAFTDDKED 
INSVALTVVS SLLKKYNVDP RSIGRLDVGT ETLIDKSKST KTLLMNLFAE SGNTDIEGID
SKNACYGSTA ALFNAVNWIQ SESWDGRNAI VMCGDIAIYK EGSARPVGGM GACAMLIGPD
APLVVEPVHG TYMANTWDFY KPDLSAEYPT VDGPLTIAAY LGALDNAYST YVQKAEASQA
RAAKKLSLAS VTAAVSEVAN GIVGAVNGHA NGHAETKEDG IAKFDYVCLH SPYGKLVQKG
HARMFYNDYL RNPSHPAFAN VPEDVKSLDK TKTYTDKVIE KTFIGIAGDH YKSAVIPGKD
CVSRCGNMYT ASLYGALASV VSSAPEGIEI GKRIGMYAFG SGCAASFYVL KVNGSTKEIA
DKLNLKARLA AMDVRPCQEY VDALKLREEN HNAVKYAPQG SLDNIWPGAY YLEGVDDLYR
RTYLQKPESA QV