Gene CNF04100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF04100 
Symbol 
ID3258216 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1187850 
End bp1189520 
Gene Length1671 bp 
Protein Length482 aa 
Translation table 
GC content48% 
IMG OID638257528 
Productconserved hypothetical protein 
Protein accessionXP_571711 
Protein GI58269110 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.075511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATA CAGCTGCTAC GCAAAGCTTT ACCTTCCTTT CTACTCACAT TGATGACCTA 
TCAGTCTGCC ATAATATGGC AACCTCCTTT ATCCTTCTTC CCACTTCGCT GTGTTATGAA
ATTGAATGGA AAGATACATT CGTCCCTGAC GAAACATTCA TGATGAATGG CCATCCCTCG
ACATCCACAT CCAAAAAAGA AGATAGACCT TTAGAAATAC TAATCATCGG TGCGGGAGTT
GCAGGAGTGA CAGCTGCATA TGCTCTTCGC CAAAGTCAGG CATACCAAGA AGGAAAGTTG
GTCGTAAGGC TTGTCGAAAA AAGGAGTCGT AAGTCCACTT AGGTAGGTTG GCTGGCTGTG
GAGAGCTGAT GGGATGAGAT ATGTGAAGAG GAAACATGGG GAGGGAGAAT GGGATTTCCT
ATGCATTTGA CCAAGGTCAG TCATGTTTTC TCAGCTTACT TACCAGAACC ACTCACATTG
GCAACGTACA CAACGTAACA GGCCGCACGT AAAGCCCTAG ACGATCTGCT CATCCCTTCA
CACAGCACTA AACTTCTTGT CCTTCGACAA AAGATCCCCA TTCTACATGA CGGCTTGACT
GTGCTTTCAT ACAGCGGTAA AATGGTGTAT CGGATGGTTC GCGATGTTCG AGGATGGGGG
ATGGTGGAGA GAGCCGACTT GATTAGTATT CTGAAAGAAG GAGCAGGGGA AGTGGAGTGG
GACATCGAGG CGCTTGTAGG TGAGCCTGGG ATGGAAAGAG GGATCGAGGT TTGTCTGAAA
GGGAAAAAGG AGGAGGTGGT AAGACCTGAT TTGATTGTCG GTAAGCCTCT CATTGTTCTA
TTTCAAGGCC TGGCTTCTGG ATTAACCGCT TACATGATAA AGGTGCCGAT GGGATGTTCT
CGGCCATTCG GCATTGCCTG TACTCTGATT CCCAAATGGT TGAGGATAAA CTACCGGGAG
GTTTCAGCAA GCTCCCCCAA ACGATCATAA ACCTTCGAAC AACCTCGCCT GCCATGCGAA
GATGGGTTCA CGACCCAAAT GGCATGAACT TGTTATACGG CGAATCCTTT TCTGCCACCA
TGATGCCTCT TTCATTCCCT AGTATTTACG TCGCACTCAC CATCCCCTCA CAATGGCTCA
ACCCTTCATC CCAGGTTAGA ATGAAGGGTG AAGAAATAAA GCTGGAGCCT ACGGTGCATG
GGAAGTTTCT GAGACAGTTG GAACGTGATC CAGGATGGGA AAAGAAGGAA ACGTACCCGT
TATGGAGTGC CACTAGCACG GTAGGGGGTA AAGGAAGAGT AGTACTAGTG GGTGATGCAG
CTCATGGGAT GCCGCCATTC TGCGGGGCGG GAGCTAGTGC TGGGGTCATA GATGCCGTAG
AACTTGCCAA AGTCATTGTG GATCATCTAA ACGGTAAGTC ATTCAGTCAT TAGGTCAGTA
CTCAAAGGAG TAGATCCAGT AAACAATCTC GACGATGTAT TGCGGGGATT CCGAGAGAGC
ATGAAGAAAC GCAATGACCC AATTATACGC CAATCCAAGA GGATTCTGTG GCTGGTACAA
GCCGAGCGAT GGTATGAGAA TGCAATCCGG CGGGCAGTCT TTTTTATACT GGACCTGGGA
GAGAGAATAA GTGCGCAGCG GGGCAGGAAG GTTGCTGCTG CAAGACCGTG A
 
Protein sequence
MSDTAATQSF TFLSTHIDDL SVCHNMATSF ILLPTSLCYE IEWKDTFVPD ETFMMNGHPS 
TSTSKKEDRP LEILIIGAGV AGVTAAYALR QSQAYQEGKL VVRLVEKRSQ ETWGGRMGFP
MHLTKAARKA LDDLLIPSHS TKLLVLRQKI PILHDGLTVL SYSGKMVYRM VRDVRGWGMV
ERADLISILK EGAGEVEWDI EALVGEPGME RGIEVCLKGK KEEVVRPDLI VGADGMFSAI
RHCLYSDSQM VEDKLPGGFS KLPQTIINLR TTSPAMRRWV HDPNGMNLLY GESFSATMMP
LSFPSIYVAL TIPSQWLNPS SQVRMKGEEI KLEPTVHGKF LRQLERDPGW EKKETYPLWS
ATSTVGGKGR VVLVGDAAHG MPPFCGAGAS AGVIDAVELA KVIVDHLNGV DPVNNLDDVL
RGFRESMKKR NDPIIRQSKR ILWLVQAERW YENAIRRAVF FILDLGERIS AQRGRKVAAA
RP