Gene CNG02390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG02390 
Symbol 
ID3258736 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp683885 
End bp685222 
Gene Length1338 bp 
Protein Length367 aa 
Translation table 
GC content50% 
IMG OID638257859 
Productconserved hypothetical protein 
Protein accessionXP_571964 
Protein GI58269616 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGTCC CCGTCCCTCA AATCTTCCAG TACGACTACG TGCCCGAGAC CAAGGAGGAT 
CTTGAATGGG CTGATCGTGA GTCTGCTCTC TTTTCTATTT TGTCTTGTGG GCGCACAGAC
TGACTCTTCA TCTTAGTCGC CACCATTGAT CTATCCAAAT TCAACAATCC TGAAGGCCGC
AAGGAGCTCG CCCAAACCCT CCTCGAGGCT ATCCGCACTA AGGGATTCTT CTACGTCATC
AACTTTGGCA TTCCCCAGGA GAAGGTTGAC CGCCAGTATG CCCTTGGTAG CAAGTTTTAC
GATCTGCCCC TTGAGGAGAA ATCCAAATAC GTCCCTGACT TGGAGAATGG CGAATACAAC
GGGTACAGGC CCGCTGGCAG GAGTGTACTT GGAGGGGGTA TCCGGGACAG GATTGAAGTC
TATAACATCC CCAGTAAGTC ACATTCCTCT GTTCATTTGT CTGTCAATTT CCCGTCCAGA
TCCCACTGCT GATATTCAAT CAGAATTTGA TGGCTATCAT GAGCGTAACC ACCCCGACGT
CATTGAGCAG AACATTCATG AAATCGAGGA ATTCGCTCGC TCTCTCCATA CCAACGTCCT
CGACCCTCTT CATGTTCTCG TCGCTCTCGC TCTTGAACTT CCTGAGGACT ACTTCACCAA
TCTTCACAAG TACTCCGACC CCTCCGAGGA TCATCTCCGA TACATGATGT ACCGGCATTT
CTCTCCTGAA GAGACTAAGA TCATTGAGTC CAACGATGGT CTTTACACCC TCGGTCACAC
CGACTTGGGT ACTTTGACCT TGCTTTTCAG GCAACCAGTT GCCGCTTTAC AGATCAGGGA
CCATGAGACT GGCAACTGGA AGTGGGCCAA GCCCTTGGAT GGTAGCCTTA CCGTCAACAC
TTGTGATGCC CTATCTTTCT TGACTGGCGG CTACATCAAA AGTACCGTCC ACCGAGTGAG
TAATCTTTTA ACGCAGTTTC GAGTGCCGGA GCTGACGCAG CGATTTAGGT GAGCATTCCG
CCCAAAGACC AGCAGCAGTA CGACCGTCTC GGTCTCCTCT ATTTTGCCCG TCCTCAAAAC
GATTTACCCC TCGCCACTGT TGACAGCCCC TTGCTGAAGA GAGAGGGTTT CGACAAGAAT
GAGTTTGAAC GAGGCGGTTA CAAAGTGCCG ACCATGGGTG GTAAGTCACA ATCTCGCCAT
CACCAAAACT TCCGTTGATG TTTTCCCGTA GAATTTGTAC AAGTCAAGCA AAAGTGGCAG
CAAACCAAGC GAGTTGCACA CCGAGAAGGT GATGGTTCCC AGATTCTTCC TGGTTTCGAA
GGAAAGTATC ACGACTAA
 
Protein sequence
MPVPVPQIFQ YDYVPETKED LEWADLATID LSKFNNPEGR KELAQTLLEA IRTKGFFYVI 
NFGIPQEKVD RQYALGSKFY DLPLEEKSKY VPDLENGEYN GYRPAGRSVL GGGIRDRIEV
YNIPKFDGYH ERNHPDVIEQ NIHEIEEFAR SLHTNVLDPL HVLVALALEL PEDYFTNLHK
YSDPSEDHLR YMMYRHFSPE ETKIIESNDG LYTLGHTDLG TLTLLFRQPV AALQIRDHET
GNWKWAKPLD GSLTVNTCDA LSFLTGGYIK STVHRVSIPP KDQQQYDRLG LLYFARPQND
LPLATVDSPL LKREGFDKNE FERGGYKVPT MGEFVQVKQK WQQTKRVAHR EGDGSQILPG
FEGKYHD