Gene CNM02230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM02230 
Symbol 
ID3255244 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp680339 
End bp682437 
Gene Length2099 bp 
Protein Length515 aa 
Translation table 
GC content46% 
IMG OID638254379 
Productpeptidyl-diphthamide biosynthesis, putative 
Protein accessionXP_568269 
Protein GI58261718 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00272] diphthamide biosynthesis protein 2
[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TATAACAGTT AGAACAGCAA TATGTCGGAC GCATTCTCAA CCCCAGCAGA TCACGCTTTA 
TCCCACCCTG AACTGGAAGC GATCTTGGAG AATGCCCAAG CCGGACCATC TAGCATAGGA
GATGGAGCGG AAGGAATGAG CATAGAAGAA GCTTTCGAAG TGGATGAGAC TGTCAGGAGA
GTGCTTGAGG GTGATTATAA GACTGTAAGT AGTGAGACAT ATATATGAAA GAGTCGTTTT
CCCAGGAACA TCAACGTTTG ATGGTTGCGA CGGAAATGAA ATGTCATGCA TGACCAAAGC
ATAGCTGACA AATGCCTCTT ACAGATTGGG TTACAATTCC CCGATGAACT CTTGCCATCT
TCAGTCTCCG TCTATCGGGC TATCCAGACT CGCATAGCAC ATACCGGAGC TCAGGCATAC
GTGCTTGCCG ATAGCACGTA CGGGAAGTCA GTCGTTTTCA GCGTAGTAAC TTGGAATGAA
GCTAACACTA TCGAAAGTTG TTGTCCGGAT GTCTTGAGTT GTCTTCATCT CCCAGCAGAT
TTCTTAGTAC ATTATGGACA TGCTTGTCTC ACTCCGTAAG CTAGATTGGA TTCCTTTGAG
TCTGCAAGCT GATGAAACAG AACGGACGCT CTCCCTGTTC ACTACGTCTT TCCCCGGCAA
AAGCTCGACG TCAAGCAAGC TGTGGAGTCA TTGTTAGCGG CGAGCAAGAA CGAACTAGAC
GGTGACGGCA GGAAAGGTAT AGTGGTTGTG TGGGATGTGT CATATGATTG GCTAGCGAGT
GGGTTTTAAT TTATCTATAT CCATCTCTAC GAAGCAATCT AACGGAAATA CAGATGATAT
CAGGGATACA TTTTCTCAGG ACTCAACTAT CCAGATCAGT TTTGCCTCAA TTCAAAAGCC
TACTCTTGCT TCACAGAAGG GACTCAAGGA TGTAAAGGGT AAGACACCTG CTCTCAGGAG
CGTAGAACCA CCTCAGGGGT TGGAAATGAA TGATTGTGTC TTATGGTATA TTGGTGAAGA
GGGAAGATCC TGTATGAATC TACAGATGAC ACATGCCAAC AATCCTGTGA GCATACGCTC
GACCCATTAT AGATAATCTA TATCTGATAA ATCAACAGCT CTTCATCTAC TCACCTTCTT
CCCAGTCTGT ATCGCCTCTT CACCGCACCA CTTCTCGTCT TCTCTCACGC CGTCTCTTCG
CTCTCCATCA AGCGCTTTCC GCCGACGTAT TCGGCCTTAT TGTTTCCAAC ATCGGTCTCG
CGTCTTCCAA ACCCCTCCTT GCACAACTGA GAGAGGATCT GAAGAGAGCA AAAAAGAAGA
GTTACACCCT GAGCGTAGGC AGGCTGAACC CAGCAAAGCT TGCAAATTTT GCAGAGATAG
AGTGTTTCGT GTTGGTTGGT TGCGCTGAAG GTGGTGTTGT TGACTCGAAG GTGTGTTCTC
TTTCGTCTAA CAGAAATAAT AAGCTGATGA TTTGCTGATG CAGGATTTTT TGAGACCTAT
CATTACACCC TGGGAATTGG AATTGGCACT TCAAGGCCCA GACCACGTAT GGGCACCGGA
GAATTGGACT TTGGACCTGG GTACCGTTCT CAAAGGTGAT GCTCATCCTG CTTTCATAAT
TCACATGCTA AGACCCATCA AATAGATGCC CAAGAGCGTG AAATAAAAAT CAAGCAAGAT
TCTTCCACCG CTGACAGTGA CGACGATTCA CTCGAATTTT CACTTATTAC CGGAACAATG
CGAACAAAAA AACGCTTTGC GTTTGGAAAC GGCACCCACA CTCTTGAAAA CAACAAGTTA
TTGGGAGACG GTGGTGTGCA AGACCTGACC TTGCGTAACC AAAACTTCTC ACTGTCCAAG
CTTGAATCCG CCGGAAGTAC CTTTTTGGCG TCAAGAGAAT TCCAAGGATT AGAACCAAGA
TATGGAATGG ATGAGCCTAG TGTTCTGGAG CAAGGGAGGA GTGGGGTCGC GAGAGGCTAT
ACAGAGGAGA AATAGACAAG TCCTCTTGGA CGTGTCTTCA GAGAGCGTTT TGACTATAAT
TCAATGTACC TCTCGGCATT GCATCGTATC GCATTTCGTT AACAATGTAC CACTTGTAT
 
Protein sequence
MSDAFSTPAD HALSHPELEA ILENAQAGPS SIGDGAEGMS IEEAFEVDET VRRVLEGDYK 
TIGLQFPDEL LPSSVSVYRA IQTRIAHTGA QAYVLADSTY GNCCPDVLSC LHLPADFLVH
YGHACLTPTD ALPVHYVFPR QKLDVKQAVE SLLAASKNEL DGDGRKGIVV VWDVSYDWLA
NDIRDTFSQD STIQISFASI QKPTLASQKG LKDVKGKTPA LRSVEPPQGL EMNDCVLWYI
GEEGRSCMNL QMTHANNPLF IYSPSSQSVS PLHRTTSRLL SRRLFALHQA LSADVFGLIV
SNIGLASSKP LLAQLREDLK RAKKKSYTLS VGRLNPAKLA NFAEIECFVL VGCAEGGVVD
SKDFLRPIIT PWELELALQG PDHVWAPENW TLDLGTVLKD AQEREIKIKQ DSSTADSDDD
SLEFSLITGT MRTKKRFAFG NGTHTLENNK LLGDGGVQDL TLRNQNFSLS KLESAGSTFL
ASREFQGLEP RYGMDEPSVL EQGRSGVARG YTEEK