Gene CNH00550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00550 
Symbol 
ID3259136 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp1024776 
End bp1026939 
Gene Length2164 bp 
Protein Length475 aa 
Translation table 
GC content48% 
IMG OID638258430 
Productconserved hypothetical protein 
Protein accessionXP_572249 
Protein GI58270186 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCACATCCA CCTCAGTTGA CATGCCAGAG CTCCCTGGTA AGTTCATGGT CAGGTATCCA 
GCTATTGCTA AGCTATTACA GAGGTTGAAC GAGCTCGGAA ACTTATTGAA GACTCCTGCA
AGGGGTACAA GATCGCTTCA GTCGATGCTC AAGAAGACAG TATCATCTTT ACCGGTGGAA
CAGACCATAA CGAATTTGCC AAAGAGATCG CAGGAAGGTC TATAACAGGA TGTGAGCGGA
AAGGAAAAAC GTGCGTGCGG GCAGACCAGG TGTCAGCTGG ATTTGCGACT GATGAAGGAT
GGCATAGGTT CTGGATGACC TTGTCGGGTG AAGGACGATA CCCGGTGATG CATTTCGGCA
TGACAGGTAT GATACAGCTC AAAGGTCAAG AACCAACTTG GTATCGGAGA AGGCCCAAAG
AAAGTGCGGA TGTCTGGCCT CCCAGAGTGA GTCTTGTTTT CAACACCGTT TATATGAGCT
AACAAGAAGA TAGTTCTATA AGTTGTATGT CATCAAGTTA CACTGAATTA TGCAGCTAAA
CAAAATATAG CGTACTAAAG CTTGAACCTC AAGAAGGCTC CATCGCCGAT GAACCTCGAG
AACTTGCTTT CATAGATGGC AAGTCACCCC GCGACTCTTC CAACGTTACG CTAATGCCGC
CCCAGGGCGT CGGCTTGGTC GTCTTCGTTT GGTATCTGAT CCTGTATCTT CCCATCCTCC
CGTCTCGGAG TTAGGTTTCG ACCCTATACT CAATCATCCA ACTTTGGAAG AGTTTACAAA
GCTGCTGGTG AACAAGAAAG GTACTGTGAA AGGCGTCATC ATGGACCAAG CATTCAGTGC
GGGTGTTGGC AATGTGAGTT TTCCCATAGA AGCGAAACGC GCAAGCTAAC CCGTTCATTG
CAGTGGGTTG CAGACGAGTG AGCCTTAAAA AAGCTCCTTA ATTCCACTAA CCCGCCCTAG
GGTACTTTAC CAGGCCCGTA TCCACCCTTC TTGCCCCATT CCCGCATTAT CCGAACAAAA
CATTAGAGAT CTCCACCACC AACTCCGTGC TGTGCCTCTC ACCGCAATAT CTGTCAATGC
TGATTCGAAA CTTTTTCCTT CCGACTGGCT CTTCCGCTGG AGGTGGAGTA AAGGTACAAC
TCAGAAAAAG CAGATGGAGA AGGACAAAAA GAGTAAAGGG AAGAAGGTCG TGGACGGTGA
GGGGGGTGAG GACGTGGAGC CGGAGGATAA AGAGTTTTTG GAGCTTGTAC GTTGATGATA
TTTACCTTCC TCGGGCAAGT TGACTGAACA ATCGCTTGAT AAGCCCGATG GATCACCAGC
CACGATCAAA TTTATTGAAG TCGGTGGACG AACGACTGCA CTAGTCGAAG AGCTGCAAAA
GATGCCGGAA GGAGTCGAGA TCAAGCCGAA AATCAGTAAA GGCGGCAAAA GGGCCGGAGC
GAAGAGGAAG AAGGTCGCTA AGGAGGAAGA ATCGGTGAGT TCCTCTGGTC ATGATCACAA
GACAAAATGT TAACCTGTGG CATCATAGGA TGAAGGGAGT GAACTATCCG ACCAGAGTGA
TTTAGAAGAG CAGAAACCCA AAAGACCTCT GACGGCAAGA CAAAAAGCAG CCGCAGAGAA
AAAGGGGATC AACAGTAATT TAGCCTCGCA GACGAAAACT GAAGATAATA AACCTTCTGC
TGCAAAACGA CGAGCCATTG GAGCAAGGAC TGGGGGGTCC AAAAATCATC CGGATTCGGA
GGTGAAGAAG GAAAAAGGGA GTACAGAAGT GAGTATGATC GATTGCTTTA GGTGTATGAT
GGACTGTTGA CTGCATTGAC GTCTTGTCTA TCTTTCTTCC ACTCGAAATC CATTGCGTCT
CTATTACTCG CCGTGCAACC CTTGGACACC CTCTTCTCCC CGCTACAATC ATTATCGACT
CACAGTGTCG TAAATCCAAG AAGCAAGTGG AGGCAGTTGG CGAAAAACTT GCAAAGATCA
AGCCTCAAAA GGGTAGAAAG AAGGCAGCTT CAACTCCAGA GTCTTCAGAA CTGTCCGATC
TTGTCGAATA GTTCAAAGGC ACAAAATGGT GATTGAGTGC TCCGTTTGTT TTGACTAGAT
GAGGCCATAA GACGCACGTA GACTATACGC TGTACGTTGT AAGTGCAGAA GAATAACTTT
TGCG
 
Protein sequence
MPELPEVERA RKLIEDSCKG YKIASVDAQE DSIIFTGGTD HNEFAKEIAG RSITGCERKG 
KTFWMTLSGE GRYPVMHFGM TGMIQLKGQE PTWYRRRPKE SADVWPPRFY KFVLKLEPQE
GSIADEPREL AFIDGRRLGR LRLVSDPVSS HPPVSELGFD PILNHPTLEE FTKLLVNKKG
TVKGVIMDQA FSAGVGNWVA DEVLYQARIH PSCPIPALSE QNIRDLHHQL RAVPLTAISV
NADSKLFPSD WLFRWRWSKG TTQKKQMEKD KKSKGKKVVD GEGGEDVEPE DKEFLELPDG
SPATIKFIEV GGRTTALVEE LQKMPEGVEI KPKISKGGKR AGAKRKKVAK EEESDEGSEL
SDQSDLEEQK PKRPLTARQK AAAEKKGINS NLASQTKTED NKPSAAKRRA IGARTGGSKN
HPDSEVKKEK GSTECRKSKK QVEAVGEKLA KIKPQKGRKK AASTPESSEL SDLVE