Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00550 |
Symbol | |
ID | 3259136 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 1024776 |
End bp | 1026939 |
Gene Length | 2164 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258430 |
Product | conserved hypothetical protein |
Protein accession | XP_572249 |
Protein GI | 58270186 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCACATCCA CCTCAGTTGA CATGCCAGAG CTCCCTGGTA AGTTCATGGT CAGGTATCCA GCTATTGCTA AGCTATTACA GAGGTTGAAC GAGCTCGGAA ACTTATTGAA GACTCCTGCA AGGGGTACAA GATCGCTTCA GTCGATGCTC AAGAAGACAG TATCATCTTT ACCGGTGGAA CAGACCATAA CGAATTTGCC AAAGAGATCG CAGGAAGGTC TATAACAGGA TGTGAGCGGA AAGGAAAAAC GTGCGTGCGG GCAGACCAGG TGTCAGCTGG ATTTGCGACT GATGAAGGAT GGCATAGGTT CTGGATGACC TTGTCGGGTG AAGGACGATA CCCGGTGATG CATTTCGGCA TGACAGGTAT GATACAGCTC AAAGGTCAAG AACCAACTTG GTATCGGAGA AGGCCCAAAG AAAGTGCGGA TGTCTGGCCT CCCAGAGTGA GTCTTGTTTT CAACACCGTT TATATGAGCT AACAAGAAGA TAGTTCTATA AGTTGTATGT CATCAAGTTA CACTGAATTA TGCAGCTAAA CAAAATATAG CGTACTAAAG CTTGAACCTC AAGAAGGCTC CATCGCCGAT GAACCTCGAG AACTTGCTTT CATAGATGGC AAGTCACCCC GCGACTCTTC CAACGTTACG CTAATGCCGC CCCAGGGCGT CGGCTTGGTC GTCTTCGTTT GGTATCTGAT CCTGTATCTT CCCATCCTCC CGTCTCGGAG TTAGGTTTCG ACCCTATACT CAATCATCCA ACTTTGGAAG AGTTTACAAA GCTGCTGGTG AACAAGAAAG GTACTGTGAA AGGCGTCATC ATGGACCAAG CATTCAGTGC GGGTGTTGGC AATGTGAGTT TTCCCATAGA AGCGAAACGC GCAAGCTAAC CCGTTCATTG CAGTGGGTTG CAGACGAGTG AGCCTTAAAA AAGCTCCTTA ATTCCACTAA CCCGCCCTAG GGTACTTTAC CAGGCCCGTA TCCACCCTTC TTGCCCCATT CCCGCATTAT CCGAACAAAA CATTAGAGAT CTCCACCACC AACTCCGTGC TGTGCCTCTC ACCGCAATAT CTGTCAATGC TGATTCGAAA CTTTTTCCTT CCGACTGGCT CTTCCGCTGG AGGTGGAGTA AAGGTACAAC TCAGAAAAAG CAGATGGAGA AGGACAAAAA GAGTAAAGGG AAGAAGGTCG TGGACGGTGA GGGGGGTGAG GACGTGGAGC CGGAGGATAA AGAGTTTTTG GAGCTTGTAC GTTGATGATA TTTACCTTCC TCGGGCAAGT TGACTGAACA ATCGCTTGAT AAGCCCGATG GATCACCAGC CACGATCAAA TTTATTGAAG TCGGTGGACG AACGACTGCA CTAGTCGAAG AGCTGCAAAA GATGCCGGAA GGAGTCGAGA TCAAGCCGAA AATCAGTAAA GGCGGCAAAA GGGCCGGAGC GAAGAGGAAG AAGGTCGCTA AGGAGGAAGA ATCGGTGAGT TCCTCTGGTC ATGATCACAA GACAAAATGT TAACCTGTGG CATCATAGGA TGAAGGGAGT GAACTATCCG ACCAGAGTGA TTTAGAAGAG CAGAAACCCA AAAGACCTCT GACGGCAAGA CAAAAAGCAG CCGCAGAGAA AAAGGGGATC AACAGTAATT TAGCCTCGCA GACGAAAACT GAAGATAATA AACCTTCTGC TGCAAAACGA CGAGCCATTG GAGCAAGGAC TGGGGGGTCC AAAAATCATC CGGATTCGGA GGTGAAGAAG GAAAAAGGGA GTACAGAAGT GAGTATGATC GATTGCTTTA GGTGTATGAT GGACTGTTGA CTGCATTGAC GTCTTGTCTA TCTTTCTTCC ACTCGAAATC CATTGCGTCT CTATTACTCG CCGTGCAACC CTTGGACACC CTCTTCTCCC CGCTACAATC ATTATCGACT CACAGTGTCG TAAATCCAAG AAGCAAGTGG AGGCAGTTGG CGAAAAACTT GCAAAGATCA AGCCTCAAAA GGGTAGAAAG AAGGCAGCTT CAACTCCAGA GTCTTCAGAA CTGTCCGATC TTGTCGAATA GTTCAAAGGC ACAAAATGGT GATTGAGTGC TCCGTTTGTT TTGACTAGAT GAGGCCATAA GACGCACGTA GACTATACGC TGTACGTTGT AAGTGCAGAA GAATAACTTT TGCG
|
Protein sequence | MPELPEVERA RKLIEDSCKG YKIASVDAQE DSIIFTGGTD HNEFAKEIAG RSITGCERKG KTFWMTLSGE GRYPVMHFGM TGMIQLKGQE PTWYRRRPKE SADVWPPRFY KFVLKLEPQE GSIADEPREL AFIDGRRLGR LRLVSDPVSS HPPVSELGFD PILNHPTLEE FTKLLVNKKG TVKGVIMDQA FSAGVGNWVA DEVLYQARIH PSCPIPALSE QNIRDLHHQL RAVPLTAISV NADSKLFPSD WLFRWRWSKG TTQKKQMEKD KKSKGKKVVD GEGGEDVEPE DKEFLELPDG SPATIKFIEV GGRTTALVEE LQKMPEGVEI KPKISKGGKR AGAKRKKVAK EEESDEGSEL SDQSDLEEQK PKRPLTARQK AAAEKKGINS NLASQTKTED NKPSAAKRRA IGARTGGSKN HPDSEVKKEK GSTECRKSKK QVEAVGEKLA KIKPQKGRKK AASTPESSEL SDLVE
|
| |