Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF00820 |
Symbol | |
ID | 3258426 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 268801 |
End bp | 270790 |
Gene Length | 1990 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257206 |
Product | expressed protein |
Protein accession | XP_571247 |
Protein GI | 58268182 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3663] G:T/U mismatch-specific DNA glycosylase |
TIGRFAM ID | [TIGR00584] mismatch-specific thymine-DNA glycosylate (mug) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACAATGCCA AGCGACAGCG AAAATGCCCC AAGCCCAGCA TCCAAAGCGT TTGCAGACCG TCTAGCCAAA TACGCCTACG TCTCAACCTC TTCTTCGCCT TCACCTCGGA TGAACCCTCT CAGACGCTCC ACACGATCAC AAACCTCAAT TTCAGACGCC GCTAAGCCGT CTATACCTTT AACAACATCT CAAGAGAGGA TATACGATAG AGGATCACCC CTGAAAAAAG TAGCTTCATC CTCGTCTTCT TCCATATCGT TGTCCGCGAC AAAGAAGCGC TCAGCAAAAT CAATCTCAAA GGAGGACGAC GAGTTTCAAG ATGTCGATAC TGAAAAAGAG ACCGATGATG AGGATTACAA AGGAGGGAGA TTAAATTCGG GGAATCTGAA AGGCGGGGGA AAATCACCGA TGAAGAAAGG AATCAAGAAG CCAAGGGGTT ACGCTGCTCC CGAAATGTAT GAGCATCTCA GACCGCTGAA TGACCTGTTA GTTAAGGGAC AAGATTGTGA GTGGTACCTC CTAAACCCGT TGAAAAGGCT CCTGTATCTG GTGTGTTCAA AAAGGGACGA TCGGGACTTG GAAAGCTAAT ACAGTGCATT TTTGGTGGCA GTGGTGTTTT GTGGTATCAA GTGAGTCTTT GGTCCACTTT ATCCCTTCCG GACATTTGAT TTCAAGTACA AGACTGACAA ACTATCTACT GTAGTCCAGG TGGGACAAGT GTCTCAATAT ATATGGTCCC AAGGGTCTCA CAAGACGAAT AGGGAAAATG TCTTCTACTT TGGGCCATCA TTTCGCCCAT CCAACCAACA AGTTTTGGGT ACGTATTGTG ACTCCCAGTT TGTCCATGGT CTTTCCAAGC TTTAATTTTG ACAATACCAC CGTTTTGCAC CATTATATAG AAAGCTCTTT ATCAGTCGGG TGAGACTCCC TATAACTTCA TCGTACGAAC ATAGGATAAT GTTAATTCGT AATCTCCAAC AGGGCTTACT TCAAGATTGA TGTCGCCAAC TGAAGACTAT AAAGTCGTTG ACGAGTATAA CTATGGTCTC GTGCGTCTGC GTGTCGGCTG CTATTTTCCT AGTTTCTTTG AGATCTAACA CCTTTTGCAG ACCAATCTTG TGGGCAGACC GACTTCCGAG GTACGTTCGC AGTCTTCTAC TACGCAATGT TCGGCACTAA TCACAAGAAC TCTAAATTCC CTCCTTCAAC AATAACGGTA CAGCAAAGTG AATTATCTAC CCTGGAAATG AAGCTGAACA CTATCAACTT GTTACAAAAG TTCATAAAAT ACCAGCCCAG TGTCGTCTGT TTTGTGGGGA AGAAGATCTG GGATGTATTT GAGAGTGTAG TTGGGAAGAC GGCCGAGTTT GACGAGACTG TGAAACAAAA GGTCAAGCTG GAGGGTGAAG TTGAAAATGG AGAAGGCAGC GGTGACGGGG GAAAAGGAAG GAGCGTGATG ACACTCGCTC CGACACCGGA AAGAGGAACG GTCAAGATCG AGCCGGCCGA ACTGGGTGAT CAGCCGTCCC TACTTCCACT CTCGGCCAGT CCATCGAAAA CCCCGCAACT TACACCGGCC CAAACTGCTT CACTCTCTCC AGCCAAAGCT AGAGCGAAAA AAGGAGCTAT CAAGCCTCCA AAAACACCCT TTAGCTTCTC CCAACCTCGT GCACTGAGAA TTCCGCATCC TCCTGAAGAC AATGGAGGTG AAATCAAGTA TACGTATTTC TTTGTGGTAC CGAGTACTTC GGGGCTGGAA AGGACACCTG TGAGTCACGG CCAATCCCCC AAGGGTGTTG TTATCAGGTC TATTGACCGA TGTGAAAGTT TCCGGAACAA GTGGCCAATT TTGCAGCCCT CAAGGCATTG GTGGATGAGC TGAAAAAGGG TAGATCCCTG CAGGGCGATT TTCTGAATAT CAGCGTGGAA GGTGTAGAGG GGACTGTGGA AGATATGCGG CGGGCAGCTA TCTTAAAAAA TGCTCTATGA
|
Protein sequence | MPSDSENAPS PASKAFADRL AKYAYVSTSS SPSPRMNPLR RSTRSQTSIS DAAKPSIPLT TSQERIYDRG SPLKKVASSS SSSISLSATK KRSAKSISKE DDEFQDVDTE KETDDEDYKG GRLNSGNLKG GGKSPMKKGI KKPRGYAAPE MYEHLRPLND LLVKGQDLVF CGIKRIGKMS STLGHHFAHP TNKFWKALYQ SGLTSRLMSP TEDYKVVDEY NYGLTNLVGR PTSEQSELST LEMKLNTINL LQKFIKYQPS VVCFVGKKIW DVFESVVGKT AEFDETVKQK VKLEGEVENG EGSGDGGKGR SVMTLAPTPE RGTVKIEPAE LGDQPSLLPL SASPSKTPQL TPAQTASLSP AKARAKKGAI KPPKTPFSFS QPRALRIPHP PEDNGGEIKY TYFFVVPSTS GLERTPFPEQ VANFAALKAL VDELKKGRSL QGDFLNISVE GVEGTVEDMR RAAILKNAL
|
| |