Gene CNF00820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF00820 
Symbol 
ID3258426 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp268801 
End bp270790 
Gene Length1990 bp 
Protein Length469 aa 
Translation table 
GC content47% 
IMG OID638257206 
Productexpressed protein 
Protein accessionXP_571247 
Protein GI58268182 
COG category[L] Replication, recombination and repair 
COG ID[COG3663] G:T/U mismatch-specific DNA glycosylase 
TIGRFAM ID[TIGR00584] mismatch-specific thymine-DNA glycosylate (mug) 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACAATGCCA AGCGACAGCG AAAATGCCCC AAGCCCAGCA TCCAAAGCGT TTGCAGACCG 
TCTAGCCAAA TACGCCTACG TCTCAACCTC TTCTTCGCCT TCACCTCGGA TGAACCCTCT
CAGACGCTCC ACACGATCAC AAACCTCAAT TTCAGACGCC GCTAAGCCGT CTATACCTTT
AACAACATCT CAAGAGAGGA TATACGATAG AGGATCACCC CTGAAAAAAG TAGCTTCATC
CTCGTCTTCT TCCATATCGT TGTCCGCGAC AAAGAAGCGC TCAGCAAAAT CAATCTCAAA
GGAGGACGAC GAGTTTCAAG ATGTCGATAC TGAAAAAGAG ACCGATGATG AGGATTACAA
AGGAGGGAGA TTAAATTCGG GGAATCTGAA AGGCGGGGGA AAATCACCGA TGAAGAAAGG
AATCAAGAAG CCAAGGGGTT ACGCTGCTCC CGAAATGTAT GAGCATCTCA GACCGCTGAA
TGACCTGTTA GTTAAGGGAC AAGATTGTGA GTGGTACCTC CTAAACCCGT TGAAAAGGCT
CCTGTATCTG GTGTGTTCAA AAAGGGACGA TCGGGACTTG GAAAGCTAAT ACAGTGCATT
TTTGGTGGCA GTGGTGTTTT GTGGTATCAA GTGAGTCTTT GGTCCACTTT ATCCCTTCCG
GACATTTGAT TTCAAGTACA AGACTGACAA ACTATCTACT GTAGTCCAGG TGGGACAAGT
GTCTCAATAT ATATGGTCCC AAGGGTCTCA CAAGACGAAT AGGGAAAATG TCTTCTACTT
TGGGCCATCA TTTCGCCCAT CCAACCAACA AGTTTTGGGT ACGTATTGTG ACTCCCAGTT
TGTCCATGGT CTTTCCAAGC TTTAATTTTG ACAATACCAC CGTTTTGCAC CATTATATAG
AAAGCTCTTT ATCAGTCGGG TGAGACTCCC TATAACTTCA TCGTACGAAC ATAGGATAAT
GTTAATTCGT AATCTCCAAC AGGGCTTACT TCAAGATTGA TGTCGCCAAC TGAAGACTAT
AAAGTCGTTG ACGAGTATAA CTATGGTCTC GTGCGTCTGC GTGTCGGCTG CTATTTTCCT
AGTTTCTTTG AGATCTAACA CCTTTTGCAG ACCAATCTTG TGGGCAGACC GACTTCCGAG
GTACGTTCGC AGTCTTCTAC TACGCAATGT TCGGCACTAA TCACAAGAAC TCTAAATTCC
CTCCTTCAAC AATAACGGTA CAGCAAAGTG AATTATCTAC CCTGGAAATG AAGCTGAACA
CTATCAACTT GTTACAAAAG TTCATAAAAT ACCAGCCCAG TGTCGTCTGT TTTGTGGGGA
AGAAGATCTG GGATGTATTT GAGAGTGTAG TTGGGAAGAC GGCCGAGTTT GACGAGACTG
TGAAACAAAA GGTCAAGCTG GAGGGTGAAG TTGAAAATGG AGAAGGCAGC GGTGACGGGG
GAAAAGGAAG GAGCGTGATG ACACTCGCTC CGACACCGGA AAGAGGAACG GTCAAGATCG
AGCCGGCCGA ACTGGGTGAT CAGCCGTCCC TACTTCCACT CTCGGCCAGT CCATCGAAAA
CCCCGCAACT TACACCGGCC CAAACTGCTT CACTCTCTCC AGCCAAAGCT AGAGCGAAAA
AAGGAGCTAT CAAGCCTCCA AAAACACCCT TTAGCTTCTC CCAACCTCGT GCACTGAGAA
TTCCGCATCC TCCTGAAGAC AATGGAGGTG AAATCAAGTA TACGTATTTC TTTGTGGTAC
CGAGTACTTC GGGGCTGGAA AGGACACCTG TGAGTCACGG CCAATCCCCC AAGGGTGTTG
TTATCAGGTC TATTGACCGA TGTGAAAGTT TCCGGAACAA GTGGCCAATT TTGCAGCCCT
CAAGGCATTG GTGGATGAGC TGAAAAAGGG TAGATCCCTG CAGGGCGATT TTCTGAATAT
CAGCGTGGAA GGTGTAGAGG GGACTGTGGA AGATATGCGG CGGGCAGCTA TCTTAAAAAA
TGCTCTATGA
 
Protein sequence
MPSDSENAPS PASKAFADRL AKYAYVSTSS SPSPRMNPLR RSTRSQTSIS DAAKPSIPLT 
TSQERIYDRG SPLKKVASSS SSSISLSATK KRSAKSISKE DDEFQDVDTE KETDDEDYKG
GRLNSGNLKG GGKSPMKKGI KKPRGYAAPE MYEHLRPLND LLVKGQDLVF CGIKRIGKMS
STLGHHFAHP TNKFWKALYQ SGLTSRLMSP TEDYKVVDEY NYGLTNLVGR PTSEQSELST
LEMKLNTINL LQKFIKYQPS VVCFVGKKIW DVFESVVGKT AEFDETVKQK VKLEGEVENG
EGSGDGGKGR SVMTLAPTPE RGTVKIEPAE LGDQPSLLPL SASPSKTPQL TPAQTASLSP
AKARAKKGAI KPPKTPFSFS QPRALRIPHP PEDNGGEIKY TYFFVVPSTS GLERTPFPEQ
VANFAALKAL VDELKKGRSL QGDFLNISVE GVEGTVEDMR RAAILKNAL