Gene CNH03370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03370 
Symbol 
ID3259050 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp145821 
End bp148874 
Gene Length3054 bp 
Protein Length928 aa 
Translation table 
GC content57% 
IMG OID638258147 
Productubiquitin-protein ligase, putative 
Protein accessionXP_572505 
Protein GI58270698 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.307571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGCGAGCAC TGGAACACGC AGCATGGCAC CCAGCCACTC CCACCCCGCC CACTACCGCC 
GTCCGCCATC GCCGCTCCCC GATCTGGGTG CCCCTCTCTT CTCCCTCGCC CCGTCCGCGT
CGGCGTCTGC GTCGTCTCGC ACTGCGCGGC CACACGGGGA CACCGACAGC GTACAAGACG
TAGCCGAGAT GTCACTCGAA AGCCTAACCC TCTCGCCGCC GACCACAGCA ACAACCATCA
CCACGACAGT AGCGCCGCCG ACGCCGACGA CGCCCACATC TAGTAGTACA GCGGCGGCCA
CACATCCCGT CGCAACAGGA GGCACAGCTG CAGAGTATCC AGAGACGGCG GCTGCTGGAA
TCGCAACAGG CGCAGTGTTC ACGGGAATAG GTATCAACCC TCCTACACCG GCACCGAGTC
CCGGCCCGCA TATCCGTGGA TTTGACGATA GTGATACCCA TATACGAGAG CATCTCGAAC
AACTGCTTTA TCACGGTGAT ACAACCACCG ATCCCCACGG CCAAAACCTC GATCATGTGC
AAGTGCAAGC AAAACCGCAT GCAAAACCGC AAGCGCAAGG TAAAACCCTA CCCCATGAGC
CGTACCCCTC ACCATCTCCA GCACCAACAC CAACGCCAAC GCCACAAACC GTCCTATCCC
AATACACCTC CCTCCCACCC CTCCTTCGCT CCAAATTCCT AAACCTCCTT ATCCCCCATC
TAGCACTGCA TGAAGCGCTC TCCCTTTCGC GCAAGATTGA ACCGCTTTTG AGAAGGGACT
TTTTGAGAGA GTTGCCGTGG GAGGTGGCGT TGCATGTCTT GAGCTTTGTA AGTCGCGTCT
TTTTCATTTT CTTTTCCCCC ACAGTTAGAT GGGGTGAGCT GAGTTCGATT TATTGGGAAA
CAGGTAGACG ATCCTCAAAC ACTCGCTCGA GCGGCGCAAG TCTCAAGATA CTGGAATACA
CTCTTACAAG ACGAGACGAC ATGGCGCGAT CTACTGGCTA GACATCATCA TCGGCATCGG
TATGCGAGTG GTGGTGGGAT TGGGATTTCG GGTCGGGACC CGCCGCAGCA TGCTCGGGCG
ACGTCCAGAG AGGGAATGGA AGGGATGACG GGTAAAGAGG AGGAAGAAAA GGGGTGGGAT
GTGGGTTCGG GATTAGGACC GCCTGCTGAT CCAAACGCCG CCACCAGCAC CAACACCAAC
ACCAACACCA ACACCAGCAG CAGACGGGGA GGATCTACCT CCTCAAGTCA AGCGGGGACA
AGCAAACAAC GGTCAAAAAC GCGAGACAAC GCTTTGGGAA CCGGAAGCGG GACCAAGCAA
GCATGCTGCA TCAAGCGTGT CACCCCGTTC GGTCTCGAAA AACGCGTTCT CAACCTCTCT
AGCGGTCTCA CCGGTCTCCC TCCTCCCATC TCCCCCTTCT CCCCCTCCCC CTCCCCCTTC
CCCTCCCCTT CCCCTTCCTC TCAACACTCT TACAAAACCC GCGTCAAACT CGCCTATCTC
ACCGAAACAA ACTGGCTCAC CGCCGGCGTC CTCCTCGCCA AACACCTCTC CGCCGACGAC
TCGGTCGTCA CCACGCTGAG CTTTGACGAA ACCTGGATAG TGGTAGGCAT GGCGAATAGC
AAGATACATG TGTTTAGCGC GTTTAATGGC GGGTGGAAGA GGAGTTTGGA AGGGAGGGCG
GCGCATACGC AGGGTGTTTG GGCGATGGTG CTGGTCTCCC CACCGCCGCG AACGGCAACG
GCAACGCAAA CGGCAACGGC AAATCAGATG CAGATGCAGG AGCAGGATAT GAAGCGGATG
AGGGGAACGA GAGGGATGGG GTGGTATGGC GCCCAGGCAT CGGCATCGAC GTCGACGTCG
ACATCACCAA ACGGTTGGTT CGGCATCAAC CCCCCATCCA GTTATCCACA CCCCCAAACC
CAAACCCAAA CCCAAACCCA AAGCGAGACA CAAGAGAACG AGACGCCGTA CCAGCATGGC
CGCGAGAATC AAAGCAGAGG CCAAGACCAA GGTGGAAATG AAAATGAAGA TGAAGACGAG
AATGAGGGAA GCAATTTATG CGGTAGCGTA AGAGGGTGGC AGGGGTTGAA AACGAGTTTG
GTAGTCTCGG GCGGTTGTGA TAAGCAAGTC AAAGTATGGG ATGTCGAAAC CGGGTACGTT
TCCCCGACTT CTCCCCCCCC CCCCCCCCAA CCACATGCTG ATCGGCGCAA CTTTTTTTAG
CCAATGCATC CACTCCCTCC CCGGCCATAC CTCCACCATC CGCTGTATCA AAGTCCTCCC
CCACCGGCCT ATCGCCGTCT CCGGCTCCCG CGACTACACT CTCCGCGTAT GGGATATCCA
GCGGGGGCGA TGTCTGCATA CTCTGAGAGG CCATACGAAG AGCGTGAGGT GTGTAGAGAT
TTGGGGGAAT ATGGCTGTTT CGGGGAGTTA TGATAATACC GCCAAGGTAA GTCCAAAACT
CCACTCTCTC TCTCTCTCTC GTACCTTGTC CATACGCTAA CATGGAATAC ACAGTTGTGG
AACCTCGATA CAGGCGAATG CCTCCAAACA TTCACCGGCC ACTACTCCCA AATCTACTCC
ATCGCATTCA ACGGTTCCCT CGTCATCACC GGCTCCCTCG ACTCCACCGT CAGGGTCTGG
TCCCCGACCA CTGGCGAATG CCTCGCCCTC CTCCAAGGCC ACACCGCCCT CGTCGGCCAA
CTCCAACTGT CCGGCTCGAA ACTCGTGACC GGCGGTTCTG ACGGCCGGGT CATCATATTC
GACCTCTCCT CCATGTCCTG CATCCACCGT CTGTGTGCGC ACGATAACAG CGTGACGTGT
TTACAGTTTG ATAAGCGGTT CATCGTCTCC GGCGGGAACG ATGGGAGGGT GAAGCTGTGG
GATGTGAAGA CGGGCGGGTT TGTGAGGGAG CTGACGAAAC CTTGTGATGC GGTGTGGAGG
ACGAGTTTCA GAGGCGAGAG GATTGTGGTG CTTTGTCAGA GAGAGGGGAG GACCTGTTTG
GAGGTGGTCA GTTTTAGACC GGGAGAGGGG GAGAGAAGGG GGAAGGGGAT ATGA
 
Protein sequence
MAPSHSHPAH YRRPPSPLPD LGAPLFSLAP SASASASSRT ARPHGDTDSV QDVAEMSLES 
LTLSPPTTAT TITTTVAPPT PTTPTSSSTA AATHPVATGG TAAEYPETAA AGIATGAVFT
GIGINPPTPA PSPGPHIRGF DDSDTHIREH LEQLLYHGDT TTDPHGQNLD HVQVQAKPHA
KPQAQGKTLP HEPYPSPSPA PTPTPTPQTV LSQYTSLPPL LRSKFLNLLI PHLALHEALS
LSRKIEPLLR RDFLRELPWE VALHVLSFVI SRYWNTLLQD ETTWRDLLAR HHHRHRYASG
GGIGISGRDP PQHARATSRE GMEGMTGKEE EEKGWDVGSG LGPPADPNAA TSTNTNTNTN
TSSRRGGSTS SSQAGTSKQR SKTRDNALGT GSGTKQACCI KRVTPFGLEK RVLNLSSGLT
GLPPPISPFS PSPSPFPSPS PSSQHSYKTR VKLAYLTETN WLTAGVLLAK HLSADDSVVT
TLSFDETWIV VGMANSKIHV FSAFNGGWKR SLEGRAAHTQ GVWAMVLVSP PPRTATATQT
ATANQMQMQE QDMKRMRGTR GMGWYGAQAS ASTSTSTSPN GWFGINPPSS YPHPQTQTQT
QTQSETQENE TPYQHGRENQ SRGQDQGGNE NEDEDENEGS NLCGSVRGWQ GLKTSLVVSG
GCDKQVKVWD VETGQCIHSL PGHTSTIRCI KVLPHRPIAV SGSRDYTLRV WDIQRGRCLH
TLRGHTKSVR CVEIWGNMAV SGSYDNTAKL WNLDTGECLQ TFTGHYSQIY SIAFNGSLVI
TGSLDSTVRV WSPTTGECLA LLQGHTALVG QLQLSGSKLV TGGSDGRVII FDLSSMSCIH
RLCAHDNSVT CLQFDKRFIV SGGNDGRVKL WDVKTGGFVR ELTKPCDAVW RTSFRGERIV
VLCQREGRTC LEVVSFRPGE GERRGKGI