Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI00640 |
Symbol | |
ID | 3259495 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 152846 |
End bp | 157420 |
Gene Length | 4575 bp |
Protein Length | 1209 aa |
Translation table | |
GC content | 47% |
IMG OID | 638258548 |
Product | conserved hypothetical protein |
Protein accession | XP_572952 |
Protein GI | 58271592 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5077] Ubiquitin carboxyl-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.334737 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTTCCAAGA CTGTCTCCCA TTTTATTGTC CAGTACTCTA CCAGCAAGTC GCTTTTATGG TGAAAGCCTT GCCAAAACCC TGCCAGGACT GGGACTGGGT CGGTACAGAG GTCCGAACGC CAGGTCAGAT CACGCTCGAA CATCGACGGC GTGCAGCTGG GCTTGTTACA AGCGTCGTTT GTCCTCGCGA CCTCACACAC CTTGGAAGAG AGACTACCAA GGATGATAGT GAGGTAAAAG GGAATGGAAA AGGAACTGGG GAGAAGAAGG GCACAGGATG TAGAGCAAAA AATTGCAAAT CGAATTACAT GTGTTACAAT AACCTTGGAA CGGAAAAGGT AAACTAAGGT TACTGATCGG CTGGGAGCTA ACAGTGCAGC TTTTGGAACC TGATGCGAAA GCAGAATTCG TTATTTCAAA CCTAGGAGAT GTTCCACAAG AGCGTAATGG GCCCGCTGGT TTAAGAAACC TCGGGGCGAC GTGCTATGTA AACCTTTTCT TTCCTCTCCG TGTACTTGCC CCATTCTAAT AAGCTTCTCA GGCAAACGCC TTCTTGCAAT TGTGGTTCCA CAACGTGCCT TTTCGTAATG CAGTCTATGC TTGCGTGACT ACAGAGGTCG GCCTTTTGCG TAAATGATCA GGAAGATCGT TGACAATGTT ATGATATGAA AATAGACCAC ACCACTCTAC CAATTAGCTC TCATCTTCGC TAAGCTGGAA TATGGCGAAA AGAATGTAGT GGATCCCATG GGTCTGATAG ACGCTCTGCG ACTCAATATG GGTGATCAAC AAGATGCAGC CGAGTACATT CTTACTCCCA GTCTCGCACG GGTCGTAGGC TGACAACACG ATAGGTTCTC AAAACTGTTC ATGTCATTGA TCGCATCAGA ATTCTCAAAA CATTCCGACC CTAAACTCAA AACGTTGGTT AAAGATCAGT TTGAAGGGAC AATGCAGTAT ATTACCCAGT GTGAATGTGG GTACGAAAGT ATCTCTGAGA GTAAGCCCTT TTTACTCCCC AAACGCTAGA CGTTCCTCCT GATTTTGGGA AGCCACTTTC CTCGAGATTG AACTCTCGCT GAAAGACAAT ACAACCCTTC AATCCCGCCT GGACGAATTC ACATGCCCTG AAATCCTTGA TGGGGACAAC AAATACTCTT GCCCCTCCTG CCTCTCTAAA CGCCGTGCAA CCCGTCGTCA GCTGCCCGTC ACCCTCCCTC CCGTTATCCA CTTCTCTCTT CTCAGGTTTG TCTTCGATCT CAAAAGCATG TCGAGAAAGA AGAGTAAAGC GTCGATAAAG TATCCGAAAG AGGCGGTGCT TGGGAATTCG GTATATGAAT TAAAAGGAAT TATATCTCAT CAAGGGACAA GCGTGAGTTT GGTCTGTAGG AGATGGTATT TTGAAACTGA TTACGGCTCC TAGGCGTATC ATGGTCATTT CGTTTGCGAG ACATATGATG AAAGCAATGA TACCTGGTAT ATCTGTAACG ACGAGTTGGT GCAGCCTAAA CCTGTCCGGC CGCACAAGAA AATCAAGCTT GAGAAACCTG GCGATGACAA GGGCAAGTTA GAATCATCCA AAGATGCCTA CATGCTCGTC TACAAACGGC GAGATGGCCA TGTGTCTCCC CAATTTCCAC CAGCGATTGT CATGGAGAAG GTTAAGGAGG AGAATAGAGG GTTGAGGGAA GAATTAAATA AAGTGGGGGT GAGGAAGGAA GTTTTGGAAG ATGAGTGGGA GCATCTAAAG GGGGCGAAGG TGGATGTTAT CAGGGATCTT CCTGGGGCAT GTTTTCTTCC TTCTCTTTAC TACCCACTGA TACGACTATG GCTAACTTTT TTTTTTCGAT AGACAGACTA CATTGTTCCT CGCGACGCCC TCGCCAAATG GATCCAGTCT CCCTCCTTTC AAGATCTCTA CAAGCCATTC GACTACTCAT CCATCCTCTG TGCTCATTCG CAAGTGGATC CCCTCAAATC TTCTGATATA CGTACAATAT CCGCGCTAGC TCATGATAAA CTTTTGTTAT ATACTTCTCT TCCCGAGATC GACGTCTGCC TAATATGTGT TGCGGAAGGT TTTGTCGCAA GGTCGAGTAT AACGGAGCAG CAGTCGGCGC TTGAGGCATT CGACGAACTC AATGCCAAGG CCGAGTTGGA AGAAGGGGGT GAAGAAGAAC GGTGGTGTCT TCCCAAAACG TGGCTCATCC ACTGGCGAAC AGGCAAACTT CCTCCTCAAA CCTTGCCCAC ACACTCTTCC TACACCCTCT TATGCCCCCA CAACGCCCCC TTACCTTCAT CTTCCGCCCC GCCCGTCACC TTCATAACCT CCTCTGCCCT TTCCCTCCTC CACTCCATCT TTGGCTCCTT CCCTTCGTTC CAACCCGGAA CACCCCCCTG CCCCGAGTGT TCCTTCGAGG CAGACCAAAA TGCAGAATCG TTGGCACAAT GGAAAACGGA TGTGAAGCTT GATAAATCTA TCAAACGGCA CCTTGACCCG CGGCCGCCTG CTTTTGGATT AGATTATTAT GTACTTCCAA AAGAGTTTAT TGAGAAATGG GAGGTGTATA TGAAGACTCC AGCGGGGGAG AAGCCAGAGT TGGATATGGG TCTGGGACGA GGGAGATGTG AGCATGGATT GTTGGACTGG GATCCGCAGA TGGAGAAACA GAGGGTGATT AGTGAGATCG GATGGGAGAT GCTTTGTCAG AAGTAAGTCA TTGGCGGGAA GAGTGACCAT TCCCTACTTT TATGTTTCCG TCAAAGAAAG AAAAAAGGCT AATAATTGAT TGAAGGTACG GTGAGAAAGA GCCAATTAAA GTACAATTTG GTGCCAACCC CCCGGAAGGA AAGAAAGTGA ACATTGCTTC TTTCACTCCT GCTGTCTGCG AGCCATGCAG GATTATCAGG TAAGTTCAAC TGCACGGCCT TGGATATTGC ATGGCAGGAC AGCTGACCTT GGATGGGTTA AAGATTATCA TCATACGACG AACTCGAGAT ACCAATCGTC TTCGCTCCTG GGCCGCCCAC ATCTTACAGC ACCCCCGCCT CTACAGGCAA TACCAGCGGG TCAAAGTCAG GATCAGGATC AAGCCGTAAC ACATCAAGGA CCCTACGGTC CCGGCTTAAA ACGCTATACA TCCAGGCCAC GAGACAAAGT ACAATCAAGG ATCTCAAAGT ATCCATTCTT TCTCAAACAG GAATCACCCC GCTTCTCCAG AAGATTTATT ACAAACGTCG AACCCAACCC CAAGAGCAAG AGGAATGTTC GGGGAAAGGG AATGAAGAAG AAAAAGAGCT AGATAATGAT TTGACGATTG GCAAGCTGGG ATATTTGAAA GGGGAGGAGT TGATATTGGT TGAAGTGAAA GAAGAAGGGA ATCTGGATGA TGATGATGAC ACTGTAGATG GAGGTAATGG AGGCAAGGGA AAAAATGGAA AGAATGAAGG ATTCGGAGGG ACGGCATTGT TGGCGAGGAT TGCATGTCCG GATTGTACTT ATGAAAATGA TGGGGCAGCG GAATGCTGTG AGATGTGCAT GAGAGTGAGT ATTTTCCCCC AGGCTTGTGT TGCCACAGGA CGCTTTGCTA ATGGATTTGC TGAATTTTTT CTAGCCATTT AAATATGATT GAACGTTGAT CCGTTATGGG CATGGCTCAG GAGTTCACGA GAAGTTGTTA TGCTTTACGA ACCATCCTCT CCATTACATA CAGGTATAGA TTGCAAACCA TGCTAAGATT GTATAAACAG GATATGTATG ACAGTATTCG GACTTGCTCT CTTTATACCC CCGGAAGCGA TTACGCGCAG GAAGAGAATC GGCGAGTAAC CATCCTGGAT CTGGCGAGTA ACTCCATCGA ATGTTCGGCC CTGCAAGCAT GGGTGGGGAC TACTGGTGGG ATGACGGAAG AGGATGGCGA GTCAGGATGG GAGGAATGGA TTGTGGCTTT CAAGGATGCT GCGATGTCGT TAGAATGGAC GATGAAAGAA TTTCGTTCAC TTTACCGTCT GCAATTCAAC AACTTGGTCT ACTTTCAATC ACAAGCTACC CAACATCGCC GCCATTTGAT GGGTACCGAC CTCTATCCCA GCGACAAGAA TATGGCCGAT TTCTATCGCA CTGTTTGTCC TAAGAATCTG TTCCTTTGAT GACAAAGCGG AGTTTAACAA TGGAAGTTTG AATGAGGTTC GTACTCAACT ACTACAGTTA GAAGTTAGCA AGTACTTGGA GGCGAAAAAG GAGCAGCCTG CACCCACTCG TGCCAAGGCT CACAATCCGG TGACCCCACA GACCCAACCC ACACCTGGCT CACCCTGTCA GCTATTATTT ATTATAAACC TCAAAATCTT TTACCTTACT ATCTCTCTCC AGCTGGTAAA CACATTCGCG AAATCTTCAC CAATAAGAAC CGGTGCAACG ACTGCAGCCA AGTTGGTCAC AACTACAAAA CCTGTCCAAA GCGACTCTAC TCGCCCGGTA TCATCCGTCC GATAAAAATC GTCCAAATCC TCGTTACTCA CCTTCAACGC CATCACCCCT TTACATCCAC TTCTCTTTCT GCATCTTCTT TTTAG
|
Protein sequence | MVKALPKPCQ DWDWVGTEVR TPGQITLEHR RRAAGLVTSV VCPRDLTHLG RETTKDDSEV KGNGKGTGEK KGTGCRAKNC KSNYMCYNNL GTEKLLEPDA KAEFVISNLG DVPQERNGPA GLRNLGATCY ANAFLQLWFH NVPFRNAVYA CVTTETTPLY QLALIFAKLE YGEKNVVDPM GLIDALRLNM GDQQDAAEFS KLFMSLIASE FSKHSDPKLK TLVKDQFEGT MQYITQCECG YESISETTFL EIELSLKDNT TLQSRLDEFT CPEILDGDNK YSCPSCLSKR RATRRQLPVT LPPVIHFSLL RFVFDLKSMS RKKSKASIKY PKEAVLGNSV YELKGIISHQ GTSAYHGHFV CETYDESNDT WYICNDELVQ PKPVRPHKKI KLEKPGDDKG KLESSKDAYM LVYKRRDGHV SPQFPPAIVM EKVKEENRGL REELNKVGVR KEVLEDEWEH LKGAKVDTDY IVPRDALAKW IQSPSFQDLY KPFDYSSILC AHSQVDPLKS SDIRTISALA HDKLLLYTSL PEIDVCLICV AEGFVARSSI TEQQSALEAF DELNAKAELE EGGEEERWCL PKTWLIHWRT GKLPPQTLPT HSSYTLLCPH NAPLPSSSAP PVTFITSSAL SLLHSIFGSF PSFQPGTPPC PECSFEADQN AESLAQWKTD VKLDKSIKRH LDPRPPAFGL DYYVLPKEFI EKWEVYMKTP AGEKPELDMG LGRGRCEHGL LDWDPQMEKQ RVISEIGWEM LCQKYGEKEP IKVQFGANPP EGKKVNIASF TPAVCEPCRI IRLSSYDELE IPIVFAPGPP TSYSTPASTG NTSGSKSGSG SSRNTSRTLR SRLKTLYIQA TRQSTIKDLK VSILSQTGIT PLLQKIYYKR RTQPQEQEEC SGKGNEEEKE LDNDLTIGKL GYLKGEELIL VEVKEEGNLD DDDDTVDGGN GGKGKNGKNE GFGGTALLAR IACPDCTYEN DGAAECCEMC MRDMYDSIRT CSLYTPGSDY AQEENRRVTI LDLASNSIEC SALQAWVGTT GGMTEEDGES GWEEWIVAFK DAAMSLEWTM KEFRSLYRLQ FNNLVYFQSQ ATQHRRHLMG TDLYPSDKNM ADFYRTLEVS KYLEAKKEQP APTRAKAHNP VTPQTQPTPG SPSGKHIREI FTNKNRCNDC SQVGHNYKTC PKRLYSPGII RPIKIVQILV THLQRHHPFT STSLSASSF
|
| |