Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02710 |
Symbol | |
ID | 3259267 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 359085 |
End bp | 362189 |
Gene Length | 3105 bp |
Protein Length | 773 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258214 |
Product | hypothetical protein |
Protein accession | XP_572444 |
Protein GI | 58270576 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1236] Predicted exonuclease of the beta-lactamase fold involved in RNA processing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCACATCTCC GTCGCAACAC CCACACCAGT TCCAGATGAT TCCAAGACGA CACCACTTCA AGCCGGCTCC CCAACCAACA GTACAAGTTC TCCAACCTCC GGACGAAGAT GCCCCCTCGC TCACAATCAC TATGCTCGGC GCAGGCCAGG AAGTGGGCAG GTCCTGTTGT GTCATAGAGC ACAGAGGAAA GAAGATTGTA TGCGATGCCG GCCTGCATCC AGCACAGCCT GGTATAGGAG CTCTACCATT CATCGATGAA CTTGATTGGT CGACTGTGGA TGCGATGTTG ATCACTCAGT AAATCCTGTC CAAAATTTCG AGCCTGTCAC ATATATGAGC TGACCGCCTA TTTGTTAGTT TTCATGTCGA TCATGCAGCC GCTTTGCCGT ATATCATGGA GAAGGTATAC TTTGGTCTTT TGAGATCATG GTGGAAACTT GCTGACAGTA TATCACAGAC CAATTTCAAA GACGGTAACG GCAAAGTGTA CATGACGCAC GCTACAAAAG CTATCTATGG ATTGACCATG ATGGACACTG TGCGATTGAA GTAAGTTCAA TCTCTCTTTT CGACTCCCAT CCACTCCTGA CCCATATTAT CCCTTGCAGC GATCAAAATC CAGACACTTC CGGTCGCCTA TACGACGAAG CCGACGTCCA ATCATCCTGG CAATCCACCA TAGCAGTCGA CTATCATCAA GATATTGTTA TCGCTGGTGG TCTACGTTTC ACCCCCTACC ATGCCGGCCA TGTCCTTGGA GCGTCCATGT TCCTCATCGA GATTGCTGGG TTGAAGATCC TGTATACAGG AGACTATTCA AGGGAGGAGG ACCGACATCT GGTGATGGCG GAGATTCCAC CCGTGAAACC TGATGTGATG ATTTGCGAGA GCACGTTTGG CGTGCATACA TTACCAGACA GGAAGGAGAA GGAGGAACAA TTCACAAGTA AGCACCATCA AGAAAATTAC CCTTTCTTCG TTTGCCTGAC TAACGCGAAT GATCAATAAC AGCGTTGGTC GCCAACATTG TCCGAAGAGG TGGCCGATGC CTCATGCCCA TCCCCTCCTT CGGAAACGGC CAAGAACTCG CCCTTCTCCT CGACGAATAC TGGAACGACC ACCCCGAACT TCAAAACATC CCTGTCTACT TTGCATCCTC TCTTTTCCAA CGCGGCATGC GTGTCTACAA AACCTACGTC CACACTATGA ATGCCAATAT CCGATCACGG TTCGCCAGGA GAGATAACCC CTTTGACTTT AGGTTTGTCA AGTGGTTGAA AGATCCGCAG AAGCTTAGAG AGAATAAGGG TCCTTGTGTG ATCATGTCTT CACCTCAGTT TATGAGTTTT GGACTCAGTC GTGATCTGTT GGAAGAGTGG GCGCCGGATT CTAAGAACGG GGTGATTGTC ACTGGGTACT CCATCGAAGG TACTATGGCC AGGGTACGTA TCATCATTTT TCCCCTTTCT TCCTGGTTTC AATCTGCTCT TCTCTGACAA AAAAAATCAT TTTTAGACTC TCTTGAGCGA ACCGGACCAC ATCGAATCCC TCAAAGGAGG CAACGTCCCC CGCCGCTTAA CAGTTAAAGA AATCTCTTTC GGCGCTCACG TCGATTATGC TCAAAATTCA AAATTCATCC AAGAAATCGG TGCTCAGCAC GTTGTCCTCG TGCATGGAGA GGCTTCGCAG ATGGGAAGAT TGAGAGCGGC GTTGAGAGAT ACATATGCGG CCAAGGGGCA GGAGATTAAT ATCCATACGC CAAAAAATTG TGAACCTCTG ACTCTTACTT TTAGACAAGA GCGGATGGTC AAAGTGAGTA TTCTCTTTCC CCTTCCTTTT GAAACACTTC CCCCCTCATC AGATAATTCG TTAATCACTC AAATTCTCCG CCAGGCTATT GGCTCCTTAG CAGCTACTCG CCCTGAACAC GGTACCTCCG TCAAAGGTCT TCTCGTTTCC AAAGATTTCT CTTACACTCT CCTTTCCCCG GCCGATTTAC ATGATTTCAC TGGCCTCTCA ACGAGCACGA TCATCCAAAA ACAGGGAGTG GCGATAAGTG TAGATTGGGC GGTGGTGAGG TGGTATCTGG AGGGGATGTA TGGGGAAGTG GAGGAAGGTG TTGAGGAAGA GGGGAAAGCT GCTTTTATTG TGAGTATTTT GTTCTCATTT ATCTAATTAT ATTAGTTTTT CAATTTCCAA TATACATTTG CCTTAAACAT AATAAACTTC CTTTTCTGCC AATCTATTCT GGTCTGGTGA GCTGATTGAA ACTATCTTTC CCAATAGATA ATGAACGGAG TTCAAGTGGT GCAGATATCT CCAACCGCCG TAGAACTACG ATGGAAGTCA AGTTCAAGTA ACGATATGAT TGCCGATTCG GCTTTGGCTT TGTTGTTGGG TATAGATGGG AGCCCTGCTA CAGCTAAGCG TAAGTGTATT TTTCTTCTAT TCATCTATAC TGTTGTATAC TACTTGCCGC CAGGTGCGGG GGCTGATTTG CTTTCCGGGG ATGTTTAGTC ACCGCATCAC CAAACAAACA CGCTTGCAAC CATTCCAATT CCCATTCCCA TACCGACCTG TATCCCCACA CCTACCCGGG CGACAAGTCC GCTAAAGACG TAGCTTCCAA CCCCGAATTT GAGAGATTAC GCATGTTCCT CGAAGCGCAT TTCGGGCATG TAGAGGGACC GAATTTGAGA CCACCTCTTC CTCCGGGAGC GGATGGGGAT GGAAATGATG ATAAGGACAA AGATGGGGAC GATTGGTTGA CTATGGATGT GAAGCTTGAC AATCAGACAG CGCGGATAGA TCTAATTTCC ATGGTAAGTC TTCAAGCACT GACTTTTCCT TTTTAATCAA CGCTCTGTCA TGTTTAGCTG ACACTTCCGT CTCTCTCTCT GCCATTAGCG TGTGGAGTCT GAATCAGCTG AGCTTCAGAA ACGGGTGGAA ACAGTGTTGG AGATGGCGTT GACGACTGTC AAGTCTCTGT CACAAACGTT TTTGGGAGGG GGGCTGGACG TTGATATGGT GAAAGTAGAG CCTAACGAGA GCGATAGTTG AATGTAGCAT CGTTTGCATG GATTCCAAAC CTTCCACTGA GGATT
|
Protein sequence | MIPRRHHFKP APQPTVQVLQ PPDEDAPSLT ITMLGAGQEV GRSCCVIEHR GKKIVCDAGL HPAQPGIGAL PFIDELDWST VDAMLITHFH VDHAAALPYI MEKTNFKDGN GKVYMTHATK AIYGLTMMDT VRLNDQNPDT SGRLYDEADV QSSWQSTIAV DYHQDIVIAG GLRFTPYHAG HVLGASMFLI EIAGLKILYT GDYSREEDRH LVMAEIPPVK PDVMICESTF GVHTLPDRKE KEEQFTTLVA NIVRRGGRCL MPIPSFGNGQ ELALLLDEYW NDHPELQNIP VYFASSLFQR GMRVYKTYVH TMNANIRSRF ARRDNPFDFR FVKWLKDPQK LRENKGPCVI MSSPQFMSFG LSRDLLEEWA PDSKNGVIVT GYSIEGTMAR TLLSEPDHIE SLKGGNVPRR LTVKEISFGA HVDYAQNSKF IQEIGAQHVV LVHGEASQMG RLRAALRDTY AAKGQEINIH TPKNCEPLTL TFRQERMVKA IGSLAATRPE HGTSVKGLLV SKDFSYTLLS PADLHDFTGL STSTIIQKQG VAISVDWAVV RWYLEGMYGE VEEGVEEEGK AAFIIMNGVQ VVQISPTAVE LRWKSSSSND MIADSALALL LGIDGSPATA KLTASPNKHA CNHSNSHSHT DLYPHTYPGD KSAKDVASNP EFERLRMFLE AHFGHVEGPN LRPPLPPGAD GDGNDDKDKD GDDWLTMDVK LDNQTARIDL ISMRVESESA ELQKRVETVL EMALTTVKSL SQTFLGGGLD VDMVKVEPNE SDS
|
| |