Gene CNH02710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH02710 
Symbol 
ID3259267 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp359085 
End bp362189 
Gene Length3105 bp 
Protein Length773 aa 
Translation table 
GC content48% 
IMG OID638258214 
Producthypothetical protein 
Protein accessionXP_572444 
Protein GI58270576 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1236] Predicted exonuclease of the beta-lactamase fold involved in RNA processing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCACATCTCC GTCGCAACAC CCACACCAGT TCCAGATGAT TCCAAGACGA CACCACTTCA 
AGCCGGCTCC CCAACCAACA GTACAAGTTC TCCAACCTCC GGACGAAGAT GCCCCCTCGC
TCACAATCAC TATGCTCGGC GCAGGCCAGG AAGTGGGCAG GTCCTGTTGT GTCATAGAGC
ACAGAGGAAA GAAGATTGTA TGCGATGCCG GCCTGCATCC AGCACAGCCT GGTATAGGAG
CTCTACCATT CATCGATGAA CTTGATTGGT CGACTGTGGA TGCGATGTTG ATCACTCAGT
AAATCCTGTC CAAAATTTCG AGCCTGTCAC ATATATGAGC TGACCGCCTA TTTGTTAGTT
TTCATGTCGA TCATGCAGCC GCTTTGCCGT ATATCATGGA GAAGGTATAC TTTGGTCTTT
TGAGATCATG GTGGAAACTT GCTGACAGTA TATCACAGAC CAATTTCAAA GACGGTAACG
GCAAAGTGTA CATGACGCAC GCTACAAAAG CTATCTATGG ATTGACCATG ATGGACACTG
TGCGATTGAA GTAAGTTCAA TCTCTCTTTT CGACTCCCAT CCACTCCTGA CCCATATTAT
CCCTTGCAGC GATCAAAATC CAGACACTTC CGGTCGCCTA TACGACGAAG CCGACGTCCA
ATCATCCTGG CAATCCACCA TAGCAGTCGA CTATCATCAA GATATTGTTA TCGCTGGTGG
TCTACGTTTC ACCCCCTACC ATGCCGGCCA TGTCCTTGGA GCGTCCATGT TCCTCATCGA
GATTGCTGGG TTGAAGATCC TGTATACAGG AGACTATTCA AGGGAGGAGG ACCGACATCT
GGTGATGGCG GAGATTCCAC CCGTGAAACC TGATGTGATG ATTTGCGAGA GCACGTTTGG
CGTGCATACA TTACCAGACA GGAAGGAGAA GGAGGAACAA TTCACAAGTA AGCACCATCA
AGAAAATTAC CCTTTCTTCG TTTGCCTGAC TAACGCGAAT GATCAATAAC AGCGTTGGTC
GCCAACATTG TCCGAAGAGG TGGCCGATGC CTCATGCCCA TCCCCTCCTT CGGAAACGGC
CAAGAACTCG CCCTTCTCCT CGACGAATAC TGGAACGACC ACCCCGAACT TCAAAACATC
CCTGTCTACT TTGCATCCTC TCTTTTCCAA CGCGGCATGC GTGTCTACAA AACCTACGTC
CACACTATGA ATGCCAATAT CCGATCACGG TTCGCCAGGA GAGATAACCC CTTTGACTTT
AGGTTTGTCA AGTGGTTGAA AGATCCGCAG AAGCTTAGAG AGAATAAGGG TCCTTGTGTG
ATCATGTCTT CACCTCAGTT TATGAGTTTT GGACTCAGTC GTGATCTGTT GGAAGAGTGG
GCGCCGGATT CTAAGAACGG GGTGATTGTC ACTGGGTACT CCATCGAAGG TACTATGGCC
AGGGTACGTA TCATCATTTT TCCCCTTTCT TCCTGGTTTC AATCTGCTCT TCTCTGACAA
AAAAAATCAT TTTTAGACTC TCTTGAGCGA ACCGGACCAC ATCGAATCCC TCAAAGGAGG
CAACGTCCCC CGCCGCTTAA CAGTTAAAGA AATCTCTTTC GGCGCTCACG TCGATTATGC
TCAAAATTCA AAATTCATCC AAGAAATCGG TGCTCAGCAC GTTGTCCTCG TGCATGGAGA
GGCTTCGCAG ATGGGAAGAT TGAGAGCGGC GTTGAGAGAT ACATATGCGG CCAAGGGGCA
GGAGATTAAT ATCCATACGC CAAAAAATTG TGAACCTCTG ACTCTTACTT TTAGACAAGA
GCGGATGGTC AAAGTGAGTA TTCTCTTTCC CCTTCCTTTT GAAACACTTC CCCCCTCATC
AGATAATTCG TTAATCACTC AAATTCTCCG CCAGGCTATT GGCTCCTTAG CAGCTACTCG
CCCTGAACAC GGTACCTCCG TCAAAGGTCT TCTCGTTTCC AAAGATTTCT CTTACACTCT
CCTTTCCCCG GCCGATTTAC ATGATTTCAC TGGCCTCTCA ACGAGCACGA TCATCCAAAA
ACAGGGAGTG GCGATAAGTG TAGATTGGGC GGTGGTGAGG TGGTATCTGG AGGGGATGTA
TGGGGAAGTG GAGGAAGGTG TTGAGGAAGA GGGGAAAGCT GCTTTTATTG TGAGTATTTT
GTTCTCATTT ATCTAATTAT ATTAGTTTTT CAATTTCCAA TATACATTTG CCTTAAACAT
AATAAACTTC CTTTTCTGCC AATCTATTCT GGTCTGGTGA GCTGATTGAA ACTATCTTTC
CCAATAGATA ATGAACGGAG TTCAAGTGGT GCAGATATCT CCAACCGCCG TAGAACTACG
ATGGAAGTCA AGTTCAAGTA ACGATATGAT TGCCGATTCG GCTTTGGCTT TGTTGTTGGG
TATAGATGGG AGCCCTGCTA CAGCTAAGCG TAAGTGTATT TTTCTTCTAT TCATCTATAC
TGTTGTATAC TACTTGCCGC CAGGTGCGGG GGCTGATTTG CTTTCCGGGG ATGTTTAGTC
ACCGCATCAC CAAACAAACA CGCTTGCAAC CATTCCAATT CCCATTCCCA TACCGACCTG
TATCCCCACA CCTACCCGGG CGACAAGTCC GCTAAAGACG TAGCTTCCAA CCCCGAATTT
GAGAGATTAC GCATGTTCCT CGAAGCGCAT TTCGGGCATG TAGAGGGACC GAATTTGAGA
CCACCTCTTC CTCCGGGAGC GGATGGGGAT GGAAATGATG ATAAGGACAA AGATGGGGAC
GATTGGTTGA CTATGGATGT GAAGCTTGAC AATCAGACAG CGCGGATAGA TCTAATTTCC
ATGGTAAGTC TTCAAGCACT GACTTTTCCT TTTTAATCAA CGCTCTGTCA TGTTTAGCTG
ACACTTCCGT CTCTCTCTCT GCCATTAGCG TGTGGAGTCT GAATCAGCTG AGCTTCAGAA
ACGGGTGGAA ACAGTGTTGG AGATGGCGTT GACGACTGTC AAGTCTCTGT CACAAACGTT
TTTGGGAGGG GGGCTGGACG TTGATATGGT GAAAGTAGAG CCTAACGAGA GCGATAGTTG
AATGTAGCAT CGTTTGCATG GATTCCAAAC CTTCCACTGA GGATT
 
Protein sequence
MIPRRHHFKP APQPTVQVLQ PPDEDAPSLT ITMLGAGQEV GRSCCVIEHR GKKIVCDAGL 
HPAQPGIGAL PFIDELDWST VDAMLITHFH VDHAAALPYI MEKTNFKDGN GKVYMTHATK
AIYGLTMMDT VRLNDQNPDT SGRLYDEADV QSSWQSTIAV DYHQDIVIAG GLRFTPYHAG
HVLGASMFLI EIAGLKILYT GDYSREEDRH LVMAEIPPVK PDVMICESTF GVHTLPDRKE
KEEQFTTLVA NIVRRGGRCL MPIPSFGNGQ ELALLLDEYW NDHPELQNIP VYFASSLFQR
GMRVYKTYVH TMNANIRSRF ARRDNPFDFR FVKWLKDPQK LRENKGPCVI MSSPQFMSFG
LSRDLLEEWA PDSKNGVIVT GYSIEGTMAR TLLSEPDHIE SLKGGNVPRR LTVKEISFGA
HVDYAQNSKF IQEIGAQHVV LVHGEASQMG RLRAALRDTY AAKGQEINIH TPKNCEPLTL
TFRQERMVKA IGSLAATRPE HGTSVKGLLV SKDFSYTLLS PADLHDFTGL STSTIIQKQG
VAISVDWAVV RWYLEGMYGE VEEGVEEEGK AAFIIMNGVQ VVQISPTAVE LRWKSSSSND
MIADSALALL LGIDGSPATA KLTASPNKHA CNHSNSHSHT DLYPHTYPGD KSAKDVASNP
EFERLRMFLE AHFGHVEGPN LRPPLPPGAD GDGNDDKDKD GDDWLTMDVK LDNQTARIDL
ISMRVESESA ELQKRVETVL EMALTTVKSL SQTFLGGGLD VDMVKVEPNE SDS