Gene CNH01820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH01820 
Symbol 
ID3259172 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp609141 
End bp613541 
Gene Length4401 bp 
Protein Length1099 aa 
Translation table 
GC content47% 
IMG OID638258306 
Productconserved hypothetical protein 
Protein accessionXP_572356 
Protein GI58270400 
COG category[L] Replication, recombination and repair 
COG ID[COG1948] ERCC4-type nuclease 
TIGRFAM ID[TIGR00596] DNA repair protein (rad1) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAGTCAAA ATCGCTGCTT TGTTCCATAA ATGGGGTATT GAAGGTCTTC CAATACTCTG 
GCTTTCCATT ACATATCCAC ACAATGTCCA CAAAGCCACG GCGACACATC CCTTATCTTC
CTTTCCACAA AAATCTCATT CGCACTCTCT GCCGACCTCA GCAGGACGAC CTCTTACTCA
TTGCAAAGGG ACTGGGTCTC CGTCGCATCG TCTGTGCCCT TCTAAAGACT TATGATAGAA
AAGAAGATTT GGTGCTTGTT GTCGGTGCAA CTCCAGCCGA CGAGGCAGGG ATAGGAGATG
AACTGGGCAT TATGGGCGTG CGGGATCCGG GGTTCAGAGT TGTCGGCTAC GAAATGAGCG
TCAAGGAGAG GTAAGCGGCA TGAGTGCTAG TAAGGATAAA CCCTCTGACG TAGTAATAAG
GGAGGAAATG TATAGACATG GTGGATTATT TTCAGTCACT AGCAAGATCC TTGTCAATGA
CTTATTAAGT GAGTTTGCAA ATGTTCCGGA GTTGTCATTA AACTGGAACT GAAGTCTGTG
CATAGAGGGC ACTGTGCCTG TCAAACTAAT AACGGGCCTT GTTATCCTTC ACGCTGAGAG
AATATCACAT GGGAGTCAAG AAGAGTTTGC CGTGCGGTTG TTTCGGCGTG AAAACCAGGC
AAGCACCTTG TCGGATATTA CCAATCATGT GCTAACGATC ACTGCTAGAG CGGCTTTTGC
AAAGCCTTTT CAGATGAACC GGAGGTTTTC GCCCATGGGA TGTCACCCCT TCGAGACATG
CTTATCAATC TCAATATGAC TAGTGTTTTG ATTTGGCCCC GGTAGGTAAC TTTGCCATTA
TCTACTAGCA GTGTTCTCGG CATAAGCTAA TATCATCTAC GACAGGTTCA ATGAGGTCGT
GAAGGTAGAT CTTTCGAGTC GACGGGCGGA CGTGGTTGAA ATGTATGTGC CTATGACGGA
CCTCATGCGG CATTGTCAAG ATGCCATCAC GGAATGCATG GAAGCTATGC TTGTCGAGCT
TAAGCGTGAT CATTCTCTTG TAAGTCTATA ATGATTTATG AGCCACCGGC AAATAGCTTA
TTCTTACAGA ACCTTGATCT GGAAGATATA AACGTTCGAA ATGCACAGTT CAAAAACTTC
GATACCATTG TGCGGATGAA ATTGAAACCA GTATGGCATA AAGTAGGGGC AAAAACCAAG
ATCCACGTCG CAGCTCTTAC AGAACTGCGA AATCTACATA CGTGGGTGTC CTATAGATTG
CACTGCTCGA GATCTAATGA TAGGTTCAGA TGGTTATTGG AGTATGACTC TGCGACCTTT
GCGTCATATA TCAACACTCT CCAACGACAG CACTTTCAGG CCGAAAGATT GACAACTGGA
GCAGGCCGAC ATATCCACGA CTGGTTCAAC GCCAAAGCAG CTTCTCAACT TGTGGAGGCT
TCACAAGCAA GGGTTTCAAG AAGAAAAATG GTCATGGACA ACATCCCTGG TCCGGAAGAA
GATGCCGATA GAAGAGAAGT CAGCATTAGC AGGGATGAAG GAGTGGATTC TCAGGAAGGG
GAGTATGGGC TGGAGGAAGA AGCTTTGAGA GAACAGGAGC TCGCAGAACA ACAAAGACGA
GAAATCCAGG GAATTGTTCC TGATGACGAT GAAGAGGAAA TCATGGAGAT TTTCGCTACC
CAGACTCAAA CTCTTCCTCA GCATCACAGA CCAAATGATG CCGACAATGA CCTGGAACAA
GGGCCAATGG CCCAAGACGA GAATTCGGAC GAAACCCTTC GCTCTGCGGA AGGCGTGCCA
CCTCCGGTCT TCCGTCCGGT GATGCTGAGC GTTAAAGACG ATTTGGGTAG AAGCGTGGAG
AAGAGACTAA AAAAGGGCTA TGAGGCCGTG TTAGAGGAAC AACCTAAATG GAGTGTGTTG
GCAAAGGTCT TAAAGGAGAT TGAAGATACC ATTGCCAGTG TTCAAGTGTC CCACGCCGGT
ATGTTCCTCG TAACAGTCCT ATCGGTCATG ACAGTTGATG ACACCTTTAA TCAGACTCCC
CGGGAACGAA CATCATTCTT GTCATGACCT CTTCCGATCG TACTTGTCTT CAACTTCGTC
AATACCTTAC TACTATGTCC CGTACGGATC CTCCTTTTGG ACCCAACGCT GGTAGGAAGA
TGATGGAATC TCTCTTCCTT TCGAATTGGC AGCATGAAAA AAATGGTGAA AAGCTTGGGA
GCGCTGGAAC TGGGATGCAT AGGTCAAACG AAGATGAAAT CAGGGTCCGA GGAGATATCG
AAAGTAAGAG AGTGGAGGAG CAGAGAAGAG CAGAGAGGAC TCGGGGACGG GGAAGGGGCG
TGCCAAGTTA CAAGAGACGG AGACAGAGGG GTGGAGCTGC GGCCCCAGCG CCCAGATTAG
CCGAGATGGA GAAGTGAGTA TAAAAAGATG ATGTGCATAA AACTCCCTAC ATACTGACTC
GGTCCGCAGA GAGCACAAGG AAGCTATGAT GAAAGCTCAT TCTGCATTCG GTGGTGGTGA
GAATGATGAG GACACCCAGA TGCAGTGGGC CCTCGGGGAG TCGTAAGTAG ACGTTTACGT
TTGGAGGACA AGCAATGGGC TTACGCCTAT GAAGCACACG GTCCGCATCA TTTAATGATC
TGTCAACGCC ATCTACTAGT CTATCCTCTT CCGACCCATC GTCTGCCTTG TTGGCATCCA
CGGGTATCCT CGACGAAGAC GATCTGCAGC CCGTATCCGG ATCTCAGACG ATTGCCGAGG
CCCAATATGG TCTTTTGCCA GAAAACTTTG AAGAAGCCTA TGGTCTTATA GCACCAGAAG
GCGCGGTAAT TATCAGACCA TATGGAGGAG AAGACGATGA TATATTATTA CAAGAGTTGA
GGCCTAGGTT CGTCGTCATG TACGAGCCCA ACTTGGCATT CATTAGAAGA CTAGAGGTAA
GTGGCGGGAG TAGATCTGAT TAATGATTCA TCTAATGGGA TATCAGGTGT ACAGAAACTG
CAACCCCGGG CTATCATTAC GGGTATATCA GATGATCTAT ACCAACTCGT TTGAGGAGGA
TCGTTTCTTG TCAACCATCC AAAGAGAAGC CGAAGCGTTC AAAAAACTCA TTGACGATAG
GCAGGTTAGT CGAGTTTCAT TCCTTATGAA TAAAACATAA TAATTGACCG CGATGCTGTA
GTCGATGGTG ATCCCGATAT ACAACAACAA CCCCCGAGCT CCTATGCGTG ATACCGTTAC
TCGGTCTAAA ACTACTTATT CATCACGTAA TGCCGGCGGT GGTGAGAGTG CCGAAGAAGC
AAGGGTGAGT CGTCGCGACG TCATATATAC CGGCTTCCGC TTATCCTTCA TTAAGATCAT
CGTTGATATC CGAGAGATGG GCGCTCTTCT CCCTTCCCTT ATTGATTCGG CCGGTATCAA
GGTTGTCCCT TCAACTCTAA CTGTTGGAGA TTACATCTTG TCTCCTAAGA TGTGCGTTGA
AAGAAAGAGT TTAGCGGATT TGGAAGGCAG TTTCAATAAT GGCCGATTGT CAGTTGCTTT
ACTTATCCTC AAACTTGCGT TGCTGACTCC TCAAAGACAC ACCCAATGCG AAGCTATGAC
CTCTCACTAT GAGACTTGCA TCCTGCTCAT CGAGTTTGAC GAAGATAAAT TCGGTATGCG
AGTAGGTCTA TCTTCACTTC CATCCATTCG TCTCATACTG ATCCACTTTA TTTAGACTAA
AGAGGACGCG CGTCGAGAAG CTGCAGGTCG AGCGAATGAT CCCGATGAAA CATGGCGAGA
CACTTTCTAT CTTCAATCTA AACTGGCTCT TCTTGCCCTT CATTTCCCTC GTCTTCGTAT
TATCTGGTCT TCGTCACCCC ATGAATCAGT CAAGATACTA TCTGATCTCA AACTGAATCA
CGACGAACCC GATGAGATCA CAGCTACACT CAAAGGGTCT AGTGAGGGTG AACAGGGGGT
AAGAAGCGGG GTCGAAAACG CGGCGGCGGT TGAGATGTTG AGATCCATCC CAGGAGTCAG
CGGGAGAAAT TTGAAGTTTG TGATGAGCAA GGTGGAGAGC ATCAAGCATT TAGTGTCCAT
GAGCCGAGGA CAGCTCAAAG AGATTCTAGG AGAGGAGGGA GGAGAGAAAG CTTGGGAGTT
TCTACATCAT GACCCGAGGT ATTCGCGATG ATTCGTTGAA AGCCCAACGT TGGTTGTGCA
ATTGCAGATG GAGTATATGG GACATGTTTC TATTTTCTAC GGGATTCTAT GCGGTTCATG
TTTAGTGTAT GTGCTCTATT TATTGGGCTA GCAACAGCTC GTCGAACTTA TGCTTACCAT
CTAACCTCAG ATCATCACAT TTTAGGTCTG TCAAATTCAT CAGCGGTGCG ACATGTACAT
GAAGTCCTTC TTGTTGCTGT A
 
Protein sequence
MSTKPRRHIP YLPFHKNLIR TLCRPQQDDL LLIAKGLGLR RIVCALLKTY DRKEDLVLVV 
GATPADEAGI GDELGIMGVR DPGFRVVGYE MSVKEREEMY RHGGLFSVTS KILVNDLLKG
TVPVKLITGL VILHAERISH GSQEEFAVRL FRRENQSGFC KAFSDEPEVF AHGMSPLRDM
LINLNMTSVL IWPRFNEVVK VDLSSRRADV VEMYVPMTDL MRHCQDAITE CMEAMLVELK
RDHSLNLDLE DINVRNAQFK NFDTIVRMKL KPVWHKVGAK TKIHVAALTE LRNLHTWLLE
YDSATFASYI NTLQRQHFQA ERLTTGAGRH IHDWFNAKAA SQLVEASQAR VSRRKMVMDN
IPGPEEDADR REVSISRDEG VDSQEGEYGL EEEALREQEL AEQQRREIQG IVPDDDEEEI
MEIFATQTQT LPQHHRPNDA DNDLEQGPMA QDENSDETLR SAEGVPPPVF RPVMLSVKDD
LGRSVEKRLK KGYEAVLEEQ PKWSVLAKVL KEIEDTIASV QVSHADSPGT NIILVMTSSD
RTCLQLRQYL TTMSRTDPPF GPNAGRKMME SLFLSNWQHE KNGEKLGSAG TGMHRSNEDE
IRVRGDIESK RVEEQRRAER TRGRGRGVPS YKRRRQRGGA AAPAPRLAEM EKEHKEAMMK
AHSAFGGGEN DEDTQMQWAL GESLSSSDPS SALLASTGIL DEDDLQPVSG SQTIAEAQYG
LLPENFEEAY GLIAPEGAVI IRPYGGEDDD ILLQELRPRF VVMYEPNLAF IRRLEVYRNC
NPGLSLRVYQ MIYTNSFEED RFLSTIQREA EAFKKLIDDR QSMVIPIYNN NPRAPMRDTV
TRSKTTYSSR NAGGGESAEE ARIIVDIREM GALLPSLIDS AGIKVVPSTL TVGDYILSPK
MCVERKSLAD LEGSFNNGRL HTQCEAMTSH YETCILLIEF DEDKFGMRTK EDARREAAGR
ANDPDETWRD TFYLQSKLAL LALHFPRLRI IWSSSPHESV KILSDLKLNH DEPDEITATL
KGSSEGEQGV RSGVENAAAV EMLRSIPGVS GRNLKFVMSK VESIKHLVSM SRGQLKEILG
EEGGEKAWEF LHHDPRYSR