Gene CNB04120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04120 
Symbol 
ID3255755 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1209390 
End bp1212435 
Gene Length3046 bp 
Protein Length924 aa 
Translation table 
GC content48% 
IMG OID638255057 
Productendonuclease, putative 
Protein accessionXP_569237 
Protein GI58264162 
COG category[L] Replication, recombination and repair 
COG ID[COG1948] ERCC4-type nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.367281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCTC GGAAAAAATG CGGCAATCCG CTATTCCTCC AGTGGATGGA AGGTGAGCAC 
CTACTCTTCT ATGAAGCTAC AATAGACCAG AGCTGACTGT CTTTAGAGAT CCGTGATGCT
GCGCGTGAAA AAGGATCTAA ATCTGCTGAA ACTTATTCCA AAGCCTGTCG CTCTCTTGAA
TTTTGCCCTG TTACCTATGA TCGGCCTCGT GATCTTGCCA TCTTGGCGCA CATCGGAGAA
AAGACAATAG CGCAGCTAGA AAATAGGTGG ATAGAATACC GGAAGAGCCA TGGTTTGGAT
GTACCAGCAG AGCCAGAGAG TGAGTGTCTG TTCAAGTCGT CGGTTAAACC ATTTGGGTAT
TCATGCGTCT TAGAACTTTC TACAGCAGAG CCTAAAACAA AAGACAAAGG CAAGGGTTGT
GCAGCCCCAG ATGTTGATGG TCCAATGTCT GGCACCTCCC AAGAGACCAC AAAGAAAACT
CGCAAAACCA CCGCAAAGGC ATACATTCCT ACTCAAGGCT CTGGCGCTTA CGCTATTCTG
TTGGCTCTCA TCCTTGCGAT TGACAGGCCC GAAGTCACAA CTCAGGTCTT CTTGACAAAG
TCTGAGATTA TTCGTACCGC CCAAGAATAT TGTGACACGT CGTTCGAACA TTCAGAGAAG
GGAACATACT TTACTGCTTG GAGTGGAATG AAAACGTTAG TGAACAAAGG CTACGTCTAT
GTAACGGGAA ATCCTCATAA ACATTGCTTG ACGGAGGAGG GATAGTGAGT ATTTTGTTCG
TACTGATAAG ATCACGTAAT TGATGCTTTA TAGCGATGTG GCACTGGCCA TCCGGAATCT
GAGGCCAGAG TTTTCCCACA TGAAGAAGCA TCCATTTTCG CATGCCCCTG CTCCTGGGAC
GTCAAACAGA GTGACAGAAC TCCCTAGAAA TCGCGCAATA ACGGCGTTAG ATCTATATAA
CGGGCCTTCT ATTGTTCCTT CTACCCTCTC GACAGAATAC GTCCCGCCCG CCAATGCTCA
TTCTTCGCCA GCTTCCCGAC TCGCATCTTT TGACGCTGTG GCATCCAAAC TGACTGCAGG
GGAAAGATTT AATTTTTGGT ACATCACGCC TTCTGGTTCA CGAACCCCTC TCATGACATC
TGCCCATCTT CGCCTTGACC CCGAGCAATT CGTCAACCTT CGTCGTATCG AGTTCAAGTA
CTCGCAGCGC AACCACCCAT TTGCTGCGCA GTTGCGATTG ATGGACGCTC CTACTACTGC
AAAATTGAGG GACAAAAGTG GTGTGCCTAC ATTGTATGCA TATCTCATCG AAGCAGATGC
TCCACCCAAG TGTAGCATGT TTGATACGGA AAGCAGTCAA AGCAGGGCAA AAGCGAGCGG
GAAAGACAAT GCCAGCAATG CAGGCAGCAG TCCGCTGGGC AGCAGCCCAG CTCCCATCTC
AAGATCAAGA AGTGGTTTGG GAGGGAGAAA TGACTCTTGT GCTAGCCTTT CCGATGGCGG
GTCTCGATTG TCCGCATCGG CAGGCAAACC GACCGACCCG TTCTATTTCG ATGTTCGCAC
TTTGGCCCTT CAAAAGAATC CCTCTATGCC TTCGAATAGT GCTTCGACTT CCCAGTCTGT
GAGCAGATCG ACTTCTTCAT CACGTCCTCT TTCCAATTCG GGACCATCCC CCTATCAGAA
TCCGTACGAT GCCATACTGG GTCAAAATCC TTCTAGCCCT CCAGTATCGA CATTGTCGCG
GACCATTACC GCGCCCGCAA GCACATCATC CCAACCACGA ACATCCTCTC TTTCCACCCT
AAACATTGGC TCAAGTTCCA CTGTACCTAG TATTTCCAAT CGCTCATATT CTTCTGCCGC
GGTTTTACCG TCTCGTCCAG TCGTTCCAAG GATGGGACCG CGTCTTTCTA ACCATGTACC
CAGTCCAATC CCGGCACCCG AGCGTTTTGA TGATACTATC ATCCCACCGC CAGATTTGCA
TTCCAAATCA CTTCCTCCAT TTACCATTTC TAATGCCATC GTTTTCCCGC CAGGCTCATA
TGATATTATC CTTATCATTG ATACTCGTGA AGTCGAGTCC TCGAAAACGA AAAATAGGGA
CAAAATCGCG GAGACATTGG AGGCAAAAGG TATCAGAGTT GAGACCAGAG CTTTGCGATT
GGGCGACATG TGTTGGGTTG CCAGGAGGAA AGATGGGCTG GGTGGCGAAG AAGACGAATG
TGTGTTAGAT TATGTGGTAG AAAGGAAGAG ATTGGATGAT TTGGTCAACT CCATCAAAGA
TGGGAGATAT ACTGAGCAGT GTGTAAGTGC GTCACTCTCA AAGTAATGAC GTGCTGACTG
TGCCCAGTTC CGACTGTCGA ACGCATGTCT GAGCAACGTC TATTACATCG TTGAAGACTG
GCAAGTGAGC GAAAGAATGG AACAGAGCGG TCTTGCCATC ATGACCGTCA AGTCTCAAGT
TCAGGTCCAT AATCGGTTCT TCCTCAAGGA GACACATACT CTGAACGAAA CTATTGACTT
TCTCGCAACC ATGACCAGAG TCATTATCTC TTCCCATCGC ACCAAAGCGT TACACGTTAT
CCCTACTCAT TTCCTTTCTC GACCTTCGTT CAAACCCCTT CAGGATCATC TGCAGCTCAA
ACATCCAAAT ATCAAGTTCC ACACGTCCTT CATCGCTTAT CAAGAACTAA ATGACAAATC
GGCGAGTCAG ACGTTGAAGG AAAAGTTTGC AAAGATGATG ATGTGTGTAA AAGGCATGAG
CGCCGAGAAA GTATCTGCAT TGCTTGATGA ATGGGAAACG CCGCGGGTCA TGTGGGAGGA
TATGAAGGAA AGAGATCGAC AGCCGGATGA TTCGGAACCT CCGGGACAAC CTAGAGGTAA
GAAGAGGAAG GGCGGGAAGG GTTCTTTTTT TGCAGAAAGA GTGCAGGGGG AGGCGAGGCG
AAAGATTGGT GATGCTTTGA GTGAAAGTGT GAGTTTCATG TGGAAGATTG CGGATCTTGG
CCTTTGGCTG ACAGCCGCTC AGCTTTGGAC TGCCTTGATG GGATAG
 
Protein sequence
MPSRKKCGNP LFLQWMEACR SLEFCPVTYD RPRDLAILAH IGEKTIAQLE NRWIEYRKSH 
GLDVPAEPET EPKTKDKGKG CAAPDVDGPM SGTSQETTKK TRKTTAKAYI PTQGSGAYAI
LLALILAIDR PEVTTQVFLT KSEIIRTAQE YCDTSFEHSE KGTYFTAWSG MKTLVNKGYV
YVTGNPHKHC LTEEGYDVAL AIRNLRPEFS HMKKHPFSHA PAPGTSNRVT ELPRNRAITA
LDLYNGPSIV PSTLSTEYVP PANAHSSPAS RLASFDAVAS KLTAGERFNF WYITPSGSRT
PLMTSAHLRL DPEQFVNLRR IEFKYSQRNH PFAAQLRLMD APTTAKLRDK SGVPTLYAYL
IEADAPPKCS MFDTESSQSR AKASGKDNAS NAGSSPLGSS PAPISRSRSG LGGRNDSCAS
LSDGGSRLSA SAGKPTDPFY FDVRTLALQK NPSMPSNSAS TSQSVSRSTS SSRPLSNSGP
SPYQNPYDAI LGQNPSSPPV STLSRTITAP ASTSSQPRTS SLSTLNIGSS STVPSISNRS
YSSAAVLPSR PVVPRMGPRL SNHVPSPIPA PERFDDTIIP PPDLHSKSLP PFTISNAIVF
PPGSYDIILI IDTREVESSK TKNRDKIAET LEAKGIRVET RALRLGDMCW VARRKDGLGG
EEDECVLDYV VERKRLDDLV NSIKDGRYTE QCFRLSNACL SNVYYIVEDW QVSERMEQSG
LAIMTVKSQV QVHNRFFLKE THTLNETIDF LATMTRVIIS SHRTKALHVI PTHFLSRPSF
KPLQDHLQLK HPNIKFHTSF IAYQELNDKS ASQTLKEKFA KMMMCVKGMS AEKVSALLDE
WETPRVMWED MKERDRQPDD SEPPGQPRGK KRKGGKGSFF AERVQGEARR KIGDALSESV
SFMWKIADLG LWLTAAQLWT ALMG