Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB04120 |
Symbol | |
ID | 3255755 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1209390 |
End bp | 1212435 |
Gene Length | 3046 bp |
Protein Length | 924 aa |
Translation table | |
GC content | 48% |
IMG OID | 638255057 |
Product | endonuclease, putative |
Protein accession | XP_569237 |
Protein GI | 58264162 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1948] ERCC4-type nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.367281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCTC GGAAAAAATG CGGCAATCCG CTATTCCTCC AGTGGATGGA AGGTGAGCAC CTACTCTTCT ATGAAGCTAC AATAGACCAG AGCTGACTGT CTTTAGAGAT CCGTGATGCT GCGCGTGAAA AAGGATCTAA ATCTGCTGAA ACTTATTCCA AAGCCTGTCG CTCTCTTGAA TTTTGCCCTG TTACCTATGA TCGGCCTCGT GATCTTGCCA TCTTGGCGCA CATCGGAGAA AAGACAATAG CGCAGCTAGA AAATAGGTGG ATAGAATACC GGAAGAGCCA TGGTTTGGAT GTACCAGCAG AGCCAGAGAG TGAGTGTCTG TTCAAGTCGT CGGTTAAACC ATTTGGGTAT TCATGCGTCT TAGAACTTTC TACAGCAGAG CCTAAAACAA AAGACAAAGG CAAGGGTTGT GCAGCCCCAG ATGTTGATGG TCCAATGTCT GGCACCTCCC AAGAGACCAC AAAGAAAACT CGCAAAACCA CCGCAAAGGC ATACATTCCT ACTCAAGGCT CTGGCGCTTA CGCTATTCTG TTGGCTCTCA TCCTTGCGAT TGACAGGCCC GAAGTCACAA CTCAGGTCTT CTTGACAAAG TCTGAGATTA TTCGTACCGC CCAAGAATAT TGTGACACGT CGTTCGAACA TTCAGAGAAG GGAACATACT TTACTGCTTG GAGTGGAATG AAAACGTTAG TGAACAAAGG CTACGTCTAT GTAACGGGAA ATCCTCATAA ACATTGCTTG ACGGAGGAGG GATAGTGAGT ATTTTGTTCG TACTGATAAG ATCACGTAAT TGATGCTTTA TAGCGATGTG GCACTGGCCA TCCGGAATCT GAGGCCAGAG TTTTCCCACA TGAAGAAGCA TCCATTTTCG CATGCCCCTG CTCCTGGGAC GTCAAACAGA GTGACAGAAC TCCCTAGAAA TCGCGCAATA ACGGCGTTAG ATCTATATAA CGGGCCTTCT ATTGTTCCTT CTACCCTCTC GACAGAATAC GTCCCGCCCG CCAATGCTCA TTCTTCGCCA GCTTCCCGAC TCGCATCTTT TGACGCTGTG GCATCCAAAC TGACTGCAGG GGAAAGATTT AATTTTTGGT ACATCACGCC TTCTGGTTCA CGAACCCCTC TCATGACATC TGCCCATCTT CGCCTTGACC CCGAGCAATT CGTCAACCTT CGTCGTATCG AGTTCAAGTA CTCGCAGCGC AACCACCCAT TTGCTGCGCA GTTGCGATTG ATGGACGCTC CTACTACTGC AAAATTGAGG GACAAAAGTG GTGTGCCTAC ATTGTATGCA TATCTCATCG AAGCAGATGC TCCACCCAAG TGTAGCATGT TTGATACGGA AAGCAGTCAA AGCAGGGCAA AAGCGAGCGG GAAAGACAAT GCCAGCAATG CAGGCAGCAG TCCGCTGGGC AGCAGCCCAG CTCCCATCTC AAGATCAAGA AGTGGTTTGG GAGGGAGAAA TGACTCTTGT GCTAGCCTTT CCGATGGCGG GTCTCGATTG TCCGCATCGG CAGGCAAACC GACCGACCCG TTCTATTTCG ATGTTCGCAC TTTGGCCCTT CAAAAGAATC CCTCTATGCC TTCGAATAGT GCTTCGACTT CCCAGTCTGT GAGCAGATCG ACTTCTTCAT CACGTCCTCT TTCCAATTCG GGACCATCCC CCTATCAGAA TCCGTACGAT GCCATACTGG GTCAAAATCC TTCTAGCCCT CCAGTATCGA CATTGTCGCG GACCATTACC GCGCCCGCAA GCACATCATC CCAACCACGA ACATCCTCTC TTTCCACCCT AAACATTGGC TCAAGTTCCA CTGTACCTAG TATTTCCAAT CGCTCATATT CTTCTGCCGC GGTTTTACCG TCTCGTCCAG TCGTTCCAAG GATGGGACCG CGTCTTTCTA ACCATGTACC CAGTCCAATC CCGGCACCCG AGCGTTTTGA TGATACTATC ATCCCACCGC CAGATTTGCA TTCCAAATCA CTTCCTCCAT TTACCATTTC TAATGCCATC GTTTTCCCGC CAGGCTCATA TGATATTATC CTTATCATTG ATACTCGTGA AGTCGAGTCC TCGAAAACGA AAAATAGGGA CAAAATCGCG GAGACATTGG AGGCAAAAGG TATCAGAGTT GAGACCAGAG CTTTGCGATT GGGCGACATG TGTTGGGTTG CCAGGAGGAA AGATGGGCTG GGTGGCGAAG AAGACGAATG TGTGTTAGAT TATGTGGTAG AAAGGAAGAG ATTGGATGAT TTGGTCAACT CCATCAAAGA TGGGAGATAT ACTGAGCAGT GTGTAAGTGC GTCACTCTCA AAGTAATGAC GTGCTGACTG TGCCCAGTTC CGACTGTCGA ACGCATGTCT GAGCAACGTC TATTACATCG TTGAAGACTG GCAAGTGAGC GAAAGAATGG AACAGAGCGG TCTTGCCATC ATGACCGTCA AGTCTCAAGT TCAGGTCCAT AATCGGTTCT TCCTCAAGGA GACACATACT CTGAACGAAA CTATTGACTT TCTCGCAACC ATGACCAGAG TCATTATCTC TTCCCATCGC ACCAAAGCGT TACACGTTAT CCCTACTCAT TTCCTTTCTC GACCTTCGTT CAAACCCCTT CAGGATCATC TGCAGCTCAA ACATCCAAAT ATCAAGTTCC ACACGTCCTT CATCGCTTAT CAAGAACTAA ATGACAAATC GGCGAGTCAG ACGTTGAAGG AAAAGTTTGC AAAGATGATG ATGTGTGTAA AAGGCATGAG CGCCGAGAAA GTATCTGCAT TGCTTGATGA ATGGGAAACG CCGCGGGTCA TGTGGGAGGA TATGAAGGAA AGAGATCGAC AGCCGGATGA TTCGGAACCT CCGGGACAAC CTAGAGGTAA GAAGAGGAAG GGCGGGAAGG GTTCTTTTTT TGCAGAAAGA GTGCAGGGGG AGGCGAGGCG AAAGATTGGT GATGCTTTGA GTGAAAGTGT GAGTTTCATG TGGAAGATTG CGGATCTTGG CCTTTGGCTG ACAGCCGCTC AGCTTTGGAC TGCCTTGATG GGATAG
|
Protein sequence | MPSRKKCGNP LFLQWMEACR SLEFCPVTYD RPRDLAILAH IGEKTIAQLE NRWIEYRKSH GLDVPAEPET EPKTKDKGKG CAAPDVDGPM SGTSQETTKK TRKTTAKAYI PTQGSGAYAI LLALILAIDR PEVTTQVFLT KSEIIRTAQE YCDTSFEHSE KGTYFTAWSG MKTLVNKGYV YVTGNPHKHC LTEEGYDVAL AIRNLRPEFS HMKKHPFSHA PAPGTSNRVT ELPRNRAITA LDLYNGPSIV PSTLSTEYVP PANAHSSPAS RLASFDAVAS KLTAGERFNF WYITPSGSRT PLMTSAHLRL DPEQFVNLRR IEFKYSQRNH PFAAQLRLMD APTTAKLRDK SGVPTLYAYL IEADAPPKCS MFDTESSQSR AKASGKDNAS NAGSSPLGSS PAPISRSRSG LGGRNDSCAS LSDGGSRLSA SAGKPTDPFY FDVRTLALQK NPSMPSNSAS TSQSVSRSTS SSRPLSNSGP SPYQNPYDAI LGQNPSSPPV STLSRTITAP ASTSSQPRTS SLSTLNIGSS STVPSISNRS YSSAAVLPSR PVVPRMGPRL SNHVPSPIPA PERFDDTIIP PPDLHSKSLP PFTISNAIVF PPGSYDIILI IDTREVESSK TKNRDKIAET LEAKGIRVET RALRLGDMCW VARRKDGLGG EEDECVLDYV VERKRLDDLV NSIKDGRYTE QCFRLSNACL SNVYYIVEDW QVSERMEQSG LAIMTVKSQV QVHNRFFLKE THTLNETIDF LATMTRVIIS SHRTKALHVI PTHFLSRPSF KPLQDHLQLK HPNIKFHTSF IAYQELNDKS ASQTLKEKFA KMMMCVKGMS AEKVSALLDE WETPRVMWED MKERDRQPDD SEPPGQPRGK KRKGGKGSFF AERVQGEARR KIGDALSESV SFMWKIADLG LWLTAAQLWT ALMG
|
| |