Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA04300 |
Symbol | |
ID | 3253364 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 1156733 |
End bp | 1158671 |
Gene Length | 1939 bp |
Protein Length | 494 aa |
Translation table | |
GC content | 49% |
IMG OID | 638252750 |
Product | art-4 protein, putative |
Protein accession | XP_566780 |
Protein GI | 58258735 |
COG category | [R] General function prediction only |
COG ID | [COG1439] Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAGTTCAG CAAAATGGTT CTCTCATATA GCGCCATCGC AAAACCCACA GCGGACACTC CCAAGGTAAT CTCAATGACT GCCTCCAAAC AAGTCTTCCC CGAAGCTTTT GAGGCATTAC CGGCTCCTGC CGCTGAAGAG TCTCAAGAGC TAGAATCTCA ACCCGAAGCT CTTGCACAGG CGTCCAGTTC TACCATCGAG GCCCCTATTT CGAGCGAAGA CCGAAAAGTC ATTCAGCATT TGATCCTTGA TGCTGGACCT CTGTTGTCTT TGACGCCGCT TCGCCATCTC GCTACGAGCT TCCATACGAC CCCGATGGTT CTTGCGGAAC TGAGAGATCC TAAAGCTAGG GAGCATTGGG AGAGGCTTGC GTTGACTGGG GTCAATGTCA AGGTCGAGTC ACCTACTGCT GAAGCCATGG CCCAGGGTGG GTACTTTGTT GATTATTCAT GATTTACTGT GTGCTGACGA ACATACAGTC ACTGCTTTTG CGAAGAAGAC TGGTGACTTT GCGGTACTTT CACAAACTGA TTTGTCCGTT GCCGCTTTGA CTTATCAGTA TGAAGTGATG GTGAACGGTG TCAAGAGGAT TAGGACTGAG CCCCCTCAAG CGAAGAAGCC TAGTGCCAAG GGGAAGGAGA AGGAGAAGGA CACCAAGCAG GCGGAAAAGA AAGATGAGGA GGTCCAAGAG AACAAAAACG AAAAGGAAGG AAAGCCCGAG GAGGAGGATG TTGAAGTGGA AGAGGCGATT CAGTCTTTGA GCCAAATTAT TATTGAACCT TCAACGAAAA GCGAGCTTTC AGTAAATGAT GTTTCCACCA CCAACCGGTC CCATACCCAG ACCTCAGTCC CAGCTTCTGC CCATGCAGAG GATCCTGAAT CCGAGGGCGA ATGGATCAAC CCCACAAATC TCAGCACCCA TCGGTCTCGA GATCTGGGTC TTATCACTCC TTCTGGATCA ACAGCCAAGC CTCCGGCCGT TGCATGTATG ACCGGTGATT ATGCTGTGCA GAATATATTG TTGGGAATGC GATTGGGATT GGTTGGCGAA GGCGGTAAAA AGATTGGAAA GGTGAAGAGT TGGGTATTAA GATGCCATGC CTGTTTTAAG TGAGTTAGTA ACTTTGCTTT ATCTTGAACA TGTCTGATTC TGTGATAGAG TTTGCAAGGA TCCTAGCAAA CGATTCTGTC CATCATGCGG CAATGCTACG CTTCTTCGTA CTTCCGTTTC AACATCCGCT AAAACTGGTG AACAAAGAGT GCATCTCAAG CAGAACTTCC AATACCGCAC TAGAGGTACC ATCTATTCCA TCCCGGACCC TAAGATGGGC AGAGCCAAGG GTCAGCAAAA GGGGGGAAGC GGCTTAATTT TGAGAGAAGA TCAGAGGGAA TGGCAGGATG CAATGAGGGG AGATAGAATT AAAAAGGAGA AAGAAGAAAG GAAGGCGGCT AAGGGCGCTT TGGAAGGTTG GAATGACCCT GACGTGAGTG TCATCATTCT TCTCCCGCTT TGCCTCTAGC TCACAGCTGG TACAGTGGTT GCCCGAGATG ATCACAGTAG GTATGTCCGG AAAGGGTCGC TCTGGTGGAC ATAACATGCC TTCAATTGGC CACGGGCGTA AGAACCCTAA CCAAGCAAAG AGACGACGAT GAATGGTGTG TCTGCCAAGC GGGTTTGTAG TGTATAGACC GATAGATTTG CGTATATTCT CGCTGTAAGT CGGGAGGTGC ATATATATAC TACAATGTGG CTGACTGTTG AAGAGGATGG ACGACACTTT CCATGTATCA ATGAATGACA ATCTCCAAGC TTCTGTGTCG ATATCGGGGC ACTGTGACAA TTGCCTGATC AGATGATGAA GCGCGCCAGT TCATGTTGGT AATGAGTCTG TCGACATGTT CTCTGACTTA GTCCCAGCCG CCCCAGATAC TTATATGCT
|
Protein sequence | MVLSYSAIAK PTADTPKVIS MTASKQVFPE AFEALPAPAA EESQELESQP EALAQASSST IEAPISSEDR KVIQHLILDA GPLLSLTPLR HLATSFHTTP MVLAELRDPK AREHWERLAL TGVNVKVESP TAEAMAQVTA FAKKTGDFAV LSQTDLSVAA LTYQYEVMVN GVKRIRTEPP QAKKPSAKGK EKEKDTKQAE KKDEEVQENK NEKEGKPEEE DVEVEEAIQS LSQIIIEPST KSELSVNDVS TTNRSHTQTS VPASAHAEDP ESEGEWINPT NLSTHRSRDL GLITPSGSTA KPPAVACMTG DYAVQNILLG MRLGLVGEGG KKIGKVKSWV LRCHACFKVC KDPSKRFCPS CGNATLLRTS VSTSAKTGEQ RVHLKQNFQY RTRGTIYSIP DPKMGRAKGQ QKGGSGLILR EDQREWQDAM RGDRIKKEKE ERKAAKGALE GWNDPDWLPE MITVGMSGKG RSGGHNMPSI GHGRKNPNQA KRRR
|
| |