Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03310 |
Symbol | |
ID | 3258186 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 982179 |
End bp | 985007 |
Gene Length | 2829 bp |
Protein Length | 708 aa |
Translation table | |
GC content | 50% |
IMG OID | 638257449 |
Product | conserved hypothetical protein |
Protein accession | XP_571608 |
Protein GI | 58268904 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.472473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTTGATTAC CCTCCCCACC TCTACACCTC ACCATGTCCC GCAGGCTAGA CCTCTCCAAA TTCGCAGGCT CAGACTCGGA AGGCGAAGAA GATGTCCCCA AAGCAAGTCC ACGTATCTTT TTTAATGTGC TAGTCGTCGC CAACACTGAC CGTGGATTTG TCACAGCATC AACTGGAATT GACTGCACTC AAGAGCAAGA CCTTTTCTCA AGGTATCACC AAAAAAACAA AGCGTGATCT GGAAAAGGAG GCAGAGGAAC GTAAGCGGAT TGAAGAGGAA AAGTACACTG CAAGCCATAC CTTGAATCCG CACCGTCCAC TGACATGGCC TACCTCAGAG CGGCAGCACT GGTAATGGCA GAGATTGAAA GAGAGTTTGA AGGATCAGCT GAGGGAAGCT CGTCCGGTCA CACGGGTATG AGGGGTTTTC GGGGAAGACC CATTGGGCCT GGGGGAGGGT TTGTAAGGGC CGGAGGCGCG CCGATGGGAG CGCCAACGTC TTTTGCGCCC CCTAGAGGTC CAGCCGCTAT GGGCTACGGT AGAGTAAGTC TAGATGTTAC TGACCAAGTT AACTGGGCTG ACATATCGCA GCCAAAACCC ATGCCTGTCA GAGCGCCATC ACCGCCTCCT CCACCCAGTG GACCGAAGCC AAGGGGCAAG AGAGCGATGG ATTCATTCTT GGAAGAAATC AAGCAGTATG TCATCCTTTC AAAAAGCGAT CCCTCTTCTC CTCTTTATCA ATACTGATGG CAAATATAGT AACCAGAACG CCCGCGAACA AAAATTCAGT CAGATCGCAA AGAAGGAGGG ATCTTCAGTA ACTGCCCTTG CAGGTCTGTA CCGGCAATCT GCTATTGATG AAGACTGTGA ACCATCTTGA ACTGATCGAT TCATGTAAAA CTAGCTTGGG AAACTGGCGG AAGTGGCTTC GACCATGAGG TACGTCTCAT ATTGTCTTTT TTTTTTGGTT AGATATTTAG CTAATCCTGG AGTAGAGCAC CAATTTATTC ATTGTGGGTG TTCCAACATG AAGGCCAGCG AATGTACAAT ATCTAACCTT TTTGAGCAGT CAAATTTGCC TCAAGAGATC ACGGAAGAGA TTCTCGGTTT GCACTTTGCA AAGCAGGGAC CGGTGGCGAC TGTCAAAATC ATGTGGCGTG AGTTTCTATA TTATTATTTT CTGTTTTTGT ATATGTCAAG GAGTTGACGT TGTCTGTAGC CCGAGGCGAC GAAGCATTCA GCCAGGCAAG CGCCCGACGG GGTCTCACCG GTTTCGTGTC GTACATGGAA CGGAAAGATG CTGAACGGGC TGTGAAAGAG CTGGACGGGT CGGAATGGAT GGGAAATTCG ATTCGAGTAG GATGGAGCAA GCCTGTTGCA AAGCCCTTGA AAGCTCTTTT CGGTGAGTCA TTTTTTCATA TCTATCACAG AAAAAGAAAA GGAAAAAAGG AAAAAGCCTG AACAGCAATG TAGATATCAC AAGTGACAGT CACAAGCGAA GAAGGTCAAG ATCACGATCA AGGGGCCGCA GCCCTCCTAG AAAGAAATCG CATCGCGCCC GTTCATACTC GTACTCCTCG TCATCTTCTT ACTCTCGTTC CCCTTCGCCC GAACGGACTT GCAAACAAAA GTGGCTGGAT AGTATCCCTG AAGAACATGG GAGGTTTATC AAGACTGTTG CGAACCGCGT AAAGGAACAT GGTAAAGGAT TTGAAGATGT TTTGATGGAA AAGGAAAGGG AGAACCCAAA GTTTGCCTTT TTGTATGACG ACAAGGCATG TGGCTTTTCT TTTTTTTTCT TTCCCGAGTA TTGTTGACCA GAGGTACAGC TTCCCGACTA CCACCTTTAC CAATCCACAC TTTCATCACA TTACCGCATT CCGTCTCCAC CCCCCGAAGC ATTCAACGAC GACGGTTACG CCTCCATCTA CTCTTCCGAC TCGGCCGAGG ATTCCGAAAG GGAGCGAACG TCCAAGGGTA AACTTGGCCG ACTTGCGAAA AGGCGGTTTG AGGCCATGTT ACGGGTGATG ACTGGGAAAA GGGCGGAGAT TGCGCGGGGG ATGGAGTTTG CGCTCCGACG AGCCGAAGCG GCGGATGAGA TTGCAGATAT CATCTGTCAG TCTGTCCAGG TGGATTCGAC GCCCGTCCCG CGAAAGATTG CGCGTCTTCA CCTCATCTCT GATATCCTTC ACAACTCTGC GTCGCCTTTG CCCAACGTTT GGCGATACCG CCTCGCGTTT GAGCATAGGC TACCGCCCGT GCTGGCGCAT TTGAATACGG TGGAGAAGAG TTTGATGGTG TATTCGGGCA AGATTAGTGC GGATGTGTTT AGAGGGCAAG TGGGGAATGT ATTGGATATC TGGGAACGAT GGCACGTTGC TTTTTTTTTT TTTGGCTGGT TGGTTGGCGC GCTTTTTGCT GACTTTTTTT GCAAATAGGA TTGTATTCAA CACCGATACT GCGGAACTCT TCCGTGCCGT GTTGGTCGGC GACAAGCCTC TCTCTGCGCT CGTCAAGACC CCCCAGGGCG CATGGGTTGA CAAGGCCAAG TTGGAGGCGG AGGAAGCGGA GAGGAAGAAG AGGGAAAACG AGCAACAAAA AGCTGAAGAG GAGGAAGATA GGTTTGAGCA GAGTGGGTTT AAGAGTAGTT TTAAGAGGAT CAATCCGCAG GCTGGGCCTG CGCCTGCGCC TGAATCAGTA CCGTCAGCTG TCAGATCTGC GTTTGAGGAT GAAGATTTGG ATGGGGAGGT GATGGAAGAT TTAGACGGGG AGGCGATGGA AGATTTGGAC GGGGAGGCGA TGGAGGATTT GGACGGGGAG GCAATGTAA
|
Protein sequence | MSRRLDLSKF AGSDSEGEED VPKHQLELTA LKSKTFSQGI TKKTKRDLEK EAEERKRIEE EKAAALVMAE IEREFEGSAE GSSSGHTGMR GFRGRPIGPG GGFVRAGGAP MGAPTSFAPP RGPAAMGYGR PKPMPVRAPS PPPPPSGPKP RGKRAMDSFL EEIKHNQNAR EQKFSQIAKK EGSSVTALAA WETGGSGFDH ESTNLFISNL PQEITEEILG LHFAKQGPVA TVKIMWPRGD EAFSQASARR GLTGFVSYME RKDAERAVKE LDGSEWMGNS IRVGWSKPVA KPLKALFDIT SDSHKRRRSR SRSRGRSPPR KKSHRARSYS YSSSSSYSRS PSPERTCKQK WLDSIPEEHG RFIKTVANRV KEHGKGFEDV LMEKERENPK FAFLYDDKLP DYHLYQSTLS SHYRIPSPPP EAFNDDGYAS IYSSDSAEDS ERERTSKGKL GRLAKRRFEA MLRVMTGKRA EIARGMEFAL RRAEAADEIA DIICQSVQVD STPVPRKIAR LHLISDILHN SASPLPNVWR YRLAFEHRLP PVLAHLNTVE KSLMVYSGKI SADVFRGQVG NVLDIWERWI VFNTDTAELF RAVLVGDKPL SALVKTPQGA WVDKAKLEAE EAERKKRENE QQKAEEEEDR FEQSGFKSSF KRINPQAGPA PAPESVPSAV RSAFEDEDLD GEVMEDLDGE AMEDLDGEAM EDLDGEAM
|
| |