Gene CNA01270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01270 
Symbol 
ID3253767 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp343821 
End bp345760 
Gene Length1940 bp 
Protein Length429 aa 
Translation table 
GC content51% 
IMG OID638252460 
ProductU1 snRNP 70K protein (short form), putative 
Protein accessionXP_566577 
Protein GI58258329 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCAAAAACA TCCTCAATAA AGCCCCACGA AGGGCCGATA ACCAACATGT CACACCTCCT 
TCCTCCTAAC CTCCTGAAGC TTTTCGCCCC TCGACCTCAG CCGCCCTTCT TAAAACCCCT
GACGAGGGAC GAACGCATCC GAGGTCCAAA CAACCTTGGG GGTGTTGCCG GCCTTGCTAA
GCGTATAAAA GAAGAAGCCG AAGATGCTGA GGTCAAGCAA GGGATGGGCA TGAATCCCGA
AAAAGCTCTC GATGAACAAA AGGAGGAAAA TGTAAAGATG GAATTGAAGC AAGATGGCGA
AGATGGAGAG GTCGCGGAGG ATTCTGGCAA GGAGAAGAAG AAAAAGACAG CTAGAGATAA
GATTGCGGAG ATGGGTATCA TTGGAGAGGA GGCTGTCAAA ATGAGGAAGG AATTGAGGAA
GAAGAGGCAA GAGGAGTACA AGAAGAATGC CGAGAGCAAC TGTGGGTATT ATCTAACCGT
CTATGTAAGG TCACTAACAT TTCTGCAGAT AAACCTCAGG ATGACGCTCA AGCCATCGGT
GACCCTTATA AAACCCTTTT CATCTCAAGG CTAGTAAGTA GATGTTGTAA ACCTATCTGA
CCGTTCTCAC TGACACGATG TTCCACAGTC TAAGAAAGCG AACGAGACAG ACCTTCGCCG
AGAGTTTGAA ATGTATGGCC CTATCGAGCG GATACGTATC GTTCGAAATC GAAAGGGCAA
GAGTAACGGC TACGCCTTTA TTGTTTATGA GCGAGAACGA GATATGAAAG GTGGGCCTTC
AATTGCTGTT CAGAAGGTTA ATAACTTACG ATTGGCTAGC CGCATACAAG GACGCTGAAG
GCATTCCTAT TCACCACAAG AAGATTCTTG TCGATGTCGA GCGTGGACGC ACTGTTAAAG
GATGGAAACC TCAACGTCTC GGTGGAGGTC TTGGTGGCCG TCCCAAACCC GTCGCCCCAG
AAGCCTCCCC TGCACCGTAT GTCGCTCCCA GCAACCTCCG AGGTGGCCGA GGAGGGTTCC
GTGGAGGAGG ACGAGGCGGC GGTGCTGGTT TCCGAGGTGG TTTTCAAGGG AACAGGGGAG
GGTTTGGCGG AGGTAACAGA GGAGGATTTG GCGGAGGGGA TGACAGAGGA GGTTTCGCTG
GACGAGGCGG TTTCCGAGGG GGTTTCCAAA ATCGAGGAGA TCGTGGTCCT GGTGGATTTG
GCCAACAACA GGGAGGGTTT GGCGGTCCAG GAGGCCCAGG TGGATACGGT GGACAGGGAG
GAGGGTACGG TGGCGGTGGT GGCGGCTTCG GGTAAGTTCT CATCCGATAC GCACTGTCCG
TCTGTGGTAC TAACAAGTCA GCAGCGGTCC CGGCCAGCAA AATGGAGTTC AAGGCGGACA
AGGTGGAGGA GGCTTTGGGG GGGGTGGAGG TTACAAACGC GATTACGACA ACGCCGGTGG
ACCTGGTGGT TATGGTGGAG GAGGCGGCGG TGGTGGTGGT GGGTTCGGGG GAGGCGGTGG
AGGTTACCAG GACAGGGATC CCAAGAGGAT GCGGTATTGA GATGTTCTAT TTATTTGATT
CATTTACTCA ATTTTCAGTG CGGTCCGATG AGGAGATATT TGCGGAAGCC GTACGGAGCA
TTGTGGGTGC ATTTGCACAG GTCTAAGTAT GTAAAACAAG GTATAGCATT TTCTCGGGTT
CTGTCTGTTC GTGTGCTTCC TGTATCGACC TACCATCACC ACGTTCTTGG CACTTGAGGA
AGCCTAATGA TAGGAACAAC ATTATTTAAT AAATGGTGGG TATCGGTAGG TGCCTGTACT
CGATCAGCAA GTCACCCCCT TCTTTTATCA CATTATCGAC AACTGAATTG AGGCTAGATA
AAGTGTTGGG AGACTGACCT CAGAGAAGTA GGCGTACTGT ATTTGGATAG GTAGCAGGCA
TATACTTCGC TTCGAAGCAG
 
Protein sequence
MSHLLPPNLL KLFAPRPQPP FLKPLTRDER IRGPNNLGGV AGLAKRIKEE AEDAEVKQGM 
GMNPEKALDE QKEENVKMEL KQDGEDGEVA EDSGKEKKKK TARDKIAEMG IIGEEAVKMR
KELRKKRQEE YKKNAESNYK PQDDAQAIGD PYKTLFISRL SKKANETDLR REFEMYGPIE
RIRIVRNRKG KSNGYAFIVY ERERDMKAAY KDAEGIPIHH KKILVDVERG RTVKGWKPQR
LGGGLGGRPK PVAPEASPAP YVAPSNLRGG RGGFRGGGRG GGAGFRGGFQ GNRGGFGGGN
RGGFGGGDDR GGFAGRGGFR GGFQNRGDRG PGGFGQQQGG FGGPGGPGGY GGQGGGYGGG
GGGFGGPGQQ NGVQGGQGGG GFGGGGGYKR DYDNAGGPGG YGGGGGGGGG GFGGGGGGYQ
DRDPKRMRY