Gene CNI00920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00920 
Symbol 
ID3259395 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp228034 
End bp229558 
Gene Length1525 bp 
Protein Length401 aa 
Translation table 
GC content52% 
IMG OID638258577 
Productsmall nuclear ribonucleoprotein, putative 
Protein accessionXP_572846 
Protein GI58271380 
COG category[A] RNA processing and modification 
COG ID[COG5200] U1 snRNP component, mediates U1 snRNP association with cap-binding complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTTGAGACA TCCAACATGG GTCGATTAGC CGAAGCACAG AGAAGGCTCC TCGAGGTGAG 
CAAGTTACCG GAGCATATGC GAAATCCTTT CTTACCTTCT GCAGCAAATG ATGGGCCCCG
AAGCAATGGG TATCCAGGCG GTCAATCTCG ACTGGTGGAA CGAAAAAGTC TGTAGGAACT
TCCTCTTCGG AACATGTCTG CACACCTTGT TTGGTAACAC CGTAAGTTAC AATGGCTTAG
GTTAAAGATG TACCGGTCTT ATAGAAGCTG TCATAGAAAA TGGATCTTGG ACCATGCCCC
AAGGTTCATT CTGACCGTAT CCTCAAACAA TTTCGAGAGC ATGCCGAAGC GAACCCCAAC
GATCCCAGGC TCTCTGCTTT CAGACAAGAG CACGAAAACA GCCTGTACTC ATTTGTGGAG
GATTGTGACA GAAGGATAAG GGCGAGTCAG AGAAAGTTGG AAAAGACACC CGAAGAGAAC
AGAAAGACCG TCGATCTCGT GAGTGGGGTC TGCTGGCCAG CTGAGTTGAA ATACTGATAA
GGTGTAGATG AGGGAAATTG GAGAAATTGA GCTTTCTATC CAAGGTGGTA CAGAGGAAAT
TGAGGCGCTT GGTGAGGCAG GAAAGGTTGA GGAGTCTATG GAGAAACTTG CCGCCGTCGA
CGCTCTCAAG GCTATGAAGG CCGAAAAAGA AAAGGATCTT CAGCACCTCA ACGAAAACGC
TGGTGCAAGT GGTCACCAGA AGCTTCGTGT CTGCGAGACT TGCGTATGTT TGATCCCTTT
TCCCCATTGG CTCTCAACTG ACAAATGTAT AAAGGGTGCC ATGTTGAGTG TTCTTGACTC
TGACAAACGT CTTGCCGACC ACTTCGGTGG TAAGCTCCAT TTGGGTTACC ACGAGCTCCG
TAAAATCCTC AGTGCCTTCT CCGAAGCCCG CATGACCGGT CGACCCATAC CCATCATCCC
TCCCAAATCC CCTCGAGCTG ACGGGGATGA GCCTTTACCG TTCTCCGCCC CCGCCATCCC
TGCCACCGCA CCTGCTGCCC CCACTGGTCC TCGTTCTGGG CTAAACCCCC CTATGGGCCC
TTCAGGCAAC GAGCCGCACA CGCCCCACCA CGCTCGTGTT CCCCCTGTTG AAGAGATGCC
CGTTGTGGGA CATGGCGACA AGGTGAAGCG CGAGGCTGGC GAGCTGGTGG AGGATCTGAA
GGAAGAGAGG GCGATGGAAG AAAGGGATAA GGAGAGGGAG AGGTATAGGG AACGAGAGAG
AGATAGGGAT GATAGACATG ACAGAAATAG GTATGACGAT AGGGATAGGG ACAGGTATAG
GGACCGAGAC AGGGAGAGGG ACAGCAGGTA TGGAGGAGAC AGAGATAGGA AGAAGAGCTA
CGAGTAAGTG CTGCATCTGA TTGAAGACGG GTTGTGCTAA TGGTGGCAAC AGCCGCGAGA
GATCGAGGTC TCCTGTCAAG CGACGCGTTC TGTAACAAAC TCTGGGCCTT GTTGAAGTGG
TAAATATGCA GTTTGATCGA CTATC
 
Protein sequence
MGRLAEAQRR LLEQMMGPEA MGIQAVNLDW WNEKVCRNFL FGTCLHTLFG NTKMDLGPCP 
KVHSDRILKQ FREHAEANPN DPRLSAFRQE HENSLYSFVE DCDRRIRASQ RKLEKTPEEN
RKTVDLMREI GEIELSIQGG TEEIEALGEA GKVEESMEKL AAVDALKAMK AEKEKDLQHL
NENAGASGHQ KLRVCETCGA MLSVLDSDKR LADHFGGKLH LGYHELRKIL SAFSEARMTG
RPIPIIPPKS PRADGDEPLP FSAPAIPATA PAAPTGPRSG LNPPMGPSGN EPHTPHHARV
PPVEEMPVVG HGDKVKREAG ELVEDLKEER AMEERDKERE RYRERERDRD DRHDRNRYDD
RDRDRYRDRD RERDSRYGGD RDRKKSYDRE RSRSPVKRRV L