Gene CNN01920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01920 
Symbol 
ID3255326 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp553618 
End bp556977 
Gene Length3360 bp 
Protein Length999 aa 
Translation table 
GC content51% 
IMG OID638254610 
Producthypothetical protein 
Protein accessionXP_568670 
Protein GI58262520 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.446711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGG AAGACTTCAT CCCTCTCCAA GGTTCAGCTG GGAAGAAAGA ACGCAAGGAG 
CGTGTCAAGT ACGTACATCG CATACCCATA CAACCCTAGC TAACTCGACC ATCTAAAAGC
ACAACCCTTT TCGTCTCCTC CCTCCCTTAT ACAGCCACCA CCACCGACCT TCTCACCCAC
TTCTCATACA TCGGTCCTGT ACGACATGGT TTCGTCGCTA CCGACAGAGA ATCAGGCAAG
TCAAAAGGTG TCGGTTACGT GACTTTTTCG TTGAAAGAGG ATGCCGATCG GGCAATTCAA
GAGTTGGACG GAGGTTCTTT CGGTGGCAGT AAGAGGAAGA TCCAAGTGAA GTGGGCTGAT
GAAAGGGTGA GTGCCGTTTT TTTTTGATAT GACATGTGCA AAAACTAATG GTTGGGGGAA
CGTTACAAGG CGTCTCTGAA AGACCGAAAA GCCGAGATCA AGGTTTCAAA ACCCATCCCT
GGACAGACAG ACAACAAGTC GACGGACCCC AAGGCTATCC AAACTCTTGT TCTCACTGGG
TTACCATCTG ATATCACCAA GAATGTGCTT TGGAAGAAGA TTAGAAAGGT CAATGACAAG
GCCGAGTTGG TGTTCCCTGT CGAGGCTCAA GAAAACGAGG AGGAGGCTCC CAAGGATACT
GGTACGTTCT TCTCTGAAAC TTTTCAAACG TCTAGGAGCT AATCATATTA TATAAAGCCC
ACATCGTTTT CCCCTCCCAC GGTGATGCTC TCAAAGCTCT TCCCAAGCTT CACGGCCATA
CGTACAAGGG TAACATCTTG TCTTGTGTCC TCAAGAAGCG TCTTGAAAAA CTTTCCGCCA
AAGGAGAAGG CAAAGCTCCC AGCCACGCTG GTAGACTGAT CATTAGAAAT TTGTCTTGGG
ACGTAAGTCG CAGTACTCTT TTTATTTTCG GGAAATTACT AAAGAAGAAT ACTACAGACC
ACTATCCAAG ATCTCCGAAA AGCTTTCCTC CCTTACGGTC CTATCCATTC TATCGATCTC
CCTACCCTTC CCTCCAAACT CCCTCCTTCA TCCGACCCTG CCAAACCGCC GCCCCCTCCG
CGTGCGCGTG GCTTCGCATT TGTCTGGTTC TTGGCCCGAC ACGACGCTGA AAAGGCTATT
GAAGGCACCA ACGGTAAACC AATCAAGAAG GGTCCTGATG GTGAGGGTCG AGTGGTAGCT
GTTGACTGGG CGTTGAGTAA GGAAAAGTGG CAAGAGGCGA CTAAAGGAGA GGAGAAGAAG
GAAGGGGAGA AGGAAAGTTC TGATTCTGGG TCCGATTCTG GATCCGAGTC TGAGTCTAAT
TCGGAATCTG GCGAGGGATC TGATGAAGAA TCATCAGGTG ATGAAGGTAC TTCCGTAGTT
AGCGGCTCAG AAGAAAGCAG CAACACGGAT GAAAATGAAG AGGAGGAGGA GGAGGAGGAA
GAAGAACCTG TCAAGCCTAC CCTCCCCACT GTCGATGTCG GCAGTACCCT CTTCATCCGT
AACCTCCCCT TCGAAACTAC CGAACTTGAG CTTAACACCC TCTTCCGTTC TTTTGGGCCC
TTGCGATATG CCAAAATTAC CATTGACAAG GCAACTGGAC GATCCAGGGG AACTGGCTTC
GTTTGTTTCT GGAAGAATGA GCATGCCGAT GAAGTTATTG AAGAGGCGCA GAGAGTCGCG
ATGGAAACTG GTGCCAATTC TATCCCTGTA TGCTTATTTT TTCTCGCATT ATCCAACAGA
ACTACTGCTC ACACACGCAC TACAGCTTGG TGGCGCAGCC CCCAAGAATC CCTTCGCCCT
CCCTTCCCTC CTTACCGCTG ACCCTTCCTC TTCTCTCGCT TCCCGTCTTG TCCTTCACGG
GCGAACCCTC GACATCACCC GTGCCGTCAC CCGAGAAACC GCCTCCCAAA TGAAGGAAGA
CACTGAACGT CTCCGTAACG CCGCAGACAA GCGAAACACC TACCTCATGC GCGAAGGTGT
CATCTTCCCT AACTCTCCAG CCGCCGAGGG TCTGCCAGAG AGCGAGATTG AAAAACGTCA
GGCGAGCTTC AACTCTCGTA AAGCCTTGTT GCGAGGTAAC CCTTCGCTGT ACATCTCCAA
AACACGTCTA TCTATCCGAC AGTTGCCTCT CTTCGCGACT GATAGGACGC TCAAGCGATT
GGCTATCCAT GCAGTCAAAG CTTTCGACAA GGAAGTTGCT GATGGTGAGC GAGAAGGTTT
GGCTAGGGCA GAAGAGATGG ACGGTACCAT GAGTTCTGCT ATTGCCGCTC GTGGCGACAA
GTCAGGCGGT AAGGGTAAGG GTAAAGGAGG CAAGAAGGAG CGTGAGACAG CTGTCATCCA
ATCCAAGATT GTCCGCCAAA CTGAAAAGCT CGATCCCGTC ACAGGCCAAG GCAGGTCGAA
GGGTTATGGT TTCCTTGAAA TGCGTTCCCA CAAGGATGCA CTTAAGGTGC TTCGATGGGC
GAACAATAAC CCTGAAGTGG GTCCATTGAT GTGGGAGTGG TGGAAGGTTG AGCTCGGAGA
TATGAAGGAG CGAGTGGAAA AGGCTTTGAC GGATGCTAGG AAGAGGGAGG AAGAGCCACA
GAAGGAAACC GCCAAGGAGA GTAAGAAGGG ATTGGAAAGC GTTGAGGAGT TAGAGTCGAG
GTTGAAGAAG CTTGATAGCA GATTGGAAGA AGGCGATTCA AGAAGCGGTG GTGGAATGAG
AGGTGGAAAG ACTCTTATCA TCGAGTTCAG TATTGAGAAC GTCCAGGTAA GTCACTTTTG
AATATCCTTT TGCAAGTCAA GCCAGTGACT GATTCACCGT GATGTGTAGG TTGTGAAGCG
CCGAGTAGAA AAGATCACCA CGGCGAGAGA GAACGGGAAG CGCAAGGCGG ATATAATTGC
AGCTGAAGAC ACCGATGACG ACGAATCTGC CTCTTCCAAG CCCGCTGGCC GATTTGACAA
GCGCGCCAAG CGCGACGGAC GTGATGGTCG TGATCGTCGC GACAGGCATG ATGGTCCCGG
TGGTCGTGAG GGGTTAAAAG GCAACTTTAG AGGCAAGCAA GGTCAAGATC GAGAGCAAGG
CGGCAAGTTT GATAAAGACA GGCCTAGGGG AGGTCAGAAT GGCCCAAGAC GGGATTATAA
CGATCGTGGA GGTCGAAATG AACGAGGTGG TCGCAACGAC CGTGGTGACA GACCTGGTCA
AGTCCAGCCC CGACAACCTG GGAGGAGGGA CCAAAAAAGT GTGAACCAGG TGAAGAAGGA
TGTCAAGACC AAGGAAGCTG GGGAGAAGAG AGGTATTGAG AAGTTCGGAA GTCAACTGGG
TAGCATGATT GGGAGGAAGA GGAAGATGCG CAAGGGTGCA AAGTAGGTGG TGATTGTCCT
 
Protein sequence
MAEEDFIPLQ GSAGKKERKE RVNTTLFVSS LPYTATTTDL LTHFSYIGPV RHGFVATDRE 
SGKSKGVGYV TFSLKEDADR AIQELDGGSF GGSKRKIQVK WADERASLKD RKAEIKVSKP
IPGQTDNKST DPKAIQTLVL TGLPSDITKN VLWKKIRKVN DKAELVFPVE AQENEEEAPK
DTAHIVFPSH GDALKALPKL HGHTYKGNIL SCVLKKRLEK LSAKGEGKAP SHAGRLIIRN
LSWDTTIQDL RKAFLPYGPI HSIDLPTLPS KLPPSSDPAK PPPPPRARGF AFVWFLARHD
AEKAIEGTNG KPIKKGPDGE GRVVAVDWAL SKEKWQEATK GEEKKEGEKE SSDSGSDSGS
ESESNSESGE GSDEESSGDE GTSVVSGSEE SSNTDENEEE EEEEEEEPVK PTLPTVDVGS
TLFIRNLPFE TTELELNTLF RSFGPLRYAK ITIDKATGRS RGTGFVCFWK NEHADEVIEE
AQRVAMETGA NSIPLGGAAP KNPFALPSLL TADPSSSLAS RLVLHGRTLD ITRAVTRETA
SQMKEDTERL RNAADKRNTY LMREGVIFPN SPAAEGLPES EIEKRQASFN SRKALLRGNP
SLYISKTRLS IRQLPLFATD RTLKRLAIHA VKAFDKEVAD GEREGLARAE EMDGTMSSAI
AARGDKSGGK GKGKGGKKER ETAVIQSKIV RQTEKLDPVT GQGRSKGYGF LEMRSHKDAL
KVLRWANNNP EVGPLMWEWW KVELGDMKER VEKALTDARK REEEPQKETA KESKKGLESV
EELESRLKKL DSRLEEGDSR SGGGMRGGKT LIIEFSIENV QVVKRRVEKI TTARENGKRK
ADIIAAEDTD DDESASSKPA GRFDKRAKRD GRDGRDRRDR HDGPGGREGL KGNFRGKQGQ
DREQGGKFDK DRPRGGQNGP RRDYNDRGGR NERGGRNDRG DRPGQVQPRQ PGRRDQKSVN
QVKKDVKTKE AGEKRGIEKF GSQLGSMIGR KRKMRKGAK