Gene CNB01070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB01070 
Symbol 
ID3255804 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp326201 
End bp327441 
Gene Length1241 bp 
Protein Length332 aa 
Translation table 
GC content49% 
IMG OID638254758 
Productno arches protein, putative 
Protein accessionXP_569113 
Protein GI58263406 
COG category[R] General function prediction only 
COG ID[COG5084] Cleavage and polyadenylation specificity factor (CPSF) Clipper subunit and related makorin family Zn-finger proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0222813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCAG CCTCGAATTC CGCACCGTTA GATCCAAAGC TCGGACGAGC AGCAGATTTT 
GTTCGCCCTG ACTTCCACCA AGTCAATCTC GACCTGGAAA ACTACCTCAA GACTGAGCGC
AACTTCAAGC TCGATGCAGG TATGTCATTA ACGACTGCGT ATAACTTCTA TGCTGATATC
CTCTGTAGAC CAACAAATAT GTCCTCTGTC CATCACGCCT CTCGGCTGTC CTCTTCCGCC
TTCACAATGT CCTTATCGTC ACACTACCCC CTCCCAACTC AATTTCAAGC CACCACCTCC
TCTCCCGGCT CACCCTCGAG AGCGAGAAAA GAAGCTAACG GTATGCAAAC ACTACCTTCG
AAACCTCTGT AAAATGGGAG ACAATTGCGA GTACACCCAC GACTTTAACC TTCGCACCAT
GCCAGTGTGT ATATGGTTTG TCAAACAAGG CAAATGTGAG CTGGGAGGAG AGTGCCTGTA
TTTCCACCCC AGAGACAGAA GAGTTGAGTG TCCGGATTAC AACAGAGGAT TCTGCGTGCT
AGGTCCTAAT TGTCCGAGGA AGCATATAAG GAGGAGGCTG TGTGATGCCT ATGCCGCTGG
ATTTTGCCCT GATGGCAAGG ACTGCAAATT AGCTCAGTAA GCTGTTTTCG CGCGCCATTT
TTTCCCCATC ATCGCTAATC TCTCCAACTA GCCCGTCTCC CAACCGACCG CCTGCAGAAT
CATATATCAA CCCTATCCCA CCTGACCCCG AAGCCTTCAA TGGCCCACCA CCCCAACTGC
CTGCTGGCTA TGGTCGTTGG CGGGAATACA AATATGACCC CAATGCAGTG GTTGTTCCAG
CTGCGGCGTG GGTTGAGGGT GGAAGTTTAT CTGGTTGGCG AGCTGGAGGA TTTCTGTCTG
CGAATGCAAG ACGAGATAAC CAAAGGAATA GAGACAATGA TGATGAGGGT GGACGCGGCA
GCGGAGGAGA GAGAAAAGGT GGCTGGCAAA AAGATCTTAG CACAGTGCTT TGTTTCGTAA
GTCTTATACT GTTACGATTC AATAGCTTGC GCTGATTACC AAAACAGAGG TGCAATCAGT
ATGGCCACTT TGCCAATAAC TGTCCTAATC AATATGTGCC TGGAGACCGG GGAGGCGGTA
GACGACGGGA ATGATAAGGG ACATCAGATG CTTCGAAAGT TTTAGAGATA TGTTGTAGCA
TACTAGTAAT GCAGTGACAG TAATGCAAAA AATCGCGATG T
 
Protein sequence
MAAASNSAPL DPKLGRAADF VRPDFHQVNL DLENYLKTER NFKLDADQQI CPLSITPLGC 
PLPPSQCPYR HTTPSQLNFK PPPPLPAHPR EREKKLTVCK HYLRNLCKMG DNCEYTHDFN
LRTMPVCIWF VKQGKCELGG ECLYFHPRDR RVECPDYNRG FCVLGPNCPR KHIRRRLCDA
YAAGFCPDGK DCKLAHPSPN RPPAESYINP IPPDPEAFNG PPPQLPAGYG RWREYKYDPN
AVVVPAAAWV EGGSLSGWRA GGFLSANARR DNQRNRDNDD EGGRGSGGER KGGWQKDLST
VLCFRCNQYG HFANNCPNQY VPGDRGGGRR RE