Gene CNB02310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB02310 
Symbol 
ID3255883 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp668499 
End bp670047 
Gene Length1549 bp 
Protein Length444 aa 
Translation table 
GC content46% 
IMG OID638254882 
Productconserved hypothetical protein 
Protein accessionXP_569319 
Protein GI58264326 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.935562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGCCG AAATCATCTT GAAGCAGGCT AACCAAGACG TTCAGGTCCA GGTCAACATA 
TCAGACCTGC CCCAGCTTTT GGCCCAGGTT CACAGAGTGC CTCCGGACTA GTCGATTACA
GCAACATCTT CATTAAAAAT CTGGATTCCG ACATCAACTC GTTCTATCTT GAAGAGACTT
TCAGTCAGGT ATGAGTTATT TCAAGGGCTT AATTCTTCTC AGTGTGACGT TGGCTGAATT
TACCTCGGGT AGTTTGGTCG AGTAATCAGT GCTCGTGTGA TGCGTGATGA CCATCAGCGC
AGCAGAGGTT ATGGTTTTGT CAGCTTCTAC ACTCCTGAGG AAGGTTTGTT GCGGTCTAAA
ATGCTGAGAT TCTATCTCTA ATGCATCTAT TAGCTGCTTG CGCTGTCAAA GCTATGAATG
GGACTCAATT TGGTCGACAG GTCCTTTCTG TAACTCTTCA TGAGCCTCGT AAGCTTAGGC
CAGAGAAGAT TGCCGAGCGC GTTGCGCAAG GTCTCATCCA CCGTCAAGCT GTGCCTCCTC
ATGTTGCGAG GCGATCCTCT AACCCTGTCA AATCTCGTCG ATCATTTCGC GAGATTAGTA
TTTTTGATCG TTCCAACGAC TCGGACCCTT CGGATGATAT TCGCCTTCTC AGTCCGGAAA
GTCGGAAGGC GGTGCTTGAG AAAAGACTCA TTGCTCGAGT TCGAGTATAT GCAAAGAAGC
GGTCGGTTTC AGAAGAAATG ATTCAACCTA TCGTCAATTC ACTCTTACCG TCTGATTTGG
CGTTGATCCC TCTGCTGCAC AATCATACAC AGTTGGATAA CAGGATAGCA GAAACGCTTT
CTTCCATTCA GGAAGTTCCT GAAGTGCCTG CAGCAGCTCA AAGACCGACC GAAAGCGACA
TTGACGATCT TCGTGTACAA ATAGAAAAGA TCGATTCCGA TGACGCCGAT AACGTAATGC
GTGTCTTCAT GGACATATTG AAGGCAGAAG ATTGGGAAAG GGGACTGGGT AATAAGGCAG
AGGTAGCAAT TAAATACGGG GACGCGAAAC GACTCTTACT GAAGGAGAGG AATGAGAGAC
AAAAGGAGTC CAGTGAGGAT ATCAGCGACA AAGACGCAAT GACCGGTGAC CCAATATTTG
AAACTGGGCT GTCTGATGAA CAAGATCCCA CTATAATACC CCTGGAAACA ATTACCGTCC
CACTGCTCGC TTCCCTTTCG GCGAAAAAGA TTATCTGGAA TCTATCGTCG TCATATGGCT
CTGATATACT GCTGAAATTG GGATTGAAAG CTCCGAGCGA TCAAGAAAGA AATAGTTTGT
TAGCATGGCG AGCAAAGATT ATGAACAAAG GAAACGTCGT CATGAGGGGA GAGATGGTTA
GGATGTTGGA GAGGCATGCT GTGGTGCGTA CTATTCGCGT TTGAAATTCC CGAAATTGAA
ACTGACTTGG AATGGACAGG TTGATGGTCT GAAGCGCAGC CAGAGAATCA AAACTCTGCG
AGAGTTTGTA AATGCAGAGG ATGATGATGA GTCTTTATGT GAATTGTAA
 
Protein sequence
MWAEIILKQA NQDVQVQSAS GLVDYSNIFI KNLDSDINSF YLEETFSQFG RVISARVMRD 
DHQRSRGYGF VSFYTPEEAA CAVKAMNGTQ FGRQVLSVTL HEPRKLRPEK IAERVAQGLI
HRQAVPPHVA RRSSNPVKSR RSFREISIFD RSNDSDPSDD IRLLSPESRK AVLEKRLIAR
VRVYAKKRSV SEEMIQPIVN SLLPSDLALI PLLHNHTQLD NRIAETLSSI QEVPEVPAAA
QRPTESDIDD LRVQIEKIDS DDADNVMRVF MDILKAEDWE RGLGNKAEVA IKYGDAKRLL
LKERNERQKE SSEDISDKDA MTGDPIFETG LSDEQDPTII PLETITVPLL ASLSAKKIIW
NLSSSYGSDI LLKLGLKAPS DQERNSLLAW RAKIMNKGNV VMRGEMVRML ERHAVVDGLK
RSQRIKTLRE FVNAEDDDES LCEL