Gene CNB02850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB02850 
Symbol 
ID3255619 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp844033 
End bp847150 
Gene Length3118 bp 
Protein Length940 aa 
Translation table 
GC content50% 
IMG OID638254935 
ProductSer/Thr protein kinase, putative 
Protein accessionXP_568873 
Protein GI58262926 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCG AGATACCTGC GGCTGATGAC GACGGAGAGC TTCACCCGCC TGCGTCTCCA 
GAGAGTTCAT TCTACGTACG TACAGACATA AACTCTATAG CCTAGTTGAC CAAATCCCAA
GGCTATCAAA ATTGTTGATC GGAACCCCAA AAGAAAACGT CTCACTGGTC TCGGTCGACA
CAAAGGGTCC TCGGGAGGGG CTAAGCTGCT GAACGAGAAC GAGTGAGTTA ATCTAACATA
GATCAATACT GACCATCGTC AGAATCCGCA AGGAGATTGC AATATTCAAA AAGGTTAACC
ATCCTAATGT TGTGAGGATG AAGGAGATTA TCGATGACCC GGAGTCAAGC AAAATCTACA
TGATCCTCGA ATGGTGTCAG AATGGAGAAA TCAAATGGAA GGATGGAGAA GGTCTTCCGG
CCTTGACTGT AGGCGAGACT CGTAAGATCT TTCGAGATAC GCTACTCGGG CTGGAATATT
GTGAGTGTAA AGCTTTTTTG CCAATGTAAT TGTGGCTGAG TGATTGCCAG TGCATCATCA
AGGCATAATT CATCGTGACA TCAAACCGAG CAACCTCCTA CGGTCTGGAG ACAACACCGT
CAAAATATCG GATTTTGGTT GCTCTCATTT CTCTGAAGCT CTGAGAGCAG CGGCAGCGCA
ACCAGGTCCT GAAGGCGACG CTTACGTTGA TGACATTGAA CTCGCAAAGA CAGCTGGATC
ACCCGCCTTC TTTGCTCCTG AAATGTGCTA TTCGGGCCTT GACTCAGAGG GCCCCTCCGG
ACCTCCCAGC CATCCAAGTC CAAATCAAGA GGTACCATCC TTTATTATCC GACCTCCCTC
ATCTGTCGAT ACTTCTGCGG ACACTTCTTC AAGCCTAATG AGTTTACCGC GCTCATCCAC
TACTTTCCCT CTCAAACCCA CTGATAGCAA CGATTCGGCC GGCTCTCGTC GCCCTCTGTC
TTTCAGATCA CAATCCTCTT CGGCTACTAT CCAGCGTCGT GAAAGGCTTC CTATAACCAA
CGCTATCGAC GTGTGGGCAC TTGGTGTCAC TCTTTATTGT TTGCTGTTTG GCAAAACGCC
CTTTAATGCA CCTAACGAAT ATCTTTTGAT GCAAGTCATC GTGTCGGAAT CATATATCGT
CCCCCCTTTC ATGGGCAAAG ATCGTCTCCC CACTGGAACA GGAGGGCTCC CTGCGGCAGA
AGAAGCGGTA GAATGTTTGG ATCTGTTGAA AAGACTTTTG GAGAAAGATG CGGGAAAAAG
GATCACACTT GAGCAAGCAA AGGTGAGTCG ATACAATATC GTACATGACT GACAATCAGC
AACATGGTTT CACTCTCCAT GGATTAGCCG ATTCTGCTAC TTGGCTCGCC AAGACAGATC
CGCACACTCA CACATTCGTT ACCGTTTCAA ATGACGAAGT TGCGGCGGTT ATTACCAAGT
CTGTCCGTTT TCGCGATCGT TTCCGAAAGG GCATCAAGAC GATCTCGCAG AAACTGCAGC
TACTTGGAAC CGGGCGCACG CGTAGCCGCA GTTTTGGAGA TGGAGAATCA ACAGGAACTA
CTAATACGGA GTATTCATTG ACGCCAGCAG GGTCTGGGTC CGGGACCCCA AGAAGCAATA
AACTCAGTGC ACTTCTCCCA GTCTCTACTC CGAATAAAGA TGTGTCACCC ATGAACAGCC
CTCTTCCTCA TACAAGCATA GGACGCCGCT TTTCGCTTCT CAATGGCAGA TCACCTGATT
ACCCCATTTC TCCCCAGACC ACGACTCCGG TGGCTCAGAT GGTCTCTCCA GGGCTGTCCA
AGAACCCGTC TGCGCTCGAT TTCAAAAACG AACCGGCTGG GCGAGTAACC AGCAACCCAG
GGCCTATTGG GCTTGGATCC ATGATCCATC GACGACTGTC TTCTCACCTC GTCCCACAAG
CCAAACCTAG CATTGTGCCT TTATTGGACG AAACAACACA TTCTCCTCGT CCAGCTACGT
CGTCAGTGTC TCTAGACAAA TTTAAACTCT CTTCAGAGCA GCAATCGATA AGTGGAAGCT
TTCGTCGCCG GAAATCCATT GAGGCGGAGA TGGAAGGCCG ACGCCGGTCT CATAGTAATG
CGTCCAGTAT CAGCAGCAAA CTTGCCAGAT TATTACGCAC CGGAAGCCAA CGGTCACATC
CTCGTGTCTT GGAGAAAGAA AGTCTTGCCG GGTCGGATAC GGAAGATCTA GCTGTTGAGC
CGTCTGTGGG GTCTTCATCG CCGGCGGATG CGCTGGGGCG GATGAGTCTC GATGGCTCTC
TTCCTCGTCA GAGCCTGGAA CATATTGAAA CAGGCTCACG TTCATCAAGT CAAGGATATA
TACCCAGCCC TGATCGTCCG TTACCTGCTG GCTGGGAGTC CAGATTCCGC ACCAATGCAC
CTCGGCGTGG GTCCAACTTG AGTGAGGAAT TCACTAATCG AGTGGCAGAA GAAGAGATTG
ACTGGGATGG GTCAATATCA GATGAAGACG ATTATGATGA TACGACATCT CAGTCTGTCG
CCGCACCAAC GCCTGTAGCG GGCATGAATC CTCTCTGGCG AAGGGCACGC AATGACAATC
TTGGTGTTGA TGCTCAATCA TCCGCGCCGA TAGTATCTGC GGCGCCTTCT CTTGAGCCAA
TACCGGACGG TTCGCCCTCT GCGCCAACTA CACTTCCTGC CCGCTCCCCA TCGAAGCCTA
CTCTCCCCGA GATCAACACA GGTTCCTCTG ACCCTTTATA TCAAACCTCT TCACGAACGA
GTTCACGACT TTCACCTTCC CCTTTCCGCA ACAACTTTGC CGAGAAAGCA CGAAGCCCCT
TTGCGGGACA CCATGATGAT ACACGAAACA GCCCAAGAAG AAAGGGCTTG AATTGGCAGA
CGAGCGCACC GGGTCTGGTA GATAATGACG AAGACGAGGG CCTAGCGATT TCTTTTGGGG
GTAAACGGGG CCGGAAAGGG TCAGTTCAGA AGCCTACGAT GAGCGAAGAC AAATGACGCC
TATTCTTGTA TGAAAGCATA GACATAGAAC AGCCACCCTA GTTTTAATTG TCCCGTCTGT
CCGACATGCA GCATGATTCT GAGCAAGGCT AATAATATGT ACATAATTAC TGTGTATG
 
Protein sequence
MTTEIPAADD DGELHPPASP ESSFYAIKIV DRNPKRKRLT GLGRHKGSSG GAKLLNENEI 
RKEIAIFKKV NHPNVVRMKE IIDDPESSKI YMILEWCQNG EIKWKDGEGL PALTVGETRK
IFRDTLLGLE YLHHQGIIHR DIKPSNLLRS GDNTVKISDF GCSHFSEALR AAAAQPGPEG
DAYVDDIELA KTAGSPAFFA PEMCYSGLDS EGPSGPPSHP SPNQEVPSFI IRPPSSVDTS
ADTSSSLMSL PRSSTTFPLK PTDSNDSAGS RRPLSFRSQS SSATIQRRER LPITNAIDVW
ALGVTLYCLL FGKTPFNAPN EYLLMQVIVS ESYIVPPFMG KDRLPTGTGG LPAAEEAVEC
LDLLKRLLEK DAGKRITLEQ AKQHGFTLHG LADSATWLAK TDPHTHTFVT VSNDEVAAVI
TKSVRFRDRF RKGIKTISQK LQLLGTGRTR SRSFGDGEST GTTNTEYSLT PAGSGSGTPR
SNKLSALLPV STPNKDVSPM NSPLPHTSIG RRFSLLNGRS PDYPISPQTT TPVAQMVSPG
LSKNPSALDF KNEPAGRVTS NPGPIGLGSM IHRRLSSHLV PQAKPSIVPL LDETTHSPRP
ATSSVSLDKF KLSSEQQSIS GSFRRRKSIE AEMEGRRRSH SNASSISSKL ARLLRTGSQR
SHPRVLEKES LAGSDTEDLA VEPSVGSSSP ADALGRMSLD GSLPRQSLEH IETGSRSSSQ
GYIPSPDRPL PAGWESRFRT NAPRRGSNLS EEFTNRVAEE EIDWDGSISD EDDYDDTTSQ
SVAAPTPVAG MNPLWRRARN DNLGVDAQSS APIVSAAPSL EPIPDGSPSA PTTLPARSPS
KPTLPEINTG SSDPLYQTSS RTSSRLSPSP FRNNFAEKAR SPFAGHHDDT RNSPRRKGLN
WQTSAPGLVD NDEDEGLAIS FGGKRGRKGS VQKPTMSEDK