Gene CNF00140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF00140 
Symbol 
ID3258468 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp41651 
End bp44989 
Gene Length3339 bp 
Protein Length781 aa 
Translation table 
GC content53% 
IMG OID638257135 
Productconserved hypothetical protein 
Protein accessionXP_571284 
Protein GI58268256 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.505175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCT CGCGCGACTC GGCACTCGCC TCCGCATCCG CCTCCCGCGC CCAGAGCTAC 
AAGGGGTCCC CCAGCATCAA CCCGCGGTAC TCGACCGGCA ACCCTGCGTC CACGCCGCCG
TTAGCCGCGA ACGGCGTCCC CCCACCGCGG CCTAATCGCG CAGGCACGCT GCCGCTCGAC
CTCTCGCTCG ATAGGGATCC CAGCCCGCAA CCTGCGTCTG CCCGCTCACC CGCCTCCCAG
CTGCCGCCCG TCCTGCCGTC CCCCGCCGTA TCCCCCGGCG TATTCTCCCC GCCGACGCTG
GGCCAGCCAT TCGCCGCCCC TGTTGGCCCC GCGCCTGGCA ACCCGTATTT CCCCAGCGCG
ACCGCCGCGA TCGAAAAGGG CATGGAGGAC GTCAAGATGT CCGGCCCCGT CGGGGTCGGT
GTACCCATGG GTGTCGTCGA ACCGAGAGAA AAAGAGTTGC CACGTGAGCC CGGGTCGGCG
GCGATGGGCG GGAGAAGCAG GAGTGGGACC GGGAGGAGCA GTAAGGATAA AAAGAGCATG
TTTGGGTTCG TCTCTGGTAC GTCTCTTTTC TCTCTTTCCC GAAGCGTTTT CTCGGGAGAG
GAGATGGAGT GCTGACGAAC GAGGTAGATT TACTAGGCAA GGACAAGCCG CCAGTGATTT
CGAAACCGTA TGATCCGGTG CATGTCACCC ATGTCGGATT TGATTTCCAA ACCGGAAAGT
GTGCGTCCGG CCTTCTTTTT CTGGTATTGT GTGCACTGAC AAGCGCGATA TGTAGACACC
GGTATGCCCC CCAAATGGCA GCAAGTCCTC GACGACAATG GCATCACCCA AGACGAGCAG
GAACGAAACC CGAACGGCGT CATGGCCGTC GTGCAGTACC TGAAACATCA AGACGAAGGC
GAAGATGAAG AAGAGGAAGT ATGGGCAAAG ATGAAGAATG CGCAACCGCC TGCGTTTCCT
CCACCCTCTG CCGCGCCGTC GCAGCCGACA ACGCCGGGAG GAGGGGTCGG GAGTAGAGAG
ATGAGTAGGG AACCGAGTAA TGAGGCGCTC GGCGCAGCGG GAACACAGGT GGTCGATTTC
ACGTGTCCGA GAATGGCTCC TGCTCCGCCT ACCAAGCCGT CCCTCAACCG AATGCTCGTG
TGTTTTTTTT TCCAATTGTA GATGGAAAAA GTACTGACAG ATATAGTCGG AACGACATGC
GCCGGCTTCG CACCGACCGG CCGAACTCAC CACCCCTGCC CCTCTGGCAG CCGCTCCTAG
GGTGACACAG GCGTATTCTT CCGCATTATC GCATTCACCC CACCTCCCAC CTCCTCATCC
CACGTCGAGC GCTCCGCCGA CCGCGCCCGC GCCGCATCTC GATAGGTCGT ACTCCCAGCG
TGCGCCGGTG TCTGGTACAA AGACAAAAGT GTTGGATCGG GCGAATACGA CGAGGTCGCC
GGGATCGAGT GCGGGGATAG CGCAGGGCGC GGGCACGGGC GTGGGTTTGA CGAAATCGCA
AAGTCAGTCG GGACATAAAT CGCGCGATCC ATCGAGAGAA CCAAAAGATT CAGCTTCTTC
CTCTGGCGGA TTATCGAGAA ACCAAACGAC GACGCGACAG CAGCAAGGGG CGACTCCCCG
TAGAAGGGAA AAGGAGAAGA AGGAGAATGA AGACGTGATA AGGCAGTTGA GGATGATATG
TACACCCGGA GACCCGAATT TGGTGTACAA GAATTTCAGA AAGATTGGTC AGGGGTGCGT
CCGATTGGTA TGCAGATGAA AACACGGAGG CTAACACGAA AACAGTGCGT CGGGCGGTGT
ATACACCGCG ATAGATCGTC AAACTCTGCC GGTCGCTATC AAACAGATGA ACCTCGAAAA
ACAGCCGAAA CAGGATCTCA TCATCAACGA AATCCTCGTC ATGCGCGAAT CTGCCCATCC
AAACATTGTA AACTTTAAAG ACTCGTACCT CTGGCAAGGC GATCTGTGGG TGGTGATGGA
GTACATGGAA GGAGGCAGTC TGACAGATGT GGTGACGGCG CATTGTATGA GTGAAGCGCA
GATTGCGAGT GTGAGCAGAG AGGTTTGCGA GGGCCTGAGA CATTTACATA GTAAAGGGGT
GATACATCGC GATATCAAGA GTGATAACAT CTTGTTATCC CTGAATGGTG ATGTCAAGCT
TAGTAAGTTT GTTTCGATCT CTTATTTTTA TTCACTGCTG ACGATAAGAA ATAGCCGACT
TTGGTTTCTG CGCGCGTATT GCCGACCCGA CGACGACGAA GCGGACGACC ATGGTGGGCA
CACCGTATTG GATGGCGCCA GAGGTGGTGT TGCGGAAAGA GTATGGGCCG AATGTTGATA
TTTGGAGTCT GGGTATTTTG GCGATCGGTA TGTCGTCTGG GTCGGGACTT GATTGGGCTA
ATTTTTTTTT TTATTTAGAA ATGCTCGAAG GCGAACCGCC ATACCTTACC GAAAACCCGG
TGAGGGCGCT CTACCTCATC GCTACAAACG GTACACCCAA AATCAAGGAT TGGGACAAGC
TTTCGACCGT GTTTAGGGAT TACTTCAAGG TCACCCTCCA GGTTGATCCG GCCAAGAGAC
CGACGGCGGC GGCGATATTA AAGGTGAGTC GCGCGTCGTG TGTTACCCAA AAAAGATGGA
AAGAGGCTGA TGGAGAAATG ATTTTTAAAA GCATGAATTC TTCAAGCATA CAGCCCCGTT
GATATCATTA GCGCCTATGA TCCGATCGTC GCGCAAGAGT TAGACCGGCG ACTCTTGCTC
GTATCGTCAT ATCATCGTAG ATCCCACCCA TCGACCATCG CCCCTTCCCT TGTCATTTTT
ATTCTTATTT TTTTTTGGCC GCTACTGCAT TTGCACAAAG GCGGCCTCTG TATATTCAAG
ATCGAACCAC ATCCGAGCAA AAGGAGAAAC AAAAAGAGGG AGAAAGGAAA AAAAAAGACC
TTCAACTGTG TATACTTTCA CCACCAATCA CCTTTTTTGC AAAAGCATTT TTTTTTTCAG
CCTCGACCGA AAAGAAGAAA CGCTCGGCAG TGCAATACGT GTTAATCTCT CTTCTTTTTG
TCATTCTTTC GTTTCAGCTT TGCGTGGATT TCCCTGTAGA TTTTTATATT TCATTTTGGA
TTCGGGCTTC TTCCCAACCT CTTTTTTTTT TTTTTTTCCC CGTTTCGTTT TCTTCTTCTT
CAGAGGTGAG GGGAAGATGT GGAGAGGAGA AATATTGATG ACGGGTTAAA GATTGTTTTA
GATCAGGGTG TGGAAGAGAA GGGGATATGA GGGAGATTTG TGTGAATATA TATATATATT
TTAGACTTAT CGGGAAATGC GATATTTGTG CTCAATATA
 
Protein sequence
MTPSRDSALA SASASRAQSY KGSPSINPRY STGNPASTPP LAANGVPPPR PNRAGTLPLD 
LSLDRDPSPQ PASARSPASQ LPPVLPSPAV SPGVFSPPTL GQPFAAPVGP APGNPYFPSA
TAAIEKGMED VKMSGPVGVG VPMGVVEPRE KELPREPGSA AMGGRSRSGT GRSSKDKKSM
FGFVSDLLGK DKPPVISKPY DPVHVTHVGF DFQTGKYTGM PPKWQQVLDD NGITQDEQER
NPNGVMAVVQ YLKHQDEGED EEEEVWAKMK NAQPPAFPPP SAAPSQPTTP GGGVGSREMS
REPSNEALGA AGTQVVDFTC PRMAPAPPTK PSLNRMLSER HAPASHRPAE LTTPAPLAAA
PRVTQAYSSA LSHSPHLPPP HPTSSAPPTA PAPHLDRSYS QRAPVSGTKT KVLDRANTTR
SPGSSAGIAQ GAGTGVGLTK SQSQSGHKSR DPSREPKDSA SSSGGLSRNQ TTTRQQQGAT
PRRREKEKKE NEDVIRQLRM ICTPGDPNLV YKNFRKIGQG ASGGVYTAID RQTLPVAIKQ
MNLEKQPKQD LIINEILVMR ESAHPNIVNF KDSYLWQGDL WVVMEYMEGG SLTDVVTAHC
MSEAQIASVS REVCEGLRHL HSKGVIHRDI KSDNILLSLN GDVKLTDFGF CARIADPTTT
KRTTMVGTPY WMAPEVVLRK EYGPNVDIWS LGILAIEMLE GEPPYLTENP VRALYLIATN
GTPKIKDWDK LSTVFRDYFK VTLQVDPAKR PTAAAILKHE FFKHTAPLIS LAPMIRSSRK
S