Gene CNM01800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01800 
Symbol 
ID3255166 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp547367 
End bp549666 
Gene Length2300 bp 
Protein Length480 aa 
Translation table 
GC content52% 
IMG OID638254334 
Productcytoplasm protein, putative 
Protein accessionXP_568308 
Protein GI58261796 
COG category[S] Function unknown 
COG ID[COG1723] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.389549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCCCCCCCA CGATGCCGTC CACGGCCCAG AACACGCTGC CCCTGCCGTA TGCGCAGGTG 
GCGGCACAGC CAAGGCGAGA GCAGAGACAG CCCACACGGT GAGCAGCGAG TCCAGCAGCC
AGAGAAACCC GCTCACGCCC CGCAGAACAT CCAAGCTCGG CAGCAGTACG TCCCAGCACA
CACCCCTATG CATCGCCCAC TGACATGTCT GCCAGAGCTC AAGGTCCTCC CGACCCAGCC
AGAGACCCCC ACCATCCCAG AGGAGGAAGA GGACGACGAT GGGACAGGCC GCGCACTTGC
CGATCACGAC GAGTCTGAAG GCGTCGAGGT GCGTCTGCTT GGCTTTGCAG AACGGCCATG
CATGCTGTCG ACGATCGCAG GCGCTGACAG CAGAGACGCA GTTCTACACG CCCATCTCCC
AGATCCCAAA GGGCACAGCC CGTCGAGACG CCCAGCGTCT CACCAAGTCC GAAAAAGCCA
AGCTCCCGCG GGTGACAGCG TACTGTACCG CAGCGTGCGT CCATACTCCT GCCCGCTGCC
CAGATTCAAC GTCCTAACGA CTCGACTGAC TTGTAGCACG TACAACCTGC AGGCGATGCA
AGCCTACCTC GCCTCCCGCC CGGCATACCA CCGCACCCAC CCGCGCATGT TCGACACCGA
ATGTCTACAC ACGCCATACC TGCCCCCTCC CACCCTGGGC CCGCACGGCA TGTCAGCCCA
TAGAAATTCT CCGAGGCTGA AACCTGTCTC TGGAGCTGGT CATGTACCAG AAGGGGATTT
GCTCAATTTA GGGAATGACT ATTCATCAAC GGCTCATAAA CGTGCCAGCT CGCCAAGCCG
ATCTAACCAG AACCAAAACC AAAACGAGGT AAAGCGCCAA CCGGGGTTCT CTAAACGACC
CGGTAGCGGG CGCAAGAAAT CCTCTTCTGG GTCAACGTCC AAAGACACCG GCGCGGATGG
GATGACGGAT AGTGAGAGGG AAGAGGATGA TGACTTTGAA GACGAATGGA TCCCGGACGT
GTTCCTGTTC GAGTATGGGT GCGTCGTGCT TTGGGGAATG ACGGAAAGGG AGGAGAAAAA
GTTCTTGGCT AGCATGTGTG TCTTTTAATG TGCTATTCAT AGGACGTACG AGCTGACGAG
GATAACAGAA AGAGGTTTGA AATTGAGAGA TTATCAGCCG AAGACGTCGA GATGGAGGAC
CTCAATTTTT ACTATGCCGA TTACTCTCGG TACGTCGCTT TCCACTCCAT ACACCTCTCC
CAAATCACAA ACGCTGAGAC TTTTGCCAAC TTGTTTCAAG TATATACAAC GACGTAATCA
CGTTGCGTAA AGGATCTTCC TACATGTACG TCCTCTCCCC TAATGCCTTT TTTTTTTACC
TCTCGCATAC CTGCTTCGAC TAACACGATT CTACATGATG ACAGGACAAA ACTCTCCCTT
TCACATGCGC TCTCCCAATC CGTCAAGATA TCACTATTCG AAGAACTCAT CATGGGTACG
ATCGAGCAAA CGAAAGATAT CCCCAAGAGC CTTTCCGAAA CTGGAAAAAT TGGCGTGCGT
CCCTTCCCCA TCTTGGGTTA TCATCAAGTT TGCAAGTTTA CTAATCGTCT TCTTCCTTTC
CTTTTCTTCT CTGTATGATG ATTGTAGTTG CCAAGAAGTG AGATTATGAA GCAGATCGGA
AACCTTTTCA TTCTGCGTAT CAATATCAAC CTCGTCGGGT CTATCCTCGA TTCTCCCGTA
AGCCGTTTTC TTCCCCCCTC CCTCTTTTGC ACCCCACTAA TCCCTGCCTT TTTTTTCCCT
CTCCCTCTAC CTCCCCCCCC GATCTGAAAC GACAGGAATT CTTCTGGACA TTCCCCGACC
TCGAACCACT CTACAACGCC GCCCGGTCAT ACCTCGAAAT CGGCCAACGT GTCGAACTCT
TAAACGCCCG TGTAGATGTT TTGCAGGATA TGCTCAAGTT GTTGAAGGAG AGTGTGAATT
CGAGTCATGG AGAGAGGTTG GAGGCTATTG TCATTTTCTT AATGTCCGTC CTTACCCAAC
ACGACTTGTT ACTGATGCTG ACGAATGGAA ATGTCCAGTG GAATTGAGAT TGTCCTCGGT
ATCATCACCA TTCTTGTCGA TCTCAGTTTC TCGTAGAAAT CCCGACGACT CAAAAAATCG
ACATCGAGTC GAAACTGAGT CGAGATCAAG ATTGCGTCAA GAAGCCCAGA TCTGTAACAA
ATGAAAAGGT TTGAGCAAGG ACGGACAGCT AGGGAAGGGG CAGTGTAAAT GCAGTTAAAT
AGATGCTGAT ACAGCGAAGC
 
Protein sequence
MPSTAQNTLP LPYAQVAAQP RREQRQPTRT SKLGSKLKVL PTQPETPTIP EEEEDDDGTG 
RALADHDESE GVEFYTPISQ IPKGTARRDA QRLTKSEKAK LPRVTAYCTA ATYNLQAMQA
YLASRPAYHR THPRMFDTEC LHTPYLPPPT LGPHGMSAHR NSPRLKPVSG AGHVPEGDLL
NLGNDYSSTA HKRASSPSRS NQNQNQNEVK RQPGFSKRPG SGRKKSSSGS TSKDTGADGM
TDSEREEDDD FEDEWIPDVF LFEYGCVVLW GMTEREEKKF LASIKRFEIE RLSAEDVEME
DLNFYYADYS RIYNDVITLR KGSSYMTKLS LSHALSQSVK ISLFEELIMG TIEQTKDIPK
SLSETGKIGL PRSEIMKQIG NLFILRININ LVGSILDSPE FFWTFPDLEP LYNAARSYLE
IGQRVELLNA RVDVLQDMLK LLKESVNSSH GERLEAIVIF LIGIEIVLGI ITILVDLSFS