Gene CNI01850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI01850 
Symbol 
ID3259592 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp512306 
End bp515187 
Gene Length2882 bp 
Protein Length815 aa 
Translation table 
GC content50% 
IMG OID638258669 
Productconserved hypothetical protein 
Protein accessionXP_572845 
Protein GI58271378 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCCA AGCCAGCCAA GACATCTTCC AAGACTGCGA AGTTGCCGAC TCCTGCAGCC 
ACTCCTACCC GTCAACCTTC GGTCGATACC CTCTCAAAAC CGGCGGTGGT TGACACGCCT
CCGACGCCGG CATCTCCTGC CCAGGTCAGC GCTCCAGTTC CTCCCAAGAA GTTTGCATAT
CCCATCCCAT CACGCCTCTC TCCGTCGACT ATCAACAACA CGGAAAGCTT GTTGCGCTTC
ATCATCCTGG CGTTAATATG CGGTGCTGCG ATTGGAAGTC GATTGTTTGC GGTGATCAGA
TTCGAATCTG TCATCCACGA ATTGTGAGTC AAATCTATCC GCAGGATTCT CTCAATGCTA
ACGTCTGTGT CTTCATTTCA GTGACCCCTG GTTCAACTAG TAAGGCCATT CTTCGAGGTC
AAACTTCCCT ACTGATTTAT TCCTAGCCGA GCCTCGAAAG TTCTTGTTAA CAAGGGTTTC
TACGAGTTCT GGAACTGGTT TGACCCCTCC GCTTGGTACC CTCTCGGCAG AACTGTCGGT
ACCACGCTCT ATCCTGGCTT GATGGTCACG TCTGGACTGA TTTGGCATGC TCTTCGGGCA
ATCAATATGC CCGTGGACAT TCGCAATGTC TGTGTCCTCC TTGCACCTGG ATTTTCTGGA
TTGACTGCCT GGGCGACTTA TCTGTCAGTG TAGCAAATTA TCTGATCATT GCATACACTA
ACATGAGAGC GTTGTAGTTT CACCACTGAA ATGTCTACAC CATCAGCTGG TCTATTGGCG
GCCGCTTTCA TTGGCATTGT ACCCGGATAC ATCTCTCGAT CTGTCGCCGG TTCTTATGAC
AACGAAGCCA TTGCCATCTT CCTCTTGATG AGCTCCTTCT ACTCTTGGAT TAAGGCCGTC
AAAACCGGTA GCTCATTTTG GGGTATGATC ACTGCCTTGT TCTACGGGTG GATGGTCGCT
GCATGGGGTG GTTACGTTTT CATCACCAAC AGTATGTCGC TCGGCCCTCA ATTGAATTGT
CTATTTACTC TTTTGCAGTG ATTCCATTGC ACGCCTTTGT TCTCATTGTC ATGGGCAGGT
TCAACAACCG GCTTTATACC GCTTACTCTT CCTGGTATGT CATTGGAACT ATCGCCTCCA
TGCAGGTCCC CTTTGTGGAG TTCCTCCCCA TCCGAACCTC TGAGCACATG GCGGCCTTGG
GTGTTTTCGG TCTTGTACAG CTGATCGGAT TCGTCGAAGT CGTCCGACGA CTCGTGCCTG
GCAAGCAATT CCAGCTCCTT CTCAAAGCTT TTGTCGTGGC CGTATTCTGC CTCAGTTTTG
CTGCCCTCGT CACTTTGACT TTCTCTGGAT GGATCGCCCC CTTCGCCGGA AGATTTTATT
CTCTTTGGGA TACTGGCTAT GCGAAGGTCC ACAGTGAGTC AAATGCCATA CCTTCCGGGA
TCTATATTTA TTGTCAACAT AGTGCCCATT ATTGCCTCCG TCTCCGAACA CCAGCCCACC
GCTTGGCCCT CATTCTACTT TGACCTCGAA ATGCTTATCT TCTTTTTCCC TGCCGGTGTC
TTCTGGTGTT TCAAGGAGCT TCGCGATGAG CAGATCTTCA TCATCATTTA TGCCGTTCTC
AGTGCCTATT TTGCCGGTGT CATGGTTCGA CTTATGCTTG TCATCACGCC TGTTGTCTGT
GTTTCCTCCG CCATTGCGTT CTCCAAACTT CTCGAGGCGT ATATTGACCC CGTCATCCCC
GAAAGCGACG AGGAAGCTGG CGAGTCTCAG ACGCAGGTTG TCTCCAAGTC CAAGGCGAAG
AAGATGGCCG CTGCCAACGC CAATAAGAGC GGGTTCTCTT TCACAGGTAT TTTGAGCGGC
AAGTCTGTCT CCGGCATCTT TGGTCTCGAC ACTCGATTTG CTGTGGTTTC CATTCTCTCT
GTCTTCCTCT TCATCTTTGT CCTTCACTGC ACATATGTGA CTTCAACAGC GTATTCTTCG
CCTTCAGTGG TACTTGCATC GCGAAACCCG GATGGTAGCC AAAATATCAT TGATGATTTC
CGAGAGGCTT ACTACTGGAT TCGCCAAAAC ACCGCCGAAG ACAGCGTCAT CATGTCCTGG
TGGGATTACG GCTACCAGAT CGCTGGTATG GCTGATCGCC CCACCCTTGT TGATAACAAT
ACCTGGAATA ACACCCACAT TGCCACAGTT GGTAAGGCCA TGGCTTCCAA CGAAGATGTC
GCATATCCTA TCTTGAGGAA GCATGATGTC GATTACGTTC TTGTGATCTT TGGGGGCTTA
TTGGGCTACT CTGGTGACGA TATCAACAAG TTTTTGTGGA TGGTTAGGAT CTCACAAGGT
GAATGGCCTG ACGAGGTGCA GGAAGTCAAC TACTTTACTC AAAGAGGGGA GTATGCTGTC
GATGACAGGG CGTGCGTCTA TTTTTGTATT ATTGCCTGGC GAAATTTGCT GACAATCTGG
GCAGCACCCC TACTATGAAG AACTCTCTCA TGTACAAAAT GTCTTACTAC CGGTACGTCA
AAATACATTG TCGATAGATG AACATCGTCT GACATATCTC AGCTTCCCCG AGCTTTATGG
TGGACACCCG GCTCAAGACA GGGTTCGAGG CCAAATTATC CCTCCTAACA GTGTTACTCT
TGATACCCTT GGTAAGTTTA GGGTTTGAAG ATAAGTGTAT AAGACGCTAA TGTCGTCACA
TTCCTGCAGA CGAAGCGTTC ACATCCGAAA ATTGGATCGT CAGGATCTAC AAGGTCAAGA
AGGAAGATCC CATTGGACGA GACCACAAGG CCGTTACTGC CTGGAACGGG GGTAAGAAGT
TGAAGAAGAG TCCTAGTGCC AGTGAGGGCG TGAAGCGGGG CAGCGGGAGA CCTAGCATGT
GA
 
Protein sequence
MAPKPAKTSS KTAKLPTPAA TPTRQPSVDT LSKPAVVDTP PTPASPAQVS APVPPKKFAY 
PIPSRLSPST INNTESLLRF IILALICGAA IGSRLFAVIR FESVIHEFRA SKVLVNKGFY
EFWNWFDPSA WYPLGRTVGT TLYPGLMVTS GLIWHALRAI NMPVDIRNVC VLLAPGFSGL
TAWATYLFTT EMSTPSAGLL AAAFIGIVPG YISRSVAGSY DNEAIAIFLL MSSFYSWIKA
VKTGSSFWGM ITALFYGWMV AAWGGYVFIT NMIPLHAFVL IVMGRFNNRL YTAYSSWYVI
GTIASMQVPF VEFLPIRTSE HMAALGVFGL VQLIGFVEVV RRLVPGKQFQ LLLKAFVVAV
FCLSFAALVT LTFSGWIAPF AGRFYSLWDT GYAKVHMPII ASVSEHQPTA WPSFYFDLEM
LIFFFPAGVF WCFKELRDEQ IFIIIYAVLS AYFAGVMVRL MLVITPVVCV SSAIAFSKLL
EAYIDPVIPE SDEEAGESQT QVVSKSKAKK MAAANANKSG FSFTGILSGK SVSGIFGLDT
RFAVVSILSV FLFIFVLHCT YVTSTAYSSP SVVLASRNPD GSQNIIDDFR EAYYWIRQNT
AEDSVIMSWW DYGYQIAGMA DRPTLVDNNT WNNTHIATVG KAMASNEDVA YPILRKHDVD
YVLVIFGGLL GYSGDDINKF LWMVRISQGE WPDEVQEVNY FTQRGEYAVD DRATPTMKNS
LMYKMSYYRF PELYGGHPAQ DRVRGQIIPP NSVTLDTLDE AFTSENWIVR IYKVKKEDPI
GRDHKAVTAW NGGKKLKKSP SASEGVKRGS GRPSM