Gene CNI01870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI01870 
Symbol 
ID3259382 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp516126 
End bp519254 
Gene Length3129 bp 
Protein Length840 aa 
Translation table 
GC content50% 
IMG OID638258671 
Productconserved hypothetical protein 
Protein accessionXP_573000 
Protein GI58271688 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTCTCTCAA ACGACGCGAG AATCCCTGCG GCCCCAATGT TCTGCAGCGA TGCAGGGCCC 
AGACTTGCTC TTACGACACG TCCACTGTCT CCTATTGTTG CTGACACAGC TCGCACCAAA
CAAGGAGCCC ACAGCTCGCC ATGCAGATCC CATCAGCCAT CCGCAGAACC ATCCCTTCTT
TAAGAAGTGC ACATGAGAGC AACATCTTCC TCTCCTCTCT ATCGTTTTAC CCGTACAAAC
GTGTCGTGAG TAGCATACTA ACTGCGCCAC CGACCATCAT CCGGCCATTC AAGTCAATCA
CAACAATTAT TTACCTGCCT ACGGCCAACC ACAGGAGGAA GAGATAACGA GATGGCGCCC
AGAGACAGTT CGCGATTCTC TTTCAAGTCT CGCTTTTCCC TTTTATCCCT TCTCCCTGCC
CGGCATCCAC CTCTTTTTGT CGATCCCCCT CGAATGCCAC CTACTCATCC GGGTCCACGG
CACGAGACAA TCATTTCAGA TACGGTTTCG CCCGTCCACA GCGAGTATCC CGATCTGGGA
ACACCCTTCC CGCCCTGCCG AAAAAAGAGC TATGACAGGA GTCCATTGGG GTGGGGACAC
GGTGAAGAGG GGGAGCCAGT GGAGGAAAAG AAGAGGAGAG CAAAGAGACC ACCCCCGCTC
AACCTGGAGA GAACGCATAT GATGTATCCA CCTTCATCCG TGGACGTCGT CATCGACCCT
GGTACGCCGT TTAACCTTGA CTCTCCTCCG CCTCAGAAGC AAGTGCCTTC ATTACCCAAG
AAGATCAAGC CGAGAAAGAA GCGGCAGTTG GTCGAAGACC CTTTCGAAGT AGCAGAGGTC
GAGATAGGGC ATCGGTACCC ATCGTGGAAA GATGGCAAGG CCGACATTCG TCCAGGACAG
GTCATTCCCC CTGGGCTTAT GCCTTCTCTG GTTGACTTGG ACGTCAAGCC CAAGCCCAAC
CGTCGTGGCA CGGCGGTTGA CACGCCTGAC AATTATGAAT CGGTTCTTCA CAATGTGCTT
CTCACCCCCA CATACATCGT TCCTTCGCCA CAAGTCAATT CCACACCGAC GCCTTCTCCT
TACGATCCTT CCGAGTTCAT CGAAAAGTAC GACCGCCCAC GCACTCTGCT GGACAGAGCA
ACAGACACTA TCTCAAACGC TGCTAAACGA ACTTCAAAAT GGATGCCTGG CAAGAGTATC
CTTCGTAATG GTTCCAGCGG TGACGATGGA GCTACAAACA AGTCACTGAG GGCATTGAAA
GAGAGAGAAC TTCAGGAGAT GGCTCGCTTT CGTCAGAACG CTAGGCCAGG TGTGCGGTTA
GCCGTACCTG GCGAGGCGGC ATATTCGAGT GGTTCAAGCC CGGCGAGAGA GATGAGCAAC
ATCTATTCAA GAAAAAGTCC TGGCTGGCTT GGTTCAAGAG AAAGAATAAT AGGCTACGAT
GGAGATGGAA AGTATCCTGT AGTGGTTGGT TCCAGTAACA ACGGATGGAG AGCGGGGAAA
AGGGATGAAG AGCACGCAAA GAGGAATAAG AGAATATGGA AGGTGTGTAT AATGTTTCTT
CCAGCATGAT GAGGTTCTTA AAGCTGATTC ATGAATATCC TAGATTACAA TTATTGTTGC
GATCTTGTTC CTTACTGCTT TGACGATTGG CCTTTGCACT TCACTCCTTC GCAAGTCCTC
ATCTTCATCT TTGGACACTT CATCATCTAG CGACGCCGCC AGTGCGGATA ACAGTTCTGG
ATCCGCAACA GTCACTTCTT CAGCAGCTTC TGCCTCTTCA ACGTCATCTC AAACACTCAG
TACCTGCCTT AACCTCTTCA CTTCGTCCGC TCCCACATCT CCAGCTTCTT ACCCTTGTTC
CGACTGTGTG CAGGTCCTTC AGTCTACTAC TAATGATTTT TCTGAACCGA TGGTTAACGG
CAATTCCACT GGTGTGGGGT CCGCACTTCA GTTCTGTGCC ATGATGGACA TCTACAGTCA
GATTGAAAAT ACTAGCCAGC TAAGTAAGTG GGGTGAAGAT GCGAGTCCTT GTGGATGGGA
TGGGATTGGG TGTGACTCCA GGGGAAGGAT AACAAGTTTG TCTCTGCAAT ATCCCAACGT
ACCCACAGCG CTTCCAGATA CTCTCGGAAA TGTTTATGCT TTGAAAGCAC TACATCTTCT
TGGTAATTCA TCTGTACCTA GTGAGTTGTG TTCATCTGTC AATCTCAAGA ACTTTCTAAA
CAGTTTTATC ACTTTTTAGC TGGTGATTTT CCAAGCTCTT TACTATCTCT TCCCTATTTT
CAGACATTGG ACCTTGAGTA CACTGCGATC ACCGGCAGGA TCGATACGGC ACCCTTCAAC
TCTGCAACAG GGTTGGTCAC TTTGATGCTG GTCAGCAATT CCCAGCTTGG CACATCTATG
CCTGACCTGT CATCCAACAC CAAACTCGTC ACTGCCGCTG TCACAGGACA AGGGTTGACC
GACGCCGAAG CGGACAAATT GCCCTCTTCA TTGACCTACC TGTGAGTAAA CTTTATATCT
CGAAAAGGCG AAATTCTAAC AGGTCATCAC AGCGATTTGT CTTACAACTC GCTCAGCGGT
CAGGTACCTT CGTTCAGCCA ACTTGCGTCC CTCAAAACTT TGTATCTTCA AAACAACGAT
TTCACTTCTG CCCCCGATTC CATTCCATCG TCCCTGACTA CCATGTCTTT CACGTCGAAT
TACCGGCTAT CCGGCACCAT GCCATCTTCA GTGTGTTCGA GCACCGTGCT CACATCATGT
GATCTTCGAA GCACGAATCT GACTGCGGGC ACGACATCGT CAAGCAGTCG TTCGAGCTCG
AGCACCTCCC TGAGTTTTGT AGCTGCCAGC AGTACCATCA CAAGTATAGC CTCAACTTCA
AGCTCAAGTG TTAAAGTCAG CGGCAGTTCC ATGTCAAGCG CTAGCAGCAG CAGCGGTAGC
AACTCAGCTG CCAGAGATAC AAACGTGACG AGCAGCACAA TGATAGGAGT GTCAGCTAGG
GCGAGTACAG AAGAAGGGAC ATGCGGGGTT TGTCAATTTA ATTAACCAAA AATTCGTATT
CTCGAACTGC ATGATTTACA TATCTATAAC CAGCACATAC ATGGTGGAGC GCTCTTATCT
ATCTATTGG
 
Protein sequence
MAPRDSSRFS FKSRFSLLSL LPARHPPLFV DPPRMPPTHP GPRHETIISD TVSPVHSEYP 
DLGTPFPPCR KKSYDRSPLG WGHGEEGEPV EEKKRRAKRP PPLNLERTHM MYPPSSVDVV
IDPGTPFNLD SPPPQKQVPS LPKKIKPRKK RQLVEDPFEV AEVEIGHRYP SWKDGKADIR
PGQVIPPGLM PSLVDLDVKP KPNRRGTAVD TPDNYESVLH NVLLTPTYIV PSPQVNSTPT
PSPYDPSEFI EKYDRPRTLL DRATDTISNA AKRTSKWMPG KSILRNGSSG DDGATNKSLR
ALKERELQEM ARFRQNARPG VRLAVPGEAA YSSGSSPARE MSNIYSRKSP GWLGSRERII
GYDGDGKYPV VVGSSNNGWR AGKRDEEHAK RNKRIWKITI IVAILFLTAL TIGLCTSLLR
KSSSSSLDTS SSSDAASADN SSGSATVTSS AASASSTSSQ TLSTCLNLFT SSAPTSPASY
PCSDCVQVLQ STTNDFSEPM VNGNSTGVGS ALQFCAMMDI YSQIENTSQL SKWGEDASPC
GWDGIGCDSR GRITSLSLQY PNVPTALPDT LGNVYALKAL HLLGNSSVPT GDFPSSLLSL
PYFQTLDLEY TAITGRIDTA PFNSATGLVT LMLVSNSQLG TSMPDLSSNT KLVTAAVTGQ
GLTDAEADKL PSSLTYLDLS YNSLSGQVPS FSQLASLKTL YLQNNDFTSA PDSIPSSLTT
MSFTSNYRLS GTMPSSVCSS TVLTSCDLRS TNLTAGTTSS SSRSSSSTSL SFVAASSTIT
SIASTSSSSV KVSGSSMSSA SSSSGSNSAA RDTNVTSSTM IGVSARASTE EGTCGVCQFN