Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI01870 |
Symbol | |
ID | 3259382 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 516126 |
End bp | 519254 |
Gene Length | 3129 bp |
Protein Length | 840 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258671 |
Product | conserved hypothetical protein |
Protein accession | XP_573000 |
Protein GI | 58271688 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGTCTCTCAA ACGACGCGAG AATCCCTGCG GCCCCAATGT TCTGCAGCGA TGCAGGGCCC AGACTTGCTC TTACGACACG TCCACTGTCT CCTATTGTTG CTGACACAGC TCGCACCAAA CAAGGAGCCC ACAGCTCGCC ATGCAGATCC CATCAGCCAT CCGCAGAACC ATCCCTTCTT TAAGAAGTGC ACATGAGAGC AACATCTTCC TCTCCTCTCT ATCGTTTTAC CCGTACAAAC GTGTCGTGAG TAGCATACTA ACTGCGCCAC CGACCATCAT CCGGCCATTC AAGTCAATCA CAACAATTAT TTACCTGCCT ACGGCCAACC ACAGGAGGAA GAGATAACGA GATGGCGCCC AGAGACAGTT CGCGATTCTC TTTCAAGTCT CGCTTTTCCC TTTTATCCCT TCTCCCTGCC CGGCATCCAC CTCTTTTTGT CGATCCCCCT CGAATGCCAC CTACTCATCC GGGTCCACGG CACGAGACAA TCATTTCAGA TACGGTTTCG CCCGTCCACA GCGAGTATCC CGATCTGGGA ACACCCTTCC CGCCCTGCCG AAAAAAGAGC TATGACAGGA GTCCATTGGG GTGGGGACAC GGTGAAGAGG GGGAGCCAGT GGAGGAAAAG AAGAGGAGAG CAAAGAGACC ACCCCCGCTC AACCTGGAGA GAACGCATAT GATGTATCCA CCTTCATCCG TGGACGTCGT CATCGACCCT GGTACGCCGT TTAACCTTGA CTCTCCTCCG CCTCAGAAGC AAGTGCCTTC ATTACCCAAG AAGATCAAGC CGAGAAAGAA GCGGCAGTTG GTCGAAGACC CTTTCGAAGT AGCAGAGGTC GAGATAGGGC ATCGGTACCC ATCGTGGAAA GATGGCAAGG CCGACATTCG TCCAGGACAG GTCATTCCCC CTGGGCTTAT GCCTTCTCTG GTTGACTTGG ACGTCAAGCC CAAGCCCAAC CGTCGTGGCA CGGCGGTTGA CACGCCTGAC AATTATGAAT CGGTTCTTCA CAATGTGCTT CTCACCCCCA CATACATCGT TCCTTCGCCA CAAGTCAATT CCACACCGAC GCCTTCTCCT TACGATCCTT CCGAGTTCAT CGAAAAGTAC GACCGCCCAC GCACTCTGCT GGACAGAGCA ACAGACACTA TCTCAAACGC TGCTAAACGA ACTTCAAAAT GGATGCCTGG CAAGAGTATC CTTCGTAATG GTTCCAGCGG TGACGATGGA GCTACAAACA AGTCACTGAG GGCATTGAAA GAGAGAGAAC TTCAGGAGAT GGCTCGCTTT CGTCAGAACG CTAGGCCAGG TGTGCGGTTA GCCGTACCTG GCGAGGCGGC ATATTCGAGT GGTTCAAGCC CGGCGAGAGA GATGAGCAAC ATCTATTCAA GAAAAAGTCC TGGCTGGCTT GGTTCAAGAG AAAGAATAAT AGGCTACGAT GGAGATGGAA AGTATCCTGT AGTGGTTGGT TCCAGTAACA ACGGATGGAG AGCGGGGAAA AGGGATGAAG AGCACGCAAA GAGGAATAAG AGAATATGGA AGGTGTGTAT AATGTTTCTT CCAGCATGAT GAGGTTCTTA AAGCTGATTC ATGAATATCC TAGATTACAA TTATTGTTGC GATCTTGTTC CTTACTGCTT TGACGATTGG CCTTTGCACT TCACTCCTTC GCAAGTCCTC ATCTTCATCT TTGGACACTT CATCATCTAG CGACGCCGCC AGTGCGGATA ACAGTTCTGG ATCCGCAACA GTCACTTCTT CAGCAGCTTC TGCCTCTTCA ACGTCATCTC AAACACTCAG TACCTGCCTT AACCTCTTCA CTTCGTCCGC TCCCACATCT CCAGCTTCTT ACCCTTGTTC CGACTGTGTG CAGGTCCTTC AGTCTACTAC TAATGATTTT TCTGAACCGA TGGTTAACGG CAATTCCACT GGTGTGGGGT CCGCACTTCA GTTCTGTGCC ATGATGGACA TCTACAGTCA GATTGAAAAT ACTAGCCAGC TAAGTAAGTG GGGTGAAGAT GCGAGTCCTT GTGGATGGGA TGGGATTGGG TGTGACTCCA GGGGAAGGAT AACAAGTTTG TCTCTGCAAT ATCCCAACGT ACCCACAGCG CTTCCAGATA CTCTCGGAAA TGTTTATGCT TTGAAAGCAC TACATCTTCT TGGTAATTCA TCTGTACCTA GTGAGTTGTG TTCATCTGTC AATCTCAAGA ACTTTCTAAA CAGTTTTATC ACTTTTTAGC TGGTGATTTT CCAAGCTCTT TACTATCTCT TCCCTATTTT CAGACATTGG ACCTTGAGTA CACTGCGATC ACCGGCAGGA TCGATACGGC ACCCTTCAAC TCTGCAACAG GGTTGGTCAC TTTGATGCTG GTCAGCAATT CCCAGCTTGG CACATCTATG CCTGACCTGT CATCCAACAC CAAACTCGTC ACTGCCGCTG TCACAGGACA AGGGTTGACC GACGCCGAAG CGGACAAATT GCCCTCTTCA TTGACCTACC TGTGAGTAAA CTTTATATCT CGAAAAGGCG AAATTCTAAC AGGTCATCAC AGCGATTTGT CTTACAACTC GCTCAGCGGT CAGGTACCTT CGTTCAGCCA ACTTGCGTCC CTCAAAACTT TGTATCTTCA AAACAACGAT TTCACTTCTG CCCCCGATTC CATTCCATCG TCCCTGACTA CCATGTCTTT CACGTCGAAT TACCGGCTAT CCGGCACCAT GCCATCTTCA GTGTGTTCGA GCACCGTGCT CACATCATGT GATCTTCGAA GCACGAATCT GACTGCGGGC ACGACATCGT CAAGCAGTCG TTCGAGCTCG AGCACCTCCC TGAGTTTTGT AGCTGCCAGC AGTACCATCA CAAGTATAGC CTCAACTTCA AGCTCAAGTG TTAAAGTCAG CGGCAGTTCC ATGTCAAGCG CTAGCAGCAG CAGCGGTAGC AACTCAGCTG CCAGAGATAC AAACGTGACG AGCAGCACAA TGATAGGAGT GTCAGCTAGG GCGAGTACAG AAGAAGGGAC ATGCGGGGTT TGTCAATTTA ATTAACCAAA AATTCGTATT CTCGAACTGC ATGATTTACA TATCTATAAC CAGCACATAC ATGGTGGAGC GCTCTTATCT ATCTATTGG
|
Protein sequence | MAPRDSSRFS FKSRFSLLSL LPARHPPLFV DPPRMPPTHP GPRHETIISD TVSPVHSEYP DLGTPFPPCR KKSYDRSPLG WGHGEEGEPV EEKKRRAKRP PPLNLERTHM MYPPSSVDVV IDPGTPFNLD SPPPQKQVPS LPKKIKPRKK RQLVEDPFEV AEVEIGHRYP SWKDGKADIR PGQVIPPGLM PSLVDLDVKP KPNRRGTAVD TPDNYESVLH NVLLTPTYIV PSPQVNSTPT PSPYDPSEFI EKYDRPRTLL DRATDTISNA AKRTSKWMPG KSILRNGSSG DDGATNKSLR ALKERELQEM ARFRQNARPG VRLAVPGEAA YSSGSSPARE MSNIYSRKSP GWLGSRERII GYDGDGKYPV VVGSSNNGWR AGKRDEEHAK RNKRIWKITI IVAILFLTAL TIGLCTSLLR KSSSSSLDTS SSSDAASADN SSGSATVTSS AASASSTSSQ TLSTCLNLFT SSAPTSPASY PCSDCVQVLQ STTNDFSEPM VNGNSTGVGS ALQFCAMMDI YSQIENTSQL SKWGEDASPC GWDGIGCDSR GRITSLSLQY PNVPTALPDT LGNVYALKAL HLLGNSSVPT GDFPSSLLSL PYFQTLDLEY TAITGRIDTA PFNSATGLVT LMLVSNSQLG TSMPDLSSNT KLVTAAVTGQ GLTDAEADKL PSSLTYLDLS YNSLSGQVPS FSQLASLKTL YLQNNDFTSA PDSIPSSLTT MSFTSNYRLS GTMPSSVCSS TVLTSCDLRS TNLTAGTTSS SSRSSSSTSL SFVAASSTIT SIASTSSSSV KVSGSSMSSA SSSSGSNSAA RDTNVTSSTM IGVSARASTE EGTCGVCQFN
|
| |