Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC00570 |
Symbol | |
ID | 3256477 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 159150 |
End bp | 162208 |
Gene Length | 3059 bp |
Protein Length | 749 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255275 |
Product | conserved hypothetical protein |
Protein accession | XP_569346 |
Protein GI | 58264380 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTAAAGTCA AGAATCCTCA TTTCAGGATG TACAACGGGC AACGTGAGTC TTTGGCGAGC GATGAGAGCG TACGGCTAAC CGAACGCAGC TCCCTTCCCA GGCAATGGCC TCCCCCCTTT CCCTCCTCAG ATGGTTCCTA ACGGCACTAT GCCTGCTGGC TTTCCGCCTT TCGCTCCTCC TTTTCCACCT CCTTTTCCAG GTATGCGACC ACCTTCAGGT CCAGGAATAC CTTCTCCAGC GACTCCAGGT ATGATGCCTT CTCCAGCGCC TATGTCAGGT ATGCAGACTC CAGGTATGGC TTTCCGACCA CCTCCAATGG GATATGCTCC TCGACCAGCC CACCCTGGTC TGGGATCGAC TCCTCATGGT TTGCCGCAGC CTCCGCACAT GCCTCCTAGT ACACCAAAAC CGGATGTAAA GACTACAAAG GTATTTGTCG GTGGCATCGC TCCAGGAATA ACAGATGAGA CTTTGGAGAG TTTGCTTAAT GTACGTCCAT ATCACTTCTC TTTCGGTCAA ATACTCATTT AACTAGGCAT GTGGACCTTT GCATGAACTT AAACGGGTTA TCGGTGCGAG TGGCAAGCCT CAAGCATTCG GGTTTGCGAT GTTTGAAAAC CCCGAAGTCG TCTTGCGTTG TATCAGATGT TTAAACGGTG TCGAATTACC CGACATGACG CCAGAGGGTA GAAGAGACAG GAAGCCAGCG AAGAAACTGA TTGTGAAAGC GGATGAGAAA ACCCAGGCAT TTTTGGAAGA ATTTGAATCG ACTTTGGGTC GAAGCGATGT AAGTCTTTCG TGCTTCTTAC AGGTCACGCT GACAGTACAA AAAGTCGGAC GAAGAAGCGG ACGCATCAAG CCGGAAATCA ATTCAACACA TCATCGCTCT GCTCACAGAT CCCAACGCCC AACATCCCGA CGGCCCTTCG GCTGGTAACA ACGGTGGCCA ATCTCCTGTA CAAGTGGTCG TCCCAGCTCA TCTCCAAGAT CTCCAAGAAG GTGACCTTCC TGAAGAACAA CGTGTAGCCG TCCTTGGTCA GATCGCAGTC TTCCGAGAGA ATGCGGCAAA GAGAGAGAGG GAGAAAAAGT TGATGGAGGA GGAAAAAGAG AGGTACAAAG CAATGCAGAG TCAGGGCGGT GGGCAGCGAC AGGCGCCGAG TGGGTATGGT TATGGTAACC GTGGTTTGGC CAAACAGCAA CAGCAAGCGG AGAGACAGTG GGGATCACAA TCCGCGAGCC CAGTACCGTG GAAAGGTGCT GGTCCAAGCC GGTATGGTCC CGGAGAGAGG GATCCTCAGG CTTACGATAA GCCAGTTTCA TTTGTAAAAG CTGAGACGAC GGAAGGTAAG GCGGAAAGTG GACGAACTGA TGAAGAGGAG GAAGAGCTCA GGAGGAGGAA AATTCAGATG GATAAGGATA TTGCTTTGCG TGATGTACGT TCATTGTTTT TTTCTTCATT GCTAAACACC GTTGACCAAT TGACGATAGG CCGAGCGAAG AGTAGAAGCT CGTGAAAGGA CACGACTCGA CGCGCTATCC CGCGAAATGA ACTACCGTCA GACCCAAAAA GATCTCGTGG CCCGCTCCCG TGCCCGACAA GAAGAGTTGT ACGCCCGTTA CGACGATGAT GAGTATATTG AACGATCCGA GCGTGAGCGA GATCGACGAG ATCGACCTGA TAGGGAAAGG GATGTTCGCG AGCTGTTCTA CGTTGATCGC CCTCAGTGGC GAGCGAGGAG ACAAAAATTT AGGCAGAACG AATATCAAGC GGATCTGCGA GATCGTCAGC ATGAAGAGGA CGAGCGTCAA GCCCTTGAGC GCGAGTCGGA AGAGTTCTTG AAGAAGCAGA TGGCTGAACT CGCTGCTCTT GAGCAATCTC AGCGTGCTCA AGGTCTCCTC ACTGAGGATG CTGCACCTAT CAAGCTCGCA ATCAATCCTT CTGCCCTTCC ACCTCCTCCT CCGAAGGAAG AGAAGAAACA TGTCGTTGCG CCCAGGCCGG GGGTCCAGTT TGAGGGTGAG GATGATGATG AGGAAAGTGG TAGGAAGAAG AAGGGAACGT TTGTGAGGTT GGATGAAGAA GAGGAAGGTG ATGGGTTAAA TGAGGCTGAG AAGAGGGCAC GGAGGAATGC CAGGTTGTTG GATGTCAAGA AGGGAGTTCC CAATGACAGG AGAAGTATCT GGAAGTTCCC CGTTGAGTGG GCTGCTGTTG GTGAAGTGCG TTCTCGATAC AAATACATGT TCGAGGAATC AGCTGATATT TTTATCCGGA AAACAGACTC TAATACAAAA CAAGATCAAA CCGTTCGTTC ACGAAAAGAT TCGAAACTTC CTTGGCGAGT TGGATGAAGA TCTCGCAGAC TTCGTCCTTG AGCATCTTCG TGACCGCAAG GGCGCTGATG ACCTTGTTGA CGGTCTTGAG CCTGTAAGCT TCCTCTACTT GCACGGCCAT TTGATTAAAA TCGCTGTACT AATTTGTCAT CGTTTCAGAT CTTGGCTGAA GACGCGGAAC CCTTCGTCCT TCAATTATGG CGCCAACTTA TTTTCGAAAG TTTGGCATTC AGAGAGGGTA TTGACACTGG TTCTATGATG ATCTAGAAGA AAATCACTCA ACAATATATG AACCTGTGTG GGAGCTGGGA AGGAGGATGT ACGAATGGAT TTTTGCGAAA GGGATGATGA AAAGTGGATT AAGGGTATGT CAATGCATGA CGTGAAAAAG GAATTTTGAT TACGATTTCA CATTGAGGAG CGCCTAGCGA ACAAATTGAG CATGGTCATA CGGGATGAGC GCTTGTGGAT TGATATTGGG ACAAGGTTGT TAACAAAGAC GGGCGAGTAC ATCCCCCTTC CTATACTTCT GTCTGGGTTG GGCGTATGTG AAGGAAAGCC TTGAAATGAT AAATTGATTT TGTTCCAGCT TACTCAAAGT TTTATGCTCA GAAAAGATCC TTGTGGCGAC GTGAAACAGT TCATTGAGAC TGTAAGTGGT CCTGCTATCT GAGACGTGTC GTGTGTCAAT CAGTGGGCGA TCTCTTTGAC ATGTATTTTC ATAGTCACT
|
Protein sequence | MYNGQPPFPG NGLPPFPPQM VPNGTMPAGF PPFAPPFPPP FPGMRPPSGP GIPSPATPGM MPSPAPMSGM QTPGMAFRPP PMGYAPRPAH PGLGSTPHGL PQPPHMPPST PKPDVKTTKV FVGGIAPGIT DETLESLLNA CGPLHELKRV IGASGKPQAF GFAMFENPEV VLRCIRCLNG VELPDMTPEG RRDRKPAKKL IVKADEKTQA FLEEFESTLG RSDSDEEADA SSRKSIQHII ALLTDPNAQH PDGPSAGNNG GQSPVQVVVP AHLQDLQEGD LPEEQRVAVL GQIAVFRENA AKREREKKLM EEEKERYKAM QSQGGGQRQA PSGYGYGNRG LAKQQQQAER QWGSQSASPV PWKGAGPSRY GPGERDPQAY DKPVSFVKAE TTEGKAESGR TDEEEEELRR RKIQMDKDIA LRDAERRVEA RERTRLDALS REMNYRQTQK DLVARSRARQ EELYARYDDD EYIERSERER DRRDRPDRER DVRELFYVDR PQWRARRQKF RQNEYQADLR DRQHEEDERQ ALERESEEFL KKQMAELAAL EQSQRAQGLL TEDAAPIKLA INPSALPPPP PKEEKKHVVA PRPGVQFEGE DDDEESGRKK KGTFVRLDEE EEGDGLNEAE KRARRNARLL DVKKGVPNDR RSIWKFPVEW AAVGETLIQN KIKPFVHEKI RNFLGELDED LADFVLEHLR DRKGADDLVD GLEPILAEDA EPFVLQLWRQ LIFESLAFRE GIDTGSMMI
|
| |