Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB02270 |
Symbol | |
ID | 3255595 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 657696 |
End bp | 660558 |
Gene Length | 2863 bp |
Protein Length | 799 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254878 |
Product | peroxisome targeting sequence binding protein, putative |
Protein accession | XP_569156 |
Protein GI | 58263492 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.655025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTCCCACAC ATACCGCCAT GTCTGCTTTC CTATCGGGCG CTTCGGTCCA GTGTGGACCT ACATCTGCTC TCAAAAACGT CTCAGACAGG ATCAACGTTG ACCGTTCTCT ACAACAGGTG CGCTGATGAG TGTCGATCAC TTTGCAAGCT TATTTATAAT GAGCTAACTT ATCCCGTGGT AGGATCGACT AGCCTACACC TCGAATGCCT CTGGATCATC ATCAAAGGTG GGTATGGAAG TATGCATTAC ATCCTATATT GATAGCTTGC ATAGCAGCCA TTCAGAGCTC AACTTGCTCA GCTGCCAGTC AACCAGTCCC CGAAGCAAGT TCCAGGACCT CCATCAACTT TTGACCTCTC TTCACTGAGG CAACAGCTTT CTCCTGTTCC ATCAACTTCT CATGCCTCAG ATTGGGCCAG CGATTTCGTC CCTCTGACAG GACCGACAGC ATCGAAATAC ACCCGTTCTG TAGCTTCCAC AAAGAGCGGT TGGCAGGAAG AGTTTAGCCA GCATGTTGCA GCCGCGCCTC CACTTGGAAC CACAAGATCT CAGAAACCCC TATATAATGC TGGTCTTGCA CCATGGGAGG TTCCAGAAAC TCAATATCAA CTTCAGCGCC CGGCATTGAT GCGACCAAGC TTTCGGTCGC ATGACATTCC GATTTCAGAA GCAAGCCTGA CATTTCCTCG ACCTGATATC CATGCAGCGG CTCCCGTTAG TAAATATGAA GAGCAAATTG GTCAACCAAC CGGATCAGAA ACTCTGCCTC TTGATGAATC TCAACAATTA CTAGCTCGGA CGGCCCGGTC CTTTGTAAGT AATCTGGAGA CCCAGTCAGA CATTTTATCT GCCAATCCCA AATTTGCACA AAGTAAATTC TTGTCTCTTC TGAGAGGCTT AGGTGACGAA CAGGTTGTGG TCAAAGAAGG GCAAGAGGTA AAAGGTGAGG AGGTGGGTGA GGGTGCGACA TTCGTTGAGC GGAACATTGT TGGAAACAAT TGGGCGGAAG GTTTTGCCAA GCAAAAAGAG AAATCAATAC CGCAAGAAGC CCCCACCTTG GCTGAAGCTG AATACCTCGA ACGAAGATCG CCTTATCCTC CAGGTCAAAA AGAGTATCCA GCCCTGAATT CTTGGGTTCC GGCTTTGCCA ACGCACACAT TGGTTCCTCC CCAGACAGCT CCTCAAGCTG CCCGACTAGC CGCGAATAAT GGTGCTTTGT GGGATCAACA GTATCATGAT CAAGAAGCCC TCATACAATC TTCGGAATCG CCCGCACCCG AGCCAAGAAA GAATGTCCAC TTTGACGAAC ACCCCGCTTC TCGAGAAAGG AGTGGTGTAC CCAGCACTCT CGAGGAAGCC ATTTCGTCTC CTGGCAACAT CCCCGGCGCA GGTTGGGGTT GGAATGAACA AGGGCTAACT CACGACTTTG ATGAAGATGT TTTTGAAGAG TTCAATGGTC AACTCAGACG GGCCCAAGAA AGTTTGGAAG GCGGAGTGGG GAAGCAAGAA AGTTGGGATA GGCTGCAAAG TGATTGGGAG GAATTTCAAA GGACAGAACC TGGCGTTGCA CATTTCAGAG GCATGGGGAC TGGCGACCAG TCAGAAAGAT ACATGTTCCA AAGTCGAAAC CCTTATTCGA CCGATGAAGA GGAGCTATAT TTCGAAGTGT CTAGAGACTC ACCTACCTTA AAAGTGAGTT GCATATCCCC TTCTTCTGTT TGATTCAGCC GTCTGACGTA CATCCCAGGG TATACTTGAG CTTGAATCTG AAGTACAGAA AGATTCTACC TCACATGAAG CATGGTATGC CCTCGGCCTC AAGCAACAAG AGAATGAGCG CGAAGACCAA GCTATCTTGG CCCTATCCAA AGTCATTCAA CTGAATCCGC AGTATCGGCC AGCGTATCTT GCCCTCGCTG TCAGCTATAC AAATGAAGGG GAAAATGAGG CCGCGTGCAC CATGCTGGAG GATTGGATTA GACTGAAGGA CAGCAAAAAT ACAACCGGCG CTGATGGACA AAAAGGAAAA GACAGAAACA AACTCATTGA GAGTTTAATA GAAATTGCGA GGCAAACGCC GCACGAGATT GATGCTGATG TTCAGGTTGC ACTCGGGGTG CTGTTCAATA TGAGTGGGGG CCAGGTAAGT CGTCTTTCTT GTCAGCCATT GCGCGATTAC TTATGACCAT CTGGGACAGG ATTATTCCAA AGCTGAGGAT TGCTTTTTGG CGGCCTTGGA GGCCCGGCCT GAAGTAAGGC AGCTTGCTTC CTGTAATACA TCTCGCAGAT GCTAACTGCC CCTATAGGAT TGGTTGTTGT ACAACCGTCT CGGTGCAACA CTTGCGAACA GCGGAAGATC CAGCGAAGCT GTACAGTACT ACCATCAAGC TCTGAGGTTG CATCCCGGTT TTGTCCGAGC TTTGTAAGCA TTTCACCACT ATATCTCGGG TGATGCGAGC TTTCTAACGC AGCTCTTCTC TTCAGGTTCA ATCTTGGCAT CGCATACATG AATCTCGGTG AATATCAAAC TGCCGCCCAA TCAATCCTCG ACGCCCTGAG GCTCCAGCAC TCTGAGGCAA GTGAAGCCTA TGCATACGGC CAGAATGGTG GAGGTGCAAA AGGTGTTACG AGCGAGACGC TATGGAATAG TCTAAAAGGC GCCTGCTTCT AGTAAGTGCT ATATTCACCT CTCCAAACGT TTTTGATTCC TGTGTTAACG GATGGGTATC AAGTATGAAT CGACAGGACC TTGTAAAAAT CGTAGAAAAG AGAGATTTAT CTGGTGAGTA GCCCTATCTG ATGTGCGCCA TCGCTAAATT TTTGTCAGGT CTGCCATTGA AATTTGTAGA TGAAGAGCTC TAG
|
Protein sequence | MSAFLSGASV QCGPTSALKN VSDRINVDRS LQQDRLAYTS NASGSSSKQP FRAQLAQLPV NQSPKQVPGP PSTFDLSSLR QQLSPVPSTS HASDWASDFV PLTGPTASKY TRSVASTKSG WQEEFSQHVA AAPPLGTTRS QKPLYNAGLA PWEVPETQYQ LQRPALMRPS FRSHDIPISE ASLTFPRPDI HAAAPVSKYE EQIGQPTGSE TLPLDESQQL LARTARSFVS NLETQSDILS ANPKFAQSKF LSLLRGLGDE QVVVKEGQEV KGEEVGEGAT FVERNIVGNN WAEGFAKQKE KSIPQEAPTL AEAEYLERRS PYPPGQKEYP ALNSWVPALP THTLVPPQTA PQAARLAANN GALWDQQYHD QEALIQSSES PAPEPRKNVH FDEHPASRER SGVPSTLEEA ISSPGNIPGA GWGWNEQGLT HDFDEDVFEE FNGQLRRAQE SLEGGVGKQE SWDRLQSDWE EFQRTEPGVA HFRGMGTGDQ SERYMFQSRN PYSTDEEELY FEVSRDSPTL KGILELESEV QKDSTSHEAW YALGLKQQEN EREDQAILAL SKVIQLNPQY RPAYLALAVS YTNEGENEAA CTMLEDWIRL KDSKNTTGAD GQKGKDRNKL IESLIEIARQ TPHEIDADVQ VALGVLFNMS GGQDYSKAED CFLAALEARP EDWLLYNRLG ATLANSGRSS EAVQYYHQAL RLHPGFVRAL FNLGIAYMNL GEYQTAAQSI LDALRLQHSE ASEAYAYGQN GGGAKGVTSE TLWNSLKGAC FYMNRQDLVK IVEKRDLSGL PLKFVDEEL
|
| |