Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL03830 |
Symbol | |
ID | 3254741 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 46764 |
End bp | 49705 |
Gene Length | 2942 bp |
Protein Length | 732 aa |
Translation table | |
GC content | 48% |
IMG OID | 638253854 |
Product | clathrin binding protein, putative |
Protein accession | XP_567943 |
Protein GI | 58261066 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5096] Vesicle coat complex, various subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCCATAACAT CCTTTATTTT ACACTGGTCC AGCTCGCATC TGCTATATTT CAATCTCCAG CTTACACACC ACTTACAGTA CTATCGCATA TCCCTTACAC ACACATTTAA TCCCTGACAA AAAACGTTCA AAATGGCTAT GGCCCCGCCC CGCAAAGGTT AGCCCCATCC CTAATTCACG TCACATGTAT TCGCTAACCA TGCAATCTTC AGGCGAGAAC TGGGAACTTC GCCAACAGCT GAACTCAGAG TACAGGGATA AGCGGGCAGA TGCTATCAAG AGAGTCATTG CGAACCATAC CATTGGCAAG GACTGTAGTG GCTTGTTTCC CGATGTCGTC AAGAATATGG TGAGTGTTTA ATAACTCTTT GCTGGAGTCA AGGAGCAAGG GAGTACAGGT AAGAGCTGAT GTTCTGACCG CAGCAAACAG ATGATCTGGA GCAAAAGAAA CTCGTATATT TATATCTCAT GAACTATGCG AAGACACAAC CTGAACTCGT TATCCTTGCT GTCAACACTT TCGTCAAGGT GGGTTTGAGT TTGACTTATT CCATCTCTTC TGCTAACCTG CTTTATTAAA GGACACTGCC GACCCCAATC CTCTTGTTAG AGCTCTTGCC ATCCGCACCA TGTCCATCCT TCGCGCCGAG AAAATCCTCG ATTACCTCGC TTCACCTTTA TCTCGATGCC TCAAAGATGA GAACCCCTAC GTCCGAAAGA CTGCTGCTCT GTGTGTCGCC AAAGTGTTTG ATTTGAAGCC AGAGCTGGCT ATCGAATACG GGTTCATTGA AACTTTGAGA GATCTTATCG GCGATGGTAA TCCTATGGTA TGTCATGCAA TGATGAGCGT ACGAAGATGT CTGATCAATT GACAGGTTGT TGCAAATGCC GTTGCCGCAC TCGGGGATAT TCACGAGGCT TCTCTTAACC TTCCTTCTTC TCAGCCCGGC TCGCCAAATG ACGATGAATC TCCTAGCAGT GTCCGTCCCA ATCAATCCCT TTTCATCATT GACCCCGCTA CCCTTACCAA ACTCCTTGTC GCTTTGAACG AATGTTCTGA ATGGGGTCGT ATTGCCATCC TCACCACCTT GGCGAGATAT AGGACAAATG ACGAGAAGGA GAGTGAACAT ATCTGTGAAA GAGTGATGCC CCAGTTTCAG CACGTGAATG CGGCAGTTGT GTTGGGTGCG GTAAAGGTGA TCATGATTCA TATGAAAAAT GTCACCAAGG AAGACCTTTT GAAGTCTCTT ACTCGAAAAA TGGCTCCTCC ATTAGGTGAG TTCAACATTT TTTATTTTTT ATTTTTAATT CAAAAATGAA TGGTTTGACC AACAACGAAC AGTCACCCTT ATCTCCTCAC CGCCCGAGGT GCAATGGGTC GCACTTCGTA ACATCAACCT CCTCTTGCAG AAACGTCCCG ATATCCTTGC CAGCGAGATG CGCGTTTTCT TCTGCAAATA TAACGATCCT TCCTACGTCA AGGTGGAGAA ACTTGAGATT ATGGTTAGAT TGGCGAACGA GAAGAATGTG GACACTTTGC TTGGGGAGCT GAAGGAATAC GCGTCGGAAG TTGATGTCGA TTTTGTTCGC AAGGCTGTCA GGGCGGTTGG CCAAGTTGCT ATCAAGATCG ATGAGGCTGC CGGGCGATGT GTCGAAGTAT TGATGGAACT GATCGAGACA AGAGTCAGCT ATGTCGTACA GGAGGCTGTC ATCGTCGTCA AGGTAAGCTT CACTGCATGG GTATATCTGG ATGAGGCTAA TTCTTTGACA GGACATCTTC CGAAAATACC CCCACTCATA CGAAGGTATC ATCCCGGCGC TCTGTGCTAA TCTTGAGGAA TTGGATGAGC CTGAGGCCAA GGCCTCTTTG ATTTGGCTCA TCGGCGAGTA CGCAGAGAAG ATTGAGAATG CGGACGAGTT GCTGGGAGCG TTCTTGGAGA CTTTCAGTGA GGAAAGCTAT CCTGTACGAC TCATCATGAT TTCACCATTC ACTCGTTGAC TCATCATCTC TGCAGGTTCA ACTTCAAACC CTTACTGCGA TTGTCAAGCT GTTCCTCAAG AAACCTGATG AAAGCCAGGC TATCGTCCAA AAAGTGCTTC AAGCTGCTAC AAAGGACTGT GACAGCCCAG ACGTCAGAGA TAGGGCCTAC ATATACTGGA GATTGCTGTC ATCCGACCCC GCCGCTGCCA AGGTAAGTCA ATATCCACCT CTTGCTTCAT GCGAAATTCT AATGGATGTA ATATAGTCTG TTGTTTTGTC AGTCAGACCA CCTATCAGTC TTCCTCAAAC GACTGTCGCT CCTGCAATCC TTGAGGAGCT TATTGGCGAA ATCTCGACAT TGGCGAGTGT GTACCACAAG CCTGCGGCTA CGTTCATAGG CAAAGGTCGT TTGGGTGCAG ACGAGATGCA CAAGAAGAGC TTGGAGTACG TCTATCTCTC ATTCAGTCCT TATAGTGATG ACCTAACCTA TGCTGGACAG TGCCGAGGAC GATGTCTCAC GCGAAAAGGC TCTTCAAGCT GTCGTTGCCG GTAACCAAGC CGAAAACCTG CTCGACTTTG ACGACGAACC TACTCCTACC AACGGCGAAT CTTCCATCCC TGCTCCGGGT GCGGGTTTGG GCATCTCTTC TCAGGCTATT GCGAGCGCAG CCAAGAGCAC GAACCCGTTA GATGAATTAA TGGACTTGTT CTCCACTGCT AGCATGACAA CACCAGTCGT TCAACCTGGT CAGCCTGCAG CTCAGGCTCA AACGTCAGCT CAGAGTTCAG GGGGTTTGGG CGGGTTAAAT GGATTGGCAG GGCTGTCTAG CCCACCGCAG AGTGTATCGC CGCAACCCGG AGCTCCGCAA AGCCAGAAAC AACAACAGCA GGCGGCGGCT CAAGATGATT TGTTGGGGTT GTTTTAGATG AATAACTGAA AATGTATGTT GATATGGGTC TT
|
Protein sequence | MAMAPPRKGE NWELRQQLNS EYRDKRADAI KRVIANHTIG KDCSGLFPDV VKNMQTDDLE QKKLVYLYLM NYAKTQPELV ILAVNTFVKD TADPNPLVRA LAIRTMSILR AEKILDYLAS PLSRCLKDEN PYVRKTAALC VAKVFDLKPE LAIEYGFIET LRDLIGDGNP MPGSPNDDES PSSVRPNQSL FIIDPATLTK LLVALNECSE WGRIAILTTL ARYRTNDEKE SEHICERVMP QFQHVNAAVV LGAVKVIMIH MKNVTKEDLL KSLTRKMAPP LVTLISSPPE VQWVALRNIN LLLQKRPDIL ASEMRVFFCK YNDPSYVKVE KLEIMVRLAN EKNVDTLLGE LKEYASEVDV DFVRKAVRAV GQVAIKIDEA AGRCVEVLME LIETRVSYVV QEAVIVVKDI FRKYPHSYEG IIPALCANLE ELDEPEAKAS LIWLIGEYAE KIENADELLG AFLETFSEES YPVQLQTLTA IVKLFLKKPD ESQAIVQKVL QAATKDCDSP DVRDRAYIYW RLLSSDPAAA KSVVLSVRPP ISLPQTTVAP AILEELIGEI STLASVYHKP AATFIGKGRL GADEMHKKSL DAEDDVSREK ALQAVVAGNQ AENLLDFDDE PTPTNGESSI PAPGAGLGIS SQAIASAAKS TNPLDELMDL FSTASMTTPV VQPGQPAAQA QTSAQSSGGL GGLNGLAGLS SPPQSVSPQP GAPQSQKQQQ QAAAQDDLLG LF
|
| |