Gene CNL03830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL03830 
Symbol 
ID3254741 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp46764 
End bp49705 
Gene Length2942 bp 
Protein Length732 aa 
Translation table 
GC content48% 
IMG OID638253854 
Productclathrin binding protein, putative 
Protein accessionXP_567943 
Protein GI58261066 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCATAACAT CCTTTATTTT ACACTGGTCC AGCTCGCATC TGCTATATTT CAATCTCCAG 
CTTACACACC ACTTACAGTA CTATCGCATA TCCCTTACAC ACACATTTAA TCCCTGACAA
AAAACGTTCA AAATGGCTAT GGCCCCGCCC CGCAAAGGTT AGCCCCATCC CTAATTCACG
TCACATGTAT TCGCTAACCA TGCAATCTTC AGGCGAGAAC TGGGAACTTC GCCAACAGCT
GAACTCAGAG TACAGGGATA AGCGGGCAGA TGCTATCAAG AGAGTCATTG CGAACCATAC
CATTGGCAAG GACTGTAGTG GCTTGTTTCC CGATGTCGTC AAGAATATGG TGAGTGTTTA
ATAACTCTTT GCTGGAGTCA AGGAGCAAGG GAGTACAGGT AAGAGCTGAT GTTCTGACCG
CAGCAAACAG ATGATCTGGA GCAAAAGAAA CTCGTATATT TATATCTCAT GAACTATGCG
AAGACACAAC CTGAACTCGT TATCCTTGCT GTCAACACTT TCGTCAAGGT GGGTTTGAGT
TTGACTTATT CCATCTCTTC TGCTAACCTG CTTTATTAAA GGACACTGCC GACCCCAATC
CTCTTGTTAG AGCTCTTGCC ATCCGCACCA TGTCCATCCT TCGCGCCGAG AAAATCCTCG
ATTACCTCGC TTCACCTTTA TCTCGATGCC TCAAAGATGA GAACCCCTAC GTCCGAAAGA
CTGCTGCTCT GTGTGTCGCC AAAGTGTTTG ATTTGAAGCC AGAGCTGGCT ATCGAATACG
GGTTCATTGA AACTTTGAGA GATCTTATCG GCGATGGTAA TCCTATGGTA TGTCATGCAA
TGATGAGCGT ACGAAGATGT CTGATCAATT GACAGGTTGT TGCAAATGCC GTTGCCGCAC
TCGGGGATAT TCACGAGGCT TCTCTTAACC TTCCTTCTTC TCAGCCCGGC TCGCCAAATG
ACGATGAATC TCCTAGCAGT GTCCGTCCCA ATCAATCCCT TTTCATCATT GACCCCGCTA
CCCTTACCAA ACTCCTTGTC GCTTTGAACG AATGTTCTGA ATGGGGTCGT ATTGCCATCC
TCACCACCTT GGCGAGATAT AGGACAAATG ACGAGAAGGA GAGTGAACAT ATCTGTGAAA
GAGTGATGCC CCAGTTTCAG CACGTGAATG CGGCAGTTGT GTTGGGTGCG GTAAAGGTGA
TCATGATTCA TATGAAAAAT GTCACCAAGG AAGACCTTTT GAAGTCTCTT ACTCGAAAAA
TGGCTCCTCC ATTAGGTGAG TTCAACATTT TTTATTTTTT ATTTTTAATT CAAAAATGAA
TGGTTTGACC AACAACGAAC AGTCACCCTT ATCTCCTCAC CGCCCGAGGT GCAATGGGTC
GCACTTCGTA ACATCAACCT CCTCTTGCAG AAACGTCCCG ATATCCTTGC CAGCGAGATG
CGCGTTTTCT TCTGCAAATA TAACGATCCT TCCTACGTCA AGGTGGAGAA ACTTGAGATT
ATGGTTAGAT TGGCGAACGA GAAGAATGTG GACACTTTGC TTGGGGAGCT GAAGGAATAC
GCGTCGGAAG TTGATGTCGA TTTTGTTCGC AAGGCTGTCA GGGCGGTTGG CCAAGTTGCT
ATCAAGATCG ATGAGGCTGC CGGGCGATGT GTCGAAGTAT TGATGGAACT GATCGAGACA
AGAGTCAGCT ATGTCGTACA GGAGGCTGTC ATCGTCGTCA AGGTAAGCTT CACTGCATGG
GTATATCTGG ATGAGGCTAA TTCTTTGACA GGACATCTTC CGAAAATACC CCCACTCATA
CGAAGGTATC ATCCCGGCGC TCTGTGCTAA TCTTGAGGAA TTGGATGAGC CTGAGGCCAA
GGCCTCTTTG ATTTGGCTCA TCGGCGAGTA CGCAGAGAAG ATTGAGAATG CGGACGAGTT
GCTGGGAGCG TTCTTGGAGA CTTTCAGTGA GGAAAGCTAT CCTGTACGAC TCATCATGAT
TTCACCATTC ACTCGTTGAC TCATCATCTC TGCAGGTTCA ACTTCAAACC CTTACTGCGA
TTGTCAAGCT GTTCCTCAAG AAACCTGATG AAAGCCAGGC TATCGTCCAA AAAGTGCTTC
AAGCTGCTAC AAAGGACTGT GACAGCCCAG ACGTCAGAGA TAGGGCCTAC ATATACTGGA
GATTGCTGTC ATCCGACCCC GCCGCTGCCA AGGTAAGTCA ATATCCACCT CTTGCTTCAT
GCGAAATTCT AATGGATGTA ATATAGTCTG TTGTTTTGTC AGTCAGACCA CCTATCAGTC
TTCCTCAAAC GACTGTCGCT CCTGCAATCC TTGAGGAGCT TATTGGCGAA ATCTCGACAT
TGGCGAGTGT GTACCACAAG CCTGCGGCTA CGTTCATAGG CAAAGGTCGT TTGGGTGCAG
ACGAGATGCA CAAGAAGAGC TTGGAGTACG TCTATCTCTC ATTCAGTCCT TATAGTGATG
ACCTAACCTA TGCTGGACAG TGCCGAGGAC GATGTCTCAC GCGAAAAGGC TCTTCAAGCT
GTCGTTGCCG GTAACCAAGC CGAAAACCTG CTCGACTTTG ACGACGAACC TACTCCTACC
AACGGCGAAT CTTCCATCCC TGCTCCGGGT GCGGGTTTGG GCATCTCTTC TCAGGCTATT
GCGAGCGCAG CCAAGAGCAC GAACCCGTTA GATGAATTAA TGGACTTGTT CTCCACTGCT
AGCATGACAA CACCAGTCGT TCAACCTGGT CAGCCTGCAG CTCAGGCTCA AACGTCAGCT
CAGAGTTCAG GGGGTTTGGG CGGGTTAAAT GGATTGGCAG GGCTGTCTAG CCCACCGCAG
AGTGTATCGC CGCAACCCGG AGCTCCGCAA AGCCAGAAAC AACAACAGCA GGCGGCGGCT
CAAGATGATT TGTTGGGGTT GTTTTAGATG AATAACTGAA AATGTATGTT GATATGGGTC
TT
 
Protein sequence
MAMAPPRKGE NWELRQQLNS EYRDKRADAI KRVIANHTIG KDCSGLFPDV VKNMQTDDLE 
QKKLVYLYLM NYAKTQPELV ILAVNTFVKD TADPNPLVRA LAIRTMSILR AEKILDYLAS
PLSRCLKDEN PYVRKTAALC VAKVFDLKPE LAIEYGFIET LRDLIGDGNP MPGSPNDDES
PSSVRPNQSL FIIDPATLTK LLVALNECSE WGRIAILTTL ARYRTNDEKE SEHICERVMP
QFQHVNAAVV LGAVKVIMIH MKNVTKEDLL KSLTRKMAPP LVTLISSPPE VQWVALRNIN
LLLQKRPDIL ASEMRVFFCK YNDPSYVKVE KLEIMVRLAN EKNVDTLLGE LKEYASEVDV
DFVRKAVRAV GQVAIKIDEA AGRCVEVLME LIETRVSYVV QEAVIVVKDI FRKYPHSYEG
IIPALCANLE ELDEPEAKAS LIWLIGEYAE KIENADELLG AFLETFSEES YPVQLQTLTA
IVKLFLKKPD ESQAIVQKVL QAATKDCDSP DVRDRAYIYW RLLSSDPAAA KSVVLSVRPP
ISLPQTTVAP AILEELIGEI STLASVYHKP AATFIGKGRL GADEMHKKSL DAEDDVSREK
ALQAVVAGNQ AENLLDFDDE PTPTNGESSI PAPGAGLGIS SQAIASAAKS TNPLDELMDL
FSTASMTTPV VQPGQPAAQA QTSAQSSGGL GGLNGLAGLS SPPQSVSPQP GAPQSQKQQQ
QAAAQDDLLG LF