Gene CNH03830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03830 
Symbol 
ID3259143 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp13287 
End bp16228 
Gene Length2942 bp 
Protein Length755 aa 
Translation table 
GC content48% 
IMG OID638258099 
Productclathrin binding protein, putative 
Protein accessionXP_572541 
Protein GI58270770 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCATAACAT CCTTTATTTT ACACTGGTCC AGCTCGCATC TGCTATATTT CAATCTCCAG 
CTTACACACC ACTTACAGTA CTATCGCATA TCCCTTACAC ACACATTTAA TCCCTGACAA
AAAACGTTCA AAATGGCTAT GGCCCCGCCC CGCAAAGGTT AGCCCCATCC CTAATTCACG
TCACATGTAT TCGCTAACCA TGCAATCTTC AGGCGAGAAC TGGGAACTTC GCCAACAGCT
GAACTCAGAG TACAGGGATA AGCGGGCAGA TGCTATCAAG AGAGTCATTG CGAACCATAC
CATTGGCAAG GACTGTAGTG GCTTGTTTCC CGATGTCGTC AAGAATATGG TGAGTGTTTA
ATAACTCTTT GCTGGAGTCA AGGAGCAAGG GAGTACAGGT AAGAGCTGAT GTTCTGACCG
CAGCAAACAG ATGATCTGGA GCAAAAGAAA CTCGTATATT TATATCTCAT GAACTATGCG
AAGACACAAC CTGAACTCGT TATCCTTGCT GTCAACACTT TCGTCAAGGT GGGTTTGAGT
TTGACTTATT CCATCTCTTC TGCTAACCTG CTTTATTAAA GGACACTGCC GACCCCAATC
CTCTTGTTAG AGCTCTTGCC ATCCGCACCA TGTCCATCCT TCGCGCCGAG AAAATCCTCG
ATTACCTCGC TTCACCTTTA TCTCGATGCC TCAAAGATGA GAACCCCTAC GTCCGAAAGA
CTGCTGCTCT GTGTGTCGCC AAAGTGTTTG ATTTGAAGCC AGAGCTGGCT ATCGAATACG
GGTTCATTGA AACTTTGAGA GATCTTATCG GCGATGGTAA TCCTATGGTA TGTCATGCAA
TGATGAGCGT ACGAAGATGT CTGATCAATT GACAGGTTGT TGCAAATGCC GTTGCCGCAC
TCGGGGATAT TCACGAGGCT TCTCTTAACC TTCCTTCTTC TCAGCCCGGC TCGCCAAATG
ACGATGAATC TCCTAGCAGT GTCCGTCCCA ATCAATCCCT TTTCATCATT GACCCCGCTA
CCCTTACCAA ACTCCTTGTC GCTTTGAACG AATGTTCTGA ATGGGGTCGT ATTGCCATCC
TCACCACCTT GGCGAGATAT AGGACAAATG ACGAGAAGGA GAGTGAACAT ATCTGTGAAA
GAGTGATGCC CCAGTTTCAG CACGTGAATG CGGCAGTTGT GTTGGGTGCG GTAAAGGTGA
TCATGATTCA TATGAAAAAT GTCACCAAGG AAGACCTTTT GAAGTCTCTT ACTCGAAAAA
TGGCTCCTCC ATTAGGTGAG TTCAACATTT TTTATTTTTT ATTTTTAATT CAAAAATGAA
TGGTTTGACC AACAACGAAC AGTCACCCTT ATCTCCTCAC CGCCCGAGGT GCAATGGGTC
GCACTTCGTA ACATCAACCT CCTCTTGCAG AAACGTCCCG ATATCCTTGC CAGCGAGATG
CGCGTTTTCT TCTGCAAATA TAACGATCCT TCCTACGTCA AGGTGGAGAA ACTTGAGATT
ATGGTTAGAT TGGCGAACGA GAAGAATGTG GACACTTTGC TTGGGGAGCT GAAGGAATAC
GCGTCGGAAG TTGATGTCGA TTTTGTTCGC AAGGCTGTCA GGGCGGTTGG CCAAGTTGCT
ATCAAGATCG ATGAGGCTGC CGGGCGATGT GTCGAAGTAT TGATGGAACT GATCGAGACA
AGAGTCAGCT ATGTCGTACA GGAGGCTGTC ATCGTCGTCA AGGTAAGCTT CACTGCATGG
GTATATCTGG ATGAGGCTAA TTCTTTGACA GGACATCTTC CGAAAATACC CCCACTCATA
CGAAGGTATC ATCCCGGCGC TCTGTGCTAA TCTTGAGGAA TTGGATGAGC YTGAGGCCAA
GGCCTCTTTG ATTTGGCTCA TCGGCGAGTA CGCAGAGAAG ATTGAGAATG CGGACGAGTT
GCTGGGAGCG TTCTTGGAGA CTTTCAGTGA GGAAAGCTAT CCTGTACGAC TCATCATGAT
TTCACCATTC ACTCGTTGAC TCATCATCTC TGCAGGTTCA ACTTCAAACC CTTACTGCGA
TTGTCAAGCT GTTCCTCAAG AAACCTGATG AAAGCCAGGC TATCGTCCAA AAAGTGCTTC
AAGCTGCTAC AAAGGACTGT GACAGCCCAG ACGTCAGAGA TAGGGCCTAC ATATACTGGA
GATTGCTGTC ATCCGACCCC GCCGCTGCCA AGGTAAGTCA ATATCCACCT CTTGCTTCAT
GCGAAATTCT AATGGATGTA ATATAGTCTG TTGTTTTGTC AGTCAGACCA CCTATCAGTC
TTCCTCAAAC GACTGTCGCT CCTGCAATCC TTGAGGAGCT TATTGGCGAA ATCTCGACAT
TGGCGAGTGT GTACCACAAG CCTGCGGCTA CGTTCATAGG CAAAGGTCGT TTGGGTGCAG
ACGAGATGCA CAAGAAGAGC TTGGAGTACG TCTATCTCTC ATTCAGTCCT TATAGTGATG
ACCTAACCTA TGCTGGACAG TGCCGAGGAC GATGTCTCAC GCGAAAAGGC TCTTCAAGCT
GTCGTTGCCG GTAACCAAGC CGAAAACCTG CTCGACTTTG ACGACGAACC TACTCCTACC
AACGGCGAAT CTTCCATCCC TGCTCCGGGT GCGGGTTTGG GCATCTCTTC TCAGGCTATT
GCGAGCGCAG CCAAGAGCAC GAACCCGTTA GATGAATTAA TGGACTTGTT CTCCACTGCT
AGCATGACAA CACCAGTCGT TCAACCTGGT CAGCCTGCAG CTCAGGCTCA AACGTCAGCT
CAGAGTTCAG GGGGTTTGGG CGGGTTAAAT GGATTGGCAG GGCTGTCTAG CCCACCGCAG
AGTGTATCGC CGCAACCCGG AGCTCCGCAA AGCCAGAAAC AACAACAGCA GGCGGCGGCT
CAAGATGATT TGTTGGGGTT GTTTTAGATG AATAACTGAA AATGTATGTT GATATGGGTC
TT
 
Protein sequence
MAMAPPRKGE NWELRQQLNS EYRDKRADAI KRVIANHTIG KDCSGLFPDV VKNMQTDDLE 
QKKLVYLYLM NYAKTQPELV ILAVNTFVKD TADPNPLVRA LAIRTMSILR AEKILDYLAS
PLSRCLKDEN PYVRKTAALC VAKVFDLKPE LAIEYGFIET LRDLIGDGNP MVVANAVAAL
GDIHEASLNL PSSQPGSPND DESPSSVRPN QSLFIIDPAT LTKLLVALNE CSEWGRIAIL
TTLARYRTND EKESEHICER VMPQFQHVNA AVVLGAVKVI MIHMKNVTKE DLLKSLTRKM
APPLVTLISS PPEVQWVALR NINLLLQKRP DILASEMRVF FCKYNDPSYV KVEKLEIMVR
LANEKNVDTL LGELKEYASE VDVDFVRKAV RAVGQVAIKI DEAAGRCVEV LMELIETRVS
YVVQEAVIVV KDIFRKYPHS YEGIIPALCA NLEELDEXEA KASLIWLIGE YAEKIENADE
LLGAFLETFS EESYPVQLQT LTAIVKLFLK KPDESQAIVQ KVLQAATKDC DSPDVRDRAY
IYWRLLSSDP AAAKSVVLSV RPPISLPQTT VAPAILEELI GEISTLASVY HKPAATFIGK
GRLGADEMHK KSLDAEDDVS REKALQAVVA GNQAENLLDF DDEPTPTNGE SSIPAPGAGL
GISSQAIASA AKSTNPLDEL MDLFSTASMT TPVVQPGQPA AQAQTSAQSS GGLGGLNGLA
GLSSPPQSVS PQPGAPQSQK QQQQAAAQDD LLGLF