Gene CNE01310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE01310 
Symbol 
ID3257860 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp359441 
End bp362816 
Gene Length3376 bp 
Protein Length1031 aa 
Translation table 
GC content51% 
IMG OID638256719 
Productspliceosome complex protein, putative 
Protein accessionXP_570716 
Protein GI58267120 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATATCCATAA AAAACCAGGA GCCAGACAGC ATGGCATCCA CATCCCTCGT AGACAGCCTC 
ACTTCCCACT TCCCACTCAC ACTCCCCATC CCCACCCCCA TCACCCACCC ACACCTCATC
CCCTCCGCAG ACCTCCCTGT CGAAGAAGAC CTCCTGCACA ACCCGGAAAA CCTCCGTTCA
TGGCTCTCCT ACATCCACAA CGTCAAGGAG AAGATCGCTG CCGATGAGCC TGCCAAGGGT
GGCGTGCTTT CCCCGGAGGA GGAGATTCTC GGGCCGTTGG CGAGCAAAAA TGCAAGAGAC
GGATTGCAGC GATTGGTATC GATCTACGAG CGAGCCATCG CCGTTTTCCC GACCAGCTAC
AAGCTCTGGA AAGCGTATTA CCTTACCCGT CAATCATATG TGTTGGGAGA ACTAACCAAT
GATGCCAAAG AAGCGCGATC CCAGCAAGCC AAACGAGGTG CCGCGTACAA GACCAACGTT
CGAGAGCTTT TAGACGGAGC CGAAGAAGCC CACGAGTGGA CAGGCGGTCT CGACCCCGTG
GTCGGTTATG CCGAATGGAG GAGCTTGGTT GCGACCGGTG AGAGGATGAT CATGTGTCTT
CCCAACTTGC CAATTCCATG GCTTCTTCAT TTGGGCGTCT TGTTGCACCC TAAGTGTCCT
TCTGTTTTCA AGAACGGCTC GTACGCGAGG AGGGCGTTTG ATAGGGCGTT GAGGACGTTA
CCTCCCAGTT TGCATGGAAG AGTCTGGGGT CTATATCTCC GCTGGGCCGA GATTGTTGGC
GGGGACGCTG GTGAGAGGGT CTGGAGAAGA TACCTCAAGG TCAGTTTCTG TCCTTTCCAT
TGTGATTGAC ACTGACATCC GACCCAGGTC GACCCGAGCT TAACAGAACG ACACATCACC
TATCTCCTCG AAGCTGAAGA GCCTCGCCCT CTCGCTGCAG CCAAATACCT CCTTTCCATC
GCTCGCCGCG CACAACAAAA TCTCTACTCT TCTCTGGAGG GCAAATCTCC TTACCAATTG
TTTGTCGACT TTTTGGAACT TGTGGAAAAG TATGCCGATC AGATCGGTAT GGACGAAGAA
GGTACACTCG AGTTGCAAAG AACAAAACGT GCAGTGGAGG AAAAGGTTGA TGGGGAGCAG
CCCCAGGTGG AAGGGCAGGA GCAACAGCCT CAGGAAGAGC CCGCTAGCAT CAACGGTCGT
CTAATGCGCA TTGCGGGCCC CCCTGTTCCA CTTGAGCAAG GAAAACTCTT CAAACCTGTC
AACGCTGCTT CAGCCCAAGC CCCCACTCAA TTGACCTACG ATGAAGATAC CGATCCATCA
AACCCTCGAT TACTTGATGT GGAAGGTATC GTCGAAAGGG ATGGTCTTCA GGTCTACAAG
GACCAAGCTG GTCGATTGTG GACTGGTTTG GCTACTTATT GGATCAAGAG AGGAGAGTTT
GAAAGGGCCA CTGCGACGTT TGAGAGGGGT CTTGCAGCTG TGGTGACTAT TCGAGACTTT
ACTCAGATTT TCGACGCCTA CGCCGAATTC TCAGAGACTA TGATCTCCAC TCTCATGGAT
GCTCTTGCTG ACGAGGATAA TCTTGAAGAT GAAGACTTTG ACGCGGAAGA GACTGAACAG
GAGTTGGATG AGCGAATGAA GAGTTTCGAA GAGTTGATGG ACCGCCGACC CTTCCTTGTC
AACGATGTGC TCCTCCGTCG AAACCCCAAT GAAGTGGTCG AGTGGGAAAA GCGTATTGCC
CTTCATGGCG ACGATGACGC GAAGGTCGTT GAAGCGTACG TCAAGGCACT CGATACCATC
AACCCCCGAA AAGCTACCGG ACCTCTCTAC CCCTTGTATG TCAACTTCGC CAAGTTCTAT
GAAGAAGGTG GAAGCAAGGA CGACAATGGA GAGCCAAGGA ATGAGCCTGA TTTGGAGCAA
GCGAGAAAGA TCTTTGAGCG AGCCACAAAG GTGCCTTTCA AGGCGGTTGA TGAATTGGCA
GAGGTTTGGT GCGAGTGGGC TGAGATGGAG TTGAGAAATG AGTAAGCGTC CTCTTGCATG
TCGAAGGGCA ATGCTAATAT GTGCGGTTAG GAACTACGAG GAAGCCATTC GATTGATGCA
GCGGGCGACA ACTGTCCCTA AGAACACCAA AATCAATTAT TACGATGATG TGAGCAGGCT
CCTGTTCTCT GCATGACCCA TCTAACGTTC GTATAGAACA TCCCTCCCCA GTCTCGACTT
TTCAAATCCC TCAAACTGTG GTCCTACTAC TCTGACCTTG AAGAGTCTAT TGGCACCGTC
GAATCCACCA AAGCAGTATA TGACAAGATC ATGGAGCTCA AGATAGCCAA CGCCCAAGTC
ATTGTCAATT ACGCCACGTT TTTGGAGGAG AACAAGTACT TTGAGGAAAG TTTCAAGGTG
TACGAGCGAG GAATTGAGCT GTTCCACTTC CCCATCGCTT TCGAGATCTG GAATATTTAC
CTCTCCAAAT TTGTTAAGCG ATACGGCGGC AAGAAACTCG AACGTGCTCG AGACTTGTTT
GAACAAGCCC TCGAAAACTG CCCTGAAAAG TTCTGCAAGC CTCTCTATCT TATGTACGCC
AAACTCGAAG AGGAGCATGG TCTTGCCAAG CGAGCAATGG GTATCTACGA CCGCGCTGCG
TCGACCGTTC AAGACTCTGA CAAGTTTGAG ATGTACACCA TCTACATCGC CAAGGCTACT
GCCAACTTTG GCCTTCCTGC CACTCGACCA ATTTATGAGC GGGCCCTCGA ATCTTTGCCT
GACAAGCAGA CAGCGGAGAT GTGTCGACGA TTCGCGAGGA TGGAAAGGAA GCTGGGTGAG
ATTGATAGGG CGAGGGCGAT CTACGCGCAT GCGAGCCAAT TCTGTGACCC CAGAATTGAG
CCCGAATTCT GGCAAGAATG GAATGATTTC GAGAGTATGT CTCATTACTT TTTTCTTCAG
TGTATTTTTA TGATTGACTT TGCAACTCTA GTCGAAACTG GATCTGAAGA CACGTTCCGG
GAAATGCTCC GTATCAAACG AGCTGTTCAA GCGTCTTTCA ACACCGAGAC CTCGTTCATT
GCTGCCCAGG CTGCTGCCGC GTCCAAGGGT ACTGAAAAAC CTACTGATAC TTCAGCTCAG
GAAGCACAAG ACGCCGCCGA CCCTATGGCT GCGATGGAGC GTGAATTGAG CGCGGCTGGT
GCTGATGGCG CAAGAAAGGG TGGAGCACCG GCGTTTGTCG CGAGTACGTT GAATAAGACT
AATGCCAACG GAATTGATGA AGGAGGAGAA GAAACAGGAG AAATGGCGAA CCCTGATGCG
ATCGTCATGG ATGAGGATGA GTTCTAAGGA AAGATACCTA GAGAAAGGAA TAATCGATGC
ATCTTTACTA TCTGTT
 
Protein sequence
MASTSLVDSL TSHFPLTLPI PTPITHPHLI PSADLPVEED LLHNPENLRS WLSYIHNVKE 
KIAADEPAKG GVLSPEEEIL GPLASKNARD GLQRLVSIYE RAIAVFPTSY KLWKAYYLTR
QSYVLGELTN DAKEARSQQA KRGAAYKTNV RELLDGAEEA HEWTGGLDPV VGYAEWRSLV
ATGERMIMCL PNLPIPWLLH LGVLLHPKCP SVFKNGSYAR RAFDRALRTL PPSLHGRVWG
LYLRWAEIVG GDAGERVWRR YLKVDPSLTE RHITYLLEAE EPRPLAAAKY LLSIARRAQQ
NLYSSLEGKS PYQLFVDFLE LVEKYADQIG MDEEGTLELQ RTKRAVEEKV DGEQPQVEGQ
EQQPQEEPAS INGRLMRIAG PPVPLEQGKL FKPVNAASAQ APTQLTYDED TDPSNPRLLD
VEGIVERDGL QVYKDQAGRL WTGLATYWIK RGEFERATAT FERGLAAVVT IRDFTQIFDA
YAEFSETMIS TLMDALADED NLEDEDFDAE ETEQELDERM KSFEELMDRR PFLVNDVLLR
RNPNEVVEWE KRIALHGDDD AKVVEAYVKA LDTINPRKAT GPLYPLYVNF AKFYEEGGSK
DDNGEPRNEP DLEQARKIFE RATKVPFKAV DELAEVWCEW AEMELRNENY EEAIRLMQRA
TTVPKNTKIN YYDDNIPPQS RLFKSLKLWS YYSDLEESIG TVESTKAVYD KIMELKIANA
QVIVNYATFL EENKYFEESF KVYERGIELF HFPIAFEIWN IYLSKFVKRY GGKKLERARD
LFEQALENCP EKFCKPLYLM YAKLEEEHGL AKRAMGIYDR AASTVQDSDK FEMYTIYIAK
ATANFGLPAT RPIYERALES LPDKQTAEMC RRFARMERKL GEIDRARAIY AHASQFCDPR
IEPEFWQEWN DFEIETGSED TFREMLRIKR AVQASFNTET SFIAAQAAAA SKGTEKPTDT
SAQEAQDAAD PMAAMERELS AAGADGARKG GAPAFVASTL NKTNANGIDE GGEETGEMAN
PDAIVMDEDE F