Gene CNL04540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04540 
Symbol 
ID3254871 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp262704 
End bp267562 
Gene Length4859 bp 
Protein Length1274 aa 
Translation table 
GC content48% 
IMG OID638253925 
Productmicrofilament motor, putative 
Protein accessionXP_568003 
Protein GI58261186 
COG category[Z] Cytoskeleton 
COG ID[COG5022] Myosin heavy chain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATGTA TCACCACACT CACATTAAGC AAAACTGAAA TTCGCCAAAG GCTCCTTCAA 
AGAAAGCAGG AAAGAAGGGC GCCGTCGGCG GCTTCTTATC AGGAGCCTCT AAGCCTCAGA
AGGTCCAAAA GGTCAGCATT CGTCATTTGC CTATCTATGT TTATGAGCTA ATCTAAGTAT
CCACAAGGCC GACTGGAGTG AAGGGTTCAC AAAGAAGAAG GCCGCAGGTG TTCCCGACAT
GACTTTGTTG AGCACAATCA CTAACGAGGC TATCAACGAC AACCTCAAAG TACGATTCCA
GAATCAAGAG ATCTATGTAT GTATTCCATT GTCAACCCTT AACGAAAGCT AACTTTGCCA
CAAGACATAT ATTGCCCATG TTTTGATTTC TGTCAATCCA TTCCGAGGTA CTTCGATTAC
TCAGCCGTCA CGTTACTTGC TAACAACGAG TAGACCTTGG TATCTATACG AATGACGTCC
TCAACTCATA TCGAGGCAAA AACCGTCTTG AAATGTCCCC TCATGTCTTT GCTATCGCCG
AATCAGCCTA TTATCGTATG ACAACCGAAA AAGAGAACCA ATGTGTCATT ATTTCTGGTG
AATCAGGTGC AGGCAAGACC GAAGCCGCCA AGAGGATCAT GCAATATATC GCCGCGGTGT
CAGGAGGCGC CGAGGGCGGC GGTATTGAGG GAATTAAGGA AATGGTTTTG GCCACGAACC
CTCTTTTGGA GAGTTTCGGT TGTGCAAAGA CTCTGAGAAA CGATAATTCC AGTCGACACG
TAGGTCTGTT AATAGGCCGT CTGTCCTAGC TGACGTGGCG TAGGGTAAAT ACCTTGAGAT
TATGTTCAAT GGCATGGGGC AGCCAGTCGG TGCTCAAATC ACCAACTATC TTCTGGAAAA
GGTCAGCCTT TTCAGCTATC AATCTAGGAA AGGACGCTGA AAGCGATTCT CAGAACCGGG
TGGTCGGACA AATCGACGAC GAGCGAGACT TTCACATCTT CTATCAGTTC ACCAAGGGTG
CTAGTGCCAA AATGAAGGGT CAGTTTAGTT TTTCGTCTTG AGTCGCCTGC TAACGATACG
CGTAAAGAGG CATTTGGTTT ACAAGGCCCT GAGGCGTACG CATATATCAG TCGAAGTGGT
TGTCTGGATG TCAAGAGTAT TAACGATGTG TCCGATTTTC AGGAGACTTT GGTGTGTCCC
TAATCTCTTT TGGAAGTTGT GTGTTGATTA ACAATGTACA AGCGAGCAAT GCAAGTCATC
GGTCTCACAT CGGATGAACA AGATTCGATT TTCCGTATCC TTGCTACCAT CCTTTGGCTC
GGTAACATCG ACTTTGTCGA GGGCGACGAC GGCAACGCTG CAATCTCTGA TTCCGGTGTG
GCGGACTTTG CAGCTTATTT GTTGGAAGTT GATTCGGCGC AATTGCAAAA AGTGCTGTTG
ATGAGAATTA TGGAGACGCA AAGAGGCGGA AGGAGGGGTA GTGTTTATGA GGTTCCTCAA
AACGTAGCCC AAGCTTCTTC CGGAAGAGAT GCCCTTGCAA AGGCCTTGTG AGTGACGCTA
GCGTTTGAAG CTTGAACGCG AGCCCTGACT GTAAATTAGA TACAACAATT TGTTCGAATG
GATTGTCAGC CGAGTCAACA TTTCAATGAA GCCTCAAACT CCTTCTCAAT ACGTCATTGG
TGTGCTTGAT ATCTAGTATG TTCCTGAATA CAGCACCGTC ATTTTCTGCT GACAACTATA
AACAGCGGTT TTGAAATTTT CCAAGTTAGT TACAGATTGC AGCCATTAAA GATTTGCTGA
CTCTGTAAAA GGATAACTCA TTCGAACAAC TTTCTATCAA CTATGTCAAT GAAAAGCTTC
AGCAAATCTT CATCGAACTG ACATTAAAAG CCGAACAAGA GGAGTATGTC CGAGAACAAA
TCAAGTGGAC TCCTATCAAG TGTAGGTTGA CGAAGTTTCT GGCAATTGGT GACGATTGCT
CATCTCTTCA TAGTCTTTGA CAATTCCGTG GTTTGCTCTC TCATTGAAGA CCGTCGTCCC
GCCGGTATTT TCGCCACTCT CAATGACGCC ACTGCTACTG CCCACGCCGA TCCGTCTGCT
GCCGACAACT CTTTCATTCA GAGATCGAGT ATGCTCGCTT CCAACCCTAA TTTCGAAGCA
CGAGGCAACA AGTTCCTTAT CAAACATTAT GCTGGTGATG TGCTCTATAC TGTAGCGGGG
ATGACTGATA AGAATAAGGA CACTCTCATC AAGGATATCT TGGATCTTAT CGAGGGCAGT
AAGGATCCAT TCTTACATAC CCTTTTCCCC GACAAGGTGG ATCACACCTC CAAGAAGAGG
CCTCCGACCG CAGGTGACAA GATCAAGTTG TCTGCCAACT TACTCGTTGA GAACCTTATG
AAGTAAGCGT ATATTTTATG ACGATTATCT TGAGCTAATG AGTCGACAGC TGTCAACCAC
ACTATATCCG AACAATTAAA CCCAACCAGC ACCGCTCGCC GACAGAATAC GATGACAAGG
CCATTCTACA TCAAATTAAA TACCTTGGTC TTCAAGAAAA TATCCGTGTC CGTCGAGCAG
GTTTTGCCTA TCGAGCCGAA TTCTCCAAGA TGATCCAGCG ATTCTACTTG CTGTCGCCTG
CCACGTCTTA TGCCGGTGAT TACATCTGGA CAGGCGATGA CCGTTCGGGT TGTGAGAGGA
TATTGACGGA TGCCAAGATC AAAAAGGAAG AGTGGCAAAT GGGTGTAACC AAGGCATTTA
TCAAAAATCC CGAAACCTTG TTTTACCTCG AAGGAGAGAG GGATAGGTAC TGGCACACGA
TGGCGTCACG TATTCAACGT GCTTGGCGAG CGTACGTCAG AAGGAAGCAT GAAGCGGCCA
CCAAGATTCA AAGATTCTGG AGGAATCAGC GTGAAGCACT TGTGTACGAA CGTAAGAGAG
ACTATGGCCA CCAAGTGCTC GCCGGGAAGA AGGAGAGGAG GAGGTTCAGC TTGTTGGGCA
TGCGCAAATT TATGGGAGAC TATCTGGACA TAGCTGGAGG GAGTGCTCAG GGAGAGATGC
TGAGAAATGC GGCTACCATA TCTCGTGGGA TCTCGTGTTT TACGACTATG TTCAAGGCTA
ATAAATGCAC TCAGCTGCCG AGCAAGTTCA TTTCAGCTCG AGGGCGGAGC TGCTTGTCTC
CAAACTTGGT AGATCAAGTA AACTGAGTCC GCGATTCCTT ATCATCGTGC GTTGGTCATT
GCGTTATCTA TTCACAGGAA ACTGATTTCA GGATAGACGG ACAAAGCGGT CTACTTTGTT
GTCTCGCAGG CCAGAGATGG CCGAGTTTCC ACAAGTTTGG AGCGCAAAAT CCCGCTGGTG
ACAATCAAGG CGATCTCAAT GACCAACTTG CGAGATGACT TTGTAGTAAG TCTATTGGCA
TGAATGTTTA TATTCCAAGC TAATTAGATG AGTAGGCTCT CAATGTCAAC GCATGTGAGG
AGGGTGACCC CATCTTCACC TGCGTTTTCA AGACAGAAAT GATAACCGTC ATCCTCACCC
TGACGGGTGG GAATATGTCC GTCAACATTG GTCCTACGTG AGTATATTAT CCTAAGAAGC
ATTGATATAG GCTTACATTG CCTTACAGGA TTGACTACGC TAAGAAGAAG GATAAGCGAG
CAGTCATCAA GACGCAGAAG AACGAGGCAG TGAGAGGTGA CGCAACATAC AAGAGTCATA
CCATTCAAGT TGGCTCAGGA GAGCCTCCTA ATAGCTGTAA GCACAAGTAG GCCAATATTA
GAGAAAAGAC CCACTCATAC ACTGTCCAGT GTCGAACCCT ATGCCTCCTC GGAAACCCAA
GGTCAAGAAG GCTGCGAAGA CCGCTTCTTC GGTGAGTGAA TATCTCGACC ACTTGAAGCC
CATACTGACC ATGCGAATAG AGTCGACCCG TAAACTCTGG TCGGCCTGCC GCTGTTGCCC
TCCCTGGTGC CACAAAGCCT GCTGCTCCCC CTGCTCTTTC CAGCATGCCC TCCCACACAC
CTGTTGTTAC GAAACCAACT GCTATCCCTA CGGCAGCCAT CGGTGCCGCG CGCGCACCTC
CATCCATCCC CGGCCGTGCA GCCGCCCCTC CTCCCCCCCC ACCACCGCCA CCACCTGCAG
GTCCCCCTAA GGAGTTCTAC AAGGCTCTTT ATAACTTTAC TGGTCAGGAA GGAGAGATGA
ATTTGGTCAA GGGAGAAGAA GTAGAGGTCA AGGAGAAGGA TGACAATGGA TGGTGGATGG
TCGTAAAGAA CGGGCAAGAA GGTTGGGCCC CGTCAAATTA CCTGAAAAAG GTGGAACAGG
CACCGCCACC CCCTCCTCCC CCACCCCCCC CGTCCCGTCC CGTGGCAGCT CGACCCCCTG
CAGCCTCTTC TGCTCCCACT GCTCCCGCCG TTACCAACGG CTCCGCGGTC CCCTCTTGGA
AAGCGAAGAA CGCTGCATCT GCCACACCTT CCGCAGACTC TACGCCCCCG ACTTCCCGTC
CAGCCTCTTC CGCTTCCAAG GTCCCACCTG CGATTAAGGC GAAACCTTCC ATCCCTGCCA
AACCTGCTAT TCCTGCCAAA CCCCAAGTAG GCGCAAAACC TGCTCCTGCG ATTGGTGGCA
AACCACCTGT GCCTACAGCA CCAAAAGTAC AACCCAAGGC AGCTAGCAAA CTAGGACAAG
TGGCGCAACC TGCGAAAGCT CCTGGACAAT TGGATTTAGC TGCTGCATTT GCGAAGAGAG
CAGCTAGGGC CCAGCAGGAG GAAGACTAGA CCTTGGCTAT CGAAAAATTA GGATGTAACG
AGTTAGAGGT GTAGGATTTG TATGGTGTTT TGGGAAACAA TTATATGTTA CAATACGTA
 
Protein sequence
MAPSKKAGKK GAVGGFLSGA SKPQKVQKAD WSEGFTKKKA AGVPDMTLLS TITNEAINDN 
LKVRFQNQEI YTYIAHVLIS VNPFRDLGIY TNDVLNSYRG KNRLEMSPHV FAIAESAYYR
MTTEKENQCV IISGESGAGK TEAAKRIMQY IAAVSGGAEG GGIEGIKEMV LATNPLLESF
GCAKTLRNDN SSRHGKYLEI MFNGMGQPVG AQITNYLLEK NRVVGQIDDE RDFHIFYQFT
KGASAKMKEA FGLQGPEAYA YISRSGCLDV KSINDVSDFQ ETLRAMQVIG LTSDEQDSIF
RILATILWLG NIDFVEGDDG NAAISDSGVA DFAAYLLEVD SAQLQKVLLM RIMETQRGGR
RGSVYEVPQN VAQASSGRDA LAKALYNNLF EWIVSRVNIS MKPQTPSQYV IGVLDIYGFE
IFQDNSFEQL SINYVNEKLQ QIFIELTLKA EQEEYVREQI KWTPIKFFDN SVVCSLIEDR
RPAGIFATLN DATATAHADP SAADNSFIQR SSMLASNPNF EARGNKFLIK HYAGDVLYTV
AGMTDKNKDT LIKDILDLIE GSKDPFLHTL FPDKVDHTSK KRPPTAGDKI KLSANLLVEN
LMNCQPHYIR TIKPNQHRSP TEYDDKAILH QIKYLGLQEN IRVRRAGFAY RAEFSKMIQR
FYLLSPATSY AGDYIWTGDD RSGCERILTD AKIKKEEWQM GVTKAFIKNP ETLFYLEGER
DRYWHTMASR IQRAWRAYVR RKHEAATKIQ RFWRNQREAL VYERKRDYGH QVLAGKKERR
RFSLLGMRKF MGDYLDIAGG SAQGEMLRNA ATISPAEQVH FSSRAELLVS KLGRSSKLSP
RFLIITDKAV YFVVSQARDG RVSTSLERKI PLVTIKAISM TNLRDDFVAL NVNACEEGDP
IFTCVFKTEM ITVILTLTGG NMSVNIGPTI DYAKKKDKRA VIKTQKNEAV RGDATYKSHT
IQVGSGEPPN SLSNPMPPRK PKVKKAAKTA SSSRPVNSGR PAAVALPGAT KPAAPPALSS
MPSHTPVVTK PTAIPTAAIG AARAPPSIPG RAAAPPPPPP PPPPAGPPKE FYKALYNFTG
QEGEMNLVKG EEVEVKEKDD NGWWMVVKNG QEGWAPSNYL KKVEQAPPPP PPPPPPSRPV
AARPPAASSA PTAPAVTNGS AVPSWKAKNA ASATPSADST PPTSRPASSA SKVPPAIKAK
PSIPAKPAIP AKPQVGAKPA PAIGGKPPVP TAPKVQPKAA SKLGQVAQPA KAPGQLDLAA
AFAKRAARAQ QEED