Gene CNM00140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM00140 
Symbol 
ID3255100 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp30656 
End bp33816 
Gene Length3161 bp 
Protein Length952 aa 
Translation table 
GC content55% 
IMG OID638254174 
Producthypothetical protein 
Protein accessionXP_568379 
Protein GI58261938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00124325 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTTC CTCTGCACCG CATACGCTTC TACGACCACA CGCCCTCCCC CATCACCGCC 
ATACAATACA CCCCCCTCCC TCTTCCCGCC CCTTCTTCGA CCCCCTCTCA AACCCCACCA
GCCCACCCAG GAGACTGTAT CATCGCAAGA GAAAACGGCC ATGTCGAAAT ATGGAAACAT
GTCTCGGACA AGCAAGTAGA CTCGTATGGG AACTGGGTCC TCTACAAGGT AAGTCTTATC
CATCCGAAAG ACTAGCTAAC CCATGCACAG ACACTCCCAC CAACCCTTAC CCATCCTACC
ATCTCGCAAC TTGCTCTAGT TATCCGTGAC CCTCTCAACT GCTCCACACC TGCCTTGAAC
GATCTTCGAC TGTTCACGTC TAGTTCAGAT TCTGGAGATC TCGTCGAAAG ATGCTTATTC
ACCGGCAAGA TTCTCCAAAC CTACTCCATC CCCTCGGCGC CTATATGGTC TCTTGCAGTA
GCCCCTACAC ACGACCTCCT CTGTCTTTCC ACCACCTCTC CCAACCTCCA TTTCCTCTCT
ATTCCACCTC CGACTATGTT TGACCCCTCG CCCTCCCTTG AACCGCCTCC TCCCCATCTC
CTTCGAACAG ACGCTCTCCC CTCTCGAACC CGAACAACCT CTATCGCTTT CGGTCCTCCT
ACGTTGACCC AGCTTCCGGA CGGTACAGCC GAATGGCGCA ATACGACATT GGTGACTGGA
AACTCGGATT CATCATGGAG GAAATGGGAG ATCCCCGCGC CTGCAGACGG GTCCAGGCAG
GGTCCCAATC GAGTGGTCTT GAAGGGACGA GCTGTGGTGG AGAAAGTGCA GAAAGCTGGA
AGAGGTGGAC GTAAGGCTAC GGGTGCTGCT GGCGGACAAA AGCAGACCAT TGTCTGGTCC
ATTGGTATCC TGCCGTAAGT TCTTGCTTTT TTTTTTCTTC TTTTTCCACG TAACACACTA
ACACCCCACG TAGAGATGGA ACCGTCGCCA CAACCGACTC TCTCGGCTCT CTTATATTCT
GGGACCCCCT CTCCCTCGCT CAGCGTCAAC ATTTCCGCGC TCACAAAGCC GATGCCATGT
GTCTCGCCAT CGGTCCCGGT GGTTCAACCG TCTTCACATC CGGCCCCGAC CAGCGTGTTT
GCCAATTCGT TCGGGCTCGA GCTCCTGGAG GAGAATGGGT CCTGGTCAGC GCCAAGAGGT
TACACGCCCA CGACGTGAGG GCTCTTGCCG TTTGGCCGCC GTACGTCCCT GTCCCTATCA
CTACAGACAC CAAACATGGC ACTAGCACTG TCGGAGCTGG GCTCGCCCCC GTGCTCGCCT
CGGGCGGTTG GGACATGTCC CCCACCTTCA CCCCCGCCTC CCACCCTTCC TCACCTCCCC
TCCCTTCCCC CTTGGCTCGC CCCTCCCAAT CCCAAACCCT CCCCACATTT GAATCCACCC
ATCCTCGACG GATGGGCTAC CTCTCCTCCG GCCTTCTCTC CTCCTCTTCC CCCATCACAT
TCTCCCCCTC TGCTCGCCTG GTCGTCGGGA AACGCGCCCG AGGCGTGGGC ATCTGGAGAG
TCCACCCCAA CGAGAACGGT TGGGAAAAAC TGCTCGAAAT GGAACTTCGC CTCCGCACCA
CTATTATCGC TACCACCATC TCGGAACACG GGAAATACCT CGCCGTCTCC GATCTGTACG
AGACGAAACT CTTCAAACTC GTCCCAACCT CTTCCGGCCT GAAACCCACC CGTCTCCCTC
TCCTTCCTGC CCTCCTCTCT TCCCCGCTTC TTCACCATCT AAACACCCAA TTAACGACTC
AAGGCTGTGG CTCAACGAGT ATGGTGTTCA CCCCCGATGG AGGCAGGGTG GTGTTGGGTC
TCGTAACCGG ACAAGTGCTG ATCATCGAGC TAGCCGAGGA TGAGGAAAAT GTAGAGGTGG
AAGTTGTCAA GTGTTTTGAG CGGAAAGAGA GGGTGGTGAG GGGTAGAGTG ATCAAGGGCA
AGAATGTCAA CGGCACTGGC GTGAACGGCA ACAATACCGC TCCTGATATC GACGTCTCTA
TGACAGAAAA AGCCGAGGAG CAAGAATCCA ACTCTGGCTC TGAATCCGAA TCCGAATCTG
AATCCGATTC ACCGTCAACT TTCCACTCCA ATGGCGCTCA CAAGCAAACT CAAAACGAGT
GGATCTCGAC TCTCGCAGTG AGCGAAGACG GTCAGTGGCT TGGTGTGGCG GATCTAGAAG
GCCGTGTCGA AGTTTTCAAC CTCGACAGTT TGCAGGTAAG CCCTTTTTTC CCCCTCTCTC
AAGGTCCCAG ACTAATGCCA TATCCACAAA CAGCTTCACT CCACTCTCCC CACCCTCCCT
CACCCACCTA CAACCCTTTC CTTCCCCTCC CTCCCCTCTC CTTCCCCATA CCTCGCCATC
CTCTCCCCTA CCAACACTCT TTCACTATAC AACCTCGACC AAAGACGGTT CATTCCGCTC
CCAAGTCTGG GGAAAGGCGA ATTGGAAAAG TTTGGGAATG TTTTAAGCAA GATGCAGACG
CCGGTTATGG GTATGGTTTG GAGGCCGTCA AGGTTCTCCC TCCCTGTTGG GCCCCGAGGA
GATGGAGAAG GAAAAGCATT GCTTTGGGGA ACAGATTACC TCGTGACCCT GCGCGTGAGC
AGGGACATGC TCCACCCCAC TCCGAACGCC CACGTCAATG GCGAGGTGGC CTCCATGCCC
AACCACTCGG CTTCTACCAC CACCATCTCA TCGATTAATG GAAAAGGAGG AGAATCAAAG
AGTTCAAGGA AGAAACGAGC GCGCGAAGCG CGCCAGGCGA AAGCCAGCCA TGGGCAAGGG
GAAGAAGGAG GAGCGGGAGT GGGACGGGAC CTTGAGGAGA AGAAGGAAGA ATATTACAAG
ATTATCGGCG ATCGATTCAA ATCCATCTTG TCTGTCGGGT GGCTCGTTGG TTCACACCAG
TCCCAAGGTG AAGAAAGGGG AGAGGAGGGG GAGCTGGAAG TAGGTGTGGT AGAAAGGCCT
TGGGGCGATT TCGTGGCAGA GTTGCCGGGT GTGTTCTGGA GTGGGTCATA TGGTTCGAGC
TAAGAGCCCA ACGGGTCGTC CCAAGAAAAG GGTTTATGGA GAGAGAAAAC GGGGAGAGGA
AGGGGAAGAT ATTATAGGTT ATAGATCCAT GTATAAGGGC T
 
Protein sequence
MSVPLHRIRF YDHTPSPITA IQYTPLPLPA PSSTPSQTPP AHPGDCIIAR ENGHVEIWKH 
VSDKQVDSYG NWVLYKTLPP TLTHPTISQL ALVIRDPLNC STPALNDLRL FTSSSDSGDL
VERCLFTGKI LQTYSIPSAP IWSLAVAPTH DLLCLSTTSP NLHFLSIPPP TMFDPSPSLE
PPPPHLLRTD ALPSRTRTTS IAFGPPTLTQ LPDGTAEWRN TTLVTGNSDS SWRKWEIPAP
ADGSRQGPNR VVLKGRAVVE KVQKAGRGGR KATGAAGGQK QTIVWSIGIL PDGTVATTDS
LGSLIFWDPL SLAQRQHFRA HKADAMCLAI GPGGSTVFTS GPDQRVCQFV RARAPGGEWV
LVSAKRLHAH DVRALAVWPP TVGAGLAPVL ASGGWDMSPT FTPASHPSSP PLPSPLARPS
QSQTLPTFES THPRRMGYLS SGLLSSSSPI TFSPSARLVV GKRARGVGIW RVHPNENGWE
KLLEMELRLR TTIIATTISE HGKYLAVSDL YETKLFKLVP TSSGLKPTRL PLLPALLSSP
LLHHLNTQLT TQGCGSTSMV FTPDGGRVVL GLVTGQVLII ELAEDEENVE VEVVKCFERK
ERVVRGRVIK GKNVNGTGVN GNNTAPDIDV SMTEKAEEQE SNSGSESESE SESDSPSTFH
SNGAHKQTQN EWISTLAVSE DGQWLGVADL EGRVEVFNLD SLQLHSTLPT LPHPPTTLSF
PSLPSPSPYL AILSPTNTLS LYNLDQRRFI PLPSLGKGEL EKFGNVLSKM QTPVMGMVWR
PSRFSLPVGP RGDGEGKALL WGTDYLVTLR VSRDMLHPTP NAHVNGEVAS MPNHSASTTT
ISSINGKGGE SKSSRKKRAR EARQAKASHG QGEEGGAGVG RDLEEKKEEY YKIIGDRFKS
ILSVGWLVGS HQSQGEERGE EGELEVGVVE RPWGDFVAEL PGVFWSGSYG SS