Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNM00140 |
Symbol | |
ID | 3255100 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006682 |
Strand | + |
Start bp | 30656 |
End bp | 33816 |
Gene Length | 3161 bp |
Protein Length | 952 aa |
Translation table | |
GC content | 55% |
IMG OID | 638254174 |
Product | hypothetical protein |
Protein accession | XP_568379 |
Protein GI | 58261938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00124325 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGTTC CTCTGCACCG CATACGCTTC TACGACCACA CGCCCTCCCC CATCACCGCC ATACAATACA CCCCCCTCCC TCTTCCCGCC CCTTCTTCGA CCCCCTCTCA AACCCCACCA GCCCACCCAG GAGACTGTAT CATCGCAAGA GAAAACGGCC ATGTCGAAAT ATGGAAACAT GTCTCGGACA AGCAAGTAGA CTCGTATGGG AACTGGGTCC TCTACAAGGT AAGTCTTATC CATCCGAAAG ACTAGCTAAC CCATGCACAG ACACTCCCAC CAACCCTTAC CCATCCTACC ATCTCGCAAC TTGCTCTAGT TATCCGTGAC CCTCTCAACT GCTCCACACC TGCCTTGAAC GATCTTCGAC TGTTCACGTC TAGTTCAGAT TCTGGAGATC TCGTCGAAAG ATGCTTATTC ACCGGCAAGA TTCTCCAAAC CTACTCCATC CCCTCGGCGC CTATATGGTC TCTTGCAGTA GCCCCTACAC ACGACCTCCT CTGTCTTTCC ACCACCTCTC CCAACCTCCA TTTCCTCTCT ATTCCACCTC CGACTATGTT TGACCCCTCG CCCTCCCTTG AACCGCCTCC TCCCCATCTC CTTCGAACAG ACGCTCTCCC CTCTCGAACC CGAACAACCT CTATCGCTTT CGGTCCTCCT ACGTTGACCC AGCTTCCGGA CGGTACAGCC GAATGGCGCA ATACGACATT GGTGACTGGA AACTCGGATT CATCATGGAG GAAATGGGAG ATCCCCGCGC CTGCAGACGG GTCCAGGCAG GGTCCCAATC GAGTGGTCTT GAAGGGACGA GCTGTGGTGG AGAAAGTGCA GAAAGCTGGA AGAGGTGGAC GTAAGGCTAC GGGTGCTGCT GGCGGACAAA AGCAGACCAT TGTCTGGTCC ATTGGTATCC TGCCGTAAGT TCTTGCTTTT TTTTTTCTTC TTTTTCCACG TAACACACTA ACACCCCACG TAGAGATGGA ACCGTCGCCA CAACCGACTC TCTCGGCTCT CTTATATTCT GGGACCCCCT CTCCCTCGCT CAGCGTCAAC ATTTCCGCGC TCACAAAGCC GATGCCATGT GTCTCGCCAT CGGTCCCGGT GGTTCAACCG TCTTCACATC CGGCCCCGAC CAGCGTGTTT GCCAATTCGT TCGGGCTCGA GCTCCTGGAG GAGAATGGGT CCTGGTCAGC GCCAAGAGGT TACACGCCCA CGACGTGAGG GCTCTTGCCG TTTGGCCGCC GTACGTCCCT GTCCCTATCA CTACAGACAC CAAACATGGC ACTAGCACTG TCGGAGCTGG GCTCGCCCCC GTGCTCGCCT CGGGCGGTTG GGACATGTCC CCCACCTTCA CCCCCGCCTC CCACCCTTCC TCACCTCCCC TCCCTTCCCC CTTGGCTCGC CCCTCCCAAT CCCAAACCCT CCCCACATTT GAATCCACCC ATCCTCGACG GATGGGCTAC CTCTCCTCCG GCCTTCTCTC CTCCTCTTCC CCCATCACAT TCTCCCCCTC TGCTCGCCTG GTCGTCGGGA AACGCGCCCG AGGCGTGGGC ATCTGGAGAG TCCACCCCAA CGAGAACGGT TGGGAAAAAC TGCTCGAAAT GGAACTTCGC CTCCGCACCA CTATTATCGC TACCACCATC TCGGAACACG GGAAATACCT CGCCGTCTCC GATCTGTACG AGACGAAACT CTTCAAACTC GTCCCAACCT CTTCCGGCCT GAAACCCACC CGTCTCCCTC TCCTTCCTGC CCTCCTCTCT TCCCCGCTTC TTCACCATCT AAACACCCAA TTAACGACTC AAGGCTGTGG CTCAACGAGT ATGGTGTTCA CCCCCGATGG AGGCAGGGTG GTGTTGGGTC TCGTAACCGG ACAAGTGCTG ATCATCGAGC TAGCCGAGGA TGAGGAAAAT GTAGAGGTGG AAGTTGTCAA GTGTTTTGAG CGGAAAGAGA GGGTGGTGAG GGGTAGAGTG ATCAAGGGCA AGAATGTCAA CGGCACTGGC GTGAACGGCA ACAATACCGC TCCTGATATC GACGTCTCTA TGACAGAAAA AGCCGAGGAG CAAGAATCCA ACTCTGGCTC TGAATCCGAA TCCGAATCTG AATCCGATTC ACCGTCAACT TTCCACTCCA ATGGCGCTCA CAAGCAAACT CAAAACGAGT GGATCTCGAC TCTCGCAGTG AGCGAAGACG GTCAGTGGCT TGGTGTGGCG GATCTAGAAG GCCGTGTCGA AGTTTTCAAC CTCGACAGTT TGCAGGTAAG CCCTTTTTTC CCCCTCTCTC AAGGTCCCAG ACTAATGCCA TATCCACAAA CAGCTTCACT CCACTCTCCC CACCCTCCCT CACCCACCTA CAACCCTTTC CTTCCCCTCC CTCCCCTCTC CTTCCCCATA CCTCGCCATC CTCTCCCCTA CCAACACTCT TTCACTATAC AACCTCGACC AAAGACGGTT CATTCCGCTC CCAAGTCTGG GGAAAGGCGA ATTGGAAAAG TTTGGGAATG TTTTAAGCAA GATGCAGACG CCGGTTATGG GTATGGTTTG GAGGCCGTCA AGGTTCTCCC TCCCTGTTGG GCCCCGAGGA GATGGAGAAG GAAAAGCATT GCTTTGGGGA ACAGATTACC TCGTGACCCT GCGCGTGAGC AGGGACATGC TCCACCCCAC TCCGAACGCC CACGTCAATG GCGAGGTGGC CTCCATGCCC AACCACTCGG CTTCTACCAC CACCATCTCA TCGATTAATG GAAAAGGAGG AGAATCAAAG AGTTCAAGGA AGAAACGAGC GCGCGAAGCG CGCCAGGCGA AAGCCAGCCA TGGGCAAGGG GAAGAAGGAG GAGCGGGAGT GGGACGGGAC CTTGAGGAGA AGAAGGAAGA ATATTACAAG ATTATCGGCG ATCGATTCAA ATCCATCTTG TCTGTCGGGT GGCTCGTTGG TTCACACCAG TCCCAAGGTG AAGAAAGGGG AGAGGAGGGG GAGCTGGAAG TAGGTGTGGT AGAAAGGCCT TGGGGCGATT TCGTGGCAGA GTTGCCGGGT GTGTTCTGGA GTGGGTCATA TGGTTCGAGC TAAGAGCCCA ACGGGTCGTC CCAAGAAAAG GGTTTATGGA GAGAGAAAAC GGGGAGAGGA AGGGGAAGAT ATTATAGGTT ATAGATCCAT GTATAAGGGC T
|
Protein sequence | MSVPLHRIRF YDHTPSPITA IQYTPLPLPA PSSTPSQTPP AHPGDCIIAR ENGHVEIWKH VSDKQVDSYG NWVLYKTLPP TLTHPTISQL ALVIRDPLNC STPALNDLRL FTSSSDSGDL VERCLFTGKI LQTYSIPSAP IWSLAVAPTH DLLCLSTTSP NLHFLSIPPP TMFDPSPSLE PPPPHLLRTD ALPSRTRTTS IAFGPPTLTQ LPDGTAEWRN TTLVTGNSDS SWRKWEIPAP ADGSRQGPNR VVLKGRAVVE KVQKAGRGGR KATGAAGGQK QTIVWSIGIL PDGTVATTDS LGSLIFWDPL SLAQRQHFRA HKADAMCLAI GPGGSTVFTS GPDQRVCQFV RARAPGGEWV LVSAKRLHAH DVRALAVWPP TVGAGLAPVL ASGGWDMSPT FTPASHPSSP PLPSPLARPS QSQTLPTFES THPRRMGYLS SGLLSSSSPI TFSPSARLVV GKRARGVGIW RVHPNENGWE KLLEMELRLR TTIIATTISE HGKYLAVSDL YETKLFKLVP TSSGLKPTRL PLLPALLSSP LLHHLNTQLT TQGCGSTSMV FTPDGGRVVL GLVTGQVLII ELAEDEENVE VEVVKCFERK ERVVRGRVIK GKNVNGTGVN GNNTAPDIDV SMTEKAEEQE SNSGSESESE SESDSPSTFH SNGAHKQTQN EWISTLAVSE DGQWLGVADL EGRVEVFNLD SLQLHSTLPT LPHPPTTLSF PSLPSPSPYL AILSPTNTLS LYNLDQRRFI PLPSLGKGEL EKFGNVLSKM QTPVMGMVWR PSRFSLPVGP RGDGEGKALL WGTDYLVTLR VSRDMLHPTP NAHVNGEVAS MPNHSASTTT ISSINGKGGE SKSSRKKRAR EARQAKASHG QGEEGGAGVG RDLEEKKEEY YKIIGDRFKS ILSVGWLVGS HQSQGEERGE EGELEVGVVE RPWGDFVAEL PGVFWSGSYG SS
|
| |