Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG04120 |
Symbol | |
ID | 3258970 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 1159382 |
End bp | 1162765 |
Gene Length | 3384 bp |
Protein Length | 885 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258035 |
Product | hypothetical protein |
Protein accession | XP_572163 |
Protein GI | 58270014 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGGTCTGC AACGTCTCAT CCTCATCTCC GTCTTCCTCA CAAACCTCCA CGGCTGCTCG GCGACCATGA CAGCCCAATC CTTCGACAAG ATCGTAAAGC TTGCGACAAA GCCGAAGAAC GCCCCTCCTA AAGCAAAGTA CATTGATTCA CTAATCGCTG CTACCTATGC GGACGACAGC TCGATCAACG AGATTGCGAT CGTACTCGCA CAAAGGTTGA GAGATACCAA CGGTGTGGTG AGCATTCAAT GGTGCTCTAG AGCAAAGAGT CACCGAGTCC TGACGGCATA CTAGGTCGTG TTCAAGGGCC TCCTTACGTT GCACCAGATG ATACGCACTG GACAGACTGA AGCTCTGCTC GATGTTCTCG CCAGAAACGA TGTCTTGAGA CTCCGGAATA TCTATAGCCA GCGGTTCCAA GGTACGCTTT ATCATCAGGT CTTCCCTATC GGGGCGATAT TGCCAGATGC TGATTGATTT CACTACAGGA TACGTCCCCC CTGCTAGCAT GGGTGCTTAT GCCGACTACC TCGACAACAG AATTAGGGTG TATAGGGATT TAAAGCGAGA CCTTATAAGA GTGCAGACAG AGTCTAACAG GAGGAGTGAT GGGCTGGGTG CTGCTTGTAA GTCGATTTAG GTCGATCTGG ATCTTGTCCA TATGGAGGTA GCCGGGGCGT CAATTGTCGT TTAGCTAACG GGCGTATTAG CAAAAGCAAG AAGATTAAGA CATCTTCCTG TAGAGAAGGG TTTGCTGAGA GAAGTCAAGG TGGTGCAAAG GTTATTGGAC AGTCTCATCA AGTGCAAGGT ACGCTCCATG AATTGTAATG AGATTGGGCT GATGGAAATC CAGTTTTATG ACGATGATTT GAGAGACGAA AACACAGTTT TGGCATTGAG ACTTCTTGTA AAAGACTTGC TTGTCTTGTT CCAGGCCGGT AATGAAGGTG TCTGCAACAT CCTAGGTGTG TTTTGTACTC ACACGCACGT CCCTGGCTGA CATACCGTTG CCCATCTAAC AGAGCACTAT TTCGAAATGT CCAAGGTCGA CGCTACCGAC TCGTTTGAAA TCTATAAATC TTTCATCAAG CAAACCGACA AAGTCGTTGA CTACCTCTCC TTTGCCCGAA AACTTCACCA TGTCTTAAAT GTCCCTGTGC CGAATCTCAA GCATGCCCCA ACAGGGCTCG TCAAGGCGTT GGAAGAGTAC CTCAACGATC CCAACTTTGA GCAAAATAGA ATGGATTACA AGAGGAGTCT GGGAGTCGTC GAGGGAGGTA GTAGGCGTCC GAGTGATACA GAGCCCACAA GGAAAGCTTC GCCCGATAAG AGCACATCGA CATCGACCAA GGCTGCATCT CCTGCGCCGG AAGTTAAGCC TCAAGCTCCC GCGGGAGCTT CGAAAAAGAT CCAAGATTTC TTCGAATCTA TTCAAGCGGA CCAGCAACCT ACCATGTTTG GCGGTGCTCC TCAACAGTAT GTTTCCGAGC CCGTGTAATC ATTTTGATTT TAGACTAAGC ATTATATATA GGATCAACTA CGCTCAGATG ACTGTCAACC AGCATCAGCA GTTCAATCCC TTCCGCCAAT CTATGATGAT GCCCCAGCAA ACTGGATTCA TGCAGCCTCA GATGACCGGC TTTTCTCATC CACAACAACA AGGTTTCCTC CAGCCTCAAC AAACAGGTGC CATGGCGTTT GGAAGACAGT CCATGATGCC TATGTCTACA GGACAACCAG GTGCAGGAGG AGAATTTGGT TTCATTCAGC CACCCCATGC GCAAGCTCAA CAGCCGCAAA TGCAGATGCA GATGCAGCCT CAGCAAACTG GATTCCTACA GCCTCAGGCT ACTGGATTTA ACCCTTTCAG GCAGAGTATG ATGCTTACTG GCAATGGTAT GGGTATGGGC GGTTTGAGCG GGCCCATGTC CCAACCTTCT TCTCCTTCGC CTTTTGCCCA ACCATCTCAT CAGACACAAG GACAAGGCCA AATTCAACGT CCAGGGTCGA CGCCTGCATT CTCTACTCCT CCTTCCAACG GCACAGCTGC CAGTTCCAGC TCCGAGGCTA AACCTTTGAC GGCCCAGACA ACAGGCTCGA AGAACCCCTT CGCCCCTGCA GGCGGCGCTG TTCCTCCTGT ACCTACTCTT CGATCCCAGC ATCAGCCGCC GCAGAAGAAG CCGACAATGA ATGAGATGAT GATGGGTCTT CATACCGGTA ATAGTGACGG AGCATGGGGT CAGCCTCAAG CGCAGCAACA GCAGACGCAA CCAGCGGATC AAGCGGGCCA GCAAGGGAGT GCTCAGGGTA CAGGGATGTC GAGCATTGCG AGCGAGTTTG CATCAAACAA GAACCAGACG AATGGTTCTG CCAATGCGAA CACTGGCGGT GGTGGGACGG ACTTCTTGTC TCAATTCGGA TCTTTGTCTG TTAATCGTCC TGGTGCTTCC TCTCCATCTA CACAAACAGC TTCCTCTTCG AACCCGCTGT CGTTCTTGTC TACAAACCCT ACAGGTAGTA CCAGCGCGAC TTCAGGGCTC ACTTCACAGA CCACAGGCGC AAACACCAAC AGCAGTGCGA ATGGTTTCAT TCAACCCCAA CCTACCGGGT ATGGCGGCTC TAACATTAAG CCATTCAAAC CTTCGAGCAG CTTTGGTAAT CAGTTGATGG AGAACTTACC ACCCCTACCT GAATCTGGTG CTGGATCAAA CCCCGGTTCT GCTGTAGCCT CACCGAGTGG CGCTCATGGA GTTGGAGCCG TTCAGCCGCA GAGTACAGGT TTCCCCGGCT TGGGATCCTT GTCATTCCAA AACACTGGTA ACCCTGCGGG ATCTTCAGCG GGAACTGGCA GCGGTCTGGT GCCCCAAATG ACAGGAGCTC CCAACCCGTT TAGGCAATCT ACCATGCTTG GAGGATCGTC ATCTAATGCT GGAAGGTTGA ACCCGCAGAT GACAGGGATG GGCGCCTTTA GCGGGTTATC GGCGTTTGGT GGACAAAACC ATGGACAGGG ACAAGGAATA TTTGGACAAC AGCAGCAGCA ACAGCAGCAG CCGTTCCAAC AGCAAGCCCA GCAGGGATCG TTGATCTGAT CTGATGTGAT GTCTGGATAA CTGGCGAAAG CGTCAGACGT CAATGAGAAT GGTGTGGGAG GGAGACGTTT GTAAAAGCTT GGTCTCTTCA AAAGTGTTGA AATATGATAC ATAGCATTGA GCGGTTTTTG TTTTGTAAGG GATCTTTTTT CTATATACCC CCAATTTTAC CCCTTAGTAT TTCGTCGTCC GCGATACATC AGATCTACTG TACGCCATAC GATTCTCGGT AATAGGTTCA TAGGGACTCG ATTTCAAAGG CTTTCCAGTT CTAATTTGAT TAGAATATGA TATG
|
Protein sequence | MTAQSFDKIV KLATKPKNAP PKAKYIDSLI AATYADDSSI NEIAIVLAQR LRDTNGVVVF KGLLTLHQMI RTGQTEALLD VLARNDVLRL RNIYSQRFQG YVPPASMGAY ADYLDNRIRV YRDLKRDLIR VQTESNRRSD GLGAASKARR LRHLPVEKGL LREVKVVQRL LDSLIKCKFY DDDLRDENTV LALRLLVKDL LVLFQAGNEG VCNILEHYFE MSKVDATDSF EIYKSFIKQT DKVVDYLSFA RKLHHVLNVP VPNLKHAPTG LVKALEEYLN DPNFEQNRMD YKRSLGVVEG GSRRPSDTEP TRKASPDKST STSTKAASPA PEVKPQAPAG ASKKIQDFFE SIQADQQPTM FGGAPQQINY AQMTVNQHQQ FNPFRQSMMM PQQTGFMQPQ MTGFSHPQQQ GFLQPQQTGA MAFGRQSMMP MSTGQPGAGG EFGFIQPPHA QAQQPQMQMQ MQPQQTGFLQ PQATGFNPFR QSMMLTGNGM GMGGLSGPMS QPSSPSPFAQ PSHQTQGQGQ IQRPGSTPAF STPPSNGTAA SSSSEAKPLT AQTTGSKNPF APAGGAVPPV PTLRSQHQPP QKKPTMNEMM MGLHTGNSDG AWGQPQAQQQ QTQPADQAGQ QGSAQGTGMS SIASEFASNK NQTNGSANAN TGGGGTDFLS QFGSLSVNRP GASSPSTQTA SSSNPLSFLS TNPTGSTSAT SGLTSQTTGA NTNSSANGFI QPQPTGYGGS NIKPFKPSSS FGNQLMENLP PLPESGAGSN PGSAVASPSG AHGVGAVQPQ STGFPGLGSL SFQNTGNPAG SSAGTGSGLV PQMTGAPNPF RQSTMLGGSS SNAGRLNPQM TGMGAFSGLS AFGGQNHGQG QGIFGQQQQQ QQQPFQQQAQ QGSLI
|
| |