Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00580 |
Symbol | |
ID | 3255628 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 166202 |
End bp | 168525 |
Gene Length | 2324 bp |
Protein Length | 732 aa |
Translation table | |
GC content | 53% |
IMG OID | 638254711 |
Product | conserved hypothetical protein |
Protein accession | XP_569068 |
Protein GI | 58263316 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.328391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCACC ACCCCGCGGT AGCAGCACAG CCGGGCCGCA CTATTGCCCC TATCCCGCAC CACCGCCCAC AGCAACCCCG GATCACTCCT TACACACCAA ACGTACGCGA CCTCAACCCA GGACCTAAGA ACAGACTCAT CCTCGCCCTC CGCTCCAACA TCCCCTTTGA AGTCGACTGG GCGCTACCGC AGCTTGTTGT CGCAAGTTTC GACCAGTCGG ACGGGTTCAA GCTCGAGGCA TGGCCAGACA GCATTTGCGC GTTGAAGGAA TGGCCGGCCA AGTGGCTTGA AGGACTAGAA AGGGAAGCTG CAGTGTTTGA GATGAAAGCT GGGCGATTGG ATTTTGAGGG GGACGAGAAT GATGAAGAGG GGAGGATGGC AAAGCGCAGA AAAAGGGATC TGGCGCTGGG GGCGGTGGTA GAGTGGGAGA ACGATCTCAA GGTGGAACAA CGGGCGACCA ACTCTTTGCT CGTCCTCAGA AACGCATCCT TCAACGCACC CAACGCAAAG ATCCTCTCAA GCTCAAGCTT CCTCGCTTTT CTAGCCGATT TCTTCTCTTT GCCTCTACCG TTTCTCCAGC ATCTTTGCCT GAGAACCCCA GAGCCTATAC ATCATATCCT CATCATTGTC CAGTCCATCT TCCCCCATTT GCGCGTGGAC ATGCCAGGTA TCGACCGCAT CAAGCACATC TTTGGCGTCG TCTTCCCTCA GCTTTTTGTT GATACCCGCG ATATCGCAAT GATGAACAAC CTTATCCCTC TCATGATGAT GGGCCAGACA ATCCCCAATA ACCACCCTCC TCCGCCTGAA CTCATCCCTC ATCTTCTCCA GCTTCTCGTT CTCCGTCCAG CAGGCCCACT TCTCGATTTG ACTCTTGACA TCCTCATCTC CCTCTCCACA AATCCCATCC ACTCCCGTGC CATACTTTCT CATACTTCTT TCCCGCATCA TCTCAAATCC ATCACAGCCT TACTCGAACA TCAAGCTCGT CCGGTGGTGA ATGCCCTTGA CCCACCGCCT TCTACGAGAG GGAAAATGGT GCGTAACCCA GCGGGACCGA GTTGCAGAGC AGAGGAACTT AATCAAAGGC GGACGAAGGA ACGAGAGGCC GCATTGGGAC ATATGGATCC CATGGCTGGA GGTAGACCGG TGTACAATGA GGTAGGGGAT AAGCCACCGA CATTTAGTCC GGCGACGAAG AAGAGGCTTT TCAGGATGAA AGAACCCGAA AGGTCTATCG AGTGGTGAGT CATAAATCAT TAACCACTAC TTGAAGATGT ACAACAAGTT GACTTTGGTA TGTAGGATGC ACCAGGCATT CGTCTACTCA TCGACAGCCC AAGTCCTTCA AGTGACATTC TGGCACGCCT ACCGAGATTT CTTCACCAAC CCAGCTTGCG TAGAACCAAT GTTGAGTGCA TCTGATGTGA TCAAGAATGT CACTGCAGCT TTCCCTGGAG CGAGCGCAAA AGTTTGGACC GATGCGAGTG GTGCGCAAAA GTTTGTGATT GCTGGTGTCG GGTTCAGGAA GCGATCAGGT ACGTGCAGTC AAACTTTTTG GATTTATAGT CGAATCGCTA ATGAGAACCA ATAAAAACAC AGATGACGAT GAAAGGTTTA CATGTTACTG GCATGCATGC ACCCAACGGT ACTCAGCTAC CAACCCCGTC CAACTGCTCG AACACATTAG CAACTACCAT CTCCAAACCT TTTCTGCACC CCAATGCCAA TGGGGCTCAT GCGATCACAA CCTCTGCACG TACTCTCATC TCCTCACCCA TATCCCCCTC GGCCAGCCTC CATCCTCCAT CTCCGTCCCT GACGCCATCT CTTGCCATAT CGCAGACCAT AGTAGCTCCG TCTTGCAGCG CAAGATCACC AATCGTACCG TCCCTCCTTT ATCCAGCGTT CGTCTAGCCG TTCAGGGGGC ATTTACCCCT GTCGACGCTC GTCGACAACC TACTGGCGCC GCCCTTCTCG CGGCGTTACT TATCCGTAAC CTCGCCCGTA CCCTCCGTGC CGAGATCTCG CTCGCCGTGC CCGAATTGTC TCATGCTCAA ACGCAAGAAA CGGCAGATGA AGCTCAAGCG AGAAAAAAAC ACCTTCTCGA AGAGAGGTAT GGATTGCCAA TCCCGGATTC GGTGTTGAAA GAAGAAGAAG AGGAGCAGGC GAATGTGCAG CAAGGCCAAG ATTTAGATAT GAGTGAGGAA GAGAGGGAGA GGGCGAAAAA GGCGTTTGAG AATGTGGAGG AGAGGATTAT GAAGGTCATG TTGGAGAATG TTAGTGGGAT AACGCAGTAT CTTGGTGATG CGCTTGGGCT GTAG
|
Protein sequence | MQHHPAVAAQ PGRTIAPIPH HRPQQPRITP YTPNVRDLNP GPKNRLILAL RSNIPFEVDW ALPQLVVASF DQSDGFKLEA WPDSICALKE WPAKWLEGLE REAAVFEMKA GRLDFEGDEN DEEGRMAKRR KRDLALGAVV EWENDLKVEQ RATNSLLVLR NASFNAPNAK ILSSSSFLAF LADFFSLPLP FLQHLCLRTP EPIHHILIIV QSIFPHLRVD MPGIDRIKHI FGVVFPQLFV DTRDIAMMNN LIPLMMMGQT IPNNHPPPPE LIPHLLQLLV LRPAGPLLDL TLDILISLST NPIHSRAILS HTSFPHHLKS ITALLEHQAR PVVNALDPPP STRGKMVRNP AGPSCRAEEL NQRRTKEREA ALGHMDPMAG GRPVYNEVGD KPPTFSPATK KRLFRMKEPE RSIEWMHQAF VYSSTAQVLQ VTFWHAYRDF FTNPACVEPM LSASDVIKNV TAAFPGASAK VWTDASGAQK FVIAGVGFRK RSDDDERFTC YWHACTQRYS ATNPVQLLEH ISNYHLQTFS APQCQWGSCD HNLCTYSHLL THIPLGQPPS SISVPDAISC HIADHSSSVL QRKITNRTVP PLSSVRLAVQ GAFTPVDARR QPTGAALLAA LLIRNLARTL RAEISLAVPE LSHAQTQETA DEAQARKKHL LEERYGLPIP DSVLKEEEEE QANVQQGQDL DMSEEERERA KKAFENVEER IMKVMLENVS GITQYLGDAL GL
|
| |