Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG04690 |
Symbol | |
ID | 3258601 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 1339805 |
End bp | 1342075 |
Gene Length | 2271 bp |
Protein Length | 720 aa |
Translation table | |
GC content | 53% |
IMG OID | 638258093 |
Product | hypothetical protein |
Protein accession | XP_572148 |
Protein GI | 58269984 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTATTT GTAAGGGAAG GAGCGAGGAG GTGACCGGCG GTGATGTGAC TCGCCACCGG TCCAACACGT TGCGCGGAAC GAATTTACGA TCGATAACTC AACTTCTTGT TTTTCCTTCC CGCCTTCGTT TCCCAGTCCT CATCGTCTTC TCTGTCCATT CAAACCCCAG TATTACCATC TTTGCTCCAC CCTCCGCTTC CACAATGACC TTTCCGGACA CAAATGAAGG AGATTGGGCG TCGAGACGAG CTGCGGCACT AGCCCAACTT ACCTCAGACT CGCGCCTGAC TTCATTATCC CCCGCCTCTG CTTCCGACTC ACCTTTATCT TCGTTCCCCT CATCACATCA AACACTCCCC GAACATTCAA AATCCCCTCG TGACGATCCC ATTGCATACG GCGGCCACCC TAAAAACAAA GGCCGACCAT ATCCTCTCTC TCTCATCCCC TTTGACCTCA GCGAACCTTA CGAGGTTTCC GAATACCTTA CTATCCGCTC CTACGATGAG ACATGCCAAG GCGACTACCT GGGTCAAGAT TGGGCCGATA CGTATAAGAA GGGGAAGATC GTTGATGAGC AAATGTTCGA CGACGGGCAT GTTCATCGCT ATCTTGATCG CTTCATCCAA GAAAAGTACA AGGCGATGGA TACAAAGTTC TCACATCAAG CGAATTTGAA ATCGGTCTTC TACGACAGAG ATGACAAAGA GTGGGCGAAG CTAGATTTGG TGGATGTGTA CAGACGGCGG ACGGGGAAAG TTCTGGGAGT CGTGAGTCAT GGCTTGTGCA GAAGATGACA CAAGGCTCAT GATGCGTCCA ACATAGCCGT CTCAATCGCA AGACACCTCC AAATCAAAAA AGGATGGTTC CACTACTCCG GACGCCTCTC TCATCGGCAA CGTCAACGAT CCTAGCTCTC CCTCGGGCTG GAAAGAAATG CTTTACGCAG TCATCGAAGT GAAGTGGATG CGATTAGCGG CTCTTCTTAC CGGCGAGGCT AAAAGTCAAA CCGACGAGAA AGCTCTTAGT TACCTCTGCC AGGAAGGCGT GTTTCAAACT ATGTGGTATG TCATCTTGGG CTACGCCATT TCGCGCTGCA TCTTCGGTCT CTCCATAGTC AACGAATACT TCTATAGAAT TGTGTATCTC TATCAAGACT CGGCCTCAGA CAGTCCCGTG CTTGCCCTGG AGGCAGGCAA CGAGTTCTTG GAGAAAGCCG GGCGACATTT TGGATATCCG CAGGATGGTT ACTCAGTCGA AGAACTTGCA GAGCTGCAAG ACTTTTGGTC GTCGCCTCCC AATTCTCTGA TCAGCGACCG TGCCAACGCC ACTTTGAATA AAGAGGCAAG GTACCACCTC GATGCGACCA TTCTCTTGTT CCTTGCTCGT GCAGCGGCAC TTCCAACACA ACGCTTCTTC AACGATCTGC CCCTCTCTTT TGCTCATCGC GTTCCTGTTG ATGCGACCGC TCATACATCC ACCGACATGA GGTTGAAAGG ATTCGAGGCT GGGCGCAGGC GACACAGTCT TCGTTCGACC AAGAGGAACA AGCGCACATT GGCGGATTTG TATGATGAAG AGAAAGATGA AGAGGACAAG CCAGGGGGCG ACAAGCCACC TGGCAAGGAT AATGATGGCT CGCACGGCGG AAACTCTGGC TCTGGAGGCG ATAACTCACG TGGCGGAGGG TCTGGTTCTG GCGGAGGGTC TCGTCCTGGC GGAGGGTCGC GTCTTGGTGG AGGATCTCGT CCTGGCGGCA GGGGTGCAGG CGGAGGCTCC TCTTCTCGTC GTGCTGAGGC GTTTGACAGT CGAACCTCCA CTGCACCGCA AGAGTTCAGG AGGGGCCTGG AGAGGCTATC CGCTCCTAAG GAAATGTTCC ACATGAAGAC GTCCATCATG GCCTCCCTCC TCTCCAATAA CCGTATGCAC CTTTTAATCT TCCATGCTGA AGTTACGCTT ACGAATCTAC TGTAGGTGCC AGATGCTCTA GGGCTCCCCC CTCCGTCGAC AGTGACTCCT CTGGAGAGTT GGACTCGTCG TTTGACACGT CCTTCGGCTC CAATAGGGCG GCTCTTATCC TTGACGATCT CCGCGACGAT CCCCCCCCAA TAGTCAACAA GCCCGACCCT GTCGATATCG ACCTTGAAGA TATCGACCCA GAGTCGGGCG AGCTTACGTT GGCGGCCTTT AAGGACCACC TAACGATGCT CGGGGTGCGG GTGAAGCTGG TCACTCGGGA CCAGATGGGC GTCTTGTTGG CCCGGGGATG A
|
Protein sequence | MVICKGRSEE VTGGDVTRHR SNTLRGTNLR SITQLLVFPS RLRFPVLIVF SVHSNPSITI FAPPSASTMT FPDTNEGDWA SRRAAALAQL TSDSRLTSLS PASASDSPLS SFPSSHQTLP EHSKSPRDDP IAYGGHPKNK GRPYPLSLIP FDLSEPYEVS EYLTIRSYDE TCQGDYLGQD WADTYKKGKI VDEQMFDDGH VHRYLDRFIQ EKYKAMDTKF SHQANLKSVF YDRDDKEWAK LDLVDVYRRR TGKVLGVPSQ SQDTSKSKKD GSTTPDASLI GNVNDPSSPS GWKEMLYAVI EVKWMRLAAL LTGEAKSQTD EKALSYLCQE GVFQTMWYVI LGYAISRCIF GLSIVNEYFY RIVYLYQDSA SDSPVLALEA GNEFLEKAGR HFGYPQDGYS VEELAELQDF WSSPPNSLIS DRANATLNKE ARYHLDATIL LFLARAAALP TQRFFNDLPL SFAHRVPVDA TAHTSTDMRL KGFEAGRRRH SLRSTKRNKR TLADLYDEEK DEEDKPGGDK PPGKDNDGSH GGNSGSGGDN SRGGGSGSGG GSRPGGGSRL GGGSRPGGRG AGGGSSSRRA EAFDSRTSTA PQEFRRGLER LSAPKEMFHM KTSIMASLLS NNRARCSRAP PSVDSDSSGE LDSSFDTSFG SNRAALILDD LRDDPPPIVN KPDPVDIDLE DIDPESGELT LAAFKDHLTM LGVRVKLVTR DQMGVLLARG
|
| |