Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC07020 |
Symbol | |
ID | 3256232 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 2048845 |
End bp | 2051651 |
Gene Length | 2807 bp |
Protein Length | 697 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255921 |
Product | hypothetical protein |
Protein accession | XP_569850 |
Protein GI | 58265388 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.22515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTCGCATC CCGCTTATTA GAAGCTTCTA CTACCTGTCT TCCTACATCT TCCTTCTTGC TTCCCACCAG CGTAACTGGC CAGTCGCCAC CGCTACCCGT TCTTCCTTCC TTCCCGCTAG TCGCGTCAAC TGTCGCGTAA GATTTCTTAC CTCCGCGAGC AGGGCGAATT TCCTACCCTC CTCTTACGAC AGAGATAATG GGCCGCACCA CCAGACGCTC AGCAAGGCCC CAATCGGGTA CATCTACTCC CCCCAACATC TCCTCTCCGC CTTCGCGCCC TTTGTTTCCC AGCATCTCCC TTCGCTCTTC ATCGCCTACT CCACCTACCC ACGAAAACCT CTCAGCCCGG TCCTCTCTCC TCAGCGTGCT CACACGACCA GCCCATTCAG TCGCAGCGAC GCCAGATTCG CCAGAGGAAG AGCTCTCGGA TTTGTCGACA GTAAGCAGCT ATGCCTCGAG TGTTATCTCT GGTACACATT CAAAGGACAT GACCGAAGAG GAGATTGCAA AGGTATATGA GAATTATTTG GAGAAGCCAT TCATCCAAAT AAAGACTACA ATAAAGCATA GCGAGTTTGG ACATTGTAAC AATCCCAATT GGCGGTGGAC CAGTCAGGTA AGTACCGCCG AGCGCGTAGC ACGACTTGTA ATGAAAGATT TTGGGTTAAC GATATCAGTG GAATCCCAAC GAAGTGATTC ATGCCGACGA TGCCAGACCA TCCTACACTG TGTTACTAAG TACATACTTG AGTTACATCC TGCTTATCAT TATCGGTCAC ATTCGAGACT TCTTCGGTAA AAAGTTTACA CCAGCCTCAT ACGCTCATTT GATGCCCCAG AACGTAAGTC CTTGTCTCGT TTACGTGGCT AACATGTACA GGGCTACGCC GCGCTCAACT CCGACTTCGA CTCTTTCTAC ACCCGTCGAT TAAAGAAACG TCTCGACGAC TGTTTCGCGC GACCTACCAC CGGTGTCCCC GGTCGAACCA TTGTCTGCTA CGATCGTTCC TCCACCGACC AGAACAACAC TTTTCAATTG ACTGGTACTA CAACGCGCGC TTTGAACGTT TCCTCTTATA ACTATCTCGG TTTCGCCTCT TCTACAGGTG GCTGCGCCGA TGCTGTCGAA ATGGCCATTA AGCGATACGG TGTTGCCAGT GCCGGTGCCA GACATGAAGC TTCTACTACC GACCTTCACT TACAATGTGA GAAACTCGTC GCCAAGTTTC TCGGTGTTGA AGCGTCCATG GTCGTTTCGA TGGGTTATGC CACCAACTCG ACTACGATCC CTGCGTTGGT TGGTAAAGGC TGTCTTGTGA TTTCCGACGA ATTCAACCAC GCTTCTATCC GTGCCGGTGT GAGAATGAGT GGGGCTTCGA TGAGATGGTA CAAGCACAAC AATATGGATG TGCTTGAGAA CTTGTTGAGG GAAGTCATTT CACAAGGCCA ACCCAGGACT CACAGGCCGT GGAAGAAGAT ATTGGTTATT GTCGAAGGAT TATTCTCAAT GGAGGGTAGT TTGGTTGATC TTCCCAGATT GATTGAGCTC AAGAAGCGTT ACAAGGTGAG TCATTACTGT TCGTATGAGA TCATTCACGC TGACATATGT CAGTTCTATT TGTATGTCGA TGAAGCTCAC TCTATCGGTG CGATGGGCCC CAACGGTCGA GGTGTTTGTG ACTATTTCGG TATCGACCCC CGTGAAGTCG ACGTCTTGAT GGGTACCGTC ACAAAATCGT TCGGTGCTGC CGGTGGTTAC ATCGCGGGTA GCAAGGAACT CGTCGACCGT CTCCGTGTCC GATCACACGC TACCGCGTAC GCTGAATCCG TATCTCCTGC CGTTCTCACC CAGATCATCG CTTCTATGGG TTCTATCATG GGCATTGCCC CTCCCCTGGC TGCCCCTCCC ACAGAGGACG ACAAGTCCGA GACATGGTCC ATCGCATCCC GTCCAGCAGT GTACGGCCCC GCCCCATCTT CCCTCCTCCC CCCTTGGCTC ACCCTCCCTC CTCACTTACT CAACGGAACC GAGGGCCGCG AACGTCTCCG CCGTATCGCT TTCAACTCCC GGTACCTCGC TTCTGGTCTT CGCAAACTGG GCTTCATCGT CTACGGTAAT CGGGATTCAC CCATCATTCC TCTTCTTATC TTCCAGCCCG GTAAGATGGG TTACTTTTCT CGTATGATGC TCGAGCGTAT CGGCCCCGAC AAGACACCCA TCGTCGTCGT GGTCGTGGCC TATCCCGCGA CCCCACTCAT CACTTCCAGA GTCAGATTCT GTCTCTCGGC GAGTCACACG AAAAATGATA TGGACATGGT CCTCAGGGCA TGTGATGAAG TTGGAGATGT GTTGAATTTG AAATATAATA AGCAGGAAAT GAGTGTGGAG GAAGTCATTG CCAATGCTGA GGAGCTGGTT GCTGCTTCTC ATGTTTAAAC ATGGGGTGAA GAATGAGTGT GTTCATTTTG CATATGGAAG GGAACGATGA AGGACGAACT AAATCATACA GCGTTTAATT ATTTTCACAC CTCTTTTTTC GTCTTTTTCG TTTAATCCTT CTCCATTATT TTATTTTACA ATTTCTCAAC AAGCAATTTG AATCGTACAT TTTTCCTAAT TTGTTTTCTA TCACCTTTTT TCTCACCTTC TTGCAAAGCA ACAAGCAGCT TAGATTGCTT CAATTGCGAA ACAAAGCTAG AATCTGTAGG AGCTAGAACC TATATAGCCT GGACTGGATT CTAGGGAGTA TTCGTCTTTC TTTTTGGTCT GGTTTCTTTT GCCTATTATA TGTAAATAGG ATAAAGG
|
Protein sequence | MGRTTRRSAR PQSGTSTPPN ISSPPSRPLF PSISLRSSSP TPPTHENLSA RSSLLSVLTR PAHSVAATPD SPEEELSDLS TVSSYASSVI SGTHSKDMTE EEIAKVYENY LEKPFIQIKT TIKHSEFGHC NNPNWRWTSQ WNPNEVIHAD DARPSYTVLL STYLSYILLI IIGHIRDFFG KKFTPASYAH LMPQNGYAAL NSDFDSFYTR RLKKRLDDCF ARPTTGVPGR TIVCYDRSST DQNNTFQLTG TTTRALNVSS YNYLGFASST GGCADAVEMA IKRYGVASAG ARHEASTTDL HLQCEKLVAK FLGVEASMVV SMGYATNSTT IPALVGKGCL VISDEFNHAS IRAGVRMSGA SMRWYKHNNM DVLENLLREV ISQGQPRTHR PWKKILVIVE GLFSMEGSLV DLPRLIELKK RYKFYLYVDE AHSIGAMGPN GRGVCDYFGI DPREVDVLMG TVTKSFGAAG GYIAGSKELV DRLRVRSHAT AYAESVSPAV LTQIIASMGS IMGIAPPLAA PPTEDDKSET WSIASRPAVY GPAPSSLLPP WLTLPPHLLN GTEGRERLRR IAFNSRYLAS GLRKLGFIVY GNRDSPIIPL LIFQPGKMGY FSRMMLERIG PDKTPIVVVV VAYPATPLIT SRVRFCLSAS HTKNDMDMVL RACDEVGDVL NLKYNKQEMS VEEVIANAEE LVAASHV
|
| |