Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ02600 |
Symbol | |
ID | 3254101 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | + |
Start bp | 750752 |
End bp | 752014 |
Gene Length | 1263 bp |
Protein Length | 383 aa |
Translation table | |
GC content | 55% |
IMG OID | 638253417 |
Product | dihydrodipicolinate synthase DapA, putative |
Protein accession | XP_567550 |
Protein GI | 58260280 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.474557 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAGGC AGGCCACAAA TGAACAGCGC CTGGCAAGAT ATAAAGGCTC GACCATCGAT GTTCAAGAGA CCATCAATCA CCAGTTCTCC CAATCTTACA CAGCTATTCC TAGCCAACAC TACCCCACTT CACCACACAT TCACACAACA CCAACCACTC AATTCAAAAT GACCGCCAAC GGTTTCTCCG ACAATGTTTC CAAGTTCTCC AACCCCCGAG GCCGAGTCGG CTTCGACATG AGCGGTATCA CTCCTGCTCC CGTGAGTTTT CCCTTTGACT CCCATCTAAT TGTACCTTAA GCAGAGTTCT GACTCCTTCC AGGTCACTCC ATTCAACGAG GACGGCTCGG TCGACTACGA GGCTATCCAG CGTATTGGAT CTTGGCTTGC CTCCGTCGAG GGTGTCAAGG GTCTCGTCGT TCTCGGTCAC GCCGGTGAAG GTACCTTCCT CACCTCGGAG GAGCAGGTCA AGGTTATCAA GGCCTTTGTT AAGTCGGTTA ACAACGAGAT CCCCATCATT GCCGGTATCA CCCGAGAGGG CAACTACGTT GCTGGTCTCG AGGCGAAACG CGCAAGAGAA GCCGGTGCCG CTGCTGGTCT GCTTTACCCT TCGCACGGAT GGCTCCGATT CGGCTACCAG ACAGGTGCTC CCCAGGTTCG TTACAAGGAG GTCTACGAGG CCTCGGGTCT CCCCCTCATC CTCTTCCAGT ACCCCGACAA CACCAAGGCA ACTTACGACC TGAAGACCCA GCTCGATATC CTTGCCCAAC CCGGTGTCTT TGCCAGTGAG TAACATCTGT TGTCTGTTCC AAAGAAGTCT CCTGACCCCT TGCAGTGAAG AACGGTGTCC GAAACATGCG ACGATGGGAC CGAGAGATCC CTGTCATCAG GAAGGCACGA CCCGACATCT ACATTCTCAC TTGCCACGAC GAGTACCTCC TCCACACTAC TTTCGACGTC GACGGCATGC TCGTCGGTTA CGGTAGTATT GCTCCCGAAC TTCTCTTTGA GCTCCTTAAG GCCGGTAAGG CTCATGACTA CAAGAAGGCT CGAGCCATCC ACGACCAGCT CCTTCCCGTC ACCGCCGCTG TCTACCACCG TGGCTCCCAC ATGGAGGGCA CCGTCGCTCT CAAGCATGCC CTGGTTGCCC GTGGGATCCT CAAACACGCC ACCATCCGAG GTACCCTTTT GCCCCTCCCC GAAGGTGCCG ACAAGGAAAT TTATGATGCG ATCTCTGCCG CAAAAATTGC CAAGGTCCAG TAA
|
Protein sequence | MWRQATNEQR LARYKGSTID VQETINHQFS QSYTAIPSQH YPTSPHIHTT PTTQFKMTAN GFSDNVSKFS NPRGRVGFDM SGITPAPVTP FNEDGSVDYE AIQRIGSWLA SVEGVKGLVV LGHAGEGTFL TSEEQVKVIK AFVKSVNNEI PIIAGITREG NYVAGLEAKR AREAGAAAGL LYPSHGWLRF GYQTGAPQVR YKEVYEASGL PLILFQYPDN TKATYDLKTQ LDILAQPGVF AMKNGVRNMR RWDREIPVIR KARPDIYILT CHDEYLLHTT FDVDGMLVGY GSIAPELLFE LLKAGKAHDY KKARAIHDQL LPVTAAVYHR GSHMEGTVAL KHALVARGIL KHATIRGTLL PLPEGADKEI YDAISAAKIA KVQ
|
| |