Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC05890 |
Symbol | |
ID | 3256833 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1741087 |
End bp | 1742871 |
Gene Length | 1785 bp |
Protein Length | 433 aa |
Translation table | |
GC content | 55% |
IMG OID | 638255810 |
Product | conserved hypothetical protein |
Protein accession | XP_569812 |
Protein GI | 58265312 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.970035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAC AAACCCCGCT GCTGTCCCCC TCCCCGTCCA CCGTCTCGGC CCCCCCCGCC GCCCGCAGAC GGACATCCCT CCTCCTCGCC CTCGCCACCC TCCTCCTCCT CGCGGCCGGC CTCTCCGTCG GCCTCGTCGT CCGGCATGCC CATAAGGAGC CCAGCGATGT CCTCGAAAGG GCAAAGCTTT ACCTCAAAGC GTGCACATCC CGCTCCATCT GCAGCCGGCC GGATGCTGAC TGCGCTGTAG CTCGCCGGTG ATCGATGGCC ATATCGACCT CCCCGAATTC GCCCGTGCGG TCTACGGCAA CAATATCGAA AAGTTTGATT TGCGTGGTGC CCTCGTACGT CGTTCGCCTC TTTCTTTGCT TTTCTAGACA GAAAACTGAC GAGATGGTGT CTAGCCCGGA CACTTTGATA TCCCTCGGGC CAGAGAGGGC CATCTGGGAG CATTCTTCTG GTCCATGTCA GTCCTCTCAA TCGTTGACTA CCATGGGCCA TGCAATTAAA CTCTTGACCT TGTTATAGCT TTACCGAATG CCGCGATACG AATGGCGACG ACTTTATGAA CCCGACCTTT GAAGTTCGAG GCGAGTCCGA AAAACATTCT CCTTCCCATA ACCGTCTTCC CCTTTCCACT ACTAACCTTT TTGCTCTAGA CGCCCTCGAA CAGCTCGACG TGTCAAACAA CCTCATCTCC AAATACAGCG ACACATTCGC CGTCGCCCGG ACCGCCGACC AAGTCGAGTG GGCTATAAAG CACGGCAAGA TTGCGAGTCT TTTCGGGCTG GAGGGTGCGC ATATGCTTGG CAATTCTCTT GCAGTGTTGA GAATGTACCA CCAGCTCGGG GTGAGGTATA TGACCCTCAC GCATAGCTGT AACAACGCGT TTGCCGATTC GGCCGGTATC TTTGGAGACG TCAAAGAACG TTGGGGCGGT CTGAGCCCCC TGGGCAAAGA ACTCGTTCCA GAGATGAACC GACTCGGAAT CTTCATCGAC CTCTCCCACG TTTCCGACCA AACTGCCCTC CAGGCGCTGG ACCTGACAGA AGCACCCGTC ATCCTCTCGC ATTCCTGTGC GAGGCATTTC AATAAGATGA ACAGGAATGT ACCGGACGAG GTGCTGGCTA GGTTGGGCAG CGGAAAGGGA AAAGTCGATG GGGTGGTGAT GGTCAAGTGA GTCTGTAACG CTTCTTTAAT TAACAGCGGA AAACTGATAG ACCCCGCTTC ATTACAGCTT CTTCCCCGTA TTCGCCTCTC CCAACCCGGA CCTCGTCGAC GTCGCATACA TCGCTGATGA AATCGAGTAT ATCGCCAATA AAACTAGCAG GGATCAGTGA GTCACCCCGT CACATTTAAA AACTTGATTC TCAAAGCGCT GACTGTTTTG GGGGTGGTGG CGCCTGACAG TGTCGGGATC GGATCAGATT ACGACGGGAT TGAATCAGTG CCCAAGGGTC TTGAAGACGT TTCCAAGTAT CCTTACCTCG TACGTCTCTG ACTTTTGTCG TTTGCCGTTT TACCCCTGCT AACGTCGCCC CCCGGCCGGC CGGACAGTTT GCCGAACTAA TCAAACGCGG TTGGTCCAAG AACGATCTCT CCAACCTCGC CGGCGGGAAC CTCCTCCGCG CCATGCGGGG GATGGAAGAC GTGAGCCGTC GGATGAGGGA CGAACAGGGT AAACAGCCAA GTATGGCGAA ATATGATAAG CGGAGGGATT TGGATGGGGG CGATTGGGAT TTCTAGCCGG GGGGGGGGTG AAAATTGGAT TTGCGAGAAG AAACG
|
Protein sequence | MAEQTPLLSP SPSTVSAPPA ARRRTSLLLA LATLLLLAAG LSVGLVVRHA HKEPSDVLER AKLYLKASPV IDGHIDLPEF ARAVYGNNIE KFDLRGALPG HFDIPRAREG HLGAFFWSIF TECRDTNGDD FMNPTFEVRD ALEQLDVSNN LISKYSDTFA VARTADQVEW AIKHGKIASL FGLEGAHMLG NSLAVLRMYH QLGVRYMTLT HSCNNAFADS AGIFGDVKER WGGLSPLGKE LVPEMNRLGI FIDLSHVSDQ TALQALDLTE APVILSHSCA RHFNKMNRNV PDEVLARLGS GKGKVDGVVM VNFFPVFASP NPDLVDVAYI ADEIEYIANK TSRDHVGIGS DYDGIESVPK GLEDVSKYPY LFAELIKRGW SKNDLSNLAG GNLLRAMRGM EDVSRRMRDE QGKQPSMAKY DKRRDLDGGD WDF
|
| |