Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1522 |
Symbol | |
ID | 4073010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1858061 |
End bp | 1859167 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983531 |
Product | Zn-dependent dipeptidase |
Protein accession | YP_590598 |
Protein GI | 94968550 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.329262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.479778 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGCC GCCAGTTTGT TTCCTTCGCT GCGCAAGCGT CTGCGCTCGC CTTCATCGCT CCGCACTCAT TCGCACAAAC TGCGACCCCC GACATCGCCG CACTCTATAA GAACGCCCTT GTCATCGACA CCCTCTGCGC CCCCTTCGCC ACCGACGACT TCCCGCCCGC CGACAACGCC CTCCAGCAGG TTCGCGGCTC CGGCTTCACT GCGATCAACA CCACCATCTC AGACCGCACC TATGAAGGAA CGATCCAGAC GCTCGCCCGA ATTCACTCCT ACGTCGAGCG CTATCCTGAA CTTTTCTCGA TCGTCATCAA GCGCTCCGAC ATCGACCGTG CCAAGCGTGA GAATAAGGTC TGCATCATGC TGGGATTCCA ATACACCTCG TTTTTCGAAG AAGATGTTTC TCGCATCGAG GTTTTTCGCG ATCTCAGTGT GCGCATCATG CAGCTCACCT ACAACCTGCG CAGTACGTTC GGCGACGGCT GTCTCGAGTC CGAAAACTCA GGGCTGAGCA GGGCCGGACA CGACCTGGTC AAGAAGATGA ACGCGATCGG TATCGCCGTT GATGCCAGTC ACAGCGGCTA CCGCACTACT TCCGACGCCA TAGCCGGTTC CGCGAAGCCC ATCCTCATCT CGCATTCGGG ATGTGCCGCG GTCAGCGCGC ATCCACGCAA CAAGCCAGAT GAAATCCTTA AAGCACTCGC CGATCGCGGC GGCTATTTCG GTGTCTATCT CATGCCCTAC CTCGTCGCTT CCCCGACCGT CCCGACGCGC GAACACGTCA TGGCGCATCT GCTTCACGCC ATCAATGTCT GCGGCGCAGA CCACGTCGGC ATTGGCTCCG ATGGCAGCAT TGAGGCGGTC CATCTCACCG ACGAGCAGAA GAAAGCTTTC GATGAGGACA TCGCCCGGCG TAAGAAGCTC GGCATCGGCG CACCCGGCGA AGACCGCTAT CCCTACGTTC CCGACGTCAA CGGTCCCAAC CACATGGAAC TGATCGCCAG CGAATTACAG AAGCGCGGCC AGCCCAGTTC GGTCATCGAG AAGGTACTCG GCGCGAACTT CTATCGCGTT ATTGGCGATA TCTGGGGCAC AGCTTAA
|
Protein sequence | MDRRQFVSFA AQASALAFIA PHSFAQTATP DIAALYKNAL VIDTLCAPFA TDDFPPADNA LQQVRGSGFT AINTTISDRT YEGTIQTLAR IHSYVERYPE LFSIVIKRSD IDRAKRENKV CIMLGFQYTS FFEEDVSRIE VFRDLSVRIM QLTYNLRSTF GDGCLESENS GLSRAGHDLV KKMNAIGIAV DASHSGYRTT SDAIAGSAKP ILISHSGCAA VSAHPRNKPD EILKALADRG GYFGVYLMPY LVASPTVPTR EHVMAHLLHA INVCGADHVG IGSDGSIEAV HLTDEQKKAF DEDIARRKKL GIGAPGEDRY PYVPDVNGPN HMELIASELQ KRGQPSSVIE KVLGANFYRV IGDIWGTA
|
| |