Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2579 |
Symbol | |
ID | 4070542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3047545 |
End bp | 3048834 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984596 |
Product | membrane dipeptidase |
Protein accession | YP_591654 |
Protein GI | 94969606 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGAT TCGCATTCCT ACTGTTGCTC TGCGGGCTAT GTGCCGCACA ATCTCCCACC CCTAAGCAGC CGGCCAAAGC AGCCCCCGTC GGCTGGAAGG CCATTCACGA CTCTGCCCTC GTTGTTGATA CCCATGCCGA CACTCCGCAA CCTTGGCTGG ACAAGAACAT CAACGCCGCC GACCCCGACT CGAAGCTGAT GGTCACGATT CCGGCGGCCA AAGCCGGCAA CCTCGGTGCC GAGTTTTTCT CGATCTGGGT GGATCCAGTA AAGTTCAAAG GCCATTACCC CGACCGCACT CTCGCGCTCA TCGATGCCGT CTATCAGCAG GTCCAGCGCA ATCCGAAAGA CATGATGTTC GCCACCAGCG TGAAGGACAT CTACGCCGCT CGCCGCGAAC ATAAGCTCGC CTCGTTGATG GGAATCGAGG GTGGCCATTC CATCGCCAAC GATCTCGGAC TACTGCGCGA TTACTACCGC CTCGGCGTGC GTTATATGAC GCTCACCTGG TCGAACACCA ACGACTGGGC TGACTCCTCC GGTGACGTGG ACGACAAGAA CATTCAGCAC CACGACGGTC TTACCGATTT CGGCCGCGAC GTGGTCCGTG AGATGAACCG CATCGGCATG ATCGTGGACA TCTCCCACAC CTCTGACCGC ACCTTTTACA AGACGCTGGT CGTCGCCCGC GCTCCCGTAA TCGCCTCGCA CTCTTCTTCT CGCGCGCTCA CCAACGTTCC CCGCAACATG ACCGACGACA TGCTCCGCGC CCTCAACCGT AACGGCGGTG TCGCCATGGT CAACTTCAAC TGCGGATTCA TCAGCAACGA ATACGCAGCC GCAGAGAAGA AACTCGAAGC GGAAGACCAC TCCATCGCCG ACCTTAAGAA GAAAGCTGCC GAACCGGGTT CAAATATCAC CGAAGCCGAC ATCCAGAAGG CGGAAGACGC GTTCTACGCC AGTATTCCCC GCCCTCCGCT CAGCAACTTG ATTGACCACA TTGATCACAT GGTGAAAATC GCCGGGATAG ATCACGTCGG ACTGGGTTCA GATTTCGACG GCGTTAGCTG CACCCCCGAG GGTATCGATT CGGCCGCCGA TCTTCCGAAA ATCACGCAGG CACTCCACGA CCGCGGCTAC AATGCAGAGC AGATTAAAAA GATTCTCGGC GGCAATATCC TCCATGTCTT TTCCGAAGTG GAGAAGACGG CCGCGCAGCT TCAGGCGGAG TCGCCCGAGA ACAAAGACAC GCGGCACGAG GTAAAGCTCG ACGCCCAGCC CAAGAAATAG
|
Protein sequence | MRRFAFLLLL CGLCAAQSPT PKQPAKAAPV GWKAIHDSAL VVDTHADTPQ PWLDKNINAA DPDSKLMVTI PAAKAGNLGA EFFSIWVDPV KFKGHYPDRT LALIDAVYQQ VQRNPKDMMF ATSVKDIYAA RREHKLASLM GIEGGHSIAN DLGLLRDYYR LGVRYMTLTW SNTNDWADSS GDVDDKNIQH HDGLTDFGRD VVREMNRIGM IVDISHTSDR TFYKTLVVAR APVIASHSSS RALTNVPRNM TDDMLRALNR NGGVAMVNFN CGFISNEYAA AEKKLEAEDH SIADLKKKAA EPGSNITEAD IQKAEDAFYA SIPRPPLSNL IDHIDHMVKI AGIDHVGLGS DFDGVSCTPE GIDSAADLPK ITQALHDRGY NAEQIKKILG GNILHVFSEV EKTAAQLQAE SPENKDTRHE VKLDAQPKK
|
| |