Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0995 |
Symbol | |
ID | 4069760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1259183 |
End bp | 1260346 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637983002 |
Product | aminopeptidase DmpA |
Protein accession | YP_590072 |
Protein GI | 94968024 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0120323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAAG TCGTGCTCGC GACTGCTCTC ACTCTCCTAA GCGCCCTCGG CATCGCCCAG AACCAAACCT CGAACGAACG GCCCCGCGCC GCTGACCTGG GAATCAACGT CGGCGTGCTT CCGCGTGGAC CGCTCAACGC CATTACCGAT GTGGATGGAG TGCTGGTTGG CCAGACGACC ATCATCCGCG GCGACAACAT TCGCACCGGC GTGACCGCGA TCCTGCCGCA CGCAGGCAAT CTTTATCGCG AGAAGGTGCC CGGCGCGGTG TTCGTGGGGA ACGGCTACGG CAAGATGACT GGCTCTACAC AGGTAGAGGA ACTCGGCGAG ATCGAAGGGC CGGTGCTGCT GACCAACACA ACGAGTGTTT CGCAGGTCGC CGATGCGCTG ACCACGTACA TGATCGGCTT GCCCGGGAAC GAAGACGTGC TCTCGTTCAA TCCGGTGGTC GGTGAAACCA ACGACGGCTA TTTGAACGAC ATCCGCGGAC GCCACGTTTC GCCGGACGAT GTATTTGCCG CGATCAAGAG CGCAAAGGGC GGCCCGGTCG CAGAAGGCGC CGTTGGAGCG GGCACCGGAA CCGTGGCGTT CGGATGGAAG GGCGGCATTG GCACGGCGTC GCGACGATTG CCGGCGAACC TTGGCGGCTA CACCGTCGGG GTGCTGGTGC AGACGAATTT CGGCGGCGTG CTGACCATCG CAGGCGCACC GGTAGGGCAG GAACTTGGCC AGTACTACCT GCGCGAAGAG CTGCAAAAGG CCGGTAATGG AAGAGACCGC GCTGATGGCT CGTGCATGAT CGTCGTAGCC ACCGACGCCC CGATGGACTC GCGCAATTTG AAGCGCCTCG CAGCACGAGC GATCCTGGGG CTGGCCCGTA CCGGTAGCGC GTCATCGAAC GGCAGCGGCG ATTATGCGAT CGCGTTCTCC ACCGCTTCGC AGAACCGGAT TCGTACTGCC GACAAAGCGC TAACGCGGCA CACTGAAACA GTGACCAACG ACACGATGTC ACCGCTGTTC GAAGCCACGA TTGAAGCGAC CGAGGAAGCG ATTTACAACT CGATGCTGAA GGCCACGACG ACCACGGGAA ACGGGCATAC GGTAAAGGCG TTGCCGATCG AGGAGACAAA GGGGGTTCTT AAGAAGTATG GGGTGATTCG ATAA
|
Protein sequence | MHKVVLATAL TLLSALGIAQ NQTSNERPRA ADLGINVGVL PRGPLNAITD VDGVLVGQTT IIRGDNIRTG VTAILPHAGN LYREKVPGAV FVGNGYGKMT GSTQVEELGE IEGPVLLTNT TSVSQVADAL TTYMIGLPGN EDVLSFNPVV GETNDGYLND IRGRHVSPDD VFAAIKSAKG GPVAEGAVGA GTGTVAFGWK GGIGTASRRL PANLGGYTVG VLVQTNFGGV LTIAGAPVGQ ELGQYYLREE LQKAGNGRDR ADGSCMIVVA TDAPMDSRNL KRLAARAILG LARTGSASSN GSGDYAIAFS TASQNRIRTA DKALTRHTET VTNDTMSPLF EATIEATEEA IYNSMLKATT TTGNGHTVKA LPIEETKGVL KKYGVIR
|
| |