Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0237 |
Symbol | |
ID | 4073087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 249625 |
End bp | 250791 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637982238 |
Product | peptidase M20 |
Protein accession | YP_589316 |
Protein GI | 94967268 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCAAA GACCTTCGTC TGACCCTAGA ACCTCGCATC GTGACCTGTT TGCCCACTTG AAAGACCAGC AATCGGCCAT GCAATCCATG CTCCGCCGCA TCGTGGAACA CGAGTCCCCG AGCGACGATA AACCTTCCTG CGACGCCATG GCCGAAATGC TAGCGCAGGA TTTCGGCGCC ATCGGCGGCC GCACGAAACT CCATCGCGAC AAGATCACCG GCAATCACCT GCAAGTGGAT TTCGGCGGAC CGCGCGGCGC GAAGCCGGTG CTGATCGTCG GACATTACGA CACGGTCTAT CCGCTCGGCA CTCTGAAGCA GATGCCGTAC CGCGAGAGCA AAGGTCGCAT CAGCGGTCCA GGTGTGCTCG ACATGAAGGG CGGCATCGTC CAAATTCACG GCGCCATCGC CGCACTGCAG ACCCAAGGCG GGTTGCCGCG ACCAGTGACG ATCATCCTCG TCTCCGACGA GGAGACCGGC AGCGAGAGTT CCCGGTCGAT CACCGAGAAG CTCGCGAAAC AATCCGCAGC AGCGTTTGTC TGTGAGCCGG CCGCTGGCAC CGACGGCGCG CTGAAGACCG CGCGCAAAGG CGTCGGCGAC TATTTGTTGC GTATCACTGG CGTGGCTTCC CACTCGGGAC TCGATTTCGA GAAGGGCCAG AGCGCGGTGC TGGAACTTGC GCGACAGATT GAGAAAATCG CCGGCTTCAC CGATCTCAAG CGGGGAACCA CGGTCAATCC CGGCGTCATT CGCGGTGGAA CCCGCAGCAA CGTGATCGCT GCGTCGGCGG AAGCGGAAAT CGACGTGCGC GTGAGCACCA AGCGCGATGT CGAACGCGTT TCGAAGCTGT TTGCACGGTT GAAACCGATC AATAGGCGCT GCTCGCTAGC TCTTTCCGGC GGAGTCAACC GTCCGCCGAT GGAGCGCACT CCGGGCACCG CAGCTTTATT TGCGCAGGCG CAGCTGATCG CTGCGGAACT CAGCTTTTCC CTAGCGGAGA GAATGGTGGG GGGCGGTTCC GACGGCAACT TCACCGGGGC TATCGTGCCG ACTTTAGACG GGTTAGGCGC GGTCGGCGAC GGAGCCCACG CGGTTCACGA ATATATTTTC GCCACCGAAA TGCCGAAGCG CGCAGCACTG CTGGCCGGAT TAATTCGGGC GTTATAG
|
Protein sequence | MPQRPSSDPR TSHRDLFAHL KDQQSAMQSM LRRIVEHESP SDDKPSCDAM AEMLAQDFGA IGGRTKLHRD KITGNHLQVD FGGPRGAKPV LIVGHYDTVY PLGTLKQMPY RESKGRISGP GVLDMKGGIV QIHGAIAALQ TQGGLPRPVT IILVSDEETG SESSRSITEK LAKQSAAAFV CEPAAGTDGA LKTARKGVGD YLLRITGVAS HSGLDFEKGQ SAVLELARQI EKIAGFTDLK RGTTVNPGVI RGGTRSNVIA ASAEAEIDVR VSTKRDVERV SKLFARLKPI NRRCSLALSG GVNRPPMERT PGTAALFAQA QLIAAELSFS LAERMVGGGS DGNFTGAIVP TLDGLGAVGD GAHAVHEYIF ATEMPKRAAL LAGLIRAL
|
| |