Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2756 |
Symbol | |
ID | 7977980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2792012 |
End bp | 2793424 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644799552 |
Product | dipeptidase |
Protein accession | YP_002950711 |
Protein GI | 239828087 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01887] dipeptidase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATCG ATTGGATGGA AGAGGTATTG AAGCGAAAAG AGGCGCTTAT TCAAGATACG CAGGCGCTAT TGCGCATTCC GAGTGTTCTC GATGAAGAGA ATGCAACAGA AGAAGCCCCG CTTGGACAAG GAGTATACGA GGCGTTGCAG TTTTTATTGC AGAGGGGCCA AGAAGAAGGG TTTGCGGTAA AAAACGTCGA TGGCCTCGCG GGACATCTTG AAATCGGACA AGGAGAGGAA CTCATCGGCG TATTATGCCA TGTGGATGTT GTACCACCGG GAGATGGCTG GTCAAGCGAC CCGTTTGCTG CAGAAATCCG TGATGGGAAA TTATATGCTC GCGGCGCGAT TGACGATAAA GGACCAACAA TGGCTGCATT TTATGCTATG AAAATTGTGA AAGAACTCGG GTTGCCATTG CAAAAACGAG TGCGTATGAT TGTTGGAACA GATGAAGAAA GCAAGTGGCG TTGTGTCGAA CATTATTTTA AGCATGAGGA AATGCCGACA ATGGGATTTG CTCCGGATGC GGATTTTCCG ATTATTTACG CGGAAAAAGG AATTGTCGAT ATGGATTTGC GCAAGCCATC AATAGGAACA CAGGAAGAAA GTGAAATTCA GCTGCAGTCG TTCCAGGCAG GACGACGTTA TAATATGGTT CCTGACTTTG CTAAAGCGGT ATTGCTCGTT CGTGCAGACC GGCAACAAGA AATCGAACAG CAGTATCGCC AGTTTCTTCA TGAAACGAGC ATGAATGGAA ATGCTGTTGT GGGAGGCAAT ACCGTTACTC TTCAATTAGA AGGAATTTCC GCTCATGCGA TGGAACCAGA AAACGGGAAA AATGCAGGTT TATGGCTGGC GAAGTGGCTA TCAGATGTTG CGTTGGATAC GCAGGCGCAA TCGTTTATCC GTTTTGTGAC GGACTATTTC TTTGCCGATT CTCGTGGAAA AGCGTTAGGC ATTGCTTACA ACGATGAGAT TACCGGCGAT TTAACAGTCA ATGTTGGAAT ATTATCATAT GATGCGCAAG CTGGAGGAAA GCTTGGCATT AACATACGCT ATCCGGTCAC CAATGATATA GAACAAACGA AGCAAAAACT ACAGAACATC GCAGCGCAGC ATGGTTTTGC GTTGGAACAT TTTAGCGATT CAAAACCGCA CTATGTTGAT CCAAATCATG TACTTATAAA GACGCTTCAA CGCGTATACG AAGAACAAAC AGGAGAACGC GCTTCCTTGC TTTCCATCGG CGGCGGTACA TATGCCCGTT CGCTGAAAGC GGGAGTAGCG TTTGGACCGC TGTTTCCGGG ACGGCCGGAT GTGGCGCATC AAAAGGATGA ATATATCATG ATCGATGATT TATTGAAAGC AACCGCGATT TACGCTCAGG CAATTTATGA ATTAGCAAAA TGA
|
Protein sequence | MNIDWMEEVL KRKEALIQDT QALLRIPSVL DEENATEEAP LGQGVYEALQ FLLQRGQEEG FAVKNVDGLA GHLEIGQGEE LIGVLCHVDV VPPGDGWSSD PFAAEIRDGK LYARGAIDDK GPTMAAFYAM KIVKELGLPL QKRVRMIVGT DEESKWRCVE HYFKHEEMPT MGFAPDADFP IIYAEKGIVD MDLRKPSIGT QEESEIQLQS FQAGRRYNMV PDFAKAVLLV RADRQQEIEQ QYRQFLHETS MNGNAVVGGN TVTLQLEGIS AHAMEPENGK NAGLWLAKWL SDVALDTQAQ SFIRFVTDYF FADSRGKALG IAYNDEITGD LTVNVGILSY DAQAGGKLGI NIRYPVTNDI EQTKQKLQNI AAQHGFALEH FSDSKPHYVD PNHVLIKTLQ RVYEEQTGER ASLLSIGGGT YARSLKAGVA FGPLFPGRPD VAHQKDEYIM IDDLLKATAI YAQAIYELAK
|
| |