Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1194 |
Symbol | |
ID | 7977666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1245977 |
End bp | 1246900 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644798147 |
Product | Membrane dipeptidase |
Protein accession | YP_002949320 |
Protein GI | 239826696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000135749 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTCG ATGCGCATTG TGACGCACTG ATGAAATTAT GGATGGATCG GTCTTTATCG TTCCAAGATG GCAAAAGTCT TCATGTCACC TTTCCATCAC TTCTAGAGGC GAAAATAAAA GTTCAATGTT TTGCCATTTA CATACCGGAA ACTGTTCCGG AAGAAGCCCG TTTTACGGTT GCGCTAGAAA TGATAGACAT TTTTTTTGAA CAAATGATTG AGCGATTTCC AATGTTGAAA TTTGTTCGTT CGAAACGCGA TATTGACGCA TTGGAAGAAA ATGAAATTGG GGCGATGTTA ACGCTTGAAG GATGCGATGC GATTGGGACA AGCCTTGTAA AATTAAAAAC ATTGCTTCGC CTTGGTGTCT CGTCTGTTGG ACTGACATGG AATTGGTCCA ATGCTGTAGC AGATGGTTCT TGGGAAGTGC GCGGTGCCGG ATTAACGGAA TTCGGAAAAC AAGTGGTGCG GCATCTTAAC GAGGCAAAAC GTTGGGTCGA TGTTTCCCAC TTATCGGAAA AAGCATTTTG GGACGTGATG GAAATCGCGC AATTTCCGAT TGCTTCTCAT TCTAATACAT ATCGCTTTTG TCCACATCCG CGTAATTTGC GAGATGAGCA AATTCGCGCG TTGATTGAGA AAAATGGGAT GATTGGAATT ACGTTCGTTC CGTACTTTTT AACGAAAGAA AAAGAAAAAG CGGCGATTTC TGACGTTCTC CGCCATTTGG AACACATTTG TTCACTCGGG GGAGCAAGGC ACGTTGGGTT TGGCTCTGAT TTTGACGGAA TGGAAGAAAC GGCAAAAGGA CTCGAAAATG CGCGGTGCTA TACAAATTTA GTCAACGAGC TGCAAAAACA CTACTCAGAA GATGAGGTAG AACGGTTTTT ATTTCGCAAT TTTTATGACC ATTTGCCGCA GTAA
|
Protein sequence | MIFDAHCDAL MKLWMDRSLS FQDGKSLHVT FPSLLEAKIK VQCFAIYIPE TVPEEARFTV ALEMIDIFFE QMIERFPMLK FVRSKRDIDA LEENEIGAML TLEGCDAIGT SLVKLKTLLR LGVSSVGLTW NWSNAVADGS WEVRGAGLTE FGKQVVRHLN EAKRWVDVSH LSEKAFWDVM EIAQFPIASH SNTYRFCPHP RNLRDEQIRA LIEKNGMIGI TFVPYFLTKE KEKAAISDVL RHLEHICSLG GARHVGFGSD FDGMEETAKG LENARCYTNL VNELQKHYSE DEVERFLFRN FYDHLPQ
|
| |