Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1595 |
Symbol | |
ID | 7976244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1665104 |
End bp | 1666336 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798483 |
Product | peptidase T |
Protein accession | YP_002949655 |
Protein GI | 239827031 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01882] peptidase T |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.782792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACG AAATCATTGA ACGTTTTACG AAGTATGTTC AAGTTGACAC CCAGTCTGAC CCGAACAGTG ATACTTGTCC CTCCACTCCG GGTCAATGGA CGCTAGCGAA AATGCTAGTA GAAGAATTAA AAGCAATCGG CATGGAAGAA GTAACAATAG ACGAAAACGG CTATATCATG GCAACATTGC CCGCAAATAC CGATAAAGAT GTGCCAACCA TTGGCTTTTT AGCCCATATG GATACGGCTC CCGAATTTAC CGGCGCCAAT GTAAAACCGC AAATTGTCGA AAACTACGAT GGCAATGATA TCATATTAAA TGAGGCGCTA CATATTGTGC TTTCGCCGAA AGATTTTCCG GAACTTGCAA ACTACAAAGG CCATACATTA ATCACCACCG ATGGCACGAC GCTGCTTGGC GCGGACAATA AAGCGGGAAT CGCGGAAATT ATGACGGCGA TGGCCTACTT AATCCAACAT CCGGAAATTA AACACGGAAA AGTGCGCGTC GCGTTTACGC CAGATGAAGA AATCGGCAGA GGGCCGCATA AGTTTGACGT CGCTAAGTTC GGTGCTAAAT ATGCGTACAC AGTCGATGGC GGTCCGCTTG GCGAACTAGA GTATGAAAGC TTTAACGCCG CGGAGGCAAA AATCAAATTT AAAGGAAAAA ATGTTCATCC AGGCACCGCC AAAGGCAAAA TGATTAACTC GATGAAAATC GCGATGGAAT TTCACGCACA GCTTCCTGCC AACGAAGCGC CGGAACATAC GGAAGGCTAC GAAGGATTTT ATCATTTGCT TTCCTTCCAA GGAAATGTCG AAGAAACAGC GCTTCATTAC ATTATCCGCG ACTTTGACCG CGAACAATTT GAAGCGCGCA AAGCAAAAAT GCGGGAAATC GCGGCAAAAC TGCAAGAAAA ATACGGAAAA GAGCGAATCG CGATCGAAAT AAAAGACCAA TATTATAACA TGAGAGAAAA AATCGAACCG GTCCGCGAAG TTGTCGATAT CGCCTATGAA GCGATGAAAA ACTTAAATAT TGAACCGAAA ATTTCGCCGA TCCGCGGCGG CACGGACGGG TCGCAGCTTT CGTACATGGG GCTACCGACT CCAAATATTT TCACGGGCGG CGAAAACTTC CACGGCCGCT ATGAATACAT TTCCGTCGAC AACATGATCA AAGCAACAAA CGTCATTATT GAAATTATTA AGCTGTTCGA GCAAAAAGCG TAA
|
Protein sequence | MKNEIIERFT KYVQVDTQSD PNSDTCPSTP GQWTLAKMLV EELKAIGMEE VTIDENGYIM ATLPANTDKD VPTIGFLAHM DTAPEFTGAN VKPQIVENYD GNDIILNEAL HIVLSPKDFP ELANYKGHTL ITTDGTTLLG ADNKAGIAEI MTAMAYLIQH PEIKHGKVRV AFTPDEEIGR GPHKFDVAKF GAKYAYTVDG GPLGELEYES FNAAEAKIKF KGKNVHPGTA KGKMINSMKI AMEFHAQLPA NEAPEHTEGY EGFYHLLSFQ GNVEETALHY IIRDFDREQF EARKAKMREI AAKLQEKYGK ERIAIEIKDQ YYNMREKIEP VREVVDIAYE AMKNLNIEPK ISPIRGGTDG SQLSYMGLPT PNIFTGGENF HGRYEYISVD NMIKATNVII EIIKLFEQKA
|
| |