Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2994 |
Symbol | |
ID | 7977364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3014940 |
End bp | 3016127 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644799794 |
Product | PDZ/DHR/GLGF domain protein |
Protein accession | YP_002950933 |
Protein GI | 239828309 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000000815912 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATAT GGGTGTTGGA GGCGCTCAAA GGCGTTGGGC GGCTGCTTGT GCAACCGTTG TTTTATTATG GGATCGCACT GGCGCTTGTT ATCGGATGGC GGCGGGTGAA GCGGGAAAGA AGCTATTTTT CTATTCGCGT ATACAACATG TTTCATGAAA GCAAGCTTTT TTGGCGAAGC GGCCTCGTTG CCGGCGGTAT TTTGTCGTTA GCAGCGGTTG CTATCGGCAT CGTGCTTCCG CGCGATGCTA TTTCAATGAT TGCGCTTGTG ACGATCGCAA TCGGATTAAC GATGCAAATG CGGCTGCTTT CCCCGGCGTA TACGATGGGG CTTGTGTTTT TTATCGTTAG TATTCTTGCC AATGATAAAG AAACGGCACC AGCTTTAAAG CGTTTTTTCC CGGAACTTAG CGAGACGAAT ATGGCTGCGC TTGCTATTTT GCTTGCTTTA TTGCTTTTTG TGGAAGCATG GCTTATTTTA CAAAACGGCG CTATCGAAAC ATCTCCACAG TTAGCGAAAA GCAAGCGTGG TTTTGTTGTC GGAGAACATT GGTCGCAGCG GCTCTGGTTC GTTCCTGTGC TGCTTCCGGT AACAGGAGAA CTTCCGTCTC CATTTTCATG GTGGCCATTA TTTCCTGTTG CTGGCGATTC TTATTCACTG ATGCTTGTGC CGTTTTTGAT CGGATTTTCT GAGCGTGTAC AAGGGATGCA TCCAAAAGCT TCGATTCGTT TAACTGGAAA AAGAGTGATG TTGCTCGCAT GGATTGTCAG CTTGTTTGCT ATAGGAGGCT ATTGGTATGC GCCGCTTTCG ATGATTGCGG CAGCGCTTGC CTTGATTGGA AGGGAGTGGC TTGCTTTCTT CCAGCATCGT CAAGATCGCT TGAAACCTCC GTATTTTTCG AAGCGTGAAC AAGGGCTTGT CATTTTAGGG ATTCTTCCGA ATTCCAAGGC GGAAAAAATG GAGCTAGAAA TTGGTGAAGT CATCACAAAA GTGAACGGAA TGACAGTAAA AACGGAAACA GAGTTTTATG AAGCGTTGCA ACGAAACCGC GCATTTTGTA AATTAGAGGT AGTCAATGAA CATGGAGAAG TTCGTTTCGT TCAAGGGGCG TTATATGAAG ATGAACATCA TGAGCTTGGG CTGCTGTTTG TAAAAGAACG GGAAAAATGG GCGCTAGAAG CCGTCTAA
|
Protein sequence | MSIWVLEALK GVGRLLVQPL FYYGIALALV IGWRRVKRER SYFSIRVYNM FHESKLFWRS GLVAGGILSL AAVAIGIVLP RDAISMIALV TIAIGLTMQM RLLSPAYTMG LVFFIVSILA NDKETAPALK RFFPELSETN MAALAILLAL LLFVEAWLIL QNGAIETSPQ LAKSKRGFVV GEHWSQRLWF VPVLLPVTGE LPSPFSWWPL FPVAGDSYSL MLVPFLIGFS ERVQGMHPKA SIRLTGKRVM LLAWIVSLFA IGGYWYAPLS MIAAALALIG REWLAFFQHR QDRLKPPYFS KREQGLVILG ILPNSKAEKM ELEIGEVITK VNGMTVKTET EFYEALQRNR AFCKLEVVNE HGEVRFVQGA LYEDEHHELG LLFVKEREKW ALEAV
|
| |