Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1635 |
Symbol | |
ID | 7976281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1711139 |
End bp | 1712377 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644798518 |
Product | hypothetical protein |
Protein accession | YP_002949690 |
Protein GI | 239827066 |
COG category | [R] General function prediction only |
COG ID | [COG2404] Predicted phosphohydrolase (DHH superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.805744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAAT TGTTTACGGA CAGCGATTTA GACGGAATTG GGTGCGGACT GTTGGCGAAA ATTGCGTTTA AAGAAGTAAA CATATCATTT TGTTCCTACC GTAAGTTAGA TGAACGAGTG AAACAATTTA TTGAAGATGA ACAGCATAAC GAAGCAAGTG TATTTATTAC CGATCTGGCG GTCAGTGAAG AGGTGGAGAA AAAACTGGCG GAACGATTTG AAGCAGGAAA ACACGTTCAA GTTATCGACC ACCATGTCAC TGCCCTCCAT TTTAATAAGT ATCCGTGGGG GTGGGTGCAG CCAACGGATG AACAAGGCAA AAAAACGTGC GCAACCTCGC TATTTTATGA ATATTTAATC CGTGAACAAA AACTGGAACG AAACGAAACG CTCGATGAAT TTGTAGAGCT TGTCCGCCAA TACGATACAT GGGAATGGGA AGAAACAGAC AACACGCGGG CAAAGCGGCT AAATGATTTG CTGACGATTT TAGGCCTGGA CGAATTTTGG GATCGCATGA GTGAACGGTT AACAGAAGGA GGTCCATTTG CTTTAACCGA AACGGAAGAA CTTATTTTGG ATATGGAAGA AAAGAAAATT CAGCGCTACA TCCGAATGAA ACAAAAACAA CTTGTCCAGC GCTGGTTTGA TGATTATTGT GTCGGTATTG TTTTTGCCGA ACAACATATG TCTGAACTAG GTAATGCATT ATCCAAACGT TGTCCTCATT TAGATTTAAT CGCCATGGTT AATCTCGGCA CGAAACATAT TGGGTTTCGA ACGATTCACG ACAACGTGAA TGTTGCGGAG TTTGCGAAAC AGTTTGGAGG AGGAGGCCAT CCGAAAGCAT CCGGATGCTT TGTTAATGAA ACAACATTCC CGCTTTTTGT CGTCGATGTG TTCTCGCTTC CGCCGGTATA TCATGATGTG GAACAAAACC AGCTCAATAC AAAAGATCAA ACAGAAGGGT TCTTTTTTAC CAATCATCAA GGACAATGGT TTTTCTTTCA TCCAAGCGAT GACAAATGGG GCGTTTATCA TCAAGAACAG GAAGTACAAT CGTTTTCGAG CCAAGAAGAG GCGGAACGGT TTATAAAACG TCAATTCGCA GCGGGACTGG CGGACGATCA AGCGGTGATT GACTTTTTGC AGCAACAGCT ATGTATAGAA AAAGAAAAGA TCAAAGATGA GTATATCAAT GCCCTGCAAC AATATAAACA AAAAGCCATC ACCAAATAA
|
Protein sequence | MIKLFTDSDL DGIGCGLLAK IAFKEVNISF CSYRKLDERV KQFIEDEQHN EASVFITDLA VSEEVEKKLA ERFEAGKHVQ VIDHHVTALH FNKYPWGWVQ PTDEQGKKTC ATSLFYEYLI REQKLERNET LDEFVELVRQ YDTWEWEETD NTRAKRLNDL LTILGLDEFW DRMSERLTEG GPFALTETEE LILDMEEKKI QRYIRMKQKQ LVQRWFDDYC VGIVFAEQHM SELGNALSKR CPHLDLIAMV NLGTKHIGFR TIHDNVNVAE FAKQFGGGGH PKASGCFVNE TTFPLFVVDV FSLPPVYHDV EQNQLNTKDQ TEGFFFTNHQ GQWFFFHPSD DKWGVYHQEQ EVQSFSSQEE AERFIKRQFA AGLADDQAVI DFLQQQLCIE KEKIKDEYIN ALQQYKQKAI TK
|
| |