Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1471 |
Symbol | |
ID | 7976917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1546045 |
End bp | 1547115 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644798375 |
Product | protein of unknown function DUF871 |
Protein accession | YP_002949548 |
Protein GI | 239826924 |
COG category | [S] Function unknown |
COG ID | [COG3589] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTATC TATCTTTCTA CTTATCAGAA CAAACGGATC AAATCGAAAA AAGATTTGCA CAGGCAAATT TGTTCGGATG TCGGGAACTA TTTACATCGC TTCATATTCC CGAAGACGAT CTTACCTTTT ATCGCAAGCG ATTACAGGAG ATCGGGCAGT TAGCGAGAAA ATACGGAGTC GGGATCATCG CTGACGTTAC CCCGGCTTCT TTATCCAAAA TTGGGGTGAA TGGAGACAAT CTTGATTTGC TTTTTGATGG AGGAATTATT GGACTGCGGC TTGATGATGG ATTTTCTATG AAAGAAGCAG CAACGTTTTC TCACCGGATG AAAGTTGTTT TCAATGCGAG CACGATGACG GAAGAAGAAT GCGACGATCT TGCTTTTTGT GACGTGAACT GGAATCAAAT TGAAGCGTGG CATAATTTTT ATCCACGTCC AGAAACAGGA TTATCAAAAG AATTAGTCAT TCAAAAAAAC AAGATTTTAC GCCGAAAAGG AATTCGAACG CTTGCCGCGT TTATTCCGGG AAATAAAGAA AAACGAGGTC CGCTCCATCA AGGGCTTCCT ACGCTAGAGG CTCATCGGTA TATGGATCCT CTTTGCGCGT ACGTGGAATT AGTGCGGGAT TGTGAGGTAG ATAAAGTATT TGTGGGCGAT GGCGGGATGA CAGATAATGT GCTTGTGCGA ATGAAGGAGT TTCGGGATGG GGTTATTCCG CTGCGCTACC GGCCGCTTGT GCAACAACAT GAACTGCTTT CCATGGTCGA AACGGTTCAA ACGAATCGGC GTGACGCGGC AAGGGATGTC ATTCGATCAC TGGAATCCCG CTTATCTTTC TCATGGCCAA AGCATTTATT AGCACCAGCA TGTACAATCG AAAGGCAAAA AGGCAGTGTT ACGATGGATA ATATTCGATA TGGACGATAT GCCGGTGAAC TGCAAATCAC ATTAACCGAT TTGCCAGCAG ATGAGAAAGT CAATGTGATT GGACGGATTA TCAAAGACGA TCTTCCTCTC CTTGCGTATG TCAAAGGAGG ACAACAGTTT CGCCTTGTGC GGATCACATA A
|
Protein sequence | MFYLSFYLSE QTDQIEKRFA QANLFGCREL FTSLHIPEDD LTFYRKRLQE IGQLARKYGV GIIADVTPAS LSKIGVNGDN LDLLFDGGII GLRLDDGFSM KEAATFSHRM KVVFNASTMT EEECDDLAFC DVNWNQIEAW HNFYPRPETG LSKELVIQKN KILRRKGIRT LAAFIPGNKE KRGPLHQGLP TLEAHRYMDP LCAYVELVRD CEVDKVFVGD GGMTDNVLVR MKEFRDGVIP LRYRPLVQQH ELLSMVETVQ TNRRDAARDV IRSLESRLSF SWPKHLLAPA CTIERQKGSV TMDNIRYGRY AGELQITLTD LPADEKVNVI GRIIKDDLPL LAYVKGGQQF RLVRIT
|
| |