Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2048 |
Symbol | |
ID | 7977284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2108968 |
End bp | 2110053 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644798866 |
Product | Spore coat protein CotH |
Protein accession | YP_002950036 |
Protein GI | 239827412 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5337] Spore coat assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000159417 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACTA GTCAGCTGCC TCTGTATGCG ATTTATATCC ATCCTAATGA TTTACAGGAG CTGCGCCGCG ACATCTGGAA CGATGACCCT GTTCCTGCCA CATTTACGTT TCAAAAAAAA TTTCATATCG ATATAAGCTA TCGAGGCTCA CACATTCGCA AATTTAAGAA AAAATCCTAT TTTATCTCGT TTTATAAACC TTACTTCTTT CATGGAGTTC ATGAACTTCA CTTGAACGCG GAATATAAAG ACCCTTCTCT CATAAGAAAT AAACTTTCCC TCGATTTTTT CTCGGCACTT GGCGTCCTTT CTCCTTCTTC CCGGCACGTT CTTTTATCGA TCAATGGAAA GCATGAGGGA ATTTATCTTC AATTAGAATC TGTGGATGAA TTTTTCTTAA AAAAACGTCA ATTGCCGGTA GGACCTATTT TTTACGCGAT TGATGATGAC GCTAATTTTT CGCTGATTGG TTCGTTTGAT AAGACCCCTA AAACGTCTTT GGATGCGGGC TATGAAAGGA AACTGGGAAC TCGAGAGGAT CACCGATATC TGGAGGAATT CATTTTCAAG TTAAACACAA CGCCAAAATA TGAGTACGAA TCTGTCATGT CAAAGCTTTT GAACGTCAAT AAATATTTGC GCTGGCTTGC AGGAGTTGTT TGCACGCAAA ATTTCGATGG CTTCGTTCAC AATTACGCGC TATATCGCAA TCCAGATTCC GGTTTATTCG AAATCATTCC GTGGGATTTT GACGCTACTT GGGGACGGGA TATTAACGGC AAACAAATGG ATTATGATTA CGTACGTATC GAAGGATTTA ATACGTTAAC CGCTCGATTA TTAGATATAA AGCGCTTCCG AAAAATGTAT TATGACATCA TGAAACATAC ATTGGATCAC GAATTCACTG TAGAGTTTAT GAAGCCAAAA GTAGAGCAAC TTTACCAGCA ATTGCGGCCC CATGTCGTCA ATGATCCGTA TATAAAAGAC CGTATTGAAC AGTTTGATGG TGAACCGAAG CGGATTTGCG ATTTTCTCGA AAAACGAAAC ACTTATTTAA AAAATCAGCT ATCTACCCTT CTATAA
|
Protein sequence | MSTSQLPLYA IYIHPNDLQE LRRDIWNDDP VPATFTFQKK FHIDISYRGS HIRKFKKKSY FISFYKPYFF HGVHELHLNA EYKDPSLIRN KLSLDFFSAL GVLSPSSRHV LLSINGKHEG IYLQLESVDE FFLKKRQLPV GPIFYAIDDD ANFSLIGSFD KTPKTSLDAG YERKLGTRED HRYLEEFIFK LNTTPKYEYE SVMSKLLNVN KYLRWLAGVV CTQNFDGFVH NYALYRNPDS GLFEIIPWDF DATWGRDING KQMDYDYVRI EGFNTLTARL LDIKRFRKMY YDIMKHTLDH EFTVEFMKPK VEQLYQQLRP HVVNDPYIKD RIEQFDGEPK RICDFLEKRN TYLKNQLSTL L
|
| |