Gene GWCH70_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2048 
Symbol 
ID7977284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2108968 
End bp2110053 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content39% 
IMG OID644798866 
ProductSpore coat protein CotH 
Protein accessionYP_002950036 
Protein GI239827412 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000159417 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACTA GTCAGCTGCC TCTGTATGCG ATTTATATCC ATCCTAATGA TTTACAGGAG 
CTGCGCCGCG ACATCTGGAA CGATGACCCT GTTCCTGCCA CATTTACGTT TCAAAAAAAA
TTTCATATCG ATATAAGCTA TCGAGGCTCA CACATTCGCA AATTTAAGAA AAAATCCTAT
TTTATCTCGT TTTATAAACC TTACTTCTTT CATGGAGTTC ATGAACTTCA CTTGAACGCG
GAATATAAAG ACCCTTCTCT CATAAGAAAT AAACTTTCCC TCGATTTTTT CTCGGCACTT
GGCGTCCTTT CTCCTTCTTC CCGGCACGTT CTTTTATCGA TCAATGGAAA GCATGAGGGA
ATTTATCTTC AATTAGAATC TGTGGATGAA TTTTTCTTAA AAAAACGTCA ATTGCCGGTA
GGACCTATTT TTTACGCGAT TGATGATGAC GCTAATTTTT CGCTGATTGG TTCGTTTGAT
AAGACCCCTA AAACGTCTTT GGATGCGGGC TATGAAAGGA AACTGGGAAC TCGAGAGGAT
CACCGATATC TGGAGGAATT CATTTTCAAG TTAAACACAA CGCCAAAATA TGAGTACGAA
TCTGTCATGT CAAAGCTTTT GAACGTCAAT AAATATTTGC GCTGGCTTGC AGGAGTTGTT
TGCACGCAAA ATTTCGATGG CTTCGTTCAC AATTACGCGC TATATCGCAA TCCAGATTCC
GGTTTATTCG AAATCATTCC GTGGGATTTT GACGCTACTT GGGGACGGGA TATTAACGGC
AAACAAATGG ATTATGATTA CGTACGTATC GAAGGATTTA ATACGTTAAC CGCTCGATTA
TTAGATATAA AGCGCTTCCG AAAAATGTAT TATGACATCA TGAAACATAC ATTGGATCAC
GAATTCACTG TAGAGTTTAT GAAGCCAAAA GTAGAGCAAC TTTACCAGCA ATTGCGGCCC
CATGTCGTCA ATGATCCGTA TATAAAAGAC CGTATTGAAC AGTTTGATGG TGAACCGAAG
CGGATTTGCG ATTTTCTCGA AAAACGAAAC ACTTATTTAA AAAATCAGCT ATCTACCCTT
CTATAA
 
Protein sequence
MSTSQLPLYA IYIHPNDLQE LRRDIWNDDP VPATFTFQKK FHIDISYRGS HIRKFKKKSY 
FISFYKPYFF HGVHELHLNA EYKDPSLIRN KLSLDFFSAL GVLSPSSRHV LLSINGKHEG
IYLQLESVDE FFLKKRQLPV GPIFYAIDDD ANFSLIGSFD KTPKTSLDAG YERKLGTRED
HRYLEEFIFK LNTTPKYEYE SVMSKLLNVN KYLRWLAGVV CTQNFDGFVH NYALYRNPDS
GLFEIIPWDF DATWGRDING KQMDYDYVRI EGFNTLTARL LDIKRFRKMY YDIMKHTLDH
EFTVEFMKPK VEQLYQQLRP HVVNDPYIKD RIEQFDGEPK RICDFLEKRN TYLKNQLSTL
L