Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2049 |
Symbol | |
ID | 7977285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2110175 |
End bp | 2111383 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644798867 |
Product | hypothetical protein |
Protein accession | YP_002950037 |
Protein GI | 239827413 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000426052 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGAC AACAAAAAGA TAGTCAATCA GTGGAGTCAA ATGCTTATCT AAAATCGTTA GTGGGTAAAA GAATAAAAGT ATACAGAGGG GGACCAGAAT CTTGCGAAGG TAGGCTTTTA GATGTTCAAT CCGATTATGT TGCACTAATG CCAGAGCAAT CAAACGACAA TAAACAAAAT AACAACGCAA ACAATAACAA CAACGCAAAA AAGGACAACG CAATTATTTA CTATAACTTA AGACATGTCC AAAGCATTAG TGAAAATTTA AAAGCTAACT CTGTTGAATC TAACTTAAAA CATGTCCAAA ACATCGTCCA AAGCATCAGT GAAAATATTT ATAGCAACTT AAGACAGGTC CAAAGCATTA GTGAAAATTT AATCGCTAAC TTAAAACATG TTCAAAGCAT CGATAAAAAT TCAACAGCTA ACTCTGTTGA ATCTAACTTA AAACTTGCCC AAAACGTCAG TGAAAATTCA AAAGCGAACT CTATTGAATG GTCGATCGAT TTTGATAATC ATCCAGAATT GGTGTCAGTG AATAATTTCA CAGAATTATT AAAAAATTTA ACTGGCAGCA TGGTCAAGGT GAATAAAGGC GGTCCTGAAT CTAAAAAAGG AATAGTGCTC CTCGTTGCTG GTGATTATAT GGGCCTCTTA ACGGAAGACG ATGGCATTGT ATTTTATAAT ACAACTCACA TCAAAAGTAT AAGCGTGCAA AATAGAAGCC AAAATATCGA TCAAAATATC GATCAAAGCA CTCCTCTCAG CAATTCCCCT ATCCATTATG ATAATTATTT TGATGACATC CATGCACAAA ACTTCCTTGA ATTATTTGAT TATTTTGCTT ATAAATGGGT CTCGATTAAC CGCGGCGGTC CTGAAGCAGC AGAAGGAATT CTTGTGCAAG AAGAAGGAGA ACATTATACG TTAGTGAACA ATGATGAAGT CATTCGAATT TACCCTTATC ACATTAAAAG CATCAGTATT GGTACAAAAG GCTTTCTCAA ACAACAACAG CAGCAACAAA ATAATAATGA AGCTGCTGAG AATGAAAACT CCCAAGATGT GAATATGACC AACAAAGTAG AAGATAACAG AACAGCGGGC AGAGAACAGC GTTCAAGTCG AAGAAGCTCT CCACAGGAAA CCATCGTTAA GGAAACAATC GTTAAAACGA TTGATTATAT TTGGGATCCG AAACGATAA
|
Protein sequence | MIRQQKDSQS VESNAYLKSL VGKRIKVYRG GPESCEGRLL DVQSDYVALM PEQSNDNKQN NNANNNNNAK KDNAIIYYNL RHVQSISENL KANSVESNLK HVQNIVQSIS ENIYSNLRQV QSISENLIAN LKHVQSIDKN STANSVESNL KLAQNVSENS KANSIEWSID FDNHPELVSV NNFTELLKNL TGSMVKVNKG GPESKKGIVL LVAGDYMGLL TEDDGIVFYN TTHIKSISVQ NRSQNIDQNI DQSTPLSNSP IHYDNYFDDI HAQNFLELFD YFAYKWVSIN RGGPEAAEGI LVQEEGEHYT LVNNDEVIRI YPYHIKSISI GTKGFLKQQQ QQQNNNEAAE NENSQDVNMT NKVEDNRTAG REQRSSRRSS PQETIVKETI VKTIDYIWDP KR
|
| |