Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2969 |
Symbol | |
ID | 7977269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2991186 |
End bp | 2992157 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799769 |
Product | protein of unknown function UPF0052 and CofD |
Protein accession | YP_002950908 |
Protein GI | 239828284 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000778436 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTGA AGCATCAGCC TAAAATTGTC ATTATCGGCG GCGGAACAGG GCTCCCTGTA TTGTTGCGCG GCTTGAAGCA GTATGCCATT GATATTACGG CAATCGTTAC TGTCGCTGAT GATGGCGGCA GCTCAGGAAG ATTGCGCGAT GAATTAGATA TTCCGCCACC GGGAGATGTG CGTAATGTAT TGGCAGCGCT GTCAGATGTC GAGCCGCTTA TTGTTGAATT GTTCCAACAT CGATTTAAAA ACGGCAACGG TCTATCAGGA CATTCGTTAG GTAATTTAAT TTTGGCAGCG TTGACGTCCA TTACCGGCGA TTTTGTCAAA GCCATCCGCG AGATGAGCAA AGTGTTGAAA GTGCACGGAC AAGTATTGCC AGCGGCGAAT AAAAGCGTCG TTCTTCATGC GGAAATGGAA GATGGCGTCA TTGTTTCTGG AGAATCGAAA ATTCCTTATT CCGGCAAAAG GATTAAAAAA GTATTTTTGA CACCGGAAAA TATTGAACCG CTTCCAGAAA CGATTGAAGC GATTCGTTCT GCGGATTTGA TCGTCATCGG TCCGGGAAGT TTATATACGA GTATTTTGCC AAATTTACTT GTGCCGAAAA TCGGGCAAGA AGTTTGTCAA GCAAAAGCAA AAAAAGTGTA TATATGTAAT GTGATGACCC AAGCGGGTGA GACGCTGCAC TACACGGTAA GCGACCATGT CAAAGCGCTC CATGACCATA TGGGATGTTT ATTTTTGGAT GTTGTTGTTG TGAATAATGG TCATATTCCG GAAGAAATTC AAAAACGTTA TGCAGAAGAG CTGGCGGAAC CGGTGAAGGA TGATAGCGAT CGGCTCATCG ATTTAGGAAT TCAAGTGATT CGCGATAATA TCGTCAGCTA TGAAGATCAC GTCATTCGTC ACGATACGAA AAAAGTGGCA TCGTTACTTA TTTCGCTGAT TACAGCACCT CCTTCCTCCT AA
|
Protein sequence | MSVKHQPKIV IIGGGTGLPV LLRGLKQYAI DITAIVTVAD DGGSSGRLRD ELDIPPPGDV RNVLAALSDV EPLIVELFQH RFKNGNGLSG HSLGNLILAA LTSITGDFVK AIREMSKVLK VHGQVLPAAN KSVVLHAEME DGVIVSGESK IPYSGKRIKK VFLTPENIEP LPETIEAIRS ADLIVIGPGS LYTSILPNLL VPKIGQEVCQ AKAKKVYICN VMTQAGETLH YTVSDHVKAL HDHMGCLFLD VVVVNNGHIP EEIQKRYAEE LAEPVKDDSD RLIDLGIQVI RDNIVSYEDH VIRHDTKKVA SLLISLITAP PSS
|
| |