Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0035 |
Symbol | |
ID | 7979424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 47068 |
End bp | 48288 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644796995 |
Product | 3D domain protein |
Protein accession | YP_002948243 |
Protein GI | 239825619 |
COG category | [S] Function unknown |
COG ID | [COG3583] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.686039 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACCCA ATATGAAGAA GATTTCCGGT TCATTGAGGA AAAATTTCAC TGTTACTGCT AGTAGTTTTA TAGCTTTGTC AGCAACAACG GGATTTGCTG GGTACGAAAT AGCCAAAGAT GATGTGATAC TAACGGCAAA CGGAAAAAAA CAAGAGATTC GTACTCATGC AAAAACGGTA AAAGAAGTAT TACAGGAGCA AAATATTAAG CCAAGGAAAG AAGATCGCGT CTATCCATCG TTAGATACGC CGATTACAGA TGATTTAAAC ATCGTTTGGG AAGCGAGCAA GAAAGTCACT TTGACAGTAG ATGGAAAAAA ACAAGAAATA TGGACGACTG CAAAGAACGT TGCCGAGCTA TTAAACTCAC AACATATTAA AATCGAAAAG CACGACAAAA TTGCCCCAGC ACCAAATACA AAAATTAAAA AGGGAATGAA AGTCAACATA GAAAAGGCAT TCCCAGTTCA ATTAAATGTC GGCGGTAAAC AGCAACAAGT GTGGGCAACT TCGACTACTG TCGCTGACTT TTTAAAACAA CAAAATGTAA AATTAGATGA ACTCGACCGT GTAGAGCCAT CTTTACAGGA CAAACTAAAA GAAAATATGG TCGTAAAAGT GATTAAAGTT GAAAAAGTCA CCGATGTAGT GGAAGAACCA GTTGACTTTG CAGTCGTCAC TAGACAAGAT GCACAATTGC CAAAAGGAGA ACAACGTATT ATTAGCCCTG GAGAAAAAGG GCGAGTTTCG AAAAAGTATG AAGTAGTGCT CGAAAATGGA AAAGAAGTAT CGCGGAAATT AATTGAGACA AAGATGATAA AAGAAAGTAA AAATCGGATT GTTGCTATCG GTACGAAAGT GGCTAAAAGC CGACCTGCTC ATACGCAAAG CCGCTCTGTT CAGACCGTAT CGCGCGGCCA AAAGCATGCA GCGCGAGAAA TTTATGTTGT TGCTAGCGCC TATACTGCTT ATTGTCAAGG ATGTTCAGGA ACAACGAGAA TGGGAATTAA CTTGCGTGCA AATCCTTCTG CAAAAGTAAT CGCAGTGGAT CCAAACGTTA TTCCGCTTGG ATCAAAAGTG TACGTAGAAG GATACGGATA TGCTATAGCT GCTGATACAG GATCAGGTAT TAATGGTTAT GAAATTGACG TGTTTATTCC AAAGCAATCG GATGCACTTC GTTGGGGTAG AAAGCGTGTG AAGGTGAGAA TTCTTCAATA A
|
Protein sequence | MLPNMKKISG SLRKNFTVTA SSFIALSATT GFAGYEIAKD DVILTANGKK QEIRTHAKTV KEVLQEQNIK PRKEDRVYPS LDTPITDDLN IVWEASKKVT LTVDGKKQEI WTTAKNVAEL LNSQHIKIEK HDKIAPAPNT KIKKGMKVNI EKAFPVQLNV GGKQQQVWAT STTVADFLKQ QNVKLDELDR VEPSLQDKLK ENMVVKVIKV EKVTDVVEEP VDFAVVTRQD AQLPKGEQRI ISPGEKGRVS KKYEVVLENG KEVSRKLIET KMIKESKNRI VAIGTKVAKS RPAHTQSRSV QTVSRGQKHA AREIYVVASA YTAYCQGCSG TTRMGINLRA NPSAKVIAVD PNVIPLGSKV YVEGYGYAIA ADTGSGINGY EIDVFIPKQS DALRWGRKRV KVRILQ
|
| |