Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_3304 |
Symbol | |
ID | 8527192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | - |
Start bp | 3365049 |
End bp | 3366344 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003254339 |
Protein GI | 261420657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGA AAGCATCCGC ATGGCTGGCG CTGGCGCTTG GAATTGGGAT GGCGCTGTCG GGCTGCAGCA GCTCGAACTC TTCATCCGAT AACGCCAAAC AAGGCAGCCA AAAAGCGGGA GAGAAGCTCG AGATTTTCAG CTGGTGGACC GGAGCCGGTG AGGAAGACGG CTTAAAAGCG CTCATCAAAC TGTTCCAAGA AAAATATCCG GATATTGAAG TCGAAAACGC CGCGGTCGCC GGCGGGGCCG GAACGAACGC CAAGGCGGTG TTGGCGAGCC GCATGCAAGG GAACGACCCG CCATCAACGT TCCAAGTGCA CGGCGGAGCG GAGCTGAATG AAGGCTGGGT GGCGGCTGGC AAAATGGAGC CGCTCAACGA TTTGTATGAA AAAGAAGGCT GGATGGACAA ATTCCCGAAA GCGCTTATCG ATATGGTAAG CAAAGACGGC AACATTTACT CCGTGCCTGT CAACATTCAC CGCGGCAACG TGCTGTGGTA CAACAAAAAA ATCTTCGCTG ACAACGGACT CCAGCCGCCG AAAACGTTCG ACGAGTTTTT CCAAGTGGCT GACAAGCTGA AAGCAAAAGG CATTACGCCG CTCGCCTTGG GCGACAAAGA GCCGTGGGCT GCGACGCACC TGTTTGAAAA TGTGCTGCTT GGCACGCTCG GAACGGAAAA CTATAAGAAG CTTTGGACGG GCGAATTGTC GTTTAACGAC CCACAAGTGA AACAAGCCGT CGAGACGTTT AAGAAAATGC TTGGCTATAT TAACGAAGAC CATAGCTCGC GCAACTGGCA AGACGCCGCC CAGCTCGTCG CTGAGGGGAA AGCGGCCATG TACGTGATGG GCGATTGGGT GAAAGGCTAT TTTGTCAACG ATTTGAAATT GAAGGTGAAC CAAGACTTCG GCTATGTGCC AGTGCCGAAT ACGGAAGGCA AGTTTATGGT CATTACTGAT ACGTTCGGCC TGCCGAAAGG CGTGAAAAAC CCGGATGATG TGAAGAAATT TTTAGCGGTG CTTGGTTCGG TGGAAGGGCA AGATGCGTTT AACCCGCTGA AAGGCTCCAT CCCGGCCCGC ATCGACGCGG ATCCGTCCAA GTACGATGAA TACGGCAAAC AAACGATGCA AGACTTCAAA ACGGCGGAGC TGGCGCCGAG CTTAGCGCAC GGTTCAGCAG CGCCGGAAGG GTTTGTCACG AAGGTGAATC AGGCCGTCAA CATTTTCGTG ACGCAAAAAG ATGTGAAGAC GTTCATCGAC ACGTTGGCAT CGGCCGCCGC AGAACTGAAG AAGTAA
|
Protein sequence | MRKKASAWLA LALGIGMALS GCSSSNSSSD NAKQGSQKAG EKLEIFSWWT GAGEEDGLKA LIKLFQEKYP DIEVENAAVA GGAGTNAKAV LASRMQGNDP PSTFQVHGGA ELNEGWVAAG KMEPLNDLYE KEGWMDKFPK ALIDMVSKDG NIYSVPVNIH RGNVLWYNKK IFADNGLQPP KTFDEFFQVA DKLKAKGITP LALGDKEPWA ATHLFENVLL GTLGTENYKK LWTGELSFND PQVKQAVETF KKMLGYINED HSSRNWQDAA QLVAEGKAAM YVMGDWVKGY FVNDLKLKVN QDFGYVPVPN TEGKFMVITD TFGLPKGVKN PDDVKKFLAV LGSVEGQDAF NPLKGSIPAR IDADPSKYDE YGKQTMQDFK TAELAPSLAH GSAAPEGFVT KVNQAVNIFV TQKDVKTFID TLASAAAELK K
|
| |