Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_3366 |
Symbol | |
ID | 8527254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | - |
Start bp | 3429385 |
End bp | 3430710 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003254397 |
Protein GI | 261420715 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC GCAAGTGGTT CAAACTGGCT TCGTTGTTGC TTGCCGCTGC GGTGGCGGGA ACAGGCTGTC AAGGACAGAC CGGGAATGAA CAAGGGCAGA AGAAGGAAGG ATCAACCGTA CAAATTGAGT TTTGGGCGGC GCCGAATCCG ACGCAGCAGG CATTCTGGAA GAAGATGGCG GATCGCTATA TGGAAGAGCA TCAAAACGTG AAAATCAAAG TATCGCCGAT GCCGGAAAGC CCTTCTTCGG AGGCGGGCAT TCAGTCGGCG ATTGCCGCGG GAAATGCGCC GGTCATTTCC GAAAACATCT CCCGCGGGTT TGCCGCACAG CTGGCGGACA GCCGGGCGAT TGTGCCGCTG GATCAATTTG AAGGGTTTGA CGACTTAATC GCCAAGCGGC AGATGAAAAG CACGATTTCA ACATGGAAAT TTGCCGACGG CCATCAATAT GTGCTGCCGA TTTACTCCAA CGCCATGCTG TTCGGTTGGC GAATCGATAT TCTGAAAGAG TTGGGGTATA ACGCTCCGCC GAAGACGTAC AGCGAAGTCA TGGAAGTCGG CAAAAAGCTG AAAGAAAAAT ATCCGAACAA GTTTCTATGG GCGAGAGCCG ATTTGGTTAA ACCGACGTGG TGGGCGAGAT GGTTTGACTT CTTCATGATT TACAATGCGG CGTCGAATGG GACGCATTTC ATCGATGGGA ACAAGTTGAG TGCGGATCGT GACGCGGGTG TGAAGACGCT CCAGTTCTTC GCTGATTTAA GCAAAAATCA GCTCGTGTTG ACGAAAGAGA CGAAAGACCC GTTTGAAAGC GGTACATCCG TGATGTCTGA TTTAGGGCCG TGGACGTTCC CGTATTGGGC TGAAAAGTTT CCGGAAATGA AGTTTAACGA GAAATACGTG TTGTCGATGC CGCCTGTGCC TGATGGCATG GATCCGTCGC AGGCGAAAAC GTTCGCCGAT ACAAAAGGGC TGGTTATTTA CGCTTCGGCG ACGAAAGAGC AGCAACAAGC CGCGTTCGAT TTCGTCAAAT GGGTGTATTC CGATCCGCAA AACGATTTGG AATGGCTGAA AGAAACGAAC TTGCCGCCGG CGCGGGACGA CTTGTCGACG AACGAGGCGT TTGTCTCCTA TTTTGAACAA AATCCGCAGC TGAAACTGTA TGCGGAAAAC ATTCCGAACG CCATACCGCC TATGGATAAC GCCAAAATGG TTGAGCTTCA AGAGCTGATC GGCAAAGAGG CGTTGAATCC GGTCGTCAAA GGCGAGAAAA CCCCAGAAAA AGCGTGGGAG GACATGGAAA AGGCGATCCA TGGGGTGTTA AAATAA
|
Protein sequence | MKKRKWFKLA SLLLAAAVAG TGCQGQTGNE QGQKKEGSTV QIEFWAAPNP TQQAFWKKMA DRYMEEHQNV KIKVSPMPES PSSEAGIQSA IAAGNAPVIS ENISRGFAAQ LADSRAIVPL DQFEGFDDLI AKRQMKSTIS TWKFADGHQY VLPIYSNAML FGWRIDILKE LGYNAPPKTY SEVMEVGKKL KEKYPNKFLW ARADLVKPTW WARWFDFFMI YNAASNGTHF IDGNKLSADR DAGVKTLQFF ADLSKNQLVL TKETKDPFES GTSVMSDLGP WTFPYWAEKF PEMKFNEKYV LSMPPVPDGM DPSQAKTFAD TKGLVIYASA TKEQQQAAFD FVKWVYSDPQ NDLEWLKETN LPPARDDLST NEAFVSYFEQ NPQLKLYAEN IPNAIPPMDN AKMVELQELI GKEALNPVVK GEKTPEKAWE DMEKAIHGVL K
|
| |