Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0014 |
Symbol | |
ID | 8523790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | - |
Start bp | 23691 |
End bp | 24977 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003251196 |
Protein GI | 261417514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATCC ACGTAGTGCA GAGTGGACAA ACGTTAAGTG GAATTGCTGA GGCATACGGG ACCACGGCGG AAGAAATTGT CCGGGCCAAC AAGCTTCCAA ACCCTGATAA ACTCGTTGTC GGCCAGGCGC TCGTGATCCC GATCGTCGGT CGTTTTTACT GGGTGCAGCG CGGCGACACG TTATGGTCGA TTGCACGCCG ATTTTCGATT CCGATGCAGC GGCTTGCCGA AGTGAATCGC CTCTCCTTAA ACGCTCCGCT TAAGGTCGGG CAGCGGCTTT ATATACCGCC CGGCGCCAAG CGAAGAGCGG AGTTTAACGC CTATATTGAA CCGCGCGGCG CGACTGTCAG CCCAGCGCTG GAGGCGAGCG CTCGCGAAGC CGCTCCGTAT TTGACTTATT TGAGTCCTTT TTATTTTGCG ATCCGACGCG ACGCGACATT GCAAGAGCCG CCGCTTGACG ACTTTCCGGA TATTGCCCGC GCCAACCGCG TCACGCTCGT TATGGTTGTC GCCAACATTG AAAACGGGCA GTTCAGCGAC GAGCTCGGCG CGCTTATGTT AACAAACGAA ACGCTGCAAA ATCGTCTGCT CGACAACATT GTCGCGACCG CTAGACGGTA TGGCTTCCGC GACATCCATT TTGATTTTGA ATATTTGCGC CCGGAAGACC GTGAGGCGTA TAATGCGTTT TTGCGCAAAG CGAAACGGCG GTTTGCACGA GAAGGGTGGA TGATGTCGAC CGCCTTGGCG CCGAAAACGA GCGCGACCCA GCGCGGACGT TGGTACGAAG CACACGACTA CCGCGCCCAT GGACAAATTG TCGACTTTGT CGTGATCATG ACGTATGAAT GGGGCTACAG CGGCGGGCCG CCGATGCCGG TGTCCCCGAT CGGTCCGGTC CGCCGCGTTC TCGAGTACGC CATCTCTGAA ATGCCAGCCG GAAAAATTTT GATGGGGCAA AACTTGTATG GCTACGACTG GACGCTGCCA TATGTACCCG GCGGCCCGTA CGCCCAGGCC ATCAGCCCGC AGCAAGCCAT CGCCCTCGCC GCAAAGTATA ACGTTGCCAT CGAATACGAT ATGGAGGCGC AGGCACCGCA TTTTCGCTAT CGCGACGAAA ACGGACGCGA GCATGAAGTA TGGTTTGAGG ACGCCCGCTC CATTCAAGCA AAATTTAATC TCGTGAAGGA ACTTGGTTTG CGCGGGGTCA GCTATTGGAA ACTTGGCATT GATTTTCCAC AAAACTGGCG GCTGATCGCT GATCAATTTA CTGTTGTAAA AAAATAA
|
Protein sequence | MQIHVVQSGQ TLSGIAEAYG TTAEEIVRAN KLPNPDKLVV GQALVIPIVG RFYWVQRGDT LWSIARRFSI PMQRLAEVNR LSLNAPLKVG QRLYIPPGAK RRAEFNAYIE PRGATVSPAL EASAREAAPY LTYLSPFYFA IRRDATLQEP PLDDFPDIAR ANRVTLVMVV ANIENGQFSD ELGALMLTNE TLQNRLLDNI VATARRYGFR DIHFDFEYLR PEDREAYNAF LRKAKRRFAR EGWMMSTALA PKTSATQRGR WYEAHDYRAH GQIVDFVVIM TYEWGYSGGP PMPVSPIGPV RRVLEYAISE MPAGKILMGQ NLYGYDWTLP YVPGGPYAQA ISPQQAIALA AKYNVAIEYD MEAQAPHFRY RDENGREHEV WFEDARSIQA KFNLVKELGL RGVSYWKLGI DFPQNWRLIA DQFTVVKK
|
| |