Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_3481 |
Symbol | |
ID | 8527369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 3539654 |
End bp | 3541066 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | glycoside hydrolase family 1 |
Protein accession | YP_003254511 |
Protein GI | 261420829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTATA CTCAATTAAA ACCGTTTCCA ACGGGGTTTT TATGGGGCGG TTCGACGTCT GCTTACCAAG TCGAAGGCGC ATGGAACGAA GACGGAAAAG GGCCGTCGGT CATCGATATG GCCAAACATC CGGAAGGAAC GACCGATTTC AAAGTCGCCA GCGACCATTA TCACCGGTAT CAAGAAGATA TCGCTTTGCT CGCAGAAATG GGGTTTAAAG CGTATCGCTT TTCCATCGCT TGGACGCGCA TTTATCCGAA CGGCGAAGGG GAAGTGAACC CAAAAGGATT GGAATTTTAC AACAACTTGA TTAATGAGAT TGTCCGCCAT GGCATCGAAC CGATCGTGAC GATCTATCAT TTCGATTTGC CGTACGCCTT GCAAACGAAA GGCGGATGGT CGAACCGTGC GACTATCGAT GCGTTTGTCA ACTACTGCCG GACGCTGTTT GAACATTTTG GCGACCGTGT AAAGTATTGG TTGACCATTA ATGAGCAAAA TATGATGATC CTTCACGGGG AAGCCATTGG CATTGTCGAT CCCGACAGCG AAAACCCGAA AAAAGAGCTA TACCAGCAAA ACCACCATAT GTTTGTCGCC CAAGCCAAAG CGATGGCGCT TTGCCACGAA ATGCTTCCTG ATGCAAAAAT CGGGCCGGCG CCGAATATTG CGACGATTTA TCCGGCGAGC TCCAAGCCGG AAGATGTGCT CGCCGCCAAC ACGTATTCAG CGATTCGCAA CTGGTTGTAC TTAGATATGG CCGTCTACGG CCGCTACAAT CCGACAGCGT GGGCGTATTT AGAAGAAAAA GGCTATACCC CAACGATTGC AGACGGAGAT ATGGACATCT TGCAAAACGC GAAACCGGAT TTCATCGCTT TTAACTACTA TACGTCACAA ACAGTAGCCG CCAGCGTGGG GAATGAGAGC GATATCGGCC ATACGGGAGA CCAACATATT ACAATTGGCG AACCGGGCGT ATACAAAGGC GCATCCAACC CGAACTTGCC GAAAAACGAC TTCGGCTGGG AAATTGACCC GATCGGCTTC CGAACAACGC TTCGGGAAAT TTATGAGCGC TACCGGTTGC CGCTCATCGT AACCGAAAAC GGGTTAGGAG CTTACGATCG ATTAGAAGAA GGGGATATCG TGAACGACAC ATACCGGATC GACTTTTTGC GCAACCATAT TGAACAAATG CGCCTCGCCA TCACGGACGG CGTCGACGTG TTCGGCTACT GCCCGTGGTC GGCGATCGAC TTAGTCAGCA CCCACCAAGG CATCAGCAAA CGATACGGGT TCATTTACGT CAACCGCGAC GAATTTGATT TGAAAGATTT GCGCCGTATC CGCAAACAAA GCTTTTATTG GTACCAACGG GTCATCTCCT CGAACGGCGA ACAGCTCGAC TAA
|
Protein sequence | MKYTQLKPFP TGFLWGGSTS AYQVEGAWNE DGKGPSVIDM AKHPEGTTDF KVASDHYHRY QEDIALLAEM GFKAYRFSIA WTRIYPNGEG EVNPKGLEFY NNLINEIVRH GIEPIVTIYH FDLPYALQTK GGWSNRATID AFVNYCRTLF EHFGDRVKYW LTINEQNMMI LHGEAIGIVD PDSENPKKEL YQQNHHMFVA QAKAMALCHE MLPDAKIGPA PNIATIYPAS SKPEDVLAAN TYSAIRNWLY LDMAVYGRYN PTAWAYLEEK GYTPTIADGD MDILQNAKPD FIAFNYYTSQ TVAASVGNES DIGHTGDQHI TIGEPGVYKG ASNPNLPKND FGWEIDPIGF RTTLREIYER YRLPLIVTEN GLGAYDRLEE GDIVNDTYRI DFLRNHIEQM RLAITDGVDV FGYCPWSAID LVSTHQGISK RYGFIYVNRD EFDLKDLRRI RKQSFYWYQR VISSNGEQLD
|
| |