Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0097 |
Symbol | |
ID | 7408459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 119197 |
End bp | 120537 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714507 |
Product | Glucosylceramidase |
Protein accession | YP_002572030 |
Protein GI | 222528148 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAAAA CAATTGCATG TTACGTAACA GCAAAAGATC AAGACATTCC GATGCAGCAA GTAGACAATA TAAAAGAAAG CACAAAAATA GAAAGTAGTA TTGTTATAAC AATTGAGCCA TCCACTACAT TTCAAGAGGT AATAGGTTTT GGTGGGGCAC TGACTGAAGC TGCTGCGGTA AATATACTGT CGCTTTTGCC ACACCAGCAA GAAGAGATTT TAAGAGGGTA CTTTGACCCA AAAGAGGGGC TTGGCTATAA GCTTTGTAGA ATCCACATGA ACAGCTGTGA TTTTTGTGTT GATAGCTACA GCTGTGATGA TGTTGAAGGT GACATAGAGC TAAAACACTT TAACATTGAA CGAGACAAAA AGATGGTAAT TCCTCTTTTA AAAAGGATAA AGGAGTATTG CAAAGACCTA AAAATTCTTG TTTCGCCATG GAGTCCGCCT GCATGGATGA AGACAAACGG TGATATGTGC CATGGTGGAA AGCTAAAGGA TGAGTATAAA AAAACATGGG CAAGATTTTT CTGCAAATTC ATAAAAGCAT ATAAAGAAGA AGGAATTGAG ATATGGGCTG TGACAGTTCA AAATGAGCCT ATGGCAACTC AAGTGTGGGA GTCGTGCATA TACACAGCTG AAGAAGAAAG AGATTTTGTG AAGTATTATT TGGGGCCGAC TCTTGCGGAA GAAGGACTGG GGGATGTAAA GATACTCATT TGGGATCACA ACAAAGACAT CATATATGAC AGGGTAAAAA CAATTTTGAG TGACAAAGAA GCTGCAAAAT TTGTATGGGG AGTTGCATTC CACTGGTATG GAGGAGACCA TTTTGACCAG CTCAAAAAAA TAAAAGAAGA ATTTCCTGAT GTCAATTTGG TGTTTACCGA AGGTTGTCAG GAAGGTGGAG TGAAGCTTGG TTCTTGGGAG CTTGGAGAAA GGTATGCTCA TGAGATAATT GGTGATTTTA ATAACTACAC AATTGGATTT ATGGATTGGA ATATTGTTCT TGACACAGTT GGAGGCCCCA ATCATGTAGG AAACTTTTGC GACGCTCCAA TAATAGTTGA TAAAGACCAG AAAAAGATTT ACTATCAAAA TGCATGTTAT TATATAGGGC ATTTTTCTAA ATTCATAAGG CCGGGAGCTA AAGTAGTCAA AAGTAGCTGT AGTAGTTCAA AACTTGAAGT TTTGGCAGCG AAGAGCCAGG ACGATACTTT AGCAGTGGTT GTGTTTAATA AAAACCCAGA GGAAATAGAG TTTAGTATTG TCATTGGAGA TAAAATATTC AGCGGAAAGT CTCCAGCAAG GTCTATATTG ACCATTGTTC TGGAAAAGTA A
|
Protein sequence | MHKTIACYVT AKDQDIPMQQ VDNIKESTKI ESSIVITIEP STTFQEVIGF GGALTEAAAV NILSLLPHQQ EEILRGYFDP KEGLGYKLCR IHMNSCDFCV DSYSCDDVEG DIELKHFNIE RDKKMVIPLL KRIKEYCKDL KILVSPWSPP AWMKTNGDMC HGGKLKDEYK KTWARFFCKF IKAYKEEGIE IWAVTVQNEP MATQVWESCI YTAEEERDFV KYYLGPTLAE EGLGDVKILI WDHNKDIIYD RVKTILSDKE AAKFVWGVAF HWYGGDHFDQ LKKIKEEFPD VNLVFTEGCQ EGGVKLGSWE LGERYAHEII GDFNNYTIGF MDWNIVLDTV GGPNHVGNFC DAPIIVDKDQ KKIYYQNACY YIGHFSKFIR PGAKVVKSSC SSSKLEVLAA KSQDDTLAVV VFNKNPEEIE FSIVIGDKIF SGKSPARSIL TIVLEK
|
| |