Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_0266 |
Symbol | |
ID | 5876518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 281032 |
End bp | 282342 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641540607 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001661919 |
Protein GI | 167038934 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAAAA AAGATTTGAA AATTGCTGTA ATTGGAGGGG GTTCTAGTTA TACCCCCGAA CTTATTGAGG GCTTTATCAA GAGGTATAAT GAACTACCAG TTAAAGACTT ATATTTAGTA GATATAGAAG AAGGCCAAGA AAAACTTGAG ATTGTTGGCG GTCTTGCAAA AAGAATGGTA GAAAAAGCTG GTGTAGGTAT AAACATTCAT TTGACTTTGG ACAGGCGAAA AGCCATAAAA GATGCAGATT TTGTTGTTAC CCAGTTTAGA GTAGGGCTGA TAGATGCAAG AATTAGAGAT GAAAAAATTC CTCTAAAGTA TGATGTTATA GGGCAAGAAA CAACTGGACC CGGTGGTTTT GCAAAAGCAC AGAGAACAAT ACCTGTTATT TTAGACATAT GTAAAGACGT AGAGGAACTT GCTCCAAATG CATGGCTTAT TAATTTTACA AATCCTTCAG GAGTGATAAC AGAAACGATT TTAAAGCATA CAAATGTAAA AGCGATCGGA TTATGTAATG TACCTATAGG TATGGTATAC GGTGTTGCAG AAGTACTTGG TGTTGATCCA AAAAGAGTGT ATATAGATTT TACAGGGCTT AATCATTTAG TATGGGGTAC TCATATTTAC TTAGATGGCG AAGATATAAC CGAAAAACTA ATAGACAGTT TCGCAGGTGG TAAATCTTTA TCAATGAAAA ATATACCTGA GTTGCCATGG GAACCTGAAT TTATAAAATC TCTTGGTATG TATCCTTGTC CATACCACAG ATACTATTAT TTAACAGATA AAATGCTTGA AGGACAGAAA AAAGAAGCTG CTACAGTAGG CACAAGAGGA GAAGTCGTTA AAAAGGTAGA GCAAGAATTA TTTGAATTAT ATAAAGACCC AAATTTGAAT ATAAAACCGC CGCAATTAGA AAAAAGGGGA GGAGCTCATT ATTCTGATGC TGCTTGCTCC CTGATAAGTT CAATATATAA TGACAAAAAA GACATACATG TGGTCAATGT GAGAAACAAT GGTACAATCG CAGATTTGCC AGATGATGTG GTGATAGAAA CAAATGCAAT AATAGATAGA AATGGGGCTC ATCCGATAAA TATTGGACAT GTGCCAGCGA AAATAAGGGG TTTAATGCAA GCAGTAAAAG CCTATGAAGA ACTTACTATA GAAGCAGGGG TAAAGGGGAA CTATTATACA GCTTTACAGG CGTTGACAAT TCATCCATTA GTACCTTCTG CGACTGTTGC TAAAAAAATT CTTGATGATA TACTTGAGCA AAATAAAGAG TATTTGCCAC AGTATAAATA G
|
Protein sequence | MSKKDLKIAV IGGGSSYTPE LIEGFIKRYN ELPVKDLYLV DIEEGQEKLE IVGGLAKRMV EKAGVGINIH LTLDRRKAIK DADFVVTQFR VGLIDARIRD EKIPLKYDVI GQETTGPGGF AKAQRTIPVI LDICKDVEEL APNAWLINFT NPSGVITETI LKHTNVKAIG LCNVPIGMVY GVAEVLGVDP KRVYIDFTGL NHLVWGTHIY LDGEDITEKL IDSFAGGKSL SMKNIPELPW EPEFIKSLGM YPCPYHRYYY LTDKMLEGQK KEAATVGTRG EVVKKVEQEL FELYKDPNLN IKPPQLEKRG GAHYSDAACS LISSIYNDKK DIHVVNVRNN GTIADLPDDV VIETNAIIDR NGAHPINIGH VPAKIRGLMQ AVKAYEELTI EAGVKGNYYT ALQALTIHPL VPSATVAKKI LDDILEQNKE YLPQYK
|
| |