Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0227 |
Symbol | |
ID | 7407218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 272944 |
End bp | 275385 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643714627 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002572150 |
Protein GI | 222528268 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGAATAA ATCTTGACGG AAAATGGAAG TTCAGAGAAG TTGGTCAGAA TGAGTACTAC GAGGCAAGTG TGCCAGGATG TGTCCAGCTT GATTTGATTA ACCTTGGAAA GCTTCCCGAC CCTTTTTATG CCACAAACGA GGTTTTGTTT TATGACTTAG AAGAAAAAGA TTTTGAGTAT GTAAAAGAGT TTGATGTTGA TAATGTCGAT TTTCAGGTCA AAAAGCTTGT GTTCGAAGGA ATTGATACGG TATCTGAGAT TTATTTAAAT GACCATTATT TGGGTAAGAC CGACAACATG TTTTTGAAGT ATGAATTTGA TGTGAGTCTT GCACTTAAAA AAGGGAAAAA CATTTTAAAG GTGGTCCTTC TTTCACCAAT AAAAGAAGCA GAAAGGCTCA AAAGCATTTA CCAGTCAAGC TACAGCTATC CGCAAAGAAG CTGGATAAGA AAGTCGCAGT ATTCATACGG CTGGGACTGG GGGCCAAGAA TTCTCCAGAT AGGAATCTGG AAGAGCGTGT ATTTAGAGCT TCACAATGGG CTTGAAATTC AAGATGAGTT TGTAAAAGTG GAAAGTATCT CAGATGAGCT TGCCATTGTA AGAGTGTTTG CAAAGATAAA CTGTTTTGAA AAACCATCAG AAGTAGAGAT AGAGTTATTT GATGGTAGCT TTTCTGTGAA AGTTTTTCCA GAGGTTTATA AATCAAAAGA TGGGTATTTT ATTGATGAGA GGATTGAGAT AGAAAACCCC AAACTTTGGT GGCCAAACGG GTATGGGGAG CCTTCTTTAT ATGAGTTTAA AATAACTGCA AAGACTTCAA ATGAAGCTCA AGAAAAGAAG GTCACAACAG GCCTCAGGAC TGTAAGAGTA ATAAAGGAAA AGGATGAATA TGGCGAAAGT TTTATCTTTG AGATAAATGG CAAGAAGATT TTTGCAAAAG GAGCAAACTG GATACCTGCA GATTCCATCC TGCCAAGGCT GAAAGAAGAT GACTATAAAG AATTAATTAA AATGGCAAAA GATGCGAACA TGAACATGCT CAGGGTCTGG GGCGGCGGTA TTTATGAGTA TGACTGGTTT TACGACGAAT GTGACAAAAA CGGTATAATG GTGTGGCAGG ATTTCATGTT TGCATGCGCA ATCTACCCTG ATGAGTTTGA CTTTTTCGTT GAGAATTTCA TAAAGGAGGC AGAGTACCAG ATAAAGAGGC TCAGAAACCA TCCGTGCATT GTGCTGTGGT GTGGGAATAA CGAAAACAAC TGGGGCTTTA GAGATTGGTG GCACATCGGC GATCCAGAGT TTTTGGGCAA CAGGATATAC AAAAAGGTGC TTCCTGAAAT TCTGGCAAAA CTTGACCCAA CAAGACCGTA CCACATCTCA AGCCCGTATG GAGGTGAGCA TCCAAACAGC GAAAAAGCTG GCGACAAACA TACATGGGAT ATCTGGGCTG GCTGGAAAGA CTACATCTAT TATAAGCACG ACAATGCAAG GTTTGTCTCT GAGTTTGGTT TTCAGGCGGC AGCACACCTT GATACAATGA AAAAGTACAT TCCTCTTAAA GACCAAACAA TCTTCTCAAA AACTTTGAGA ATGCACGAAA AGCAGGAAGA AGGTTTGGAA AGGCTTATAA GGTATATGGC AGGCTCAATT GGTCTACCAA AAGACTTTGA CTCTTTTGTG TATTTGTCAC AGTTTGTTCA AAAAGAGGCT ATTAAGCTTG CGGTTGAGCA TTACAGGAAG AATAAGTTCG CAACAGCAGG GGCTCTTTAC TGGCAGCTTA ATGACTGCTG GCCTGTCATC TCATGGTCGT CAATTGACTA TCTGAAAAGA AGAAAAGCAC TTTACTATGA GTCAAAAAGG ATATTTGCAA AGTTTTTGCC AGTGGTTGAG TATGAGGATG GAAAACTAAA GGTATATATT GTTAGTGATG AGCTAGAGCC AAAACAAGGT AAGCTCAATA TTACAATCTG GAACTTTGAT GGGCAAAAGT TATACGAGAA AAACCTAACT GTGGAAATTC CTGAAAATAG TGTGGTTGAG GCATTTTCTG AAAAAGTAGA AAACTTGAAT ATTTTAAAGG GCGAGTGGTT GTATATACCC AAACATGTTG AAACGGCTGT AATTGGGGAT AAGATAGACA GAGGACTTTT GGAAAGCATA GTTTTTGTGA GCCTTTTTGT CGATAGAGTA GAGTACGAAA ATTACTTTGT ATTTGAAAAG CCAATAAACC TTGAACTAAA ACCCAGTCAG TTTGAGTACA GAATAGAAGA TGACCATATT ATAATAAAAC CCAAAACTCC TGCAATTTGC CTTATAATTG AAGCTGACAG GGATGTAGAG AACAACTTCA TTTTTGCGAG ACCTGAAAAA GAGTATAAGA TTAATCTAAA TGGAGGACAG GTCAGAAAGG TTTGTGATTT GTTAGATTTG ATTGAGCGAT GA
|
Protein sequence | MRINLDGKWK FREVGQNEYY EASVPGCVQL DLINLGKLPD PFYATNEVLF YDLEEKDFEY VKEFDVDNVD FQVKKLVFEG IDTVSEIYLN DHYLGKTDNM FLKYEFDVSL ALKKGKNILK VVLLSPIKEA ERLKSIYQSS YSYPQRSWIR KSQYSYGWDW GPRILQIGIW KSVYLELHNG LEIQDEFVKV ESISDELAIV RVFAKINCFE KPSEVEIELF DGSFSVKVFP EVYKSKDGYF IDERIEIENP KLWWPNGYGE PSLYEFKITA KTSNEAQEKK VTTGLRTVRV IKEKDEYGES FIFEINGKKI FAKGANWIPA DSILPRLKED DYKELIKMAK DANMNMLRVW GGGIYEYDWF YDECDKNGIM VWQDFMFACA IYPDEFDFFV ENFIKEAEYQ IKRLRNHPCI VLWCGNNENN WGFRDWWHIG DPEFLGNRIY KKVLPEILAK LDPTRPYHIS SPYGGEHPNS EKAGDKHTWD IWAGWKDYIY YKHDNARFVS EFGFQAAAHL DTMKKYIPLK DQTIFSKTLR MHEKQEEGLE RLIRYMAGSI GLPKDFDSFV YLSQFVQKEA IKLAVEHYRK NKFATAGALY WQLNDCWPVI SWSSIDYLKR RKALYYESKR IFAKFLPVVE YEDGKLKVYI VSDELEPKQG KLNITIWNFD GQKLYEKNLT VEIPENSVVE AFSEKVENLN ILKGEWLYIP KHVETAVIGD KIDRGLLESI VFVSLFVDRV EYENYFVFEK PINLELKPSQ FEYRIEDDHI IIKPKTPAIC LIIEADRDVE NNFIFARPEK EYKINLNGGQ VRKVCDLLDL IER
|
| |