Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0458 |
Symbol | |
ID | 7407536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 520269 |
End bp | 521627 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643714846 |
Product | beta-galactosidase |
Protein accession | YP_002572363 |
Protein GI | 222528481 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000753308 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAC CAAAAGGATT TCTGTGGGGT GCTGCAACTG CATCATATCA GATTGAGGGT GCTTGGAATG AAGATGGAAA AGGTGAATCT ATATGGGACA GGTTTACACA TCAAAAAGGA AATATTTTAT ATGGTCATAA TGGCGACGTT GCCTGTGACC ACTATCATAG GTTCGAAGAA GATGTCTCTC TTATGAAAGA ACTTGGACTA AAAGCCTACA GGTTTTCTAT TGCATGGGCG AGAATTTTTC CAGATGGTTT CGGTACTGTG AATCAAAAAG GTCTTGAGTT TTATGATAGA CTCATCAACA AGCTTGTTGA AAACGGTATT GAACCGGTTG TCACCATTTA TCACTGGGAT CTTCCTCAAA AGCTACAAGA CATTGGCGGT TGGGCAAACC CAGAAATTGT AAATTATTAT TTTGAATATG CAATGCTTAT CGTAAACCGT TATAAAGACA AAGTAAAAAA ATGGATAACA TTTAATGAAC CTTATTGTAT TGCCTTTTTG GGACACTTTT ATGGAGTTCA TGCACCAGGA ATAAAAGACT TTAAAGTTGC AATGGATGTT GTGCACAACA TTATGCTTTC TCATTTTAAG GTTGTAAAAG CTGTAAAGGA AAACAATATT GATGTTGAGG TAGGAATTAC ACTAAATTTA ACTCCAGTTT ACTTTCAAAC AGAGCGTCTT GGATATAAGG TAAGCGAAAT TGAAAGAGAA ATGGTAAACC TCAGCAGCCA GCTTGACAAT GAACTTTTCC TTGATCCAGT ACTCAAAGGA AGCTATCCAC AAAAGCTGTT TGATTACCTT GTTCAAAAAG ATTTGTTGGA AACTCAAAAA GTATTGAGTA TGCAGCAGGA AGTAAAAGAA AATTTCGTTT TTCCTGATTT TCTTGGTATC AACTACTATA CACGTGCTGT CAGGCTTTAC GATGAAAATT CTAACTGGAT ATTTCCAATA AGATGGGAAC ATCCTGCAGG AGAGTACACC GAGATGGGCT GGGAAGTGTT CCCACAAGGA CTTTATGATC TTTTGATTTG GATTAAAGAA AGTTACCCAC AAATTCCAAT TTATATAACA GAAAACGGTG CTGCTTATAA CGACAAGGTA GAAGATGGAA GAGTTCATGA CCAAAAGAGA GTGGAGTATT TAAAACAGCA CTTTGAAGCA GCAAGAAAGG CAATTGAAAA TGGAGTGGAT TTGCGAGGTT ATTTTGTGTG GTCTTTGTTG GACAATCTTG AATGGGCAAT GGGTTATACA AAAAGGTTTG GAGTTATATA TGTGGACTAT GAAACCCAAA AAAGGATTAA AAAAGACAGC TTCTATTTTT ATCAGCAGTA TATAAAGGAA AACTCATAA
|
Protein sequence | MSLPKGFLWG AATASYQIEG AWNEDGKGES IWDRFTHQKG NILYGHNGDV ACDHYHRFEE DVSLMKELGL KAYRFSIAWA RIFPDGFGTV NQKGLEFYDR LINKLVENGI EPVVTIYHWD LPQKLQDIGG WANPEIVNYY FEYAMLIVNR YKDKVKKWIT FNEPYCIAFL GHFYGVHAPG IKDFKVAMDV VHNIMLSHFK VVKAVKENNI DVEVGITLNL TPVYFQTERL GYKVSEIERE MVNLSSQLDN ELFLDPVLKG SYPQKLFDYL VQKDLLETQK VLSMQQEVKE NFVFPDFLGI NYYTRAVRLY DENSNWIFPI RWEHPAGEYT EMGWEVFPQG LYDLLIWIKE SYPQIPIYIT ENGAAYNDKV EDGRVHDQKR VEYLKQHFEA ARKAIENGVD LRGYFVWSLL DNLEWAMGYT KRFGVIYVDY ETQKRIKKDS FYFYQQYIKE NS
|
| |