Gene Athe_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0458 
Symbol 
ID7407536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp520269 
End bp521627 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content36% 
IMG OID643714846 
Productbeta-galactosidase 
Protein accessionYP_002572363 
Protein GI222528481 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000753308 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAC CAAAAGGATT TCTGTGGGGT GCTGCAACTG CATCATATCA GATTGAGGGT 
GCTTGGAATG AAGATGGAAA AGGTGAATCT ATATGGGACA GGTTTACACA TCAAAAAGGA
AATATTTTAT ATGGTCATAA TGGCGACGTT GCCTGTGACC ACTATCATAG GTTCGAAGAA
GATGTCTCTC TTATGAAAGA ACTTGGACTA AAAGCCTACA GGTTTTCTAT TGCATGGGCG
AGAATTTTTC CAGATGGTTT CGGTACTGTG AATCAAAAAG GTCTTGAGTT TTATGATAGA
CTCATCAACA AGCTTGTTGA AAACGGTATT GAACCGGTTG TCACCATTTA TCACTGGGAT
CTTCCTCAAA AGCTACAAGA CATTGGCGGT TGGGCAAACC CAGAAATTGT AAATTATTAT
TTTGAATATG CAATGCTTAT CGTAAACCGT TATAAAGACA AAGTAAAAAA ATGGATAACA
TTTAATGAAC CTTATTGTAT TGCCTTTTTG GGACACTTTT ATGGAGTTCA TGCACCAGGA
ATAAAAGACT TTAAAGTTGC AATGGATGTT GTGCACAACA TTATGCTTTC TCATTTTAAG
GTTGTAAAAG CTGTAAAGGA AAACAATATT GATGTTGAGG TAGGAATTAC ACTAAATTTA
ACTCCAGTTT ACTTTCAAAC AGAGCGTCTT GGATATAAGG TAAGCGAAAT TGAAAGAGAA
ATGGTAAACC TCAGCAGCCA GCTTGACAAT GAACTTTTCC TTGATCCAGT ACTCAAAGGA
AGCTATCCAC AAAAGCTGTT TGATTACCTT GTTCAAAAAG ATTTGTTGGA AACTCAAAAA
GTATTGAGTA TGCAGCAGGA AGTAAAAGAA AATTTCGTTT TTCCTGATTT TCTTGGTATC
AACTACTATA CACGTGCTGT CAGGCTTTAC GATGAAAATT CTAACTGGAT ATTTCCAATA
AGATGGGAAC ATCCTGCAGG AGAGTACACC GAGATGGGCT GGGAAGTGTT CCCACAAGGA
CTTTATGATC TTTTGATTTG GATTAAAGAA AGTTACCCAC AAATTCCAAT TTATATAACA
GAAAACGGTG CTGCTTATAA CGACAAGGTA GAAGATGGAA GAGTTCATGA CCAAAAGAGA
GTGGAGTATT TAAAACAGCA CTTTGAAGCA GCAAGAAAGG CAATTGAAAA TGGAGTGGAT
TTGCGAGGTT ATTTTGTGTG GTCTTTGTTG GACAATCTTG AATGGGCAAT GGGTTATACA
AAAAGGTTTG GAGTTATATA TGTGGACTAT GAAACCCAAA AAAGGATTAA AAAAGACAGC
TTCTATTTTT ATCAGCAGTA TATAAAGGAA AACTCATAA
 
Protein sequence
MSLPKGFLWG AATASYQIEG AWNEDGKGES IWDRFTHQKG NILYGHNGDV ACDHYHRFEE 
DVSLMKELGL KAYRFSIAWA RIFPDGFGTV NQKGLEFYDR LINKLVENGI EPVVTIYHWD
LPQKLQDIGG WANPEIVNYY FEYAMLIVNR YKDKVKKWIT FNEPYCIAFL GHFYGVHAPG
IKDFKVAMDV VHNIMLSHFK VVKAVKENNI DVEVGITLNL TPVYFQTERL GYKVSEIERE
MVNLSSQLDN ELFLDPVLKG SYPQKLFDYL VQKDLLETQK VLSMQQEVKE NFVFPDFLGI
NYYTRAVRLY DENSNWIFPI RWEHPAGEYT EMGWEVFPQG LYDLLIWIKE SYPQIPIYIT
ENGAAYNDKV EDGRVHDQKR VEYLKQHFEA ARKAIENGVD LRGYFVWSLL DNLEWAMGYT
KRFGVIYVDY ETQKRIKKDS FYFYQQYIKE NS