Gene Athe_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0468 
Symbol 
ID7407547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp538102 
End bp540291 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content39% 
IMG OID643714856 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_002572373 
Protein GI222528491 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000274987 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATAA CCTTTAACCC ACAAACAAAT ATGTTTTTCA TAGAAGCAAA GAACACAAGC 
TATGTGATAA AGCTTTTCAA AGGAAAGTTT TTGTCCCATG TTTATTGGGG GAAAAAAATT
AAAGAATTTG AGTGGACAGA TTTTGATGTG ACAGGAGGAA GAGCATTTGG TGCAACACCT
GACCCAAATG ACAAAACATA CTCATTTGAT ACAATGCTTT TAGAATACCC TGCATATGGA
AATTCAGATT TCAGACACCC TGCATATCAG ATAGAACAGG AAGACGGCTC TCGCATTACA
AACTTAGTTT ACAAAACTCA CAGAATCTAT GATGGAAAGC CCAAACTTGA AGGTCTTCCA
ACAACATATG TTGAGTCACC TGATGAAGCC CAGACACTGG AGATAGAGCT TTATGATGAT
TTGATTGATT TGAAAGTCAC ATTGATTTAT ACAGCCTACA AAGATTATGA TGCAATAACA
AGAAGCGTAA GGTTTGAGAA CTTAGGAAAA CAAACTCTCA AAATCCTTCG TGCAATGAGC
GCGTGTGTTG ACTTTCCAGA AGGGGATTTT GAACTTTTGC ATCTTTGGGG TTCATGGGCA
AGAGAAAGAT ACATCGAGAG AACTCCACTT ATTCACGGAA CCCAGGTAAT TGAAAGTGCA
AGAGGCGAAA GCTCACATCA GCACAACCCA TTTATAGCAC TTTTGTCAAA GGATGCAACC
GAAAAACATG GCGATGTGTA TGGCTTTTCT CTTGTCTATA GTGGAAACTT TGCTGCAATT
GTGGAAAAAG ACCAGTACAA TCTTGTAAGA GTCACTATGG GAATAAATCC ATTTGAGTTT
ACATGGGTTT TAGAGCCGCA AAGCAGTTTT CAGACACCAG AGGTTGTGAT GGTTTACTCT
AATGAGGGCT TAGGAGGAAT GTCTCGCACA TACCACAAGC TTTACAGAAA AAGACTTTGC
AGAGGAGCAT ATCGGGATAA AAGAAGACCA ATTCTGATTA ACAACTGGGA GGCTACATAT
TTCAATTTCA ATGAAGAAAA ACTTCTTTCT TTGGCAAAAG AGGCAAAAGA TCTTGGGATT
GAGCTGTTTG TTTTAGATGA TGGTTGGTTT GGTAAAAGAG ACGATGATAC AAGCTCACTT
GGAGACTGGT TTGTTGACAG AAGAAAGCTT CCAAACGGTT TGGACGGGCT TGGGAAAAAG
TTAAATGAAA TGGGGCTCAA ATTTGGACTG TGGTTTGAGC CTGAGATGGT TTCGCCTGAT
AGCGAACTTT ACAGAAAGCA TCCTGATTGG TGCATACAGG TACGAGGAAG AACGTTGACA
CAATGCAGAA ACCAGTACGT TTTGGACATC ACAAGAGAAG ATGTTAGAAA AGAAATTTTA
AGGATGATGA AAGAGATTCT AAAAGCAGCT CCAATTGAAT ATATCAAGTG GGACATGAAC
AGGCCCTTAA CAGAGATAGG TTCGCTTGAG CTCCCACCAG AGAGACAAAA AGAGGTCTTC
CACAGATATG TTCTGGGACT TTATCAAATG ATGGAAGAGC TGACAATGGA GTTTCCACAT
ATTTTGTTTG AAGGATGTTC TGGCGGTGGT GGAAGGTTTG ATCCGGGAAT TTTGTATTAC
ATGCCTCAAA TTTGGACGAG TGATGACACA GACGCAATCG AAAGGCTTAA AATCCAGTTT
GGAACAAGCA TAGTTTATCC TGCATCAACT ATGGGTGCGC ATGTATCAAT TGTGCCAAAC
CATCAGGTTG GCAGGATAAC ACCAATGAAG ACAAGAGGGG TTGTAGCGCT TTCAGGCTGT
TTTGGATATG AACTTGATTT AACAAAGCTA TCTCAAGAGG ACAAAGAAGA GATTAAGAGA
CAAATTGAGC TTTATAAGAG AATATGGCAT ATAGTATTTG AAGGAGATTT GTACAGATTA
ATTTCTCCAT TTGAGGGAAA TAGCGCTGCA TGGATGTATG TGACAGAGGA TAAGAAAGAG
GCAGTTGTAT TCTATGTTGA AATTTTAAGG CAGCCAAACC CACCAATCAA AAGGTTAAAA
TTAGATGGTC TTGACCCCAG CAAGAGCTAT TTAATTGAAG GTGAGCAAAA AACAAGGTTT
GGCGATGAGC TTATGAACAT AGGGCTTATG ATTCCTCAGA TGTGGGGTGA TTTTAATTCT
CATATGTGGA TTTTAAAAGC AGTTGATTAG
 
Protein sequence
MPITFNPQTN MFFIEAKNTS YVIKLFKGKF LSHVYWGKKI KEFEWTDFDV TGGRAFGATP 
DPNDKTYSFD TMLLEYPAYG NSDFRHPAYQ IEQEDGSRIT NLVYKTHRIY DGKPKLEGLP
TTYVESPDEA QTLEIELYDD LIDLKVTLIY TAYKDYDAIT RSVRFENLGK QTLKILRAMS
ACVDFPEGDF ELLHLWGSWA RERYIERTPL IHGTQVIESA RGESSHQHNP FIALLSKDAT
EKHGDVYGFS LVYSGNFAAI VEKDQYNLVR VTMGINPFEF TWVLEPQSSF QTPEVVMVYS
NEGLGGMSRT YHKLYRKRLC RGAYRDKRRP ILINNWEATY FNFNEEKLLS LAKEAKDLGI
ELFVLDDGWF GKRDDDTSSL GDWFVDRRKL PNGLDGLGKK LNEMGLKFGL WFEPEMVSPD
SELYRKHPDW CIQVRGRTLT QCRNQYVLDI TREDVRKEIL RMMKEILKAA PIEYIKWDMN
RPLTEIGSLE LPPERQKEVF HRYVLGLYQM MEELTMEFPH ILFEGCSGGG GRFDPGILYY
MPQIWTSDDT DAIERLKIQF GTSIVYPAST MGAHVSIVPN HQVGRITPMK TRGVVALSGC
FGYELDLTKL SQEDKEEIKR QIELYKRIWH IVFEGDLYRL ISPFEGNSAA WMYVTEDKKE
AVVFYVEILR QPNPPIKRLK LDGLDPSKSY LIEGEQKTRF GDELMNIGLM IPQMWGDFNS
HMWILKAVD