Gene Athe_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0847 
Symbol 
ID7407422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp939000 
End bp940670 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content36% 
IMG OID643715225 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002572735 
Protein GI222528853 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAA AATTAAAAGT ATTTGCTTGG TTTATTTGTT TTGTGTTCAT TTTTTCAACT 
CTTATTACAT TCCCAAGTTT AAAATCTGAT TTTGTGAAAG CGGCTTCAAG CAATCAACCC
GTAAAGACAT TGACATTTTT CTATGGAGAT TCAAATGCAG ATCCTCATCC AGACCTGTTT
AGCACTCCTA TTGGTAAAGA AATTACAAAA CTTACAGGTG TAAAACTCAA AATCGAATAC
TTAGCAGGAC AGGATGAAGC AACAAAAATT GGTCTTATGT TAGCATCTGG TGATTTACCA
GATTTGATTC ATGGTCATCA GGAGCATGGA AAGTTAATAG AAGCTGGTGT ATTGGTACCA
CTTGATAACT ATATTCAAAA ATACGGGAAA TATTGTAAAC AGATTTACAC TGATAAAGAC
CTCAAAAGAC TCAGACAGAA AGATGGAAAA ATTTATTTCT TGTCTCCTTA CAGAAACGAA
ATAACTCCAG ACTTAAAACC AGATGGCTTC TGGCTACCAA TTGATCTTCT TGAAAAAGCA
AAATGGCCGA AGGTAAGGTA TTGGGAGGAC TATCAGCAGC TCATTAGAGA TTATGTAAAG
AAAAATCCTA CTATTGAGGG GAAACCAACC ATTGGATTTA CATTTATTAC AGAAAGTTGG
AGATTCTTCA CTTTAGAAAA TCCGCCTTCA TATCTTATGG GATATCAAAA TGATGGTGAT
GTAATTGTTG ACCCGAAGAC ATATGAAGCA AAAGTTTATT CTACAATGGG AGGGTCAAAG
AGATACTATA AAGATTTGAA TAAGATGTGG AAAGAGGGAC TTATTGACAA AGAAGTGTTC
GTTCAAAACT ACGACACATA TCTTTCTAAG ATTGCTCAGG GTAGAGTTGT AGGATTCTAT
GATCAGTGGT GGCAATTTGG ATATGATGCA GAGGCTTCAC TGAAGAATGC AAAGAAATAC
AATAGAATGC ATATTTCATT CCCAGTTGTT TACAAAGGTG TTCAAAGAGC AAGATATCTT
ATGATTCAGC CGATTGGCGC AAGAGATGGT ATTAGCATAA CTAAGAAGTG TAAAGATCCT
GTTACAGCAT TTAAGTTCTT GGACAAACTG TGCTCTTTAG AAGCTCAAAA ACTTATGTAT
TGGGGAATAA AAGGTGTTGA TTACAGTGTA GACAAAAACG GAAAAATGTA CCTAACAGAT
AAACAGAAAA AACAGAGAGA AGACCCTGTT TACAGGAAGA AACAAGGTCT TGGATACTGG
TGGGTATTTC CACATGCATA TTTGAAGCTG CAGGATGGAA ATTACAGAGA GCCCGGATTT
GACCCAGAGT ATGTATACAA GAACTTCTCA CCAGCTGAAA AGAAAGTTCT CGATGCATAT
AAAGCTAAAT ACTTCATGCA ACCTCCATTT ACAGATCCTC CACTTGAAAC ACCTTATGGA
TTTGCATGGG AAATCAATAT TCCTGCTGAT AAGCCTCAAG TTACAATTGC TCAACAGAAG
ATGAGCGAAG TGAGAAGAAA ATATCTACCA CAACTTGTAA TGGCAAAGAC AGATGCAGAT
TTTGATAGAA TATGGAAAGA ATTTGTTCAG GCATTTGAAA AAACAAATTA CAAAGTTTAT
GAGCAGTTCA AGACAGAAAT GATTCGCTGG AGAGTAAAGA ATTGGAACTA A
 
Protein sequence
MSKKLKVFAW FICFVFIFST LITFPSLKSD FVKAASSNQP VKTLTFFYGD SNADPHPDLF 
STPIGKEITK LTGVKLKIEY LAGQDEATKI GLMLASGDLP DLIHGHQEHG KLIEAGVLVP
LDNYIQKYGK YCKQIYTDKD LKRLRQKDGK IYFLSPYRNE ITPDLKPDGF WLPIDLLEKA
KWPKVRYWED YQQLIRDYVK KNPTIEGKPT IGFTFITESW RFFTLENPPS YLMGYQNDGD
VIVDPKTYEA KVYSTMGGSK RYYKDLNKMW KEGLIDKEVF VQNYDTYLSK IAQGRVVGFY
DQWWQFGYDA EASLKNAKKY NRMHISFPVV YKGVQRARYL MIQPIGARDG ISITKKCKDP
VTAFKFLDKL CSLEAQKLMY WGIKGVDYSV DKNGKMYLTD KQKKQREDPV YRKKQGLGYW
WVFPHAYLKL QDGNYREPGF DPEYVYKNFS PAEKKVLDAY KAKYFMQPPF TDPPLETPYG
FAWEINIPAD KPQVTIAQQK MSEVRRKYLP QLVMAKTDAD FDRIWKEFVQ AFEKTNYKVY
EQFKTEMIRW RVKNWN