Gene Athe_0181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0181 
Symbol 
ID7407172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp223549 
End bp225336 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content38% 
IMG OID643714583 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002572106 
Protein GI222528224 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000497192 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAGAC CAAAGATTGT TAAAAGAATT ATTTCGGTTT TAGTAGCTAT TTGTATGCTT 
GCTTCAGTTG CTTTGATTGT GGGCAAGTCT CCACAAAGAG TGCAAGCGTC ATCAAAACTT
GGTGACATTT CATTCTTAAG ACCTGGTTTT TCTAAAGAGA GTTTAAAAAG TACTGACATT
TTCAACAAAG CAGTTGCAAA GGCAATTGAA GACTATCAGA AAAAATATGG TGGAAAAGTA
AACATAGTAT ATTCTGACTG GAACAACTGG CAAACAAAGA TAATTGCAAG GATGGCTGCA
GGTGATCCAA TCGATGTTAT TTTTGGAGGA ACCGGGACAT TTCCGGCATT TTACAACAGA
GGGCTTGTAC AACCACTTGA TAAGTATGTT GATTTGAAAG CTCCATATAT AAACAAAAGA
GCAATGGACT ATGCATTCAA GTACAATGGT CACTACTATT TAGCAAGCCA GAAAGGTTCG
AATGTTCCAT GGCTGGTTAT ATATAACAAA GACTTAATGT TAGAAGAAGG TATTGATGAA
GAAGAAATGC CGCTTGCTCT TTACAAGAAG GGAAGATGGA ACTGGGATAC ATTTGCCGCA
CTTGCTAAGA AGCTGACAGC TGATACAAAT AAAGATGGAA AGATTGATAG GTATGGTGTA
AACTTCTGGG CAGCAACAGC TATTGTATAC GCAAACGGCA CACAGTTTGT TAAGGTTGAT
TCATCTGGTA AAGGCAAGGT AAACTTTGAT AATCCAGCGC TTCAGAGAGC ATTGAACTTC
TACAAAAAGG GTGCAAAAGA AGGCTGGCTT GCTAGAGACT GGGATATAAC AGTTTCTGGT
TTGAAGAAGA GACAAACTGT AATGCTTGTT GCACCACAAT ATAAATTTGA TCAGGACAAG
AGAGAGGTTG AAGATGAGCT TGAAGCTGCT CCATTGCCAC TTGGACCAGA CAACAAGTCA
GGGCTTTATC CATTCGATGC AGACGGGTAT GGCATTATGA AGGGTTCAAA GAATCCAGTT
GGTGCTGGAA AGTTTATTAA CCTGTTATTA GAAAGCGTCC AAAAGAATCA TGATGATGTC
AATGCAAAAA ATAGACCAAA ATACTTAGTA GATTTTGTTA ATAAGCTTGC AGAAAAATCT
TTCTATCCAG GCTTAGGAGA GTCAATGCTT GGTATGCCAC ACTGGGATAT ATTCGGAAGA
GTTGACAGTT CTGACTCTGT TGCAGCTGCA TTGTCAAGTT TGAGACCTCA AGTTGAAAAG
AACGTCAAAG AAGCTTCAGC TGGTGCTATT AATGCAGTTT ACAAACCATT CAAGCCATTT
ACAATTAACT TTGAGGATGG AAAATTAGAT ACATTCAAAG TTTTAGATAC ATCAAAGAAG
ACAGTTAAAC TTTCAATTGC TTCAGGTAAA GAAGCTATAA AAGGAAAATC CTTGAAGGTA
ACTTGGGACC AAGGAAAAGA CGGTGGCGAG ATTTATGTAG TAACAGCACC AGAAAAGGTT
AAGATATACG GGTGGCATGA CTATACTGTA AGCTTTGATG TCAAAGTCTT GAAAGCACCA
AAAGCTGGCA AGACGACAGT TGTATGTTCA ATCCTCAATG ATACAAAACC AAATGCAACA
TCTTATGGCA GCATTACAAA GACAATTGAT AAAGGTCAGA CTGTCTACCA TGTAGAAGGT
AATATTACAA ATATTCCAGA TAACTCTGAC AAGATGTGCT TGAGAATTGG CGTTCAAGAA
GGAGTAGACT TTGTAATTGA TAATATTAAG GTTGTAGAAC TTGAATAA
 
Protein sequence
MVRPKIVKRI ISVLVAICML ASVALIVGKS PQRVQASSKL GDISFLRPGF SKESLKSTDI 
FNKAVAKAIE DYQKKYGGKV NIVYSDWNNW QTKIIARMAA GDPIDVIFGG TGTFPAFYNR
GLVQPLDKYV DLKAPYINKR AMDYAFKYNG HYYLASQKGS NVPWLVIYNK DLMLEEGIDE
EEMPLALYKK GRWNWDTFAA LAKKLTADTN KDGKIDRYGV NFWAATAIVY ANGTQFVKVD
SSGKGKVNFD NPALQRALNF YKKGAKEGWL ARDWDITVSG LKKRQTVMLV APQYKFDQDK
REVEDELEAA PLPLGPDNKS GLYPFDADGY GIMKGSKNPV GAGKFINLLL ESVQKNHDDV
NAKNRPKYLV DFVNKLAEKS FYPGLGESML GMPHWDIFGR VDSSDSVAAA LSSLRPQVEK
NVKEASAGAI NAVYKPFKPF TINFEDGKLD TFKVLDTSKK TVKLSIASGK EAIKGKSLKV
TWDQGKDGGE IYVVTAPEKV KIYGWHDYTV SFDVKVLKAP KAGKTTVVCS ILNDTKPNAT
SYGSITKTID KGQTVYHVEG NITNIPDNSD KMCLRIGVQE GVDFVIDNIK VVELE