Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0181 |
Symbol | |
ID | 7407172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 223549 |
End bp | 225336 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714583 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002572106 |
Protein GI | 222528224 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000497192 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAGAC CAAAGATTGT TAAAAGAATT ATTTCGGTTT TAGTAGCTAT TTGTATGCTT GCTTCAGTTG CTTTGATTGT GGGCAAGTCT CCACAAAGAG TGCAAGCGTC ATCAAAACTT GGTGACATTT CATTCTTAAG ACCTGGTTTT TCTAAAGAGA GTTTAAAAAG TACTGACATT TTCAACAAAG CAGTTGCAAA GGCAATTGAA GACTATCAGA AAAAATATGG TGGAAAAGTA AACATAGTAT ATTCTGACTG GAACAACTGG CAAACAAAGA TAATTGCAAG GATGGCTGCA GGTGATCCAA TCGATGTTAT TTTTGGAGGA ACCGGGACAT TTCCGGCATT TTACAACAGA GGGCTTGTAC AACCACTTGA TAAGTATGTT GATTTGAAAG CTCCATATAT AAACAAAAGA GCAATGGACT ATGCATTCAA GTACAATGGT CACTACTATT TAGCAAGCCA GAAAGGTTCG AATGTTCCAT GGCTGGTTAT ATATAACAAA GACTTAATGT TAGAAGAAGG TATTGATGAA GAAGAAATGC CGCTTGCTCT TTACAAGAAG GGAAGATGGA ACTGGGATAC ATTTGCCGCA CTTGCTAAGA AGCTGACAGC TGATACAAAT AAAGATGGAA AGATTGATAG GTATGGTGTA AACTTCTGGG CAGCAACAGC TATTGTATAC GCAAACGGCA CACAGTTTGT TAAGGTTGAT TCATCTGGTA AAGGCAAGGT AAACTTTGAT AATCCAGCGC TTCAGAGAGC ATTGAACTTC TACAAAAAGG GTGCAAAAGA AGGCTGGCTT GCTAGAGACT GGGATATAAC AGTTTCTGGT TTGAAGAAGA GACAAACTGT AATGCTTGTT GCACCACAAT ATAAATTTGA TCAGGACAAG AGAGAGGTTG AAGATGAGCT TGAAGCTGCT CCATTGCCAC TTGGACCAGA CAACAAGTCA GGGCTTTATC CATTCGATGC AGACGGGTAT GGCATTATGA AGGGTTCAAA GAATCCAGTT GGTGCTGGAA AGTTTATTAA CCTGTTATTA GAAAGCGTCC AAAAGAATCA TGATGATGTC AATGCAAAAA ATAGACCAAA ATACTTAGTA GATTTTGTTA ATAAGCTTGC AGAAAAATCT TTCTATCCAG GCTTAGGAGA GTCAATGCTT GGTATGCCAC ACTGGGATAT ATTCGGAAGA GTTGACAGTT CTGACTCTGT TGCAGCTGCA TTGTCAAGTT TGAGACCTCA AGTTGAAAAG AACGTCAAAG AAGCTTCAGC TGGTGCTATT AATGCAGTTT ACAAACCATT CAAGCCATTT ACAATTAACT TTGAGGATGG AAAATTAGAT ACATTCAAAG TTTTAGATAC ATCAAAGAAG ACAGTTAAAC TTTCAATTGC TTCAGGTAAA GAAGCTATAA AAGGAAAATC CTTGAAGGTA ACTTGGGACC AAGGAAAAGA CGGTGGCGAG ATTTATGTAG TAACAGCACC AGAAAAGGTT AAGATATACG GGTGGCATGA CTATACTGTA AGCTTTGATG TCAAAGTCTT GAAAGCACCA AAAGCTGGCA AGACGACAGT TGTATGTTCA ATCCTCAATG ATACAAAACC AAATGCAACA TCTTATGGCA GCATTACAAA GACAATTGAT AAAGGTCAGA CTGTCTACCA TGTAGAAGGT AATATTACAA ATATTCCAGA TAACTCTGAC AAGATGTGCT TGAGAATTGG CGTTCAAGAA GGAGTAGACT TTGTAATTGA TAATATTAAG GTTGTAGAAC TTGAATAA
|
Protein sequence | MVRPKIVKRI ISVLVAICML ASVALIVGKS PQRVQASSKL GDISFLRPGF SKESLKSTDI FNKAVAKAIE DYQKKYGGKV NIVYSDWNNW QTKIIARMAA GDPIDVIFGG TGTFPAFYNR GLVQPLDKYV DLKAPYINKR AMDYAFKYNG HYYLASQKGS NVPWLVIYNK DLMLEEGIDE EEMPLALYKK GRWNWDTFAA LAKKLTADTN KDGKIDRYGV NFWAATAIVY ANGTQFVKVD SSGKGKVNFD NPALQRALNF YKKGAKEGWL ARDWDITVSG LKKRQTVMLV APQYKFDQDK REVEDELEAA PLPLGPDNKS GLYPFDADGY GIMKGSKNPV GAGKFINLLL ESVQKNHDDV NAKNRPKYLV DFVNKLAEKS FYPGLGESML GMPHWDIFGR VDSSDSVAAA LSSLRPQVEK NVKEASAGAI NAVYKPFKPF TINFEDGKLD TFKVLDTSKK TVKLSIASGK EAIKGKSLKV TWDQGKDGGE IYVVTAPEKV KIYGWHDYTV SFDVKVLKAP KAGKTTVVCS ILNDTKPNAT SYGSITKTID KGQTVYHVEG NITNIPDNSD KMCLRIGVQE GVDFVIDNIK VVELE
|
| |