Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0847 |
Symbol | |
ID | 7407422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 939000 |
End bp | 940670 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643715225 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002572735 |
Protein GI | 222528853 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAA AATTAAAAGT ATTTGCTTGG TTTATTTGTT TTGTGTTCAT TTTTTCAACT CTTATTACAT TCCCAAGTTT AAAATCTGAT TTTGTGAAAG CGGCTTCAAG CAATCAACCC GTAAAGACAT TGACATTTTT CTATGGAGAT TCAAATGCAG ATCCTCATCC AGACCTGTTT AGCACTCCTA TTGGTAAAGA AATTACAAAA CTTACAGGTG TAAAACTCAA AATCGAATAC TTAGCAGGAC AGGATGAAGC AACAAAAATT GGTCTTATGT TAGCATCTGG TGATTTACCA GATTTGATTC ATGGTCATCA GGAGCATGGA AAGTTAATAG AAGCTGGTGT ATTGGTACCA CTTGATAACT ATATTCAAAA ATACGGGAAA TATTGTAAAC AGATTTACAC TGATAAAGAC CTCAAAAGAC TCAGACAGAA AGATGGAAAA ATTTATTTCT TGTCTCCTTA CAGAAACGAA ATAACTCCAG ACTTAAAACC AGATGGCTTC TGGCTACCAA TTGATCTTCT TGAAAAAGCA AAATGGCCGA AGGTAAGGTA TTGGGAGGAC TATCAGCAGC TCATTAGAGA TTATGTAAAG AAAAATCCTA CTATTGAGGG GAAACCAACC ATTGGATTTA CATTTATTAC AGAAAGTTGG AGATTCTTCA CTTTAGAAAA TCCGCCTTCA TATCTTATGG GATATCAAAA TGATGGTGAT GTAATTGTTG ACCCGAAGAC ATATGAAGCA AAAGTTTATT CTACAATGGG AGGGTCAAAG AGATACTATA AAGATTTGAA TAAGATGTGG AAAGAGGGAC TTATTGACAA AGAAGTGTTC GTTCAAAACT ACGACACATA TCTTTCTAAG ATTGCTCAGG GTAGAGTTGT AGGATTCTAT GATCAGTGGT GGCAATTTGG ATATGATGCA GAGGCTTCAC TGAAGAATGC AAAGAAATAC AATAGAATGC ATATTTCATT CCCAGTTGTT TACAAAGGTG TTCAAAGAGC AAGATATCTT ATGATTCAGC CGATTGGCGC AAGAGATGGT ATTAGCATAA CTAAGAAGTG TAAAGATCCT GTTACAGCAT TTAAGTTCTT GGACAAACTG TGCTCTTTAG AAGCTCAAAA ACTTATGTAT TGGGGAATAA AAGGTGTTGA TTACAGTGTA GACAAAAACG GAAAAATGTA CCTAACAGAT AAACAGAAAA AACAGAGAGA AGACCCTGTT TACAGGAAGA AACAAGGTCT TGGATACTGG TGGGTATTTC CACATGCATA TTTGAAGCTG CAGGATGGAA ATTACAGAGA GCCCGGATTT GACCCAGAGT ATGTATACAA GAACTTCTCA CCAGCTGAAA AGAAAGTTCT CGATGCATAT AAAGCTAAAT ACTTCATGCA ACCTCCATTT ACAGATCCTC CACTTGAAAC ACCTTATGGA TTTGCATGGG AAATCAATAT TCCTGCTGAT AAGCCTCAAG TTACAATTGC TCAACAGAAG ATGAGCGAAG TGAGAAGAAA ATATCTACCA CAACTTGTAA TGGCAAAGAC AGATGCAGAT TTTGATAGAA TATGGAAAGA ATTTGTTCAG GCATTTGAAA AAACAAATTA CAAAGTTTAT GAGCAGTTCA AGACAGAAAT GATTCGCTGG AGAGTAAAGA ATTGGAACTA A
|
Protein sequence | MSKKLKVFAW FICFVFIFST LITFPSLKSD FVKAASSNQP VKTLTFFYGD SNADPHPDLF STPIGKEITK LTGVKLKIEY LAGQDEATKI GLMLASGDLP DLIHGHQEHG KLIEAGVLVP LDNYIQKYGK YCKQIYTDKD LKRLRQKDGK IYFLSPYRNE ITPDLKPDGF WLPIDLLEKA KWPKVRYWED YQQLIRDYVK KNPTIEGKPT IGFTFITESW RFFTLENPPS YLMGYQNDGD VIVDPKTYEA KVYSTMGGSK RYYKDLNKMW KEGLIDKEVF VQNYDTYLSK IAQGRVVGFY DQWWQFGYDA EASLKNAKKY NRMHISFPVV YKGVQRARYL MIQPIGARDG ISITKKCKDP VTAFKFLDKL CSLEAQKLMY WGIKGVDYSV DKNGKMYLTD KQKKQREDPV YRKKQGLGYW WVFPHAYLKL QDGNYREPGF DPEYVYKNFS PAEKKVLDAY KAKYFMQPPF TDPPLETPYG FAWEINIPAD KPQVTIAQQK MSEVRRKYLP QLVMAKTDAD FDRIWKEFVQ AFEKTNYKVY EQFKTEMIRW RVKNWN
|
| |