Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0105 |
Symbol | |
ID | 7408467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 128013 |
End bp | 129068 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643714513 |
Product | D-xylose ABC transporter, periplasmic substrate-binding protein |
Protein accession | YP_002572036 |
Protein GI | 222528154 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT CACTATTAAG AATTGTTGCC ATTTTTGTTG CGGTTGTCTT CATAGTTGGT ATTGGCTATG CAGTTGTTCC AAAATATGCA AGCGCAAAAT CTTCTAAAAA GCAAATCAAA ATTGGGTTGT CCTTAGCTAC TTTGCAGGAG GAAAGATGGC ACAAAGACAG AGATGAATTT GTAAAAGCAG CTCAAAAGCT TGGTGCAAAA GTTTTAGTTC AGGCTGCTAA TATGGACGAT GTAAAGCAAA AAGAACAGTG TGAGAATTTA ATTAGCCAGG GTGTAGATGT TCTTGTTATA GTTCCAAACA ATGCAGAAGT TTTCACATCC ATAATTGAAG AAGCTCACAA GGCAAAAATA CCAGTAATTT CATACGACAG ATTAATTAAA AACGCAAATG TTGATCTTTA CATCTCATTT GACAATATAA AAGTTGGTGA ACTTCAGGGT AAATACTTAA CATCAAAGGT TCCAAAGGGC AACTACTTTG TATTCAGAGG CGCTCCAACA GACAACAACG CAACACTCTT CTATCAAGGT GCTATGAAGT ATATCCAGCC ACTTGTAAAG AGCGGAAAAG TAAAAGTTCT CTTTGACCAG CCAGTAAAAG ACTGGAAACC AGAAGAGGCT TTGAGACTTT GTGAAAATGC TCTTACTGCA GCAAAGAACA ACGTTCAAGG AATCTTGGCA CCAAACGATG GAACAGCTGG CGGAATCATT CAGGCTCTCA AAGCACAGGG GCTTGCTGGT AAGGTTGTTG TAACAGGTCA GGATGCAGAC CTTGCAGCTG TTAAGAGAAT TGTTGAGGGT ACACAGACAA TGACAGTGTT CAAAGATGTA AGACTTTTAG CTAAAAAAGC TGCTGAGGTT GCAGTTGAGC TTGCAAAAGG CAAGAAGGTT TCTCAGCTCA AAGATGTAAA CGGCAAGGTT TACAATGGTA AGATAAATGT ACCATCAATA CTTTTGACAC CAGTTGCAGT TGATAAGTCT AACATTGACA AGGTACTTAT CCAGAGCGGT TGGTTCACAA AAGAACAGGT TTATGGCAAG AAGTAA
|
Protein sequence | MKKSLLRIVA IFVAVVFIVG IGYAVVPKYA SAKSSKKQIK IGLSLATLQE ERWHKDRDEF VKAAQKLGAK VLVQAANMDD VKQKEQCENL ISQGVDVLVI VPNNAEVFTS IIEEAHKAKI PVISYDRLIK NANVDLYISF DNIKVGELQG KYLTSKVPKG NYFVFRGAPT DNNATLFYQG AMKYIQPLVK SGKVKVLFDQ PVKDWKPEEA LRLCENALTA AKNNVQGILA PNDGTAGGII QALKAQGLAG KVVVTGQDAD LAAVKRIVEG TQTMTVFKDV RLLAKKAAEV AVELAKGKKV SQLKDVNGKV YNGKINVPSI LLTPVAVDKS NIDKVLIQSG WFTKEQVYGK K
|
| |