Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2574 |
Symbol | |
ID | 7409528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2709913 |
End bp | 2711118 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643716938 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002574412 |
Protein GI | 222530530 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000602333 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATC TAAAAAGAAT CTTGACAGTA GCGCTAATAA TTACTTTTGC GGTTGTGGCG CTGATACCAC TGAGCGGAGT ATTTGCGACA TCCAAAAAAC AGCTTGTTGT CTGGTCACAT CTTACTCAGG ATGAAGTAAA AGCTTTGCAA CCAATTGCAG ATAAGTGGGG AAAAGAAAAT GGATACACTG TAAAAGTCAT CACAGACCAA GGTTCGTTCC AGAGCTTCCA GACAGCTGCA ATGAGCGGAA AAGGTCCAGA TATCATGTTT GGTATTCCAC ATGATAACCT TGGTGCTTTC TGGAAAGCAA AGCTTTTAGA AGCAGTTCCA GCAAATCTCA TTGACAAGAA AAACTTTGTA TCAACTGCGC TTGATGCATG TTCATTTGAA GGGAAGCTCT ATGCTCTTCC AATTGCTATG GAGACATATG CGCTTTTCTA TAATACATCA AAAGTAAAAG AGGCTCCAAA GACAATGAGC CAGCTCATCA CATTGGCTAA GAAATACGGG TTTATGTACG ATGTTAACAA CTTCTATTTC AGCTTTGCGT TCATTGCTCA AAATGGTGGA TATGTGTTCA AGAACAAGGG TGGTTCGCTT GATCCAAACG ATATCGGACT TGCAACAAAT GGTGCGATAA AAGGACTTTC ATTGATAAGA GATTTTGTTC AGACATATAA ATTCATGCCA AAGGACATCA AGGGTGATAT TGCAAAGGGT AACTTCCAGA ACCAGAAGAT TGCATTTTAC ATCAGCGGTC CATGGGATGT TCAGGACTTT ATAAAGGCAA AAGTACCATT TGCAGTAGCA CCATTGCCAA AGACAGATGA TGGAAAACCA ACACCATCGT TTGTTGGTGT TCAGGCAGCA TTCGTTTCAG CAAAGTCCAA AAACAAAGAT GCAGCTTTCA AACTTATGAA GTATCTTGTT GAAAACTCTG CTCTGACACT GTTCAAAGTT GGTCACAGAA TACCTGTTTT GAACAAGGTG CTAACAAGTA GCGAGGTCAA GGCAGACAAA ATCATGAGCG CATTTGCTGA ACAGGCAAAG GTTGGAATAC CTATGCCAAA CATCCCAGAG ATGTCTGCTG TTTGGCCAGT AGCTAACAAT GCGCTTTCGC TTATCACCAC AGGTAAAGCA ACACCAAAGC AGGCAGCAGA TGCTATGGTA AAACAGATCA AGCAAGGTAT TGCTCAGATG CAATAA
|
Protein sequence | MKNLKRILTV ALIITFAVVA LIPLSGVFAT SKKQLVVWSH LTQDEVKALQ PIADKWGKEN GYTVKVITDQ GSFQSFQTAA MSGKGPDIMF GIPHDNLGAF WKAKLLEAVP ANLIDKKNFV STALDACSFE GKLYALPIAM ETYALFYNTS KVKEAPKTMS QLITLAKKYG FMYDVNNFYF SFAFIAQNGG YVFKNKGGSL DPNDIGLATN GAIKGLSLIR DFVQTYKFMP KDIKGDIAKG NFQNQKIAFY ISGPWDVQDF IKAKVPFAVA PLPKTDDGKP TPSFVGVQAA FVSAKSKNKD AAFKLMKYLV ENSALTLFKV GHRIPVLNKV LTSSEVKADK IMSAFAEQAK VGIPMPNIPE MSAVWPVANN ALSLITTGKA TPKQAADAMV KQIKQGIAQM Q
|
| |