Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2376 |
Symbol | |
ID | 7407795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2526090 |
End bp | 2527406 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716740 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002574219 |
Protein GI | 222530337 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAAAAA GTAAAAGGTT AATTGCAACT CTTGTGTTAG TAGTTTTTAC TATGTCGATC TTCTTTGCTT TTTCGACCGC TGGGTCTGAG AAAGCTAAGG CAGCATCGAA AAAGGTTACA CTCAGGTTTA TGTGGTGGGG CGGAGAGGCA AGACACAAAG CCACTTTGGC AGCAATTCAG GCGTATATGA AGAAATACCC TAATGTAAGA ATTAATGCAG AGTATGGCGG TATTGAAGGT TACATGCAGA AGCTCATTAC CCAGCTTGTG GGAAGAACCG CTCCAGATAT AATCCAGATT GACGTTACAT GGATTGGTGA GCTGAGCAGC CAGGGAGATT TCTTTGCAGA CCTTAAAACT TTCAAAGAGG TCAACTTAAA GCCATTTGAA GAGAAGTTTT TAAAAGACTG GTGCTATTCA AACGGAAAAC TTATTGGACT TCCAACAGGT GTTAATGCTT CGGTACTTCA ATATAATAAA GAGTTTTTTA AGAAGTTTAA TATCGACGAA AATACAGTTT GGACGTGGGA TAACTTACTA TCAATAGCTG AAAAAGTACA CAAAAAAGAT AAAAATAGTT ATTTGCTTAA TTTTGATCAA ATCCTCTGTT ACTATGTTTT GACATCGTAT ATTGGTCAAA AAACAGGAAA GGATTGGATT TTAGATGATT ATACATTAGG ATTTAATAGG AATCAGTTGA TAGAAGCTTT TACTTATTTG AAAAAATTAT TTGATGTAGG AGCTATTCAA CCTTTTGCAG AAAGTGCGCC ATTTCAAGGT AAGCCAGAAC AAAATCCAAA ATGGCTAAAA GGAGAATTAG GGATTTTATG GAATTGGACT TCAACTTATG CTGCAAATAA AGCTATGATT CCAAGTTTGG CGATGACATT ACCACCCAGG GGCAACAACT TGAAAAACTA TGCAGTAACT GTCAGACCGT CGCAATTGTT ATCTGTAAAT AAACTTTCGA AGAATGCTAA AGAAGCTGCA AAATTTATTA ACTGGTTTTT AAACGATAAA CAAGCTGCTC TAATACTCAC TGATGTAAGA GGAGTTCCTG CAAGTTCAAG TGCCAGAGAT GCATTGTTAA AGGCAAATAA ATTAGATCCA GAAATATTGA GGGTTACAAA CGAAGCGGTA AAGTATGCAG CAAAACCACA GAATGCACTA TCACAGAATC AAGAAATAGC AAATATAGCA TATGATATCA TTCAGCAGCT TGCATACAAA CAGCTAACAC CGACACAGGC TGCAGATAAA TTGATAGCAT TATATAAACA AAAACTTTCT GAGCTTAAAA GAATGCAGTC TCGATAG
|
Protein sequence | MIKSKRLIAT LVLVVFTMSI FFAFSTAGSE KAKAASKKVT LRFMWWGGEA RHKATLAAIQ AYMKKYPNVR INAEYGGIEG YMQKLITQLV GRTAPDIIQI DVTWIGELSS QGDFFADLKT FKEVNLKPFE EKFLKDWCYS NGKLIGLPTG VNASVLQYNK EFFKKFNIDE NTVWTWDNLL SIAEKVHKKD KNSYLLNFDQ ILCYYVLTSY IGQKTGKDWI LDDYTLGFNR NQLIEAFTYL KKLFDVGAIQ PFAESAPFQG KPEQNPKWLK GELGILWNWT STYAANKAMI PSLAMTLPPR GNNLKNYAVT VRPSQLLSVN KLSKNAKEAA KFINWFLNDK QAALILTDVR GVPASSSARD ALLKANKLDP EILRVTNEAV KYAAKPQNAL SQNQEIANIA YDIIQQLAYK QLTPTQAADK LIALYKQKLS ELKRMQSR
|
| |