Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2310 |
Symbol | |
ID | 7407729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2447720 |
End bp | 2448997 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716674 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002574153 |
Protein GI | 222530271 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000523246 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAT TTATAGCTGT GATGGTTTTA ATTGCTTTTA GTGTAGGTTT GTTTTTAGCC TTTGGTCCTG CCAATTCTAA TGCCGCTTCT AAAAAACAGG TCACAATTAC TTATGTTCGA GGCAAGGACG AAACACACGC AACAGAAAAG ATTATCAAAG AGTTCATGAA GAAAAACCCA GACATCAATG TAATCTACAA AGAAAATCCA TCTGACACAG GCCAAAATCA CGATCAGCTT GTAACAGTTC TGAGTGCTGG TGGTTCTGAC ATTGATGTGT TTGACATGGA CGTTATCTGG CCAGCTGAGT TTGCTCAAGC AGGTTACACA CTTCCTCTTG ACAGATTCAT AAAGCGCGAC AAGACTAATC TCAATGACTA CATTAAAGGA ACAATTGATG CTGCAAGATT TAAAGGACAG ATGTGGGCAT TTCCAAGATT TATTGATGCA GGACTTCTTT ATTACAGAAA AGACATTGTT CCTCAAAATG AACTTCCAAA GACATGGGAT GATTTGATTA AAGTTGCTAA GAAATATAAA GGTAAAAACG GAACAAAGTA CGGATTTTTA ATGCAGGCAA AGCAATATGA AGGCCTTGTT TGTGATGCAA TTGAATATAT AGCATCTTAT GGTGGCAAGG TTGTTGATGA GAGTGGAAAC ATTGTTGTCA ACAATCAAGG AACAATTGAT GGACTAAACA TGATGAGAAA AGTTATAACA TCCGGGATTG TTCCGCCAAA CATCAATACA TTCACAGAGG TTGAAACGCA TACAGCTTTC ATAAATGGTC TTTCAGTCTT TGCAAGAAAC TGGCCTTACA TGTGGGCAAT GATAAATAGT CCACAGTCAA AAGTGAGAGG AAAAGTTGGA ATTTTACCTC TTCCAAAAGG TTCAAAAGGG TCTGCAGCTT GTCTTGGTGG ATGGATGGTA GGTATAAACA AATTCTCGAA AAATCCGGAA GCTTCCTGGA GACTTTTAAA GTTCCTTGTA CAAAAAGAAG GACAGAAACT TATGGCTATT TACAATGGAA ATGTTCCTGT TTACAAACCA CTTTTCAATG ATAAAGATGT TATAAAAGCT AACCCACTAA TAGGTGATAA GAAATTTATT GAAGCTATAT TAGCTGCTGT TCCAAGACCT GTGTCGCCAA TTTACCCAAA GATTTCAGAT GTTATGCAAA TTGAACTTTC AAATATTGTA AACGGTAAGA AGGATGTAAA AACAGCTGTT GCTGATATGG ACAAGAAATT AAAAGAAGTT GTAAAAACTT CAAAGTAA
|
Protein sequence | MKRFIAVMVL IAFSVGLFLA FGPANSNAAS KKQVTITYVR GKDETHATEK IIKEFMKKNP DINVIYKENP SDTGQNHDQL VTVLSAGGSD IDVFDMDVIW PAEFAQAGYT LPLDRFIKRD KTNLNDYIKG TIDAARFKGQ MWAFPRFIDA GLLYYRKDIV PQNELPKTWD DLIKVAKKYK GKNGTKYGFL MQAKQYEGLV CDAIEYIASY GGKVVDESGN IVVNNQGTID GLNMMRKVIT SGIVPPNINT FTEVETHTAF INGLSVFARN WPYMWAMINS PQSKVRGKVG ILPLPKGSKG SAACLGGWMV GINKFSKNPE ASWRLLKFLV QKEGQKLMAI YNGNVPVYKP LFNDKDVIKA NPLIGDKKFI EAILAAVPRP VSPIYPKISD VMQIELSNIV NGKKDVKTAV ADMDKKLKEV VKTSK
|
| |