Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tter_2042 |
Symbol | |
ID | 8640071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobaculum terrenum ATCC BAA-798 |
Kingdom | Bacteria |
Replicon accession | NC_013526 |
Strand | - |
Start bp | 171311 |
End bp | 172762 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003323768 |
Protein GI | 269839076 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000305417 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGAT ACCCAGATAG GTATCCATAC GGCAGGAAGA AACTATCTAG AAGGCAGGCC CTGAAAGCAA TAGCCTTGGG CGCAGCCGGG GCCACCCTCG CAGCCTGTGG TGTTACTACT GGTTCTCCAG GCTCCGCGAA CAAGACGCCG GCAACTACCC CGGCGGCAAG CCAAGCTCCG CCCCCGGCTG CAAACGCCAC CCCGGCGGTA CGCGAGGCTC CCAAGGGACG CAAGATCGTG CTCGAGCTGT ACAGTGTGTT CGCCGGCAAC GAGCATCAGG GATGGATCAA GCTGGCTGAG CGCTACGAGC AGGTGCAGTC GGACGTAGGT ATCAAGATCA CCTACTCACC GGGGCACCAG GACAACCCCA AACTGCTGAC GTCGATCGCG GGCGGTGCCG CTCCCGACCT GGCTCACCTG ACCCCCTTCA GCACACCCCA GTGGGCTCTG CTGGGCGTGA TGACGGACCT CACAGAATAC GTCAAGCAGG AGGGATGGAC CGAGGACGAC TGGTTTCCAC CTGCTTGGAA CGACATGCAG TGGGAGGGCA AGTTCTACCA GGTGCAGTGG GATGCGGACC CTAACTTCCC ATTCTTCTGG AATCAGAACC TATTCGAGGA GGTGGGGCTC GATCCCGATA AGGGACCGCA GACCATCGAC GAGGTGGATG AGTACTCCAA GAAGATCAAC ACCATCCGGA ACGGCAAGGT CGTGCGCATA GGGATGATCC CCTGGGACGT GTACGGGCCG GAGAACTCCC TGTTCACCTG GGGATGGTCC TTCGGCGGCG AGTTCTGGGA CCCAGACAAA CAGGAGGTTA CCCCGGACAA CGACTACGTC GTCAAGGCCC TGGAGTGGAT GGTGAAGTAC GCCAAGTCGG TGGGTGGCCC GGACAAGCTG TCTGTGACGC CTCCCAACCT GCAGCTGCAT CCCTTCGGGT CGGGCAACAT AGGCATGTAC CCGCTCGTAG TCCCGAACTA CCGAGACATC AGGAACGCCA AGCCCGACAT GAAGATCGGA CACGGGCTGC TCCCGTACGA ACCCCCTGGG GCCAAGCAAC CCGGTGCTGG GGCGTGGTTC GGCGGTTGGG GGCTCTTCAT ACCCAAGGGT GCCAAGAACC CCGATGCTGC CTGGGAGTTC ATCAAGTGGG TCTCCGCGAC GCCAGAGGGC ACCAAGGCCC AGTGGGAGAC GGTGGGATTC CCGCCCGCCT GGAAGAAGGC TCCGGTGCTC GAGGAGATCA AGAACGACCC GATCATGGGC CCCTACTACA ACGTGCTGAT ATCCGCCAAG CACTCGCGCC CGGCCATCCC GGTAGGAGCC TACTACTTCG GGCAGATAGG CGAGCAGGTG GGAGAGGCGA TCAACGGCAA GAAGACGCCT CTGGCCGCGA TGCGCACCGC CAAGGAAAAC ACCATGAAAG AGATGCAGCG CTTCATGAAG CAGAACGCGT AA
|
Protein sequence | MSRYPDRYPY GRKKLSRRQA LKAIALGAAG ATLAACGVTT GSPGSANKTP ATTPAASQAP PPAANATPAV REAPKGRKIV LELYSVFAGN EHQGWIKLAE RYEQVQSDVG IKITYSPGHQ DNPKLLTSIA GGAAPDLAHL TPFSTPQWAL LGVMTDLTEY VKQEGWTEDD WFPPAWNDMQ WEGKFYQVQW DADPNFPFFW NQNLFEEVGL DPDKGPQTID EVDEYSKKIN TIRNGKVVRI GMIPWDVYGP ENSLFTWGWS FGGEFWDPDK QEVTPDNDYV VKALEWMVKY AKSVGGPDKL SVTPPNLQLH PFGSGNIGMY PLVVPNYRDI RNAKPDMKIG HGLLPYEPPG AKQPGAGAWF GGWGLFIPKG AKNPDAAWEF IKWVSATPEG TKAQWETVGF PPAWKKAPVL EEIKNDPIMG PYYNVLISAK HSRPAIPVGA YYFGQIGEQV GEAINGKKTP LAAMRTAKEN TMKEMQRFMK QNA
|
| |