Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1679 |
Symbol | |
ID | 5170106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1676917 |
End bp | 1678197 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640564205 |
Product | extracellular solute-binding protein |
Protein accession | YP_001245260 |
Protein GI | 148270800 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000895921 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCTCTA GGTTGATGGT CCTGGGATTT TTACTACTTT TGTTCGCTGG CATTCTTCTA GGAGTGAAAC TTACGATCTT CAGTGCCGGT GCCAGTGAGG GTCAGGCACT GGATGCGGCA ATCGCTGAGT ACAAAAAACT ACATCCAGAG GTGGAATTCG AGCACGTCAA TATCACATCT GGTTGGCAGG AGAAGTTCTC CCTTGCGTTG ATGAGTGGTG ATGCTCCCGA TCTAATAGCG ATCACTGTTC CATACGCGGA TTATTTCAGA TCTTACCTTA TTGATCTCGC ACCTTATGTA GAGAAACACC TAGGCATTTC TCTCAAAGAG TACAAAGATT CCATGTACGA TGTGGTCAGA GCCTATGTGG GGAAAACGGA GGATGAGTTA ACTTACGTTC CCCTCTATCT CACTGTCCAC AGTCTTTGGG TGAACGTTGA TTATTTTGAG AAAGCGGGTA TTCCTTATCC TCCACTTGGA GGAAGGGATG AACCCTGGAC ATGGGAAGAG TTCGTAGATG TTCTCAGAAC AGTCAAAAAA GTCAACAAAC TACCAGCTGC CATGTCATTT TCCTATTCCA CGGAGAGATT ATTCAATTAC CTTGCCGTGA GGGGAGTTAA AGTTCTGGAC GAGAACCTGG ATCTTGTTCT CGATAAGGAT CCCAGAGCAA AAAAGGTGCT GCAAGATTTT GTAGATCTTT TCAAAGAAGA ACTAGTGCCG GCACCGGAGT GGATAGCACA GCAGTCCGAT ATAAACGATT TCCTGGGGGG TATCACGGCG GTTCACTGGT CCGGTAGCTG GATGTGCAGA TCCATCATCG ACATCATGAA ACAGACAGGA AAACGTTTTG CTCCGGCTTA CGTTCCAAAA GATGTCGACT GGTTTGGCAT CAACGGAGGC CATATCTTCG GGGTGGTAAG AACAGGCGAC AAGAAGCGAG AGGAAGAAGC TATAAAATTC GCTCTCTGGA TAGGACAGAA GGGACTTGGA AACGATGTGT TCAACAAGGC GCTTCTCGGA ATTTCACCGT TCAAAGGCCA TGAAATAGAT TACGGTGTAC CGGAGATGAA CGAATGGATA CCGGTCTTTC AGACTTTGAT CGAAAGGGCA CCTTCTTGGA TAGTTCCGGT CAGAACCTGC GAACTCTGGG CAAGACTCTA CGATCCTTTG AGAACACAGA TCGCCATGGT AATAGGTGAC CAGCAGAATC TTGATGATGC ATTGAAAAAC ATCCGAAAAG AGTACGAAAC CATCCTAGAA GAACTTGGAG GAAAGAGATA A
|
Protein sequence | MRSRLMVLGF LLLLFAGILL GVKLTIFSAG ASEGQALDAA IAEYKKLHPE VEFEHVNITS GWQEKFSLAL MSGDAPDLIA ITVPYADYFR SYLIDLAPYV EKHLGISLKE YKDSMYDVVR AYVGKTEDEL TYVPLYLTVH SLWVNVDYFE KAGIPYPPLG GRDEPWTWEE FVDVLRTVKK VNKLPAAMSF SYSTERLFNY LAVRGVKVLD ENLDLVLDKD PRAKKVLQDF VDLFKEELVP APEWIAQQSD INDFLGGITA VHWSGSWMCR SIIDIMKQTG KRFAPAYVPK DVDWFGINGG HIFGVVRTGD KKREEEAIKF ALWIGQKGLG NDVFNKALLG ISPFKGHEID YGVPEMNEWI PVFQTLIERA PSWIVPVRTC ELWARLYDPL RTQIAMVIGD QQNLDDALKN IRKEYETILE ELGGKR
|
| |