Gene Tpet_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1679 
Symbol 
ID5170106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1676917 
End bp1678197 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content46% 
IMG OID640564205 
Productextracellular solute-binding protein 
Protein accessionYP_001245260 
Protein GI148270800 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000895921 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCTCTA GGTTGATGGT CCTGGGATTT TTACTACTTT TGTTCGCTGG CATTCTTCTA 
GGAGTGAAAC TTACGATCTT CAGTGCCGGT GCCAGTGAGG GTCAGGCACT GGATGCGGCA
ATCGCTGAGT ACAAAAAACT ACATCCAGAG GTGGAATTCG AGCACGTCAA TATCACATCT
GGTTGGCAGG AGAAGTTCTC CCTTGCGTTG ATGAGTGGTG ATGCTCCCGA TCTAATAGCG
ATCACTGTTC CATACGCGGA TTATTTCAGA TCTTACCTTA TTGATCTCGC ACCTTATGTA
GAGAAACACC TAGGCATTTC TCTCAAAGAG TACAAAGATT CCATGTACGA TGTGGTCAGA
GCCTATGTGG GGAAAACGGA GGATGAGTTA ACTTACGTTC CCCTCTATCT CACTGTCCAC
AGTCTTTGGG TGAACGTTGA TTATTTTGAG AAAGCGGGTA TTCCTTATCC TCCACTTGGA
GGAAGGGATG AACCCTGGAC ATGGGAAGAG TTCGTAGATG TTCTCAGAAC AGTCAAAAAA
GTCAACAAAC TACCAGCTGC CATGTCATTT TCCTATTCCA CGGAGAGATT ATTCAATTAC
CTTGCCGTGA GGGGAGTTAA AGTTCTGGAC GAGAACCTGG ATCTTGTTCT CGATAAGGAT
CCCAGAGCAA AAAAGGTGCT GCAAGATTTT GTAGATCTTT TCAAAGAAGA ACTAGTGCCG
GCACCGGAGT GGATAGCACA GCAGTCCGAT ATAAACGATT TCCTGGGGGG TATCACGGCG
GTTCACTGGT CCGGTAGCTG GATGTGCAGA TCCATCATCG ACATCATGAA ACAGACAGGA
AAACGTTTTG CTCCGGCTTA CGTTCCAAAA GATGTCGACT GGTTTGGCAT CAACGGAGGC
CATATCTTCG GGGTGGTAAG AACAGGCGAC AAGAAGCGAG AGGAAGAAGC TATAAAATTC
GCTCTCTGGA TAGGACAGAA GGGACTTGGA AACGATGTGT TCAACAAGGC GCTTCTCGGA
ATTTCACCGT TCAAAGGCCA TGAAATAGAT TACGGTGTAC CGGAGATGAA CGAATGGATA
CCGGTCTTTC AGACTTTGAT CGAAAGGGCA CCTTCTTGGA TAGTTCCGGT CAGAACCTGC
GAACTCTGGG CAAGACTCTA CGATCCTTTG AGAACACAGA TCGCCATGGT AATAGGTGAC
CAGCAGAATC TTGATGATGC ATTGAAAAAC ATCCGAAAAG AGTACGAAAC CATCCTAGAA
GAACTTGGAG GAAAGAGATA A
 
Protein sequence
MRSRLMVLGF LLLLFAGILL GVKLTIFSAG ASEGQALDAA IAEYKKLHPE VEFEHVNITS 
GWQEKFSLAL MSGDAPDLIA ITVPYADYFR SYLIDLAPYV EKHLGISLKE YKDSMYDVVR
AYVGKTEDEL TYVPLYLTVH SLWVNVDYFE KAGIPYPPLG GRDEPWTWEE FVDVLRTVKK
VNKLPAAMSF SYSTERLFNY LAVRGVKVLD ENLDLVLDKD PRAKKVLQDF VDLFKEELVP
APEWIAQQSD INDFLGGITA VHWSGSWMCR SIIDIMKQTG KRFAPAYVPK DVDWFGINGG
HIFGVVRTGD KKREEEAIKF ALWIGQKGLG NDVFNKALLG ISPFKGHEID YGVPEMNEWI
PVFQTLIERA PSWIVPVRTC ELWARLYDPL RTQIAMVIGD QQNLDDALKN IRKEYETILE
ELGGKR