Gene Tpet_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1687 
Symbol 
ID5170690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1686824 
End bp1688818 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content46% 
IMG OID640564213 
Productextracellular solute-binding protein 
Protein accessionYP_001245268 
Protein GI148270808 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.789812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAC CTCAAAATTT CCGTGGAGGT GGTAGGATGA AGCGTTTTCT GGTGTTTCTT 
GTGGTTTTGC TCACTTTAAC AGCGGTTTTT GCAACAGAGT TGCCTCCTCT CACACAGTAC
AACCTTTCAG ACTTTGAAAA ACTCACCGGC AAGAAGATCA CTCAGTTCAA CGAAGCTCCG
ATTCTGAACG AACAGGTAAA GCAGGGCAAG TTACCCCCTG TAGAAGAACG ACTGCCGGAA
GATCCCGTTG TTCTCATTCC GTGGGAAAGC ACAGGAAAGT ACGGAGGAAC ATGGAACAGA
GCCTGGACAG GACCTGCCGA CAGACCACAG GCCGACAGGT TCATGCTCGA ATCTGCAATG
GTGTTTGATC CCCAGGGTAA GGAGCTCTAT CCGAACATCC TTGAGAAGAT CGAAATGTCT
TCCGATGGAA AAGAATTCAT CTGCACTCTC CGAAAAGGTT TGAAATGGTC TGATGGAGTT
CCTGTTACCA CAGAAGATGT GAGATTCTGG TACGAAGATG TCCTTCTCAA CGAGGAACTC
GTTCCTACAT TCCCAAGAGA TCTCATGGCT GGCGGCAAAC CTATGAAACT CGAAATCATA
GACGAATACA CCTACAAGAT CACTTTTGAA GAACCTTATC CTCTCTTCCT GTACCTGTAC
GCAACGAGAA AGGGAAGCTG GGGAATCCGT GGAATCATGC TTCCAGCGCA TTACCTGAAA
CAATTCCATC CAAAGTACGT GCCGCTGGAA AAAATCCAGA AGATGGCAGA AGAAAATGGG
TACGACAACT GGACGAACTA CTTCTGGTCT CTCGGTGATC ACAACGCTCA CATCTCCAAT
CCCGATCTTC CCGTGCTAGC TGCTTGGAAG TTGAAAGAAA TCACGGATGC AAAACTCGTA
ATAGAAAGGA ACCCGTACTA CTGGAAGATA GATCCCGAAG GGAATCAGCT TCCTTATATC
GATGAGATCG TATTCTGGAC CGTCCAGGAT AGACAGATGA TACTTCTGAA AGTCATGGCA
GGAGAAATCG ATATGCAAGC AAGACACCTG AGTCTGGAAG ACTACACACT CCTTGCCGCT
AACGCACAGA AGGGTGGTTA CAAAATCATC AAGTGGAAAC TTGCACGCGG AAGTGATGTT
ACACTCTGGT TGAATCAAAA CGTCAAAGAT CCTGTTCTCA GAGAACTCTT CCAGAACATC
AAATTCAGAC AGGCACTCTC ACTTGCCATC AATCGTGAAG AGATAAACTC CCTTGTCTAT
TACGGTCTCT GTGAACCGAG ACAGGCATCG TTTGTGAGCG GTGTTAAATT CTACGATCCT
GAATGGGAAA CAAGATTCGC CGAATACGAC CCTGAGACTG CAAATAAACT TCTGGATGAA
ATAGGCCTTA CAAAACGCAA CGCAGAAGGT TACAGATTGA GACCGGACGG TCAACCACTG
ATCCTGACGA TCGAATATCC CACGGGTATC TTCGGTGCAT GGGACAAAAC ACTCGAGATG
ATAGCTCAAT ACTTCCAAAA AATCGGGATA AAGGTCAATC TGAAACCAGA GGAGAGATCA
CTGTACATAA CAAGATGTAA CGGTGGTGAG CCTGAAATAG GCGTCTGGTT CTTTGACAGA
AACAAGTATC CAATGCTCGA TCCCGGAAGG CTTCTTGGAA CGGTAACCGA TGGGCCATGG
GCACCACTCT ACGGTCAGTG GTACACTTCG GGTGGAAAGG GTGGTGAAGA ACCACCCGAA
GGATCCGACA TCAGAAGAAT ATACGAACTC TGGGAAAAGG TCAAAATGAC AGTCGATGAG
AAAGAAAGAG ACAAACTCTT CAGAGAAGTC ATAAATGTTC ACAAGAAAAA CATTTTCTTC
ATAGGAACAG TTGGAGAACC AATCTGGCCG GTTGTTGTGA AGACTTATTT CAAGAATGTA
CCTGATTCAC CAGATTTTGT GTGGGAAAAC GAGGGTGATG GACAACACGC TGAACAGTAC
TACATGGACA AATAG
 
Protein sequence
MRLPQNFRGG GRMKRFLVFL VVLLTLTAVF ATELPPLTQY NLSDFEKLTG KKITQFNEAP 
ILNEQVKQGK LPPVEERLPE DPVVLIPWES TGKYGGTWNR AWTGPADRPQ ADRFMLESAM
VFDPQGKELY PNILEKIEMS SDGKEFICTL RKGLKWSDGV PVTTEDVRFW YEDVLLNEEL
VPTFPRDLMA GGKPMKLEII DEYTYKITFE EPYPLFLYLY ATRKGSWGIR GIMLPAHYLK
QFHPKYVPLE KIQKMAEENG YDNWTNYFWS LGDHNAHISN PDLPVLAAWK LKEITDAKLV
IERNPYYWKI DPEGNQLPYI DEIVFWTVQD RQMILLKVMA GEIDMQARHL SLEDYTLLAA
NAQKGGYKII KWKLARGSDV TLWLNQNVKD PVLRELFQNI KFRQALSLAI NREEINSLVY
YGLCEPRQAS FVSGVKFYDP EWETRFAEYD PETANKLLDE IGLTKRNAEG YRLRPDGQPL
ILTIEYPTGI FGAWDKTLEM IAQYFQKIGI KVNLKPEERS LYITRCNGGE PEIGVWFFDR
NKYPMLDPGR LLGTVTDGPW APLYGQWYTS GGKGGEEPPE GSDIRRIYEL WEKVKMTVDE
KERDKLFREV INVHKKNIFF IGTVGEPIWP VVVKTYFKNV PDSPDFVWEN EGDGQHAEQY
YMDK