Gene Tpet_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1545 
Symbol 
ID5170636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1536402 
End bp1538075 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content49% 
IMG OID640564072 
Productextracellular solute-binding protein 
Protein accessionYP_001245129 
Protein GI148270669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.2137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGT TTTTAGTCGT TCTCGTTCTG GTCCTGGCAC TGGTTTCGGT TTTCGGACGG 
ACTTTTGAGA GAAACAAAAC GCTCTACTGG GGTGGAGCGC TGTGGTCTCC TCCATCCAAC
TGGAACCCGT TCACACCATG GAACGCGGTT GCGGGAACCA TCGGTCTTGT CTATGAACCT
TTGTTCCTCT ACGATCCCCT GAACGACAAG TTTGAACCGT GGCTTGCAGA AAAAGGAGAA
TGGGTCAGCA ACAACGAGTA CGTACTCACG CTCAGAAAGG GTCTCAGATG GCAGGATGGA
GTTCCTCTCA CGGCAGACGA TGTGGTTTTC ACCTTCGAAA TCGCCAAGAA GTACACTGGT
ATCAGCTACA GTCCTGTGTG GAACTGGCTC GACAGGATCG AAAGGATCGA CGAACGAACG
CTGAAGTTTG TCTTCTCCGA CCCGAGGTAC CAGGAATGGA AACAGATGCT CATCAACACA
CCGATCGTAC CAAAACACAT CTGGGAAAAC AAAACAGAGG AAGAAGTCCT TCAGGCGGCT
AACGAAAATC CAGTTGGATC CGGTCCGTAC TACGTCGAGA GCTGGGCAGA CGACAGATGT
GTATTCAAGA AGAACGAGAA CTGGTGGGGC ATCAGAGAAC TCGGTTACGA TCCCAAACCT
GAAAGGATCG TGGAACTGAG AGTGCTCAGC AACAATGTCG CAGTAGGAAT GCTCATGAAA
GGAGAACTCG ACTGGAGCAA CTTCTTCCTG CCGGGTGTTC CGGTTTTGAA GAAAGCATAC
GGAATCGTCA CCTGGTATGA AAACGCTCCT TACATGCTCC CGGCCAACAC CGCAGGAATC
TACATCAACG TGAACAAGTA TCCTCTCAGC ATACCTGAAT TCAGAAGAGC AATGGCTTAC
GCTATCAATC CCGAAAAGAT CGTCACCAGG GCTTACGAGA ACATGGTGAC GGCTGCCAAT
CCCGCTGGAA TCCTGCCCCT TCCCGGTTAC ATGAAGTACT ATCCGAAAGA AGTCGTTGAT
AAGTACGGAT TCAAGTACGA TCCGGAGATG GCAAAGAAGA TCCTCGACGA GCTTGGATTC
AAAGATGTGA ACAAGGATGG GTTCAGAGAA GATCCGAACG GAAAGCCGTT CAAGCTCACG
ATTGAGTGTC CGTACGGATG GACCGACTGG ATGGTTTCTA TCCAGTCTAT TGCAGAAGAT
CTCGTGAAAG TCGGAATCAA CGTCGAACCC AAGTACCCCG ACTACTCCAA ATACGCAGAC
GACCTCTACG GTGGAAAGTT TGATCTCATA CTCAACAACT TTACAACCGG TGTTTCCGCT
ACCATCTGGT CCTACTTCAA CGGTGTGTTC TATCCAGATG CAGTAGAATC CGAGTACTCC
TACTCCGGAA ACTTTGGAAA GTACGCCAAT CCTGAAGTTG AGACTCTTCT CGACGAACTC
AACAGAAGCA ATGATGATGC TAAAATTAAA GAAGTAGTAG CCAAGCTTTC AGAGATACTG
CTCAAGGATC TGCCGTTCAT TCCTCTGTGG TACAACGGTG CATGGTTCCA GGCCTCTGAA
GCTGTGTGGA CCAACTGGCC AACGGAGAAG AATCCGTACG CTGTCCCGAT AGGCTGGAAC
GGCTGGTGGC AGTTCACAGG AATCAAGACA CTCTTCGGTA TTGAAGCAAA GTAA
 
Protein sequence
MKRFLVVLVL VLALVSVFGR TFERNKTLYW GGALWSPPSN WNPFTPWNAV AGTIGLVYEP 
LFLYDPLNDK FEPWLAEKGE WVSNNEYVLT LRKGLRWQDG VPLTADDVVF TFEIAKKYTG
ISYSPVWNWL DRIERIDERT LKFVFSDPRY QEWKQMLINT PIVPKHIWEN KTEEEVLQAA
NENPVGSGPY YVESWADDRC VFKKNENWWG IRELGYDPKP ERIVELRVLS NNVAVGMLMK
GELDWSNFFL PGVPVLKKAY GIVTWYENAP YMLPANTAGI YINVNKYPLS IPEFRRAMAY
AINPEKIVTR AYENMVTAAN PAGILPLPGY MKYYPKEVVD KYGFKYDPEM AKKILDELGF
KDVNKDGFRE DPNGKPFKLT IECPYGWTDW MVSIQSIAED LVKVGINVEP KYPDYSKYAD
DLYGGKFDLI LNNFTTGVSA TIWSYFNGVF YPDAVESEYS YSGNFGKYAN PEVETLLDEL
NRSNDDAKIK EVVAKLSEIL LKDLPFIPLW YNGAWFQASE AVWTNWPTEK NPYAVPIGWN
GWWQFTGIKT LFGIEAK