Gene Tpet_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0853 
Symbol 
ID5170523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp868962 
End bp870839 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content47% 
IMG OID640563372 
Productextracellular solute-binding protein 
Protein accessionYP_001244448 
Protein GI148269988 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.157029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TTCTTGTAGT TTTGTTCGTC GTTTCCATGT TCAGTCTGTT CTTTGCACAG 
CTTCCACCCA ATATTCCGAG GAATGAGACT TTCATTGCGC AGGTTTTGAC CGGAAGAGCA
GCCAACCCCA CCAACTTCAA CGTGTGGACA GGATGGGTTT GGCAGGACAG GGGTGTTCAG
AACTTGCTTC TGGAACCGCT CTGGTACGTG GATTTCGCAA CTGGAGAGAT CATAAACGCA
CTTGCCGAAT CTCTCCCGAC CTACAATTCT GACTTCACAG AACTCACCAT CAAACTCAGA
AAAGGTGTAT ATTGGAGCGA TGGAGAACCA TTCACAGCGG ATGACGTTGT GTTCACAATC
GAAACAATCA TGAGCACTCC AGCGTTTGGA TACCATCAGG AACTGGTCAA CGAGGTTGAA
AACGTCGAGA AATTGGACGA CTACACCGTG AAAATAAAAC TCAAGAGACC AAACGCCAGG
TTCCACACTT ATTTCCTCGA CAGATGGGGC GGAATAAGAC CTATGCCAAA ACACGTTTTT
GAAAAGGTTG AAGATCCTGT TAACTTTGAA TTCAATCCAC CCGTTGGAAC CGGCCCATAC
GTGCTCCATT CTGTCGATCC AGGAGGATAC TGGACACTCT GGCAGAGAAG AGAAGACTGG
GACAGAACAC CGACTGGTAT GCTCTTTGGA ATGCCACAAC CCAAGTACGT ACTCTTCATA
GATTACGGTT CCCCGGAAAA ACAGGTCCTT GCCATGGCAC AGCATCAGCT TGACGAAGCA
ATCCTCACGA TAGAAGCGCT CAAAGCGGTT CTCAACAGAG TCAAAACAGC CAGAGCCTGG
AGGAAAAACT TCCCGTGGAC TGTAAACAAC GATCCATGTG TGACAGGATT TGTCTTCAAC
ACAGCGAAAG AGCCATTCAA CAACATCGAA GTTAGATGGG CACTCACACT CGCAATTGAT
ATTGTTGAAT ACGCTGCCAA CGCTTTCGAT GGTGCCGTAA CGCTTTCGCC TATCCACATA
CCACTTTCAA CCGCATACTA CAACTGGTAT TTCGCAAGGC TTGAAGACTG GCTCAAAAAT
TACGAAATCG ATCTTGGAAA CGGAGAGAAA TTCAAGCCAT ACGATCCAGA AGCAGGCCTT
AGGTTAGCAG AATATGCAAA GAAGAGAGGC TATTCTGTAC CTGATAATCC TGAAATAATT
AAGAGAACTT TCGGACCTGG TTGGTGGAAG TATGCCCCCG ATGTTGCAGC AAAACTTCTG
GAAAAGAACG GATTCTACAG AGATAAGAAT GGAAAATGGC ATCTGCCGAA TGGAGATCTG
TGGCAGATAA CAATAATTGC CCCCACCAAT CCATCTGATC CTGCTTACAG AAACGCATTT
GCTCTCTCCC AGGCGTGGAA GAAATTCGGA ATAGATGCGG TCGTTCAGAC TTCCGAAAAT
GCAAACTCGT TTGGTTCGGA GGGTAACTTC GATGTTCACA CCGCATGGCC AGCCGCAGAA
CCTTGGGGTG GTCATCCAGA CCTTTACAGG ACACTCTATC CATTCCATTC TGAGTACGTT
GTTCCAATAG GTGAAAATGC AACATGGGGT AATTACTGCA GGTGGTCCGA CCCAAGGCTT
GACAAAATAA TCGAAGAACT CAAGAATACA CCATGGGGTA ACACTCAAAA ACTCATAGAA
CTCGGTACCG AAGCTCTTAA GATAATCGTT GAAGGACTTC CAAGTGTTCC GACATTCAAC
TATCCTGGTG TTATCGCATG GGATGAATAC TACTGGACGA ATTATCCTGG AGCAGAAAAT
CCATACTCGC AGCCCTACCA GCACTGGCCG AACTTCAAGT ACATGCTCCC GAAGCTGAAA
CCAACCGGTA GAAAATGA
 
Protein sequence
MKKFLVVLFV VSMFSLFFAQ LPPNIPRNET FIAQVLTGRA ANPTNFNVWT GWVWQDRGVQ 
NLLLEPLWYV DFATGEIINA LAESLPTYNS DFTELTIKLR KGVYWSDGEP FTADDVVFTI
ETIMSTPAFG YHQELVNEVE NVEKLDDYTV KIKLKRPNAR FHTYFLDRWG GIRPMPKHVF
EKVEDPVNFE FNPPVGTGPY VLHSVDPGGY WTLWQRREDW DRTPTGMLFG MPQPKYVLFI
DYGSPEKQVL AMAQHQLDEA ILTIEALKAV LNRVKTARAW RKNFPWTVNN DPCVTGFVFN
TAKEPFNNIE VRWALTLAID IVEYAANAFD GAVTLSPIHI PLSTAYYNWY FARLEDWLKN
YEIDLGNGEK FKPYDPEAGL RLAEYAKKRG YSVPDNPEII KRTFGPGWWK YAPDVAAKLL
EKNGFYRDKN GKWHLPNGDL WQITIIAPTN PSDPAYRNAF ALSQAWKKFG IDAVVQTSEN
ANSFGSEGNF DVHTAWPAAE PWGGHPDLYR TLYPFHSEYV VPIGENATWG NYCRWSDPRL
DKIIEELKNT PWGNTQKLIE LGTEALKIIV EGLPSVPTFN YPGVIAWDEY YWTNYPGAEN
PYSQPYQHWP NFKYMLPKLK PTGRK