Gene Tpet_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1600 
Symbol 
ID5171444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1600711 
End bp1602216 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content47% 
IMG OID640564126 
Productextracellular solute-binding protein 
Protein accessionYP_001245182 
Protein GI148270722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TAGTCTGGTT GTTTTTGATC CTGACAGTAA CGCTCTCCTT CGCGGCAAAA 
GACATCATCG TGGTGGGTAC AACGGACAAA ATCAGAACTC TCGATCCTGC CAACTGTTAC
GATTACTTTT CTTCCAACAT CCTTCAAAAC GTCATGGTTG GATTGGTTGA TTACGAAATA
GGAACAAGCG TTCTCAAACC CGTTCTGGCG GAGAGATGGG AAGTCGATGA AACAGGAACG
GTTTACACGT TCTATCTGAG AAAGGATGCA AAGTTCGAGG ATGGAACACC CATCGATGCT
CACGTCTTTA AGTACTCTTT CGACAGAGTG ATGAAACTGA ATGGAGATCC GGCATTCTTA
CTCTCTGATG TCGTCGAAAA AACAGAAGTA GTGGATGACT ACACCTTCCG TGTAACTCTG
AAGTACCCAT TCTCCGCGTT CGTCTCTGTC CTCGGTTACA CCGTTGCGTA TCCGGTGAAC
CCGAAGGTCT ATCCAGTCGA TTCCTTCTAC GAAGGTATCC CGTCTGCATC CGGCCCCTAC
AGGGTCAAAG AATGGATCAG AGATGTGAGA ATCGTTCTCG AAGCGAATCC GAACTACTTT
GGTGAAAAGC CAAAGACAAA GACCATCGTG ATCAATTTCT ACGAGAACGC TTCCACACTC
AGATTGGCAC TCGAGACGGG AGAAATCGAT GTTGCTTACA GACATCTCGA CCCCAGAGAT
GTTATCGATC TCGAGGGAAG AGAAGACATC GTCGTTTACA AAGGTAACAG TCCACAGATC
AGATATCTGG TGATCAACGT CACACAGCCC CCGTTCGACA ACGTGAAGGT GAGGCAGGCA
CTAGCTTATG CGGTGAACAG AGACGTCATA GTTGAAGACG TGTTCGTGGG GCTTGCAAAA
CCGTTGTACT CGATGATCCC GGAAGGCATG TGGGGACACA AGGACGTATT CCCGAAGAGG
GAGCTCGAAA AAGCGAAAGC TCTTCTCAAA GAAGCAGGCT ATGACGAGAA AAATCCTTTT
GTGATAGACC TGTGGTACAC ACCTTCACAC TACGGAACGA CAGAAGCGGA TGTTGCCCAG
GTGTTGAAGG AATCGTTCGA AGAAACGGGT GTCATAAAAG TGAATCTCAA GTACGCAGAG
TGGTCCACTT ACGTGGAGTA CTTCCTGAAC GGAACGATGG GTTTCTTCCT CCTCGGATGG
TACCCCGATT ACCTTGATCC GGACGATTAC GTGTGGCCGT TCCTCAGTGA AAGCGGTGCA
AAATCCCTCG GTAGCTTCTA CTCGAATCCT GAGGTAGAAA ACCTGATGAT AGAAGCAAGG
AAATTCACTG ATCTGGAAAA GAGAACAGAA ATCTACTACA AAGTCCAAGA GATCCTTGCA
AGGGATGTTC CTTACATACC GCTCTGGCAG GGAGTTGCAA CCTGTGCAGC GAAAAAACAG
GTGAAAGGAA TTCTGCTTGA ACCCACACAG ATATTCAGAT ACTACATACT CTACTGGGAA
GATTGA
 
Protein sequence
MKKLVWLFLI LTVTLSFAAK DIIVVGTTDK IRTLDPANCY DYFSSNILQN VMVGLVDYEI 
GTSVLKPVLA ERWEVDETGT VYTFYLRKDA KFEDGTPIDA HVFKYSFDRV MKLNGDPAFL
LSDVVEKTEV VDDYTFRVTL KYPFSAFVSV LGYTVAYPVN PKVYPVDSFY EGIPSASGPY
RVKEWIRDVR IVLEANPNYF GEKPKTKTIV INFYENASTL RLALETGEID VAYRHLDPRD
VIDLEGREDI VVYKGNSPQI RYLVINVTQP PFDNVKVRQA LAYAVNRDVI VEDVFVGLAK
PLYSMIPEGM WGHKDVFPKR ELEKAKALLK EAGYDEKNPF VIDLWYTPSH YGTTEADVAQ
VLKESFEETG VIKVNLKYAE WSTYVEYFLN GTMGFFLLGW YPDYLDPDDY VWPFLSESGA
KSLGSFYSNP EVENLMIEAR KFTDLEKRTE IYYKVQEILA RDVPYIPLWQ GVATCAAKKQ
VKGILLEPTQ IFRYYILYWE D