Gene Tpet_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0966 
Symbol 
ID5170994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp998043 
End bp999221 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content48% 
IMG OID640563484 
Productextracellular solute-binding protein 
Protein accessionYP_001244560 
Protein GI148270100 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000011003 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAGAC TGCTCGTTTT AATGCTTGTT GTGGTTTCTG CCCTTGTGTT AGCACAAACA 
AAGCTCACCA TCTGGTGTTC CGAAAAGCAG GTTGACATCC TCCAGAAACT CGGGGAAGAA
TTCAAGGCAA AGTACGGAAT CCCTGTTGAA GTTCAGTACG TTGATTTTGG AAGCATCAAA
TCCAAATTCC TGACGGCGGC TCCACAGGGA CAGGGTGCAG ACATCATTGT TGGAGCGCAC
GACTGGGTAG GAGAACTCGC CGTCAACGGT TTGATCGAAC CCATTCCCAA CTTCTCTGAT
CTGAAGAATT TCTATGACAC GGCTCTCAAA GCTTTCTCTT ACGGTGGAAA ACTCTACGGA
GTCCCGTACG CCATGGAAGC GGTTGCTCTC ATCTACAACA AGGACTACGT TGATTCTGTT
CCTAAGACCA TGGACGAGCT CATAGAAAAA GCAAAACAGA TAGATGAGGA ATACGGAGGA
GAAGTCAGAG GTTTCATCTA CGATGTCGCC AACTTCTACT TCTCTGCGCC GTTCATTCTG
GGTTACGGAG GATACGTCTT CAAGGAAACA CCTCAGGGAC TCGACGTGAC AGACATTGGA
CTCGCGAACG AAGGAGCAAT CAAAGGTGCG AAACTCATAA AGAGAATGAT CGATGAAGGT
GTTCTCACCC CGGGTGACAA CTACGGAACG ATGGATTCCA TGTTCAAAGA AGGTCTCGCG
GCTATGATCA TCAACGGACC TTGGGCTATA AAATCTTACA AAGACGCGGG TATAAACTAC
GGAGTTGCTC CCATTCCTGA GCTCGAACCG GGTGTTCCTG CCAAACCATT CGTTGGTGTT
CAGGGATTCA TGATCAACGC CAAGTCTCCA AACAAAGTGA TCGCCATGGA ATTTCTCACG
AACTTCATTG CGAGAAAAGA GACCATGTAC AAGATATACC TCGCAGATCC AAGACTTCCT
GCAAGAAAAG ATGTCCTCGA ACTCGTCAAA GACAATCCTG ACGTTGTTGC GTTTACCCAG
AGTGCTTCCA TGGGAACACC GATGCCAAAC GTGCCGGAAA TGGCTCCTGT CTGGTCTGCC
ATGGGAGACG CTCTCAGCAT CATTATCAAC GGACAGGCCA GTGTCGAAGA TGCTCTCAAA
GAGGCTGTGG AAAAAATCAA GGCACAGATA GAAAAATAA
 
Protein sequence
MKRLLVLMLV VVSALVLAQT KLTIWCSEKQ VDILQKLGEE FKAKYGIPVE VQYVDFGSIK 
SKFLTAAPQG QGADIIVGAH DWVGELAVNG LIEPIPNFSD LKNFYDTALK AFSYGGKLYG
VPYAMEAVAL IYNKDYVDSV PKTMDELIEK AKQIDEEYGG EVRGFIYDVA NFYFSAPFIL
GYGGYVFKET PQGLDVTDIG LANEGAIKGA KLIKRMIDEG VLTPGDNYGT MDSMFKEGLA
AMIINGPWAI KSYKDAGINY GVAPIPELEP GVPAKPFVGV QGFMINAKSP NKVIAMEFLT
NFIARKETMY KIYLADPRLP ARKDVLELVK DNPDVVAFTQ SASMGTPMPN VPEMAPVWSA
MGDALSIIIN GQASVEDALK EAVEKIKAQI EK