Gene Tpet_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1552 
Symbol 
ID5171285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1543354 
End bp1544529 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content49% 
IMG OID640564078 
Productextracellular solute-binding protein 
Protein accessionYP_001245135 
Protein GI148270675 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000398035 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTCTGGTGAT CGCTTTGCTT GTCGTTTCCC TCGTTGTCCT CGCTCAGCCG 
AAACTCACCA TCTGGTGCTC TGAGAAGCAG GTCGATATCC TTCAAAAACT CGGAGAGGAG
TTCAAGGCAA AGTACGGCGT AGAGGTTGAA GTGCAGTACG TGAACTTCCA AGACATCAAG
TCCAAGTTCC TCATAGCAGC TCCTGAAGGA CAGGGTGCGG ATATCATCGT TGGAGCACAC
GACTGGGTAG GCGAACTCGC AGTCAACGGT TTGATCGAAC CCATTCCGAA CTTCAGTGAT
CTGAAGAACT TCTATGAAAC TGCCCTCAAC GCGTTCTCTT ACGGTGGAAA ACTCTACGGT
ATTCCCTACG CCATGGAAGC AATAGCACTC ATCTACAACA AGGACTACGT TCCTGAACCC
CCAAAGACCA TGGACGAACT CATAGAGACA GCAAAACAGA TCGATGAAGA ATTTGGAGGA
GAAGTGAGAG GTTTCATCAC CTCAGCGGCC GAGTTTTACT ACATTGCTCC TTTCATTTTC
GGATACGGTG GATACGTATT CAAACAGACA GAAAAAGGAC TGGACGTCAA CGATATCGGA
CTGGCCAACG AAGGAGCCAT CAAGGGTGTG AAACTCCTCA AAAGATTGGT TGATGAGGGA
ATACTGGATC CCAGTGACAA TTATCAGATC ATGGATTCCA TGTTCAGGGA AGGCCAGGCG
GCGATGATCA TCAACGGACC GTGGGCCATT AAGGCGTACA AGGATGCAGG AATAGACTAT
GGTGTAGCCC CAATCCCCGA TCTGGAACCT GGCGTTCCTG CAAGACCTTT CGTTGGGGTC
CAGGGCTTCA TGGTGAACGC AAAATCCCCA AACAAACTCC TTGCCATCGA ATTCCTGACC
AGTTTCATTG CAAAAAAGGA AACGATGTAC AGAATCTACC TTGGAGATCC AAGACTTCCC
TCCAGAAAGG ACGTGCTCGA ACTTGTGAAA GATAACCCAG ACGTAGTTGG CTTCACACTG
AGCGCAGCCA ACGGTATTCC AATGCCCAAC GTTCCACAGA TGGCCGCTGT CTGGGCCGCT
ATGAACGATG CGCTCAATCT CGTTGTGAAC GGAAAAGCAA CGGTCGAAGA AGCGCTCAAA
AACGCCGTTG AAAGAATCAA AGCTCAGATT CAGTAA
 
Protein sequence
MKKFLVIALL VVSLVVLAQP KLTIWCSEKQ VDILQKLGEE FKAKYGVEVE VQYVNFQDIK 
SKFLIAAPEG QGADIIVGAH DWVGELAVNG LIEPIPNFSD LKNFYETALN AFSYGGKLYG
IPYAMEAIAL IYNKDYVPEP PKTMDELIET AKQIDEEFGG EVRGFITSAA EFYYIAPFIF
GYGGYVFKQT EKGLDVNDIG LANEGAIKGV KLLKRLVDEG ILDPSDNYQI MDSMFREGQA
AMIINGPWAI KAYKDAGIDY GVAPIPDLEP GVPARPFVGV QGFMVNAKSP NKLLAIEFLT
SFIAKKETMY RIYLGDPRLP SRKDVLELVK DNPDVVGFTL SAANGIPMPN VPQMAAVWAA
MNDALNLVVN GKATVEEALK NAVERIKAQI Q