Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1545 |
Symbol | |
ID | 5170636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1536402 |
End bp | 1538075 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640564072 |
Product | extracellular solute-binding protein |
Protein accession | YP_001245129 |
Protein GI | 148270669 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.2137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGT TTTTAGTCGT TCTCGTTCTG GTCCTGGCAC TGGTTTCGGT TTTCGGACGG ACTTTTGAGA GAAACAAAAC GCTCTACTGG GGTGGAGCGC TGTGGTCTCC TCCATCCAAC TGGAACCCGT TCACACCATG GAACGCGGTT GCGGGAACCA TCGGTCTTGT CTATGAACCT TTGTTCCTCT ACGATCCCCT GAACGACAAG TTTGAACCGT GGCTTGCAGA AAAAGGAGAA TGGGTCAGCA ACAACGAGTA CGTACTCACG CTCAGAAAGG GTCTCAGATG GCAGGATGGA GTTCCTCTCA CGGCAGACGA TGTGGTTTTC ACCTTCGAAA TCGCCAAGAA GTACACTGGT ATCAGCTACA GTCCTGTGTG GAACTGGCTC GACAGGATCG AAAGGATCGA CGAACGAACG CTGAAGTTTG TCTTCTCCGA CCCGAGGTAC CAGGAATGGA AACAGATGCT CATCAACACA CCGATCGTAC CAAAACACAT CTGGGAAAAC AAAACAGAGG AAGAAGTCCT TCAGGCGGCT AACGAAAATC CAGTTGGATC CGGTCCGTAC TACGTCGAGA GCTGGGCAGA CGACAGATGT GTATTCAAGA AGAACGAGAA CTGGTGGGGC ATCAGAGAAC TCGGTTACGA TCCCAAACCT GAAAGGATCG TGGAACTGAG AGTGCTCAGC AACAATGTCG CAGTAGGAAT GCTCATGAAA GGAGAACTCG ACTGGAGCAA CTTCTTCCTG CCGGGTGTTC CGGTTTTGAA GAAAGCATAC GGAATCGTCA CCTGGTATGA AAACGCTCCT TACATGCTCC CGGCCAACAC CGCAGGAATC TACATCAACG TGAACAAGTA TCCTCTCAGC ATACCTGAAT TCAGAAGAGC AATGGCTTAC GCTATCAATC CCGAAAAGAT CGTCACCAGG GCTTACGAGA ACATGGTGAC GGCTGCCAAT CCCGCTGGAA TCCTGCCCCT TCCCGGTTAC ATGAAGTACT ATCCGAAAGA AGTCGTTGAT AAGTACGGAT TCAAGTACGA TCCGGAGATG GCAAAGAAGA TCCTCGACGA GCTTGGATTC AAAGATGTGA ACAAGGATGG GTTCAGAGAA GATCCGAACG GAAAGCCGTT CAAGCTCACG ATTGAGTGTC CGTACGGATG GACCGACTGG ATGGTTTCTA TCCAGTCTAT TGCAGAAGAT CTCGTGAAAG TCGGAATCAA CGTCGAACCC AAGTACCCCG ACTACTCCAA ATACGCAGAC GACCTCTACG GTGGAAAGTT TGATCTCATA CTCAACAACT TTACAACCGG TGTTTCCGCT ACCATCTGGT CCTACTTCAA CGGTGTGTTC TATCCAGATG CAGTAGAATC CGAGTACTCC TACTCCGGAA ACTTTGGAAA GTACGCCAAT CCTGAAGTTG AGACTCTTCT CGACGAACTC AACAGAAGCA ATGATGATGC TAAAATTAAA GAAGTAGTAG CCAAGCTTTC AGAGATACTG CTCAAGGATC TGCCGTTCAT TCCTCTGTGG TACAACGGTG CATGGTTCCA GGCCTCTGAA GCTGTGTGGA CCAACTGGCC AACGGAGAAG AATCCGTACG CTGTCCCGAT AGGCTGGAAC GGCTGGTGGC AGTTCACAGG AATCAAGACA CTCTTCGGTA TTGAAGCAAA GTAA
|
Protein sequence | MKRFLVVLVL VLALVSVFGR TFERNKTLYW GGALWSPPSN WNPFTPWNAV AGTIGLVYEP LFLYDPLNDK FEPWLAEKGE WVSNNEYVLT LRKGLRWQDG VPLTADDVVF TFEIAKKYTG ISYSPVWNWL DRIERIDERT LKFVFSDPRY QEWKQMLINT PIVPKHIWEN KTEEEVLQAA NENPVGSGPY YVESWADDRC VFKKNENWWG IRELGYDPKP ERIVELRVLS NNVAVGMLMK GELDWSNFFL PGVPVLKKAY GIVTWYENAP YMLPANTAGI YINVNKYPLS IPEFRRAMAY AINPEKIVTR AYENMVTAAN PAGILPLPGY MKYYPKEVVD KYGFKYDPEM AKKILDELGF KDVNKDGFRE DPNGKPFKLT IECPYGWTDW MVSIQSIAED LVKVGINVEP KYPDYSKYAD DLYGGKFDLI LNNFTTGVSA TIWSYFNGVF YPDAVESEYS YSGNFGKYAN PEVETLLDEL NRSNDDAKIK EVVAKLSEIL LKDLPFIPLW YNGAWFQASE AVWTNWPTEK NPYAVPIGWN GWWQFTGIKT LFGIEAK
|
| |