Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0853 |
Symbol | |
ID | 5170523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 868962 |
End bp | 870839 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640563372 |
Product | extracellular solute-binding protein |
Protein accession | YP_001244448 |
Protein GI | 148269988 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.157029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT TTCTTGTAGT TTTGTTCGTC GTTTCCATGT TCAGTCTGTT CTTTGCACAG CTTCCACCCA ATATTCCGAG GAATGAGACT TTCATTGCGC AGGTTTTGAC CGGAAGAGCA GCCAACCCCA CCAACTTCAA CGTGTGGACA GGATGGGTTT GGCAGGACAG GGGTGTTCAG AACTTGCTTC TGGAACCGCT CTGGTACGTG GATTTCGCAA CTGGAGAGAT CATAAACGCA CTTGCCGAAT CTCTCCCGAC CTACAATTCT GACTTCACAG AACTCACCAT CAAACTCAGA AAAGGTGTAT ATTGGAGCGA TGGAGAACCA TTCACAGCGG ATGACGTTGT GTTCACAATC GAAACAATCA TGAGCACTCC AGCGTTTGGA TACCATCAGG AACTGGTCAA CGAGGTTGAA AACGTCGAGA AATTGGACGA CTACACCGTG AAAATAAAAC TCAAGAGACC AAACGCCAGG TTCCACACTT ATTTCCTCGA CAGATGGGGC GGAATAAGAC CTATGCCAAA ACACGTTTTT GAAAAGGTTG AAGATCCTGT TAACTTTGAA TTCAATCCAC CCGTTGGAAC CGGCCCATAC GTGCTCCATT CTGTCGATCC AGGAGGATAC TGGACACTCT GGCAGAGAAG AGAAGACTGG GACAGAACAC CGACTGGTAT GCTCTTTGGA ATGCCACAAC CCAAGTACGT ACTCTTCATA GATTACGGTT CCCCGGAAAA ACAGGTCCTT GCCATGGCAC AGCATCAGCT TGACGAAGCA ATCCTCACGA TAGAAGCGCT CAAAGCGGTT CTCAACAGAG TCAAAACAGC CAGAGCCTGG AGGAAAAACT TCCCGTGGAC TGTAAACAAC GATCCATGTG TGACAGGATT TGTCTTCAAC ACAGCGAAAG AGCCATTCAA CAACATCGAA GTTAGATGGG CACTCACACT CGCAATTGAT ATTGTTGAAT ACGCTGCCAA CGCTTTCGAT GGTGCCGTAA CGCTTTCGCC TATCCACATA CCACTTTCAA CCGCATACTA CAACTGGTAT TTCGCAAGGC TTGAAGACTG GCTCAAAAAT TACGAAATCG ATCTTGGAAA CGGAGAGAAA TTCAAGCCAT ACGATCCAGA AGCAGGCCTT AGGTTAGCAG AATATGCAAA GAAGAGAGGC TATTCTGTAC CTGATAATCC TGAAATAATT AAGAGAACTT TCGGACCTGG TTGGTGGAAG TATGCCCCCG ATGTTGCAGC AAAACTTCTG GAAAAGAACG GATTCTACAG AGATAAGAAT GGAAAATGGC ATCTGCCGAA TGGAGATCTG TGGCAGATAA CAATAATTGC CCCCACCAAT CCATCTGATC CTGCTTACAG AAACGCATTT GCTCTCTCCC AGGCGTGGAA GAAATTCGGA ATAGATGCGG TCGTTCAGAC TTCCGAAAAT GCAAACTCGT TTGGTTCGGA GGGTAACTTC GATGTTCACA CCGCATGGCC AGCCGCAGAA CCTTGGGGTG GTCATCCAGA CCTTTACAGG ACACTCTATC CATTCCATTC TGAGTACGTT GTTCCAATAG GTGAAAATGC AACATGGGGT AATTACTGCA GGTGGTCCGA CCCAAGGCTT GACAAAATAA TCGAAGAACT CAAGAATACA CCATGGGGTA ACACTCAAAA ACTCATAGAA CTCGGTACCG AAGCTCTTAA GATAATCGTT GAAGGACTTC CAAGTGTTCC GACATTCAAC TATCCTGGTG TTATCGCATG GGATGAATAC TACTGGACGA ATTATCCTGG AGCAGAAAAT CCATACTCGC AGCCCTACCA GCACTGGCCG AACTTCAAGT ACATGCTCCC GAAGCTGAAA CCAACCGGTA GAAAATGA
|
Protein sequence | MKKFLVVLFV VSMFSLFFAQ LPPNIPRNET FIAQVLTGRA ANPTNFNVWT GWVWQDRGVQ NLLLEPLWYV DFATGEIINA LAESLPTYNS DFTELTIKLR KGVYWSDGEP FTADDVVFTI ETIMSTPAFG YHQELVNEVE NVEKLDDYTV KIKLKRPNAR FHTYFLDRWG GIRPMPKHVF EKVEDPVNFE FNPPVGTGPY VLHSVDPGGY WTLWQRREDW DRTPTGMLFG MPQPKYVLFI DYGSPEKQVL AMAQHQLDEA ILTIEALKAV LNRVKTARAW RKNFPWTVNN DPCVTGFVFN TAKEPFNNIE VRWALTLAID IVEYAANAFD GAVTLSPIHI PLSTAYYNWY FARLEDWLKN YEIDLGNGEK FKPYDPEAGL RLAEYAKKRG YSVPDNPEII KRTFGPGWWK YAPDVAAKLL EKNGFYRDKN GKWHLPNGDL WQITIIAPTN PSDPAYRNAF ALSQAWKKFG IDAVVQTSEN ANSFGSEGNF DVHTAWPAAE PWGGHPDLYR TLYPFHSEYV VPIGENATWG NYCRWSDPRL DKIIEELKNT PWGNTQKLIE LGTEALKIIV EGLPSVPTFN YPGVIAWDEY YWTNYPGAEN PYSQPYQHWP NFKYMLPKLK PTGRK
|
| |