Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0942 |
Symbol | |
ID | 5171124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 963502 |
End bp | 964758 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640563460 |
Product | extracellular solute-binding protein |
Protein accession | YP_001244536 |
Protein GI | 148270076 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.254009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAAGA GAGTACTTCT TGTTATGCTC GTTGTTCTTT CTGTCTTTGC ATTCGCCGAG GTCAAGAAAA TAGTGTTCTG GACAGCACCA AACCCGAATC AGGAAACTTT CTGGAAAGAA CTCGTTGAAA AGTGGAACGC AGAACATCCG GATGTTCAGA TCGAGTGGTC GGTTATCCCA GCCGCTGGAA GTTCTGAAGA AGCCATTTTG AACGCTATCG CAGCTGGAAA CGCACCAGAT ATTTGTACCA ACATATTCAG TGGCTTTGCT GCCCAGCTCG CAGAAGAACT GGATGTTCTC GTTGCTTTCG ATGAGGAGTT TGGAGAAGAG TTTTGGAAAC TTGCGGACGC AAGGAAAATG AGAGGCATAC TCGAGGGCTG GAAGCTCAAC GGACATTACT ACGTCATACC CATTTACTCC AACCCCATGC TCTTCTGGTG GAGAGGAGAT CTTCTGAAAG AACTTGGATA CGAAAAACCT CCAAGAACTT ACTCTGAGAT TTACGAACTG GCAAAGAAAT GGGTTGTTCC GAAAGAAAAG TACGTTATAA GAGCAGTCGC TGGGAGAAAC TGGTGGGACA GATGGTTCGA CTTCATAACG TTCTACTATG CGGCAAGTGG TGGAAAACCC TACATCGAAA ATGGTAAAGC CGTCTTCAAT AACGAGTACG GAAAAGCCGT CGCTGAATTC ATCTACACAC TCTTCAAGAA CGGCTGGACC GCTGTGGATC TGGGTCAGGA TCCCTTCGAA AACGGTACGA TCCTTGGTCA GCTCATGGGA CCATGGCACC TGAACTACAC GAAAGAACAT TATCCTAAGG TGTACCCGCA CATAGTGATG ACACCTCCTC CTGTTCCAGA TAACTATCCT GAAAACAAAC CGATCTACAC TTTTGCAGAC ACCAAAGGAC TCGTTATGTT CAAACATTCT AAATACAAAA AGGAAGCCTT TGAATTCATC AAATGGGTCT TCTCCAACGC ACAGAATGAC GCACGCTGGA TCGAGCTCAC AAGGATGCCG CCTGCAAGAG AAGATCTTGG AACGAATCCT GTGTTTGCTG AGTACATGAA GGATCCATAT TTTGCAAAAA TAGCTGAGGC AGTTGCGTAT GCGGTCCCAC CCGCTCTAAT TACCAACACG ATAGACGTCC AGAACACCAT GACCACGTAT TTGATAGAAC CTCTCATGTA TCTGAAGACT ACACCCGAAG AAGCTCTAAA ACAGGCTGTT AAAGAAATCA ACGCACTCCT GTGGTGA
|
Protein sequence | MWKRVLLVML VVLSVFAFAE VKKIVFWTAP NPNQETFWKE LVEKWNAEHP DVQIEWSVIP AAGSSEEAIL NAIAAGNAPD ICTNIFSGFA AQLAEELDVL VAFDEEFGEE FWKLADARKM RGILEGWKLN GHYYVIPIYS NPMLFWWRGD LLKELGYEKP PRTYSEIYEL AKKWVVPKEK YVIRAVAGRN WWDRWFDFIT FYYAASGGKP YIENGKAVFN NEYGKAVAEF IYTLFKNGWT AVDLGQDPFE NGTILGQLMG PWHLNYTKEH YPKVYPHIVM TPPPVPDNYP ENKPIYTFAD TKGLVMFKHS KYKKEAFEFI KWVFSNAQND ARWIELTRMP PAREDLGTNP VFAEYMKDPY FAKIAEAVAY AVPPALITNT IDVQNTMTTY LIEPLMYLKT TPEEALKQAV KEINALLW
|
| |