Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0172 |
Symbol | |
ID | 5170645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 166105 |
End bp | 169080 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640562673 |
Product | glycosyl transferase family protein |
Protein accession | YP_001243777 |
Protein GI | 148269317 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATCT TTGAAGAGAT AAGAAAAAAG ATTGAACAAA AAGAATTCTC GAAAGCAAAA GAAATGGCTG AAAAAATAGA AGATGAAGTG GAAAAATACA ACTCACTTGG AATCATACAC TACTACGAAG GTAAAGTCAG CGAAGCGCTT GAGTTTTTCA AAAAAGCCCT CGATATCAAT CCAGTTCACG ATGATGCTCT TTTCAACTAC TCGAAGGTGC TCTTCGAAAA GGGAGAATAC TTTGAATCCT GGAGATATTT GACGAGGATC AACAACAAAA CGTGGGAAGT CTACGACATG CTCGGAGACA CACAGCTGAA ACAGAACAAT CCAGCGATGG CTCTTTATTA TTACAAAAAG GCAGCAGAAC TTTCCAACAT ACCGGAAATG AAAGAAAAAT ACCAAATCCT GAAAGACCAG TTCAAAAAAG ACGTAAAGCT TGCCATATTC TGCCTTCCTG GTCTGGACAA CTTCATAAAG GATATAGCTC AGATTTTGTC GAACATCTAC GATGTGAAGC TTGTTGTTAC AACAGATGTC AGACAGATCC AGGAAGCTTA CAACTGGGCG GATATCGTTT GGCTCGAATG GGCGAACGAA ATGGCAGTGG AGGTAACAAA TAAGCTTCCA AAGGATGGGA AAAAAATTCT TTGTCGACTT CACAGCTATG AAGCATTGGC CAATTACCCT GAAAAAATAA ATTGGAAAAA CGTCGATAAA TTGATTCTTG TTGCAGAACA TATAGAACGT ATCCTTCGAG ATTACCATTC TGAAGTTTAC AAGCAAGTAA AAGACAAGAT AGTAATTGTT CCAAACGGTC TTGATCTGAA CAGACTTAAG TTCAAAGTTA GACAACCCGG ATTCAACATA GCAGTTGTGG CTCACATCAA TCATAAAAAA GATCCTGCTA TGTGGCTTCA AATTATAGGC ATGTTGAGAA AAATTGATGA AAGATACACT TTACACATAG CTGGGGATTT TCAGGAAATA AGGTACGCTA ATTATTTCAA ACACTTCATA AAGGACGCAG GCCTTGAAAA GAACGTAAAA CTCTACGGCT GGGTCGAGGA CGTGAACGCT TTTCTTGAAG ATAAAAATTA TCTGCTTTCA ACGAGTATTC ATGAAAGCTT CGGGTACAAC ATTGCTGAAG CGATGGCAAA AGGGATAAAA CCTATTATAC ATAACTACGC TGGGGCAAAG ACGCAGTGGC CAGATGATCT TGTATTTAAC TTTATCGACG AAGTAATTCG AATTGTAACC AGCAGAGATT ACAACTCAGA GAAATATAGA TCGTTTGTCG AAAAAAACTG TTCTCTTGAG AAGCAAATTA CATCAATATT GAGCATTGTC CAACTTCAAG ATAACAACAG AACAAAAAAT AAATCAATCC ATAAGAAATC AACCACAGAC ACAGAGAATA GTTTCGCGAA AATATGGAAA GAGTATAGTA AAATTGATTC TTTCACGATC ATGAACGATC TTCCAGGAAA AAGTCTTAGA TCAGAATTCG TAAGTTTATT AGAGCGCTTT TTTATTCTCA ATAAAGCACG GATTTTAGAA GTCGGAACTG GAACAGGGGC GTTCTCAATT GAACTTGCAC TCAGAGAAGC AGATGTCACT GGTATAGACA TCGATCCTAC TTCCATCGAA CTAGCAATCA GGATAAGCAA GGATTATAAT GTTGAAAACG TTGAATTCAA AGTAGGTGAT GGTTTCAAAC TAACAGAATC GTTCAAACCA CAGGAGTTTG ATATTGCTTT CAACATGGGA GTGGTTGAAC ACTTTAAAGA TGACGACATA ATCAAAATGT TAAAGCAGAT GGGTGAAGTT GCGAAATTCG TTGTAGTAGG CGTTCCGTAC AGTGGTTCGT TTGTTTACAA AACAGCAAAA GAAACCGCTC AAAAACTTGG TGCCTGGGAA TATGGTTTTG AAAGAGATTT TTTAACCTTG GAACCTCTTA TCAGACGAGC AGGTTTGATC CCTCTTCATG AAGAAGTAAT AGGAGTTCTG GCAGAACCGT TTTACCTGAG AAGGATAAAT CCAGAGTGGG TACCTTTAAA AATAGCTGAG AATTTGCAAA AATATTTTCA AGGTGAAAAA GTTGGTTCAT GGCTGATTTG TTTTGCAACA AAATGGCCAG GTTACGCAGA TGAATTTCTG AAATTAGATG ATCACAAGAA AATAAAGTTT GAAAGCACAC AAATAAGCTT ATTAACTGTC CCCAAGCCGC TGGTTTCTAT TGTTATACCT GTTTTGAACG GAGCAAATTA TGTAAAAAGA CTCGTGGATA ATATTAAACG GATCGATTAT GAGAATTTCG AAGTTGTATT AGTTGACGAT GGTTCAACTG ATGGAACAGC TGATTTATTT GAAAGGCTCA TAAAAGGAGA ACCTAAGTTA CGAGAGAAGA TTTTGATAAT TAGAAATAAA GAAAACGTTG GAACCTTTCA CTCAAGATTG ATAGGCGTTA AACACAGCCA AGGATCATTT GTTTTCTTTC ATGATATTGA TGATCTTGTT TACTCCAAAG GAGTCAAAAA ACTTCTGGAT GATTTGTTGA ATTTTCCGAA CAAAAAAACA TTACTCACTG TAACAAATGC TTTGATGTCA GGAGAACAGT TCAATGGAGA AATTTGGTGT AGTAATTTTT ACAAAAACAA AGAAGAATTG TTTGTTTCAG AAATCACTTC TCTTTCAGGG AAATTTTCGA TAATAGACAC TCTTATAGAA CGTATCCCTC TTCAAAAAGC TTATGAAGAA CTTGCCGAAG TTCTTCATAA AGTAGGAATA ATAAAAATGA CGATTGCCGA GGATACAATT CTTGCCGATT ATCTTTTATT GGAGCATTTT GTGGAAAAAA TGATTCCTAC TTTCTACACT TTCCTGGGCT ATGAGGTCGG TAACTTACAA TCGAGCTCAA AAAACTTTTG GAAAGGATCA AACAGATTCC CATACAAATC TCTTTCCTCA TGGTAA
|
Protein sequence | MDIFEEIRKK IEQKEFSKAK EMAEKIEDEV EKYNSLGIIH YYEGKVSEAL EFFKKALDIN PVHDDALFNY SKVLFEKGEY FESWRYLTRI NNKTWEVYDM LGDTQLKQNN PAMALYYYKK AAELSNIPEM KEKYQILKDQ FKKDVKLAIF CLPGLDNFIK DIAQILSNIY DVKLVVTTDV RQIQEAYNWA DIVWLEWANE MAVEVTNKLP KDGKKILCRL HSYEALANYP EKINWKNVDK LILVAEHIER ILRDYHSEVY KQVKDKIVIV PNGLDLNRLK FKVRQPGFNI AVVAHINHKK DPAMWLQIIG MLRKIDERYT LHIAGDFQEI RYANYFKHFI KDAGLEKNVK LYGWVEDVNA FLEDKNYLLS TSIHESFGYN IAEAMAKGIK PIIHNYAGAK TQWPDDLVFN FIDEVIRIVT SRDYNSEKYR SFVEKNCSLE KQITSILSIV QLQDNNRTKN KSIHKKSTTD TENSFAKIWK EYSKIDSFTI MNDLPGKSLR SEFVSLLERF FILNKARILE VGTGTGAFSI ELALREADVT GIDIDPTSIE LAIRISKDYN VENVEFKVGD GFKLTESFKP QEFDIAFNMG VVEHFKDDDI IKMLKQMGEV AKFVVVGVPY SGSFVYKTAK ETAQKLGAWE YGFERDFLTL EPLIRRAGLI PLHEEVIGVL AEPFYLRRIN PEWVPLKIAE NLQKYFQGEK VGSWLICFAT KWPGYADEFL KLDDHKKIKF ESTQISLLTV PKPLVSIVIP VLNGANYVKR LVDNIKRIDY ENFEVVLVDD GSTDGTADLF ERLIKGEPKL REKILIIRNK ENVGTFHSRL IGVKHSQGSF VFFHDIDDLV YSKGVKKLLD DLLNFPNKKT LLTVTNALMS GEQFNGEIWC SNFYKNKEEL FVSEITSLSG KFSIIDTLIE RIPLQKAYEE LAEVLHKVGI IKMTIAEDTI LADYLLLEHF VEKMIPTFYT FLGYEVGNLQ SSSKNFWKGS NRFPYKSLSS W
|
| |