Gene Tpet_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0172 
Symbol 
ID5170645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp166105 
End bp169080 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content37% 
IMG OID640562673 
Productglycosyl transferase family protein 
Protein accessionYP_001243777 
Protein GI148269317 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCT TTGAAGAGAT AAGAAAAAAG ATTGAACAAA AAGAATTCTC GAAAGCAAAA 
GAAATGGCTG AAAAAATAGA AGATGAAGTG GAAAAATACA ACTCACTTGG AATCATACAC
TACTACGAAG GTAAAGTCAG CGAAGCGCTT GAGTTTTTCA AAAAAGCCCT CGATATCAAT
CCAGTTCACG ATGATGCTCT TTTCAACTAC TCGAAGGTGC TCTTCGAAAA GGGAGAATAC
TTTGAATCCT GGAGATATTT GACGAGGATC AACAACAAAA CGTGGGAAGT CTACGACATG
CTCGGAGACA CACAGCTGAA ACAGAACAAT CCAGCGATGG CTCTTTATTA TTACAAAAAG
GCAGCAGAAC TTTCCAACAT ACCGGAAATG AAAGAAAAAT ACCAAATCCT GAAAGACCAG
TTCAAAAAAG ACGTAAAGCT TGCCATATTC TGCCTTCCTG GTCTGGACAA CTTCATAAAG
GATATAGCTC AGATTTTGTC GAACATCTAC GATGTGAAGC TTGTTGTTAC AACAGATGTC
AGACAGATCC AGGAAGCTTA CAACTGGGCG GATATCGTTT GGCTCGAATG GGCGAACGAA
ATGGCAGTGG AGGTAACAAA TAAGCTTCCA AAGGATGGGA AAAAAATTCT TTGTCGACTT
CACAGCTATG AAGCATTGGC CAATTACCCT GAAAAAATAA ATTGGAAAAA CGTCGATAAA
TTGATTCTTG TTGCAGAACA TATAGAACGT ATCCTTCGAG ATTACCATTC TGAAGTTTAC
AAGCAAGTAA AAGACAAGAT AGTAATTGTT CCAAACGGTC TTGATCTGAA CAGACTTAAG
TTCAAAGTTA GACAACCCGG ATTCAACATA GCAGTTGTGG CTCACATCAA TCATAAAAAA
GATCCTGCTA TGTGGCTTCA AATTATAGGC ATGTTGAGAA AAATTGATGA AAGATACACT
TTACACATAG CTGGGGATTT TCAGGAAATA AGGTACGCTA ATTATTTCAA ACACTTCATA
AAGGACGCAG GCCTTGAAAA GAACGTAAAA CTCTACGGCT GGGTCGAGGA CGTGAACGCT
TTTCTTGAAG ATAAAAATTA TCTGCTTTCA ACGAGTATTC ATGAAAGCTT CGGGTACAAC
ATTGCTGAAG CGATGGCAAA AGGGATAAAA CCTATTATAC ATAACTACGC TGGGGCAAAG
ACGCAGTGGC CAGATGATCT TGTATTTAAC TTTATCGACG AAGTAATTCG AATTGTAACC
AGCAGAGATT ACAACTCAGA GAAATATAGA TCGTTTGTCG AAAAAAACTG TTCTCTTGAG
AAGCAAATTA CATCAATATT GAGCATTGTC CAACTTCAAG ATAACAACAG AACAAAAAAT
AAATCAATCC ATAAGAAATC AACCACAGAC ACAGAGAATA GTTTCGCGAA AATATGGAAA
GAGTATAGTA AAATTGATTC TTTCACGATC ATGAACGATC TTCCAGGAAA AAGTCTTAGA
TCAGAATTCG TAAGTTTATT AGAGCGCTTT TTTATTCTCA ATAAAGCACG GATTTTAGAA
GTCGGAACTG GAACAGGGGC GTTCTCAATT GAACTTGCAC TCAGAGAAGC AGATGTCACT
GGTATAGACA TCGATCCTAC TTCCATCGAA CTAGCAATCA GGATAAGCAA GGATTATAAT
GTTGAAAACG TTGAATTCAA AGTAGGTGAT GGTTTCAAAC TAACAGAATC GTTCAAACCA
CAGGAGTTTG ATATTGCTTT CAACATGGGA GTGGTTGAAC ACTTTAAAGA TGACGACATA
ATCAAAATGT TAAAGCAGAT GGGTGAAGTT GCGAAATTCG TTGTAGTAGG CGTTCCGTAC
AGTGGTTCGT TTGTTTACAA AACAGCAAAA GAAACCGCTC AAAAACTTGG TGCCTGGGAA
TATGGTTTTG AAAGAGATTT TTTAACCTTG GAACCTCTTA TCAGACGAGC AGGTTTGATC
CCTCTTCATG AAGAAGTAAT AGGAGTTCTG GCAGAACCGT TTTACCTGAG AAGGATAAAT
CCAGAGTGGG TACCTTTAAA AATAGCTGAG AATTTGCAAA AATATTTTCA AGGTGAAAAA
GTTGGTTCAT GGCTGATTTG TTTTGCAACA AAATGGCCAG GTTACGCAGA TGAATTTCTG
AAATTAGATG ATCACAAGAA AATAAAGTTT GAAAGCACAC AAATAAGCTT ATTAACTGTC
CCCAAGCCGC TGGTTTCTAT TGTTATACCT GTTTTGAACG GAGCAAATTA TGTAAAAAGA
CTCGTGGATA ATATTAAACG GATCGATTAT GAGAATTTCG AAGTTGTATT AGTTGACGAT
GGTTCAACTG ATGGAACAGC TGATTTATTT GAAAGGCTCA TAAAAGGAGA ACCTAAGTTA
CGAGAGAAGA TTTTGATAAT TAGAAATAAA GAAAACGTTG GAACCTTTCA CTCAAGATTG
ATAGGCGTTA AACACAGCCA AGGATCATTT GTTTTCTTTC ATGATATTGA TGATCTTGTT
TACTCCAAAG GAGTCAAAAA ACTTCTGGAT GATTTGTTGA ATTTTCCGAA CAAAAAAACA
TTACTCACTG TAACAAATGC TTTGATGTCA GGAGAACAGT TCAATGGAGA AATTTGGTGT
AGTAATTTTT ACAAAAACAA AGAAGAATTG TTTGTTTCAG AAATCACTTC TCTTTCAGGG
AAATTTTCGA TAATAGACAC TCTTATAGAA CGTATCCCTC TTCAAAAAGC TTATGAAGAA
CTTGCCGAAG TTCTTCATAA AGTAGGAATA ATAAAAATGA CGATTGCCGA GGATACAATT
CTTGCCGATT ATCTTTTATT GGAGCATTTT GTGGAAAAAA TGATTCCTAC TTTCTACACT
TTCCTGGGCT ATGAGGTCGG TAACTTACAA TCGAGCTCAA AAAACTTTTG GAAAGGATCA
AACAGATTCC CATACAAATC TCTTTCCTCA TGGTAA
 
Protein sequence
MDIFEEIRKK IEQKEFSKAK EMAEKIEDEV EKYNSLGIIH YYEGKVSEAL EFFKKALDIN 
PVHDDALFNY SKVLFEKGEY FESWRYLTRI NNKTWEVYDM LGDTQLKQNN PAMALYYYKK
AAELSNIPEM KEKYQILKDQ FKKDVKLAIF CLPGLDNFIK DIAQILSNIY DVKLVVTTDV
RQIQEAYNWA DIVWLEWANE MAVEVTNKLP KDGKKILCRL HSYEALANYP EKINWKNVDK
LILVAEHIER ILRDYHSEVY KQVKDKIVIV PNGLDLNRLK FKVRQPGFNI AVVAHINHKK
DPAMWLQIIG MLRKIDERYT LHIAGDFQEI RYANYFKHFI KDAGLEKNVK LYGWVEDVNA
FLEDKNYLLS TSIHESFGYN IAEAMAKGIK PIIHNYAGAK TQWPDDLVFN FIDEVIRIVT
SRDYNSEKYR SFVEKNCSLE KQITSILSIV QLQDNNRTKN KSIHKKSTTD TENSFAKIWK
EYSKIDSFTI MNDLPGKSLR SEFVSLLERF FILNKARILE VGTGTGAFSI ELALREADVT
GIDIDPTSIE LAIRISKDYN VENVEFKVGD GFKLTESFKP QEFDIAFNMG VVEHFKDDDI
IKMLKQMGEV AKFVVVGVPY SGSFVYKTAK ETAQKLGAWE YGFERDFLTL EPLIRRAGLI
PLHEEVIGVL AEPFYLRRIN PEWVPLKIAE NLQKYFQGEK VGSWLICFAT KWPGYADEFL
KLDDHKKIKF ESTQISLLTV PKPLVSIVIP VLNGANYVKR LVDNIKRIDY ENFEVVLVDD
GSTDGTADLF ERLIKGEPKL REKILIIRNK ENVGTFHSRL IGVKHSQGSF VFFHDIDDLV
YSKGVKKLLD DLLNFPNKKT LLTVTNALMS GEQFNGEIWC SNFYKNKEEL FVSEITSLSG
KFSIIDTLIE RIPLQKAYEE LAEVLHKVGI IKMTIAEDTI LADYLLLEHF VEKMIPTFYT
FLGYEVGNLQ SSSKNFWKGS NRFPYKSLSS W