Gene Tpet_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1559 
Symbol 
ID5170333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1554653 
End bp1557907 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content49% 
IMG OID640564085 
Productglycoside hydrolase family protein 
Protein accessionYP_001245142 
Protein GI148270682 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000116157 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTACG AATGGGAAAA CCCACAGCTT GTGGGTGAAG GAACGGAGAA GCCACACGCT 
TCTTTCATAC CCTATCTGGA CCCGTTCAGC GGGGAATGGG AGTACCCTGA GGAATTCATC
TCTTTGAACG GGAACTGGGG GTTTCTCTTC GCGAAAAATC CCTTCGAGGT GCCGGAGGAT
TTCTTTTCAG AGAACTTCGA CGACTCGAAC TGGGATGAGA TAGAAGTTCC AAGCAACTGG
GAGATGAAAG GATATGGGAA GCCCATCTAC ACGAACGTGG TTTATCCATT TGAACCGAAC
CCTCCTTTTG TTCCAAAAGA CGACAATCCG ACCGGGGTGT ACAGAAGGTG GATCGAGATA
CCTGAGGATT GGTTCAAAAG GGAGATCTTT CTGCATTTTG AAGGTGTTCG ATCCTTCTTC
TATCTGTGGG TGAACGGGAA GAAGATCGGT TTCAGCAAAG ACAGCTGCAC ACCCGCTGAA
TTCAGACTCA CCGATGTTCT AAGGCCAGGG AAGAATCTGA TCACCGTTGA GGTTCTGAAG
TGGAGCGATG GAAGCTATCT CGAAGATCAG GACATGTGGT GGTTTGCGGG GATATACAGG
GACGTTTATC TGTACTCGCT GCCGAAATTT CACATCAGGG ACGTGTTCGT GAGAACGGAT
CTGGATGAAA ATTACAGAGA CGGAAAGATC TTTCTGGACG TAGAGATGAG AAATCTCGGT
GAGGAAGAAG AAAAAGACCT TGAAGTAACA CTCATCACAC CGGATGGAGA CGAAAAAACA
CTCGTGAAAG AGACAGTAAA GCCGGAGGAC AGAGTCCTTT CCTTTGCCTT TGACGTGAAA
GATCCGAGGA AGTGGTCCGC GGAGACACCA CATCTGTACG TTTTGAAACT GAAACTGGGA
GAAGACGAAA AGAAAGTCAA CTTCGGATTC AGGAAGATAG AGATAAAAGA CGGAATGCTT
CTTTTCAACG GGAAACCTCT CTACATAAAG GGAGTGAACA GACACGAGTT CGATCCTGAC
AGGGGTCATG CGGTGACGGT GGAGAGGATG ATTCAGGACA TAAAACTCAT GAAGCAGCAC
AACATAAACA CAGTTCGCAC ATCGCACTAT CCGAACCAGA CGAAGTGGTA CGATCTGTGT
GACTATTTTG GACTCTACGT GATAGACGAG GCAAACATCG AATCCCACGG TATAGGCTGG
GATCCTGAAG TGACACTTGC GAACAGACCG GAATGGGAGA AGGCACATCT CGACAGAATC
CAGAGGATGG TCGAGCGTGA CAAGAATCAT CCGTGTGTTA TCTTCTGGTC TCTTGGAAAC
GAAGCGGGAG ACGGAGTGAA TTTCGAAAAA GCCGCTCTCT GGATAAAGGA AAGAGACAAC
ACGCGGCTTA TCCATTACGA GGGAACAACA AGGAGGGGAG AATCGTACTA CGTGGATGTT
TTCTCTCTCA TGTACCCGAA GATAGACGTT CTTCTTGAGT ACGCCTCCAG AAAGAGGGAA
AAGCCTTTCA TCATGTGTGA GTATGCCCAC GCGATGGGAA ACAGTGTGGG AAATCTGAAG
GACTACTGGG ATGTGATAGA AAGGTATCCG TATCTTCACG GAGGGTGCAT CTGGGACTGG
GTGGACCAGG GAATCAGGAA GAAGGATGAA AACGGAAAGG AATTCTGGGC GTACGGTGGA
GATTTCGGCG ATGAACCAAA CAACAAGAAT TTCTGCTGCA ACGGAGTGGT CCTTCCCGAC
AGAACACCCG AGCCAGAGCT TTACGAGGTG AAGAAAATCT ATCAGAACAT CAAAGTGCGT
CAGATCTCAA TAGACACCTA CGAAGTGGAG AACGGGTATC TTTTCACAGA CCTTGAGATG
TTCGATGGAA CTTGGAGGAT CAGAAAGGAC GGTGAAGTGA TAGAAGAAGG AAGGTTCAAA
CTCTCAGCCG AGCCAGGAGA AAGGAAAATT TTTAAGATAC CACTTCCAGT GATGGAAGAC
TCGGAATACT TCCTTGAGAT CTGTTTTGCT CTCTCTGAAG ATACCCTCTG GGCGAAGAAG
GGACACGTTG TAGCGTGGGA ACAGTTCCTC CTGAAACCTC CCATCTTTCA AAAAAGCATT
GTTCAGGAAA AAGTAGATTT CTCAGAAGAT GGAAGATACC TTCTGGTGAG GACAAAAGAC
GCGGAGTTTA TCTTCTCGAA ACTCACCGGC CTTCTGGAGC ACATCGTGTA CAGGGGAAGA
AATATCCTGA CAGGATCGAT CGTTCCAAAT TTCTGGAGAG TTCCAACGGA CAACGATATC
GGAAACAGAA TGCCGGAAAG ACTCTCCATA TGGAAAAAGG CATCGAGCGA AAGAAAGCTC
TTCAAGATGT TCTGGAAGAG AAGGGAAAAC AGCGTTTCCG TTCAGAGTGT CTATCAGGTA
CCCGGAAACA GCTGGGTGTA CCTCACCTAC ACCATCTTTG GAAACGGTGA CATCCTCGTG
GATCTTTCCC TGATTCCCGC AGAAGGTGTA CCGGAGATTC CAAGGATCGG TCTTCAGTTC
ACGGTCCCTG AAGAGTTCGG CACCGTGGAG TGGTACGGAA GGGGACCGCA CGAGACTTAC
TGGGACAGAA AAGAAAGTGG CCTTTTCGCA AGGCACAGAA AAGCTGTCGA TGAGATGATA
CACAGGTACG TCAGGCCCCA GGAAACGGGG AACAGATCGG ACGTGAGATG GTTTGCGCTT
TCCGACGGTG AAACAAAACT CTTTGTGTCA GGCATGCCGC AGATAGACTT CAGCGTCTGG
CCCTTTTCCA TGGAGGATCT CGAGAGGGCT CAGCACATAA GTGAACTCCC GGAGAGGGAC
TTCGTCACCG TGAACGTGGA CTTCAGACAG ATGGGCCTTG GAGGAGACGA CAGCTGGGGT
GCGATGCCTC ATCTGGAGTA CAGGCTTCTA CCAAAGCCGT ATCGTTTTTC TTTCAGAATG
AGGATTAGCA AAGAGATTCC ATCCTGGAGG GTTCTTGCGG CGATCCCTGA AACGCTCCAT
GTTGAGATGT CCTCAGAAGA CGTGATACGC GAAGGAGACA CCCTGAGAGT GAAATTTTCC
CTTCTGAACG ACACTCCACT GAGCAAGGAA GAACAGGTGG TTCTCTTTGT TGATGGAAAC
GAATACTCGG TGAGGCGAGT GGTGATTCCA CCCTTCAAGA AGGAAGAGCT GGTGTTCAAA
GTAGAAGGAT TGAAGAAGGG AGAACATCTG ATACATACTA ATCTGAACAC GAGAAAAACT
ATCTACGTGA GGTGA
 
Protein sequence
MSYEWENPQL VGEGTEKPHA SFIPYLDPFS GEWEYPEEFI SLNGNWGFLF AKNPFEVPED 
FFSENFDDSN WDEIEVPSNW EMKGYGKPIY TNVVYPFEPN PPFVPKDDNP TGVYRRWIEI
PEDWFKREIF LHFEGVRSFF YLWVNGKKIG FSKDSCTPAE FRLTDVLRPG KNLITVEVLK
WSDGSYLEDQ DMWWFAGIYR DVYLYSLPKF HIRDVFVRTD LDENYRDGKI FLDVEMRNLG
EEEEKDLEVT LITPDGDEKT LVKETVKPED RVLSFAFDVK DPRKWSAETP HLYVLKLKLG
EDEKKVNFGF RKIEIKDGML LFNGKPLYIK GVNRHEFDPD RGHAVTVERM IQDIKLMKQH
NINTVRTSHY PNQTKWYDLC DYFGLYVIDE ANIESHGIGW DPEVTLANRP EWEKAHLDRI
QRMVERDKNH PCVIFWSLGN EAGDGVNFEK AALWIKERDN TRLIHYEGTT RRGESYYVDV
FSLMYPKIDV LLEYASRKRE KPFIMCEYAH AMGNSVGNLK DYWDVIERYP YLHGGCIWDW
VDQGIRKKDE NGKEFWAYGG DFGDEPNNKN FCCNGVVLPD RTPEPELYEV KKIYQNIKVR
QISIDTYEVE NGYLFTDLEM FDGTWRIRKD GEVIEEGRFK LSAEPGERKI FKIPLPVMED
SEYFLEICFA LSEDTLWAKK GHVVAWEQFL LKPPIFQKSI VQEKVDFSED GRYLLVRTKD
AEFIFSKLTG LLEHIVYRGR NILTGSIVPN FWRVPTDNDI GNRMPERLSI WKKASSERKL
FKMFWKRREN SVSVQSVYQV PGNSWVYLTY TIFGNGDILV DLSLIPAEGV PEIPRIGLQF
TVPEEFGTVE WYGRGPHETY WDRKESGLFA RHRKAVDEMI HRYVRPQETG NRSDVRWFAL
SDGETKLFVS GMPQIDFSVW PFSMEDLERA QHISELPERD FVTVNVDFRQ MGLGGDDSWG
AMPHLEYRLL PKPYRFSFRM RISKEIPSWR VLAAIPETLH VEMSSEDVIR EGDTLRVKFS
LLNDTPLSKE EQVVLFVDGN EYSVRRVVIP PFKKEELVFK VEGLKKGEHL IHTNLNTRKT
IYVR