Gene Tpet_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0637 
Symbol 
ID5171170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp641028 
End bp642443 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content43% 
IMG OID640563144 
Productglycoside hydrolase family protein 
Protein accessionYP_001244233 
Protein GI148269773 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000622549 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTC TTTTTCTGAT GATTACGCTA ACAGCATTGA CAGGTTATAT TCTCGCCGAC 
GAACAACCCA CCTTTCGATG GGCAGTAGTA CATGATCCAT CAATTATTAA GGTAGGAAAC
ATGTATTACG TTTTTGGAAC ACATCTTCAA GTCGCAAAAT CGAAAGATCT AATGCATTGG
GAACAAATAA ATACGAGTGC TCATGACAAG AACCCCATCA TTCCTAATAT AAATGAAGAG
CTAAAGGAAA CCCTGAGTTG GGCAAGGACT CGAAACGACA TCTGGGCGCC TCAGGTTATC
CAACTTTCCG ATGGAAGATA CTACATGTAT TACTGCGCTT CCACCTTTGG TTCACCAAGA
TCTGCCATAG GAATCGCAGT CTCCGATGAT ATAGAAGGTC CGTATAAACA TTACGCAGTT
ATTGTGAAAT CCGGTCAGGT GTATTCTGTG GACGGTCCGA GTGAAGATGG GACACCATAC
GACTCCAGAA AACATCCCAA TGCACTCGAT CCTGGCGTTT TTTATGATAA AGAAGGGAAT
TTGTGGATGG TTTACGGGTC CTGGTTTGGA GGAATTTATA TTTTAAAGCT CGATCCTAAC
ACAGGCCTTC CCCTTCCTGG ACAGGGTTAT GGTAAAAGGT TAGTGGGTGG AAATCACAGT
TCCATGGAGG GGCCATACAT CCTTTACAGT CCTGATACAG ATTATTACTA TCTCTTTCTG
AGTTTTGGGG GCCTTGATTA CAGAGGAGGA TACAACATCA GAGTTGCAAG ATCCAAGAAC
CCAAACGGAC CTTACTACGA TCCCGAGGGA AAGAGTATGG AAAACTGTAT GGGAAGTAAA
ACAGTGATAT CAAATTATGG GGCAAAGTTA GTTGGTAATT TTATCTTGAG TGAGAGTAAT
ACTATCGATT TCAAAGCTTT TGGATACGTA TCTCCTGGAC ACAACTCTGC CTATTACGAT
CCAGAAACTG GGAAGTACTT CATCTTCTTC CACACGAGGT TCCCCGGTAG AGGAGAGACC
TACCAGCTCA GGGTCCACCA GCTTTTCCTC AACGAAGATG GGTGGTTTGT TATGGCTCCA
TTCCCATATG GTGGCGAAAC AGTCTCAAAA TTGCCCAATG AAGAAATAGT AGGTGAATAT
CAGTTCATTA ATCATGGGAA GGAGATAACC GATAAAATCA AACAGCCTGT GAGAATAAAA
CTAAACAGCG ATGGAAGCAT AACCGGAGCT GTCGAAGGAA GGTGGGAGAG AAAGGAACAC
TACATTACCT TGAAAATCAT CGAAGGAAAT ACAACTGTTA TTTACAAAGG AGTACTCCTG
AAACAGTGGC ATTATTCGGA GAAAAAATGG GTGACGGTGT TTACAGCTCT TTCCAACCAA
GGAGTTTCTG TGTGGGGAAT AAGAGTGGAA GAATGA
 
Protein sequence
MRFLFLMITL TALTGYILAD EQPTFRWAVV HDPSIIKVGN MYYVFGTHLQ VAKSKDLMHW 
EQINTSAHDK NPIIPNINEE LKETLSWART RNDIWAPQVI QLSDGRYYMY YCASTFGSPR
SAIGIAVSDD IEGPYKHYAV IVKSGQVYSV DGPSEDGTPY DSRKHPNALD PGVFYDKEGN
LWMVYGSWFG GIYILKLDPN TGLPLPGQGY GKRLVGGNHS SMEGPYILYS PDTDYYYLFL
SFGGLDYRGG YNIRVARSKN PNGPYYDPEG KSMENCMGSK TVISNYGAKL VGNFILSESN
TIDFKAFGYV SPGHNSAYYD PETGKYFIFF HTRFPGRGET YQLRVHQLFL NEDGWFVMAP
FPYGGETVSK LPNEEIVGEY QFINHGKEIT DKIKQPVRIK LNSDGSITGA VEGRWERKEH
YITLKIIEGN TTVIYKGVLL KQWHYSEKKW VTVFTALSNQ GVSVWGIRVE E