Gene Tpet_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1689 
Symbol 
ID5170787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1690094 
End bp1691785 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content48% 
IMG OID640564215 
ProductBeta-glucuronidase 
Protein accessionYP_001245270 
Protein GI148270810 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0566843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAC CAAAGAACAC TCTTAAGAGA ATCGTACAAA ACCTGGATGG CTTCTGGGAT 
TGTGAGATAA AAGAAGAGAA CAGACCCATA GCCGTTCCTG GAAGCTGGAA CGAGCAGTAC
CAGGATCTGT GCTACGAAGA AGGACCCTTC ACCTACAAAA CCACCTTCTA CGTTCCAGAG
GAACTTTCAC AAAAACACAT CAGACTTTAC TTTGCTGCCG TGAACACAGA CTGCGATGTC
TTCCTAAACG GAGAGAAAGT GGGAGAGAAT CACATTGGAT ACCTTCCCTT CGAAGTAGAT
GTGACAGGGA AAGTGAAACC AGGAGAGAAC GAATTCAAGG TGGTTGTTGA GAACAGGTTG
AAGGTGGGAG GATTTCCCTC GAAGGTTCCA GACAGAGGCA CTCACACTGT GGGATTCTTC
GGAAGTTTTC CACCTGCAAA CTTCGACTTC TTTCCCTACG GAGGAATCAT AAGACCTGTT
CTGATAGAAT TCACAGACCA CGCGAGGATA CTCGACATCT GGGTGGACAC GAGTGAATCC
GAACCGGAGA AGAAACTTGG AAGAGTGAAA GTGAAGGTAG AAGTCTCAGA GGAAGCGCTG
GGACAGGAGA TGACGATCAA ACTTGGAGAA GCTGAGAAAA AGATCAAAAC ATTCGACAGG
TTCGTTGAAG AAGAGTTCAT CCTCGAAAAC GCCAGATTTT GGAGCCTCGA AGATCCGTAT
CTTTATCCTC TAAAAGTGGA ACTCGAAAGG GACGAGTACA CTCTGGACAT CGGAATCAGA
ACGATCAGCT GGGACGAGAA AAAGCTCTAC CTGAACGGAA AACCCGTCTT TTTGAAAGGC
TTTGGAAAAC ACGAAGAATT TCCCGTTCTG GGACAGGGCA CTTTCTATCC TCTGATGATA
AAAGACTTCA ACCTTCTGAA GTGGATCAAC GCGAATTCCT TCAGGACCTC TCACTACCCT
TACAGTGAAG AGTGGCTGGA TCTTGCCGAC AGGCTGGGAA TCCTTGTGAT AGACGAAGCC
CCGCACGTTG GTATCACAAG GTACCACTAC AACCCCGAGA CTCAGAAGAT AGCCGAAGAC
AACATAAGAA GGATGATCGA CAGAGACAAA AACCATCCCA GTGTGATCAT GTGGAGTGTA
GCGAACGAAC CAGAATCCAA CCATCCAGAC GCGGAGGGTT TCTTCAAAGC CCTTTACGAG
ACCGCCAAAG AAATGGATCG AACACGTCCT GTGGTCATGG TGAGCATGAT GGACACGCCA
GACGAAAGAA CAAGAGATGT GGCACTGAAG TACTTCGACA TCGTCTGTGT GAACAGGTAC
TACGGCTGGT ACATCTATCA GGGAAGGATA GAAGAAGGAC TTCAGGCTCT GGAAAAAGAC
ATAGAAGAAC TCTACGCAAG ACACAGAAAG CCCATCTTTG TCACGGAGTT CGGTGCAGAC
GCGATAGCCG GTATCCACTA CGACCCACCT CAGATGTTCT CCGAGGAGTA CCAGGCAGAG
CTCGTTGAAA AGACGATCAG GCTTCTTTTG AAAAAAGACT TCGTCATTGG AACACACGTG
TGGGCCTTCG CGGACTTCAA AACCCCTCAA AATGTGAGAA GGCCCATCCT CAACTACAAG
GGTGTCTTCA CAAGAGACAG ACAACCCAAA CTCGTTGCTC ATGTGCTGAG AAAACTGTGG
AGTGAGGTTT GA
 
Protein sequence
MLKPKNTLKR IVQNLDGFWD CEIKEENRPI AVPGSWNEQY QDLCYEEGPF TYKTTFYVPE 
ELSQKHIRLY FAAVNTDCDV FLNGEKVGEN HIGYLPFEVD VTGKVKPGEN EFKVVVENRL
KVGGFPSKVP DRGTHTVGFF GSFPPANFDF FPYGGIIRPV LIEFTDHARI LDIWVDTSES
EPEKKLGRVK VKVEVSEEAL GQEMTIKLGE AEKKIKTFDR FVEEEFILEN ARFWSLEDPY
LYPLKVELER DEYTLDIGIR TISWDEKKLY LNGKPVFLKG FGKHEEFPVL GQGTFYPLMI
KDFNLLKWIN ANSFRTSHYP YSEEWLDLAD RLGILVIDEA PHVGITRYHY NPETQKIAED
NIRRMIDRDK NHPSVIMWSV ANEPESNHPD AEGFFKALYE TAKEMDRTRP VVMVSMMDTP
DERTRDVALK YFDIVCVNRY YGWYIYQGRI EEGLQALEKD IEELYARHRK PIFVTEFGAD
AIAGIHYDPP QMFSEEYQAE LVEKTIRLLL KKDFVIGTHV WAFADFKTPQ NVRRPILNYK
GVFTRDRQPK LVAHVLRKLW SEV