Gene Tpet_0483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0483 
Symbol 
ID5171691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp476045 
End bp477385 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content45% 
IMG OID640562992 
Productglycoside hydrolase family protein 
Protein accessionYP_001244083 
Protein GI148269623 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAC TGGCAAAAAA GATTGAAGAA GAGATTTTGA ATCACGTGAG AGAGCCCGAA 
ATACCCAATC GAGAGGTTAA CCTCCTCGAT TTTGGAGCGA GAGGGGATGA AAGAACCGAC
TGTTCTGAGA GCTTCAAAAG GGCCATAGAA GAACTTTCAA AACAGGGCGG AGGAAGACTG
ATTGTTCCCG AAGGTGTGTT TCTAACGGGA CCAATTCATT TGAAGAGCAA CATCGAACTC
CACGTGAAGG GAACCATAAA ATTCATTCCT GATCCTGAGA GATACCTTCC CGTCGTTCTC
ACCAGGTTCG AGGGAATCGA ACTGTACAAT TATTCTCCCC TGGTTTACGC CTTGGATTGT
AAAAACGTGG CTATCACCGG AAGTGGGGTT TTAGACGGTT CAGCAGACAA CGAACACTGG
TGGTCCTGGA AGGGAAAGAA AGATTTCGGA TGGAAGGAAG GACTTCCCAA CCAGCAGGAG
GATGTAAAAA AACTGAAAGA GATGGCAGAG AGAGGAACAC CAGTTGAAGA GAGAGTGTTC
GGAAAGGGAC ATTATCTGAG ACCGAGTTTT GTTCAGTTTT ACAGATGCAG GAATGTTTTG
GTAGAAGATG TGAAGATCAT CAACTCTCCT ATGTGGTGTG TACATCCTGT TCTTTCTGAA
AATGTGATCA TAAGAAACAT CGAAATTTCG AGCACGGGCC CAAACAATGA TGGTATCGAT
CCTGAATCCT GCAAGTATAT GCTCATTGAG AAATGCAGAT TCGACACAGG TGATGATTCT
GTGGTCATCA AATCGGGGAG AGACGCGGAC GGAAGGCGAA TCGGAGTGCC TTCTGAATAC
ATTCTTGTGA GGGACAACCT GGTGATCAGT CAGGCGAGTC ATGGTGGACT TGTGATTGGG
AGTGAAATGT CCGGTGGTGT GAGAAACGTC GTTGCAAGGA ACAACGTCTA CATGAATGTG
GAAAGGGCTC TCAGGTTGAA AACGAATTCC AGGCGTGGAG GATACATGGA GAACATCTTC
TTTATAGACA ACGTGGCTGT GAACGTTTCG GAAGAGGTGA TCAGAATAAA TCTCAGATAC
GATAACGAAG AGGGGGAATA TCTCCCTGTA GTCAGAAGCG TTTTTGTTAA GAACCTGAAG
GCGACAGGTG GAAAATACGC TCTACGGATT GAGGGTCTGG AGAATGATTA TGTAAAAGAT
ATCCTGATAT CTGATACTAT AATGGAAGGA GCGAAGATCT CTGTTCTTCT TGAGTTCGGT
CAGTTGGGGA TGGAGAATGT TATCATGAAT GGATCAAGAT TCGAAAAGCT TTACATCGAA
GGTAAAGCTC TGCTGAAATA A
 
Protein sequence
MEELAKKIEE EILNHVREPE IPNREVNLLD FGARGDERTD CSESFKRAIE ELSKQGGGRL 
IVPEGVFLTG PIHLKSNIEL HVKGTIKFIP DPERYLPVVL TRFEGIELYN YSPLVYALDC
KNVAITGSGV LDGSADNEHW WSWKGKKDFG WKEGLPNQQE DVKKLKEMAE RGTPVEERVF
GKGHYLRPSF VQFYRCRNVL VEDVKIINSP MWCVHPVLSE NVIIRNIEIS STGPNNDGID
PESCKYMLIE KCRFDTGDDS VVIKSGRDAD GRRIGVPSEY ILVRDNLVIS QASHGGLVIG
SEMSGGVRNV VARNNVYMNV ERALRLKTNS RRGGYMENIF FIDNVAVNVS EEVIRINLRY
DNEEGEYLPV VRSVFVKNLK ATGGKYALRI EGLENDYVKD ILISDTIMEG AKISVLLEFG
QLGMENVIMN GSRFEKLYIE GKALLK