Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0483 |
Symbol | |
ID | 5171691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 476045 |
End bp | 477385 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640562992 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001244083 |
Protein GI | 148269623 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAC TGGCAAAAAA GATTGAAGAA GAGATTTTGA ATCACGTGAG AGAGCCCGAA ATACCCAATC GAGAGGTTAA CCTCCTCGAT TTTGGAGCGA GAGGGGATGA AAGAACCGAC TGTTCTGAGA GCTTCAAAAG GGCCATAGAA GAACTTTCAA AACAGGGCGG AGGAAGACTG ATTGTTCCCG AAGGTGTGTT TCTAACGGGA CCAATTCATT TGAAGAGCAA CATCGAACTC CACGTGAAGG GAACCATAAA ATTCATTCCT GATCCTGAGA GATACCTTCC CGTCGTTCTC ACCAGGTTCG AGGGAATCGA ACTGTACAAT TATTCTCCCC TGGTTTACGC CTTGGATTGT AAAAACGTGG CTATCACCGG AAGTGGGGTT TTAGACGGTT CAGCAGACAA CGAACACTGG TGGTCCTGGA AGGGAAAGAA AGATTTCGGA TGGAAGGAAG GACTTCCCAA CCAGCAGGAG GATGTAAAAA AACTGAAAGA GATGGCAGAG AGAGGAACAC CAGTTGAAGA GAGAGTGTTC GGAAAGGGAC ATTATCTGAG ACCGAGTTTT GTTCAGTTTT ACAGATGCAG GAATGTTTTG GTAGAAGATG TGAAGATCAT CAACTCTCCT ATGTGGTGTG TACATCCTGT TCTTTCTGAA AATGTGATCA TAAGAAACAT CGAAATTTCG AGCACGGGCC CAAACAATGA TGGTATCGAT CCTGAATCCT GCAAGTATAT GCTCATTGAG AAATGCAGAT TCGACACAGG TGATGATTCT GTGGTCATCA AATCGGGGAG AGACGCGGAC GGAAGGCGAA TCGGAGTGCC TTCTGAATAC ATTCTTGTGA GGGACAACCT GGTGATCAGT CAGGCGAGTC ATGGTGGACT TGTGATTGGG AGTGAAATGT CCGGTGGTGT GAGAAACGTC GTTGCAAGGA ACAACGTCTA CATGAATGTG GAAAGGGCTC TCAGGTTGAA AACGAATTCC AGGCGTGGAG GATACATGGA GAACATCTTC TTTATAGACA ACGTGGCTGT GAACGTTTCG GAAGAGGTGA TCAGAATAAA TCTCAGATAC GATAACGAAG AGGGGGAATA TCTCCCTGTA GTCAGAAGCG TTTTTGTTAA GAACCTGAAG GCGACAGGTG GAAAATACGC TCTACGGATT GAGGGTCTGG AGAATGATTA TGTAAAAGAT ATCCTGATAT CTGATACTAT AATGGAAGGA GCGAAGATCT CTGTTCTTCT TGAGTTCGGT CAGTTGGGGA TGGAGAATGT TATCATGAAT GGATCAAGAT TCGAAAAGCT TTACATCGAA GGTAAAGCTC TGCTGAAATA A
|
Protein sequence | MEELAKKIEE EILNHVREPE IPNREVNLLD FGARGDERTD CSESFKRAIE ELSKQGGGRL IVPEGVFLTG PIHLKSNIEL HVKGTIKFIP DPERYLPVVL TRFEGIELYN YSPLVYALDC KNVAITGSGV LDGSADNEHW WSWKGKKDFG WKEGLPNQQE DVKKLKEMAE RGTPVEERVF GKGHYLRPSF VQFYRCRNVL VEDVKIINSP MWCVHPVLSE NVIIRNIEIS STGPNNDGID PESCKYMLIE KCRFDTGDDS VVIKSGRDAD GRRIGVPSEY ILVRDNLVIS QASHGGLVIG SEMSGGVRNV VARNNVYMNV ERALRLKTNS RRGGYMENIF FIDNVAVNVS EEVIRINLRY DNEEGEYLPV VRSVFVKNLK ATGGKYALRI EGLENDYVKD ILISDTIMEG AKISVLLEFG QLGMENVIMN GSRFEKLYIE GKALLK
|
| |