Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1198 |
Symbol | |
ID | 5170014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1218277 |
End bp | 1221192 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640563717 |
Product | TPR repeat-containing protein |
Protein accession | YP_001244788 |
Protein GI | 148270328 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000411911 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAT TTCTGAGGGA AACGTACTGG GGGGATGAAT TTCTCATAGA AGATGACGGT GAAGATCAAG TTTTGAAAAT CGTTAGAGTT CCCGTGGATA GATCTTTCTT CTTCAACGAA TACGCGAAAT TGAAACGTTT CAAGTTACCA AATGTTCTAC TGCCTGAAAA GTTGAAAATC TCTGAAGGAA AGTTCCTGCT GTTCTATCCC TACTACCACA ACCTTGCACC TCTCGAAAAT TTGAGTGAAC AGGTCGCAAA GCAGCTTTTC AGGTTGTTTC GCTTCCTCTC GAGAGCGGGC GTTACTATTC CCGCGCTCGG GATGGACGAC GTATTGGTGA ACGACGGTGT TTTTTTGATT CCCGCGCTCA TCTCGGACCT TTCTGAGGAT GTGAAAGGCG TTGTTTTTTC GAAAAAGAAA ACCCCCCAGG CAACTGAAGA AGTGTGCAGA CGTTTTTTGA AAATTCACGG AATAGAACCT CATTTGGAGG ACAGAGAGTT CTCTTTTGAG TCCAAAGAAA TCAGAATTCC TTACATTCAC AGGAAGGAAG AAATATTGAT AAAAAGAGAC ATAGAAGCCG CTCAAAAATT TCCTCTCTTC ATTTTGATCA CAGGTGAACA GAGAGTGGGA AAGACGAAAC TCGCCTCTGT TCTCGTTGAT GAGTTAAGAG AGGAAGGATA CCTGATTCAT CAGATTTCTT CTCTCGAAGA CCTCCGCGTA TGGTACGACG CTTCAGATGA ACTCGATCTT CTCACAAAGT TCGATGACGG TCAAAAAAAA GTGATTTTAA TCGACGACCT GCTCGAAGGT TCGGACTTGC TATCGTTTCT TGAGGAGTTT TTTGGACTCA CTACCAGTAC CAGAATAATC GTTTTGACGA CTTCCACAAA GGTTTTTCGG TTCTTTCACA GGGTGTACAG ACTCCCTCCT TTCACACTCG AAGAGACTCA GATCTTTCTC GAGAGAGCGT TTGGAAAGAC CAGTGAAGAT CAGGTAAAAC TAATGCACAG TCTCAGTAAA GGACTACCAG GATACATGGT GGAACTATTG AAATTCTTCA ACAAATCCAA TCTCAGAGAC AACATTGTTG GAATCTTCAA ACCTCTTCTT CGAGAGCTGG ATTCCGTAGA AATCAGAGAA CTGAGCGTTC TGGGACAGAA GTTCACTGGA AGTGAATTGA ATGTTCTTGA AAAGATAACA GGAAAAGACT ACCATGATAC TTTGATGTCT GCGTACGATT CTGGTGTGAT AACGACAGAA GAAGGGCTCT ACAGATTCAC ACTTAGAGAA TTCTGGAAAT ACTACTACAA TAAGCTTTCA GAGATCAAAA AGAAAGAACT CCACGAAAAA CTGTTCGAAA ATCTCCCGGA CGACCTCGCA ATAAAACACG CCCTATCTCT GGAAGATCCG AAATTAAGAT TCTTCTACGT GTTGAGATAC GTGAGAAAAC ACTTCTGGGA TTACGAAAAA ACCAGGTCAC TCATTGAATA TCTGAAAAGA CTGGAAGAAT CCTTCAAAAA GTCATATTAT TCCATAGAAA GCCTGAAGAT GAAGCTCGTT TTCAGGATAG ATCCTCTGGG TACAGAAAAG GAGAACTTCC ACTTTCGAAG GTTCAAAACT CTCGGAGAAG CAGATATCAA CTCAATCTTG CAAAAAGGCT CGCTGTCTTA TTATGATCTC TACAATCTCG TCGTTCTTTC CCGTTCCTGC AGAAGAGCAG GAAAAAAAGT CCCCCAGGAA ATTCTCAACA TTATCAAAAG AGAGCTCAAA GAGAAAGATT TCTCCACAAA AGAAAGGCTT TATCTGAAAG CTCATCTTCT CTACGAGCTC TTCTCCTTGA CTGAAGATCG CGAAACATTG AAAGATATGA TGAACATCGC TTCCTCGGAA GGTTTTCTCG ATCTTCAAGT GATGGGTTAC AGAGCTTTGG GACTTCTTTC AAGAACGAGA GCGATGAGTA ACTACTATTT CAATCACTCA TTAGAACTTT CAAGAAAAAT AGATCCATCG CTTTCTATTG TGGACGAAAG CAACTTGACA TGGAGCTTGC TCTATGAGGG AAAAATAACA AACTTCTTTG TTCAGCTTGA GAGGCTTAGA AAACAGGCAA GACTTTTTGA AAACGCACCT ATTCTTTCTT ACACCTATTT CCTCGAAGGG CTCTACCACA TTCACAGGAA AGACTTTCAG AAAGCAGAAG AAGTTTTCAG AACAGAACTC GAACTGGAAG AAAAACATGG AATAGAGAGA AGAGCACTCA GAGGGCTGGT CATAAATTAC CTGTTTTCTG GAGACATTGA ATCCGCCAAA AGGCTCCTGG AAAAAGATGA GCCCGAGTTC GATAGATTCG GATTCGACTT CTTCAAAAGG ATGGTTCTTG CAAAGGATGA TTCAGAATTC TTGAAAATCT GGAAAGAAAG GCTTGAGACT CCCCAGAAGT TTTTCAACGA AGAAATCGCT TACGTTTTTG CGGAAAAGTT AGCAAAACTG GATCCAGAAG GCTTCGAAGA ATTTCTCCTC GAACTTGAAA GGGAAAATGT CGAAAACTCC TCAAATCTCA CACTTGCTCT GGTTTATGAA ACATTTTACA AATTCTACAG TACGCTCAGA GAAACCTTCA AAGCCAAGCG TTATCTCAGA AAGGCGATAT TCGTCTACAA CCTGATAAGT TTGAGAGAAG TATCAGTCAA ACTGGAGGCA GAAGAAAAAA CTCAAATTGA AGGGAAAAAA CCGTCGTTTT ATCTATTGAT AGGCTTCATA GAAACAGAAA AGGAATTTTC TGATATGATG GAGTTTGCTT CGGCAAGGCT TTCAGAGGTG ATTCCGTACG AGGTTTTTTC AATTCGAATA ATCGAAAGAA CGAGGAGAGA CGAGAGGGGC GAAATTCAAA ATTTTTGGTT TTTTCAATTC GAATAA
|
Protein sequence | MKKFLRETYW GDEFLIEDDG EDQVLKIVRV PVDRSFFFNE YAKLKRFKLP NVLLPEKLKI SEGKFLLFYP YYHNLAPLEN LSEQVAKQLF RLFRFLSRAG VTIPALGMDD VLVNDGVFLI PALISDLSED VKGVVFSKKK TPQATEEVCR RFLKIHGIEP HLEDREFSFE SKEIRIPYIH RKEEILIKRD IEAAQKFPLF ILITGEQRVG KTKLASVLVD ELREEGYLIH QISSLEDLRV WYDASDELDL LTKFDDGQKK VILIDDLLEG SDLLSFLEEF FGLTTSTRII VLTTSTKVFR FFHRVYRLPP FTLEETQIFL ERAFGKTSED QVKLMHSLSK GLPGYMVELL KFFNKSNLRD NIVGIFKPLL RELDSVEIRE LSVLGQKFTG SELNVLEKIT GKDYHDTLMS AYDSGVITTE EGLYRFTLRE FWKYYYNKLS EIKKKELHEK LFENLPDDLA IKHALSLEDP KLRFFYVLRY VRKHFWDYEK TRSLIEYLKR LEESFKKSYY SIESLKMKLV FRIDPLGTEK ENFHFRRFKT LGEADINSIL QKGSLSYYDL YNLVVLSRSC RRAGKKVPQE ILNIIKRELK EKDFSTKERL YLKAHLLYEL FSLTEDRETL KDMMNIASSE GFLDLQVMGY RALGLLSRTR AMSNYYFNHS LELSRKIDPS LSIVDESNLT WSLLYEGKIT NFFVQLERLR KQARLFENAP ILSYTYFLEG LYHIHRKDFQ KAEEVFRTEL ELEEKHGIER RALRGLVINY LFSGDIESAK RLLEKDEPEF DRFGFDFFKR MVLAKDDSEF LKIWKERLET PQKFFNEEIA YVFAEKLAKL DPEGFEEFLL ELERENVENS SNLTLALVYE TFYKFYSTLR ETFKAKRYLR KAIFVYNLIS LREVSVKLEA EEKTQIEGKK PSFYLLIGFI ETEKEFSDMM EFASARLSEV IPYEVFSIRI IERTRRDERG EIQNFWFFQF E
|
| |