Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tter_2778 |
Symbol | |
ID | 8640807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobaculum terrenum ATCC BAA-798 |
Kingdom | Bacteria |
Replicon accession | NC_013526 |
Strand | + |
Start bp | 969286 |
End bp | 972222 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | PA14 domain protein |
Protein accession | YP_003324486 |
Protein GI | 269839793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAACGCA CAAGATCCCT GTGGTGGATG CTCATCGCCG TGATGCTGGC GCTCGCCAGC GCTTACATAG CTGGGGGCGG GAGCGGCACG CTGTCCGCCA CGACCTACGT GCTGCTCTTC AGCTCCAGCA GCGATAGATC CAACGCACAA CCGCTGCAGG GACAGAGCGT GTACGGGAAC ACATACATCT TCACGTCCCC GAGCGATGGG GTCTCCACGG TCAGCTTCTA CATAGATGAC CCCGACGCTC TCGGGACACC CTATCGAGTA GAGAAGACCG CGCCCTACGA CCTCAACGGG GGAACGGTAT CCACCGCCTC TCCCTACAAC ACGACGCTGC TTGCCGACGG GCAGCACTCC GTTACCGCCC TCATCAAGCT CAGCAACGGC AGCACCCAGA AGATCACAGC GACCTTCAAC GTGATAAACT CCGCCCCACA GCTGCAGTTC GACAGGAGCG CCGTCGTCTG GTCCGTGGGC AGTGGCGGCA CGGCAACACA GCAGGTAGCC CTCACAGGTG GGAACCCACC GGCCAGCTAC ACGCTATCCT ATGATGCCTT CTGGCTTAGC CTCCAGCCGA CCACTGGCAC GACTCTGGCA ACTATCTCCC TGGCGGCCAA CACGCAAGGG CTGCAGCCAG GCATATATAC GTCCACGGTA CTAGCAACTG CCCCAAACTA TAAACCCGCG TCCCTTAGGG TAACCATTAC AGTAGGGGAG CAGATACACC TCTCCTGGAT GGGTGATCCC TCCAGAACTA TGACGATTGT ATGGCGCACC TTCGACACGT CCATACCCTC GCTGGTGCAG TACAGGCAGG CCGGAACCAC CACCTGGCAG CAAGCCTCCG GATCGCTCCG TACCTCAGGG ACACGCGGCA CCCTGCACGA GGTAACGCTA TCCTTGCTCA CGCCTTCGAC GAGCTACGAA TACAGGGTAA TGCTGGACGG CTCCACCTGG AGCGAGACCT ACACCACCCA CACTGCCCCA CTCAGGGGAC CAGCGGACCT GGACGTGATC TACGTTGCCG ATACCGGGCT GATAGGCAGG GAGGATGGGC TCGCCAGCGG CACGCAGCAG GTAATAGACG AGATAGCGAG GATGCATCCC GACGTGGTAT TGCTTGGTGG TGACTACGCT TACTACAGCA CTGACAATAG GTTCGGCTCC CTGGATAATT CCATAGATGC ATGGTTCAAT CAGATGCAGC GCATAGGCGC CAAGATACCA ATGATGCCCA CTTATGGCAA CCATGAGACC CTCCTGGGGG AGGGTTACTC CTACTGGGCT GCCAGGTTCG CTACCCCAAA CGGCTACAGC AACAGGCAGA ACTACTCCTT CGATATAGGG GACGTACATT TCGTGTCGAT ATACGCGGTC GAGAACTCTA ATGGGCTGTC CGACGGGCAG CTGCAGTGGA TCGAACAGGA TATCCTGGCA GCAAAGGCCG CGGGGCAGAG GTGGATCGTG CCGTTCTACC ACGTATCGCC CTTCGCCGAT GGGCGCAACC ACCCCTCGAA CCTCGCACTG AGAGCACAGT TGGGACCACT GTTCGAGAGG CTAGTAGTCA AGATCGCCGT GAGCTCGCAC GACCAGGCCT ACGAGCGGAC CTACCCTTTG GTCGATGTTC CAAATTCGAA CACCCCGACG TCCATGGCCA AGGATTGCTA CACGATGTCG GATGGCGTGA CCTGGGTGAA ATCCAGCCCC GGAGGCAAGG AGAGCAACAA GAACGGCTCC TTCTCCCAGT TCGGCACCAA CCCTCCGCCG TCGTGGACAG CCTACCGCGA CAACACGATG CACCACTTCC TGCGAATACG CTTCTCGGCA GACGGTACCA TGAGGGTCGA AGGCTACGGC GTAAGGGGTG ATGGCAGCCC ACCAGTCCTG CAGGACAGCT TTATGTACAC AACTGGTAGC TGCGGCAGCG CAGGGCCCGA GACGACGATC ACGGCTGGGC CAACAGGGCT GACCAACCAG ACTTCGGCCA CATTCCAGTT CACATCCTCC GAGGACAACT CCTCCTTCCT GTGCTCCCTG GATGGCTCGG CCTTCTCCCC CTGCTCATCC CCTGTGGCTT ACTCTGGCCT GCAGGATGGC AGCCACACCT TCCAGGTCAA GGCAGTGGAC CAGGCAGGCA ATCAGGATCC ATCTCCCGCC TCCAGGTCCT GGACCATCGA TGCTACCCCT CCCACCATCA CGGGTACCAC GCCTGTCAAC GGAGCCAACG AAGTGCCCAC GAACACGAAG GTCTCTATCG CCCTCTCCGA GAGGGCAGAT CCCGCGAGCA TAAATGGCAG CACGTTCTAC CTGATGAAAT CGGGAAGCAG CCTGCCCGTT CCCGCCCAAG TGAGCTACGA CGATGCCCTC AAGATCGCCG CCCTGCAGCC AGATGCGCCG CTGGAAGCCG GCAGCACCTA TACAGCCAGG GCCACGGGCG ACATCAGGGA CCTGGCGGGC AACAGGCTGG GTGCCGACTA TTCGTGGGCT TTCACGGTGT CCTCGAATCC TTCCGCTGGT GGACTGCTTG GCGAGTACTT CAACAACAGC GACCTGACCG ATCTGGTACT GACGCGGGTG GACCCTGTGG TGGACTTCAA CTGGGACTAT GGCTCGCCAG ACCCCTCCAT AGACCCTGAT ACTTACTCCG TGCGCTGGAC GGGCATGGTG AAGGCCGATC GTTCAGAAAC CTACACCTTC TACACTCGGA GCAACGACGG CGTCAGGCTG TGGGTGAACG GCAAGCTGCT GGTCAACAAC TGGACGAACC ACGCCGAGAC GGAGAACAAG GGGAGCATCA GCCTCACTGC GGGCACCTGG TACCAGATCA GGCTGGAGTA CTACGAGGGC ACAGGGAGAT CGATCATCAG GCTGCTGTAC TCCTCACCGA GCACGCCCAA GCAGATCATC CCGAGCGACC ACCTGAGAAC ACCGTGA
|
Protein sequence | MQRTRSLWWM LIAVMLALAS AYIAGGGSGT LSATTYVLLF SSSSDRSNAQ PLQGQSVYGN TYIFTSPSDG VSTVSFYIDD PDALGTPYRV EKTAPYDLNG GTVSTASPYN TTLLADGQHS VTALIKLSNG STQKITATFN VINSAPQLQF DRSAVVWSVG SGGTATQQVA LTGGNPPASY TLSYDAFWLS LQPTTGTTLA TISLAANTQG LQPGIYTSTV LATAPNYKPA SLRVTITVGE QIHLSWMGDP SRTMTIVWRT FDTSIPSLVQ YRQAGTTTWQ QASGSLRTSG TRGTLHEVTL SLLTPSTSYE YRVMLDGSTW SETYTTHTAP LRGPADLDVI YVADTGLIGR EDGLASGTQQ VIDEIARMHP DVVLLGGDYA YYSTDNRFGS LDNSIDAWFN QMQRIGAKIP MMPTYGNHET LLGEGYSYWA ARFATPNGYS NRQNYSFDIG DVHFVSIYAV ENSNGLSDGQ LQWIEQDILA AKAAGQRWIV PFYHVSPFAD GRNHPSNLAL RAQLGPLFER LVVKIAVSSH DQAYERTYPL VDVPNSNTPT SMAKDCYTMS DGVTWVKSSP GGKESNKNGS FSQFGTNPPP SWTAYRDNTM HHFLRIRFSA DGTMRVEGYG VRGDGSPPVL QDSFMYTTGS CGSAGPETTI TAGPTGLTNQ TSATFQFTSS EDNSSFLCSL DGSAFSPCSS PVAYSGLQDG SHTFQVKAVD QAGNQDPSPA SRSWTIDATP PTITGTTPVN GANEVPTNTK VSIALSERAD PASINGSTFY LMKSGSSLPV PAQVSYDDAL KIAALQPDAP LEAGSTYTAR ATGDIRDLAG NRLGADYSWA FTVSSNPSAG GLLGEYFNNS DLTDLVLTRV DPVVDFNWDY GSPDPSIDPD TYSVRWTGMV KADRSETYTF YTRSNDGVRL WVNGKLLVNN WTNHAETENK GSISLTAGTW YQIRLEYYEG TGRSIIRLLY SSPSTPKQII PSDHLRTP
|
| |