Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0290 |
Symbol | |
ID | 5170867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 274216 |
End bp | 277194 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640562793 |
Product | polysaccharide export protein |
Protein accession | YP_001243895 |
Protein GI | 148269435 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAG TGCTGATCTT TCTGTTTCTC TTAGCTGCGG TGTTTTCTCT ATCTTACACC ATAAGAAAGG GAGACGTTCT GAGAGTAGAA GTGGTGGGAT ACCCGGACCT GACGAGAAAT TGCGCGGTGG ACATAGAAGG AGCGATCACT TTTCCGTACG TGGGACGTGT GAAGGTGGAA GGCCTCAGTG TGGACCAGGT GACAGAGTTT CTCAAAGAGA GGCTCTCCAA GAGTTTTTCT GATCCTGAGG TGATCGTGTC TCTTCAGCAG ATCGCACCAA GGAACGTCTA CGTCTCCGGT GTTGTGAACA GAGTGGTGGA CATGGGAATA GAAGATCTCA GCGTTTCTGA GCTTCTCTCT CTTCTCTCTG TGGATCTTTC CTCCGTTGAC CTTTCGAAGG TGAAGGTCCT CAGGGATGGG AAGGTCTTTG AACTGGATCT CTCCTCGCTT CTCTGGGGAG AAGCTCCCGA TAAGGACGTG GTGCTCCAGG AAGACGATCA GGTGATACTT CCAGAAAAGA GCTACACGGA GTTCGTGAAG GTCGTTGGGG CTGTGGCAAA GCCCGGGATC TACCCCTACA GAAGAGAGAT GACCCTGCTC GACGCCATCG CGGCAGCGGG TGGCACCACG CAGGAGAGTT CAGGAAAGAT AATCGTCCTC TCGAAGGACC AAACGACCGA GATATCTGAG AAAGATCTCT ATCAAAAGAA CCTTCTTCTG AAGCCGGGAG ATACGGTGCA CGTTCAAAAG CTCGACGAGA GGTTCGCTTA CGTTGTGGGA GCGGTGGCAA GACCGGGGAT GTACACCTTC TCCAGAGAAG AGTCTCTCAC ACTCAAGAAT CTCGTGGCGA AGGCCGGTGG CACCTCCGTT GAGGACAGGT ACATAGAAAA GGTTCTGATC ACAAGGGATG GAAAGACCAC CGAGTACACA CCGGAAGTGC TGAACGAAAA CGTTCAGCTG AATGTCGGAG ATGTCGTTGA GATAAAGAAG TACGAAGAAA CGAGGGTTTA CGTATCTGGA TACGTCTCAA GACCCGGTGT TTACGAGATC TCCCCGAAAG AGTCGGTGAC CTTAGAAAAG CTCCTCTCGA TGGTGGGAGG GTTCAAGGGA AGTGTGGAGG GAGTCGATTC GATCGTCATC ACAAGAGATG GGAGTGTCAT AGAGCTCTCA CCAAGCGAGC TGGACTTTCC GGTGAAACCG GGTGACATTG TGAACGTGAA GGAGTTCGTT CCGAAGAAGG CCTACATCCT CGGATACGTG AGAAATCCTG GACTTTACAC CTTTGGAAAG GGCGAGGCGT TCACACTGAG AAACCTCATA GCGAAGGCCG GTGGTTTCAT CGATGAAAAT CAGGTTGTCT CCGTGAAACT GTCTGGAAAG GAATACTCGC CGGATGAGAT CGTGAAGATC GATATCCTGC TCGAAGATGG CGTGTTCGTC TACGTGGAGA AGTACACGGA CAGGTTCGTT TACATGGTGG GAGACAATAC CGCGAGGAAC GGAAGGATGA GCTTTGCAAA AGACGAACCG TTCACTCTCT CGACGGCCCT GAAGAAGTAC GGTATCGAGG ATTTCTCTCT TGTGAAGAGT CTTTCACTTT TGAGAGACGG CGAGGAGAAG ATCTTTGATC CGAAGAAGAT ATTGACGGAG GATGTTTCCC TCAAGACCGG TGACACGGTT CTCGTGAAGA CCGTTCAGGC GAAAAGGGTC TATTTCACCG GAGACGTGTA CGGATACGTC GATTTCACTA AGGACGAAGA CATCACGCTC GAAAAGGCCC TCGCAAGGTT TGGAAAGATA CAGAAGAAGT ACATCGCGGG ATTGAAGGTT CACTCAAACG GAAAAGTGCA GGAGCTGACG GAAGTTGCCG ATCTTCCGCT CGAAGACGGC GCAGTTGTCG AGGTGGACGT GAAAGAGGCC GTCAGGGTGT ATGTGGACGG CTTCGTGAAA GTTCCTCAGA TGGTTGTCTT TGAGCCCGAT GAACCGGTAC TCCTGGACAG GGCCATTGTG AAAGCCGGAG GATACAAAGA AGACGCTCTC TTCGAAGCGG GAGACGTCGT TGTTCTGAGG GACGGCGGTG AGATCAACGT TCCCGAAAAC CAGCTGAGTT CGTTCGAGCT GAAGGACGGC GACCTCGTGT ACGTGAAGTA CACCGAAAGA CCACACGTTT ACGTCTTCGG AGAAGGTATC ACGAACACGC TTGTGACCTT CAGGGACGAA GAGACGCCTA CACTCAGGAA CGTTCTGGGC AAGGTGTGGG GCATAAAGAG CACAGGATCG AGAAAGATCG TCGTTGTGAG TCCATCGGGA GAGAAGAAAG AAGTGGACTA TGAAGACGTG ATCAACACGG GCGGTCCTGT TCTCGAGAGT GGAAGCGTGG TCTTCGTGCC TCTCGAGACG GAGAACTTCG CCTACGTTGT GGGAGAGGTG GCAAGGCCCG GTGCGTACGA ACTCAAAGGA GATGTGACAC TTCTCAAACT CATCGCACAG GCCGGGGGCT TGAGCAACTG GGCGCTGAAG ACGAAGGTGA TCTTGAGAAG GGGAGAAAAC GAGACAGCCT ACGACTTCAC GAACATGGAC GAGGTGCAGA AAGTAAAGAT AGAGCCGGGG GACGTGGTGT ACGTGCCACC CGTTGAGACG AACTACGTGT ACGTGCTCGG TAATGTGAGA ACACCAGGTA TCGTGAAGGT AGACAGGTAC TCGACGGTGT TCGATGTGGT GATGAGGGCC GGTGGGTTCA CAGACAGGGC CGCCACGGGA AGGATATTCC TCTTCAAAGG AGGGCCTCAG GGAGAGGTCA CCGTCTGCGA TCTCTCCGGA GTTCTCTCTG GAAAGGGTGG AGGAGTGAAC CCGAACGTCG CGCCGGGTGA TGTGGTCTTC GTTCCCGACA ACCCGCTCAT ACAGGTGACA GAAGCGCTCT CCATAGTGAA CACCATCCTG AACACGATAA GCAACGTCAG AGACTTCATG GGGTGGTAA
|
Protein sequence | MKRVLIFLFL LAAVFSLSYT IRKGDVLRVE VVGYPDLTRN CAVDIEGAIT FPYVGRVKVE GLSVDQVTEF LKERLSKSFS DPEVIVSLQQ IAPRNVYVSG VVNRVVDMGI EDLSVSELLS LLSVDLSSVD LSKVKVLRDG KVFELDLSSL LWGEAPDKDV VLQEDDQVIL PEKSYTEFVK VVGAVAKPGI YPYRREMTLL DAIAAAGGTT QESSGKIIVL SKDQTTEISE KDLYQKNLLL KPGDTVHVQK LDERFAYVVG AVARPGMYTF SREESLTLKN LVAKAGGTSV EDRYIEKVLI TRDGKTTEYT PEVLNENVQL NVGDVVEIKK YEETRVYVSG YVSRPGVYEI SPKESVTLEK LLSMVGGFKG SVEGVDSIVI TRDGSVIELS PSELDFPVKP GDIVNVKEFV PKKAYILGYV RNPGLYTFGK GEAFTLRNLI AKAGGFIDEN QVVSVKLSGK EYSPDEIVKI DILLEDGVFV YVEKYTDRFV YMVGDNTARN GRMSFAKDEP FTLSTALKKY GIEDFSLVKS LSLLRDGEEK IFDPKKILTE DVSLKTGDTV LVKTVQAKRV YFTGDVYGYV DFTKDEDITL EKALARFGKI QKKYIAGLKV HSNGKVQELT EVADLPLEDG AVVEVDVKEA VRVYVDGFVK VPQMVVFEPD EPVLLDRAIV KAGGYKEDAL FEAGDVVVLR DGGEINVPEN QLSSFELKDG DLVYVKYTER PHVYVFGEGI TNTLVTFRDE ETPTLRNVLG KVWGIKSTGS RKIVVVSPSG EKKEVDYEDV INTGGPVLES GSVVFVPLET ENFAYVVGEV ARPGAYELKG DVTLLKLIAQ AGGLSNWALK TKVILRRGEN ETAYDFTNMD EVQKVKIEPG DVVYVPPVET NYVYVLGNVR TPGIVKVDRY STVFDVVMRA GGFTDRAATG RIFLFKGGPQ GEVTVCDLSG VLSGKGGGVN PNVAPGDVVF VPDNPLIQVT EALSIVNTIL NTISNVRDFM GW
|
| |