Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0559 |
Symbol | |
ID | 8567193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 631636 |
End bp | 635016 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | YP_003289847 |
Protein GI | 268316128 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTGGA GTGATCTGTT CGGGCGCCGT CGTTCGCCTC GTGAATGGAA CGAAGAGCTG CGGCGGCTGA AGTTCGGAAC AAAGCAAGAG GAATCCGGCA AGGACGACGG GACCGACGCC GCCGGGAAAG CCTACCCGGA GGATGCCTCG GACCGGCCCG AGCCGAAGCC GGCTGAAGCG CAGCCGCTAT CGGGGCTCTG GCTGGAACTG GCCGGGGATG GGCATGCCGA GGACGCGCTG CCGGAATTGC CGGAGACCCC CGAGCCGGAG GAAGCCGAGG CCGAAACCGC GGAGGCCGAG CCTTCGGATC GGGCCGCTGC GCTGACGGAC GTGTCCGCCG AAGTGGCGTC GGCCCGGCCG CAGGCGAGTC TGGCCGATGC CACGCCGGGA CGCGAGGGGG CGCGGCAGGA GCCGCCTCGT CCGAAGCAGG AACCCACCGC TGAGCCGCCG GGTGCTTCGG TCGATCGGGG CGGCGTGCCG CCGACCGCCC GCGATTCGCG GACCGCTTCG CTGCCGGAGG GCGCCTTCAT GCCCGAACTG GCCGAGATGT TCGAAGACGA CGTCGTGATG GCGCTGCTCT ACGGCGGGGC GGTACGGCTG GATCAGATCG CGCGGGCCAT CGTGCTGCGG CGGCGTCGGC CGGAGCAACG GCTCTGGCGT TGCCTGCTGG AAGTGCCCGG TGTCGATCGG GAGACTGTAC TGGCCGAAGC GGCCCGTGTG GGTGGCATCG CACCGGCCCC GGTCGACGAA GAACGCCCCT CGGTCGAATT CATCAAGGCC ATTCTGTTGC TCCTGCCGCC TTCGGTGCAG CACATGCTGT TCGACTGGCA GCTGCTGCCC TGCGGATTCG CCCAGGGAAA AGAGGGGGGG CGCGTGCTGC TGCTGGCCAC GCACGACCCG ACGCGGCCGG AATTGAAGCA GGTGCTTCAG GAACTGCGGC TTCCCGCAGA ACTCCGGTAT GCGCCCGAGC GCATTCTGAC GAAGCGGCTG GCCGAATTGG AGAGCTTCCG ACCGCCTGTG CTCGAGCCCG AGCCGCCAAC CCCCGAAGCG CAACCACAGG TTCAGGCTCC GGAGCCTGAG ATCGATAGAA CGCCCGCGCA GCCAGAACCT GAGCCGCCGG AACCTGTCGC CGAAACGCCG ACGCCGGAGG CGGCCGATCC GGATGCGGTC CGGAAAGCAG CGCTCCCGGA GTCCCCGGAA GACGACGAGC CGGACGGGGG CGTCGGCCTG CAGGAAGCCG GAACGGCCGA GGTGTCCGAA CCGGCCGCGC CGGAAATCCA GCCTGCGGAA GAGGTACCGG CCACCTTCGC CCGGGAAGCT CCGCCGGTCG CCGATTCCCC GGCTGAACCG TTGTTCGAGC CGCTGCAGGG TGAGGCCCTG GAAGAACCGA AGGCGGAAAC CACGCCCGAG GCTCAGGCCG AAGCGGCGCT CGAAACCGCG CCGGAAGTCG CCGTGGAAAC GCTGGATGCC GCCTCCGACG CGATCGGCGA CGAAAAGCAG GAAGCCGACG CCCCGGAAGC GGCGGACGAT GGCGAGGACG TCCCGCCGCC GGTCAAGATC GTACTGGACG ACCTGCCTGA GCTGAAGGCC GCCATCGCAC GCGACCGGGT GGTGAGTCGG CTGCTTGAAA AAGACGTGGT GGCGCTGCAT CAGGTCTATC AGGCCTATCA GCGGCAACAG GACGAAGGGC TCAAGGAACC GCTCTGGCGC GTGCTGGCGC AGAGCGACCA CGTGCCGCAG GACGCCATCT ACGAAGAGGC CGCCCGGCTG TATGCCTTTC CCATTGCCAG GCTGGAGCCG GGCAAGCCCA GCCCGGAATT CGTGCGCTCG GTGATGGACA CCTTCGAAGA GGAGGTACGC GAGCGCCTGA TCGAGCTGCG CGTGGTGCCT TTCGAGGTCG ATCTGGATGC CCAGACCGGC GCTGTCAAGC TCGTGCTGGT CACGCACGAT CCCATGCGGC CCGAGGTGCA CCGCCTGGTG CACCGGCTCA AGCTGGAGCG CTTCGAACTC CAGTATGCGC CCCGGGGCGT GATCATGCAG GCGCTGATGG AGGCCTACCC GCGTCGCAAC GAATACCTGG AGCGCGTCAA AGAAGAAGCC GCCTACGATC TGGGCACGAG CTACGACGCC CAGACCGAGC TGATCGACGA AGACGCGCTG GAGGCCGAGA TCAACCGCTC CAAGCTGATC AACCTCTTCG AAGCGACGCT CGTCGAAGCC GTCCGCCAGG GGGCCTCCGA CATCCACATC TTCCCGAACA ATCAGAAGAA AGTAGAAATC CATTTCCGCA TCGATGGTCG GCTCACGCGC TGGCACGTGG AGGACAAAGT GCATCCCGAA GCGTTCATCG CGGTCATCAA GGACCAGTCG ATGAACGTGG ACCGCTTCGA GCGCGATGCG GCGCAGGATG GCTTCATCCA GCGCTGGATC GACGACCACC TCATCCGCTT CCGCGTCTCC GTACTCCCCA TCGCCAACGC GCTCGAAGAC CTGCGCTCGG AGTCGATCGT CATCCGTGTG CTCGACGACC GCAAGGTCAT CAAAGATCTG CGGCTGCTCG GGCTCAACAA GAAGGCGCTG GAGCGCTTCG AGCGGGCCAT TCGCCAGCCG CACGGCATGG TGATCGTCAC CGGGCCCACC GGCAGCGGTA AAAGCACCAC GCTCTACGCC GCCCTCCACC AGGTCGTCAG CCCCGAGGTG AACGTGCTGA CGATCGAGGA TCCGGTCGAG TACATCATCC CCGGCGTGCG CCAGATCAAG CTCAACCACA AGCTGGGGCT GGAGGAGGCG CTGCGGGCCA TCCTGCGGCA CGACCCCGAC ATCGTGATGG TCGGTGAGAT GCGCGATCGC CAGACGGCCG AGCTGGCCAT CAAGCTGGCC AACACGGGGC ACCTGACGTT TTCGACGCTG CACACGAACG ACGCCCCCAG TGCGGTGAGC CGGCTCTACA AGATGGGGAT CGAGCCGTTC CTGATCGCCT ATGCGATCAA CCTGGTCGTG GCCCAGCGCC TGATCCGCAA GGTGTGTCCG GCCTGCAAGG TGGAAGACAA AGACCCCGAC TACGTGATGC TCCGCAAACT GGGCTTCACG GACGAGGAGA TCGCGCGCAC CACCTTCTAC AAGGCCGGTC GCGACCGTAA CTGCAAGGTC TGCAAGGGCG TCGGCTACAA GGGCCGGCGG GCGATCGTCG AGGCTATGTA CTTCTCGCGG ACGATCCGAC ACATGATCGT CGAAGCGCAG GGATCCATCG ACGAAGACGC GCTGCGTGAG CAGGCGATCA AAGAGGGCAT GCAGACGCTG CGCGACGCCG CCCGCGAGGT CGTGCTGGCC GGCGAAACCA CCATTGAGGA GATGATCCGG GTGACGACGA CCGAGGAGTA G
|
Protein sequence | MSWSDLFGRR RSPREWNEEL RRLKFGTKQE ESGKDDGTDA AGKAYPEDAS DRPEPKPAEA QPLSGLWLEL AGDGHAEDAL PELPETPEPE EAEAETAEAE PSDRAAALTD VSAEVASARP QASLADATPG REGARQEPPR PKQEPTAEPP GASVDRGGVP PTARDSRTAS LPEGAFMPEL AEMFEDDVVM ALLYGGAVRL DQIARAIVLR RRRPEQRLWR CLLEVPGVDR ETVLAEAARV GGIAPAPVDE ERPSVEFIKA ILLLLPPSVQ HMLFDWQLLP CGFAQGKEGG RVLLLATHDP TRPELKQVLQ ELRLPAELRY APERILTKRL AELESFRPPV LEPEPPTPEA QPQVQAPEPE IDRTPAQPEP EPPEPVAETP TPEAADPDAV RKAALPESPE DDEPDGGVGL QEAGTAEVSE PAAPEIQPAE EVPATFAREA PPVADSPAEP LFEPLQGEAL EEPKAETTPE AQAEAALETA PEVAVETLDA ASDAIGDEKQ EADAPEAADD GEDVPPPVKI VLDDLPELKA AIARDRVVSR LLEKDVVALH QVYQAYQRQQ DEGLKEPLWR VLAQSDHVPQ DAIYEEAARL YAFPIARLEP GKPSPEFVRS VMDTFEEEVR ERLIELRVVP FEVDLDAQTG AVKLVLVTHD PMRPEVHRLV HRLKLERFEL QYAPRGVIMQ ALMEAYPRRN EYLERVKEEA AYDLGTSYDA QTELIDEDAL EAEINRSKLI NLFEATLVEA VRQGASDIHI FPNNQKKVEI HFRIDGRLTR WHVEDKVHPE AFIAVIKDQS MNVDRFERDA AQDGFIQRWI DDHLIRFRVS VLPIANALED LRSESIVIRV LDDRKVIKDL RLLGLNKKAL ERFERAIRQP HGMVIVTGPT GSGKSTTLYA ALHQVVSPEV NVLTIEDPVE YIIPGVRQIK LNHKLGLEEA LRAILRHDPD IVMVGEMRDR QTAELAIKLA NTGHLTFSTL HTNDAPSAVS RLYKMGIEPF LIAYAINLVV AQRLIRKVCP ACKVEDKDPD YVMLRKLGFT DEEIARTTFY KAGRDRNCKV CKGVGYKGRR AIVEAMYFSR TIRHMIVEAQ GSIDEDALRE QAIKEGMQTL RDAAREVVLA GETTIEEMIR VTTTEE
|
| |