Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1068 |
Symbol | |
ID | 8567709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 1221563 |
End bp | 1223956 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | glycoside hydrolase family 43 |
Protein accession | YP_003290348 |
Protein GI | 268316629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000256501 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGATTG TCATCTTGAA CAGGCTTCGC TTTAAGAAAG CCTCCCGTGA ATGGCCGGTA ATTCTCGGGG GGTGGCTGAT TCTGGCCGGA TGGATGGCGC TGGGATTTCT GGTGCGGGCG GGGTATGCGC AGGAGCCAGG TCGTCCGGTG TACGTCAATC CGGTGATCCC CGGCGATCAT CCAGACCCCA CGCTGACCCG GGTCGGGGCT TACTATTACA CGTCCGGTTC ATCCTTCAAC GTCACCCCCA GGATTTACCG CTCGACGGAC CTGGTTCACT GGGAGATCGT CGGGCGGCCG GTTTCGGCCT CCTGGTCGCT GTATGGCGAC GTGCCTGCCG GTGGGGTATG GGGGGGCCAT ATGGTTTATT ACCAGGGACT GTACTGGCAC TTCTTCGGGC GTGGGACAGG AGATCGGGCC ATGTATTTCG TAACGGCGCA TCGGCCGGAG GGGCCCTGGG GTATGCCGGT TCGCATGAAC GTACCGCCCG GCATTCCCGG TCTGGGCGTG GATAATTCCA TCTTCATTGA TGAGGATGGA CGATGGTTTT TGCTGAGCAA GAATGGTCCC CAGAATAATT ACATCGTCGA ACTGGGACCG GATGGTCAGC CGGCCGGGGT TGTCTATGAC CTGACCTGGA TCAACCCGGA CTCGGCCGGC AATCCTTATG GATGGGCTGA AGGGCCGGTG ATGTGGAAGT ATCAGGGCTA TTACTACTAC AGTTTTGCGC AACATCTGGT GGGTAATCAG TACGTTATGC GCAGCGATAC CCTGTCGGAC GATCCTGCCG ACTGGGAAGG GCCCCGGTTG CTTTTCGAAA CAGTGCCCGA TCGGTACCAG CGGGTGTTTC GCAATCCCAA TCATTGCTCA CCGGCCGTTA CCGCCGACGA TGGGACGCAC TGGATGATCT GTCATGCCTA CGATCAGAGC GGGCCGGGTG AAGAATGGCG AGCGCTGGGA CGTCAGGGGC TTCTGGTTGA AGTGCGTTAC GAAGAGGGCT GGCCCGTAGC CCGCTTTCCA ACCACGGAGC CCGTGGAGGG ACCCGCTTTG CCCAGCAGCG GCATCCCCTG GGCGGTGCCC CGATCGGACT TCTTCGACAG GAGTCGACTG GCGGTGCACT GGTCTCTGCT GGGCTACACG CCGGAGGAAA CATACGATCT GACGGAACGT CCCGGCTGGC TGCGGCTGAC GCCCAAAGGA GGGCGCACGT TTCCGCCCAC GCCGGGACGG AATACTGTGC TGCAGGCCGC CGCCGAGCGG GCCTATTCCC TGATGACCCG GGTCGATTTT GATCCGGCGA CCACCTCGGA TGAAGCCGGA CTCTGGATTA TCAACGGTCC GGAGACGCTG CAGGCACGGT TATGCGTAAC GCGCAGCTCG GAGGGCGAAC GTGTCGTGGC TTTTCGTTTC GATACCCTTG CGCACAGTAC GCCCCTGCCC TCGGAAGAAC CGGTCTGGTT GAAGCTTGAA CGAGAAGGGC ATGAACTGAC CGCTTCCTTC AGTTTGGATG GGGCAAGCTG GGCCCCGGTA GCCGAGAAGG TGAACGTGGC GCGTCTGGAC CGCGAGCAGC CCGCTTCGGA GTCCGGTTAC GATTTTAATG CATTTACGGG CAATCAGCAG GGATTGTACG TGCTCGGCAA TACCCCGGCC TACTTTGACC TCTATATATA TCGGGACGCC TACTCACGCA TTCCGGCCCA GCATCCTACC AATTACAATG GCGTGATCAC TTCGAGAAAT GGACTACCGG CGCATGCGAA TTACCTGGCC GGCATCCACG ATGGAGAATG GGCCATGTAC GCCGGCGTGG AGTTCGGCGC GCCGGGAAGC GATTATTCCA GGATACCCCG CCAGGTTGTG GTGACGGCTT CCAGCGCTAC CGGAGGTGGT GTGGTCGAAG TCTGGGTGGA CGCGCTGGAT ACCGGCCAGA AAATCGCGGA AGTGCCGATC AAATCGACAG GGAGCTGGGA CGTGTATCAG GACTTTACGG CCGAAGTGGT GCCGGTTAGT GGCCGTCATG ATGTTTTTCT GCGGTTTCGG GGAAATCCCA CGGAGACGTT GTTTCGCATT CGTTCCCTGC TGTTCGAGGG CCAGCTGACG GAGACGGCAA CAGGACCTGG CGCCGCGGTC CGGCCACTCC TGGTGACGCA TTATCCGAAT CCGGTGCGCG ATGACGTGAC TTTTCTCGTT TCCCTGCCAC GCACAGGACC TGTTCGCCTG GTGCTGTACA ACGCACTGGG GCAGCAGGTG GCCACGCTGA TTGATGCGGT GCGTCCGGCG GGGCGGTATC CGTTGCGCTT CACCATCAAG CATCTCTCGC CGGGCCTTTA TTTCTATCAG CTTACCACGA AAGACCAGGT GGTAACAGGG CAACTCATCG TGATTTCCCG ATGA
|
Protein sequence | MWIVILNRLR FKKASREWPV ILGGWLILAG WMALGFLVRA GYAQEPGRPV YVNPVIPGDH PDPTLTRVGA YYYTSGSSFN VTPRIYRSTD LVHWEIVGRP VSASWSLYGD VPAGGVWGGH MVYYQGLYWH FFGRGTGDRA MYFVTAHRPE GPWGMPVRMN VPPGIPGLGV DNSIFIDEDG RWFLLSKNGP QNNYIVELGP DGQPAGVVYD LTWINPDSAG NPYGWAEGPV MWKYQGYYYY SFAQHLVGNQ YVMRSDTLSD DPADWEGPRL LFETVPDRYQ RVFRNPNHCS PAVTADDGTH WMICHAYDQS GPGEEWRALG RQGLLVEVRY EEGWPVARFP TTEPVEGPAL PSSGIPWAVP RSDFFDRSRL AVHWSLLGYT PEETYDLTER PGWLRLTPKG GRTFPPTPGR NTVLQAAAER AYSLMTRVDF DPATTSDEAG LWIINGPETL QARLCVTRSS EGERVVAFRF DTLAHSTPLP SEEPVWLKLE REGHELTASF SLDGASWAPV AEKVNVARLD REQPASESGY DFNAFTGNQQ GLYVLGNTPA YFDLYIYRDA YSRIPAQHPT NYNGVITSRN GLPAHANYLA GIHDGEWAMY AGVEFGAPGS DYSRIPRQVV VTASSATGGG VVEVWVDALD TGQKIAEVPI KSTGSWDVYQ DFTAEVVPVS GRHDVFLRFR GNPTETLFRI RSLLFEGQLT ETATGPGAAV RPLLVTHYPN PVRDDVTFLV SLPRTGPVRL VLYNALGQQV ATLIDAVRPA GRYPLRFTIK HLSPGLYFYQ LTTKDQVVTG QLIVISR
|
| |