Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0629 |
Symbol | |
ID | 8567265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 712614 |
End bp | 715700 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | peptidase S9B dipeptidylpeptidase IV domain protein |
Protein accession | YP_003289916 |
Protein GI | 268316197 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCCT TTCGCATTGC CGGCCTGCTG CTCCTGCTGA CAAGCTGGAG CCTCTCGGCC TGGGCCCAGT ACTTCGGCCG CAACAAGGTC CAGTACGAAT CCTTCAACTG GCGCGTGCTG CGCACGCCGC ACTTCGAGAT CTACTACTAC CCCGAAGAGG AGGTCGCTGT GCGCGATGCG GCCCGCATGG CCGAACGCTG GTATCAGCGT CACAGCCGCA CGTTTCTGCA CGAGTTCGGT GAGCGCAAGC CGATCATTTT CTATGCGGAC GACGCCGATT TCCATCAGAC GAACGCCATC AGCGGCGAGA TCGGCGAAGG CACCGGCGGC GTGACCGAGG CCATCAAAGA GCGGGTGATC ATGCCGTTTA CGGGCATCTA CCGGGAGAAC GATCACGTGC TCGGTCACGA GCTGGTGCAC TCCTTCCAGT ACGACATTGC GCTGAATCGA TCCGACAGCC TGAGCCGCTT CAATCTGGCG CTGCTGCCGC TGTGGCTCGT CGAAGGCATG GCCGAGTACC TGTCGCTGGG GCGCAACGAC CCGCACACGG CCATGTGGCT GCGCGATGCG GCGCTGCGCG ACGATCTGCC CACCATCCGG CAACTCACGC GCGACCTGCG CTACTTCCCC TATCGCTACG GCCAGGCCTA TCTGGCCTAC ATCGGCGGCA AGTACGGCGA TCAGGCCGTC ACCGAGCTGT ACAAGCTGGG CGGGATCGTG GGCGTCGATA CGGCCATTGC GATCACGCTG GGCATCACGC CCGACTCGCT TTCTAAGGAA TGGATTCAGG CGGTCAAGAA CACCTACCTG CCTTTGCTTA AAGACCGCAC CCCGCCTGAG CAGGCCGGAC GGAAGGTGCT GGCGCCCGAC CTCGACGCGG GCGAGATCAA CCTGGCTCCG GCGCTCAGTC CCGACGGCCG CTACGTGGCC TTCCTTTCGG AGCGCGACCT GTTCACGATC GATCTGTTCG TGGCCGACGC CGAGACCGGC AAGGTGCTGC GGCGGCTGAG CAGTAGTGCC AGCGATCCCC ACTTCGACGC CATCCGCTTC ATCAACTCCT CGGGCTCCTG GTCGCCTGAC GGCCAGCGCT TCGCGTTCAT CACGTTCGCT CAGGGCAACA ACGAAATCGC CATCTGGAAC CTGCGGAAGG GCAAACTGGA ACGCCGCATC GCGGTCGAAG GCGTCGGTGC CATCCACAAC CTGGCCTGGT CGCCCGACGG CCGCACCATC GCCTTTTCGG GGCTTTCAGG CGGCATCAGT GATCTGTACC TGCTGGACCT GGAGACGAAT CAGGTCCGGA AGCTCACCGA CGACCGCTAC GCCGACCTGC AGCCGGCCTG GTCGCCCGAC GGCCGCACCA TCGCCTTCGT GTCGGACCGG GGGCCGGACG GCACCGACTT CGAGATCCTG CGCTACGGGC ACGAACGCCT GGCCCTGCTC GACCTCGAAA CCGGCAAGGT GCGCGTCCTG CGGCCCTTCC GAAACGGCCA GCAGATCAAC CCGCAATTTT CACCGGATGG CCGAAGCCTG TATTTCATTT CCAACCACGA CGGCTTCAAG GACATCTACC GGATGGACCT GAACACCGGA GCCGTCTATC GCATCACCAG GTTGCAGACC GGCGTCAGCG GCATCACGAG CATCTCGCCG GCCATGAGCG TGGCCGCTCA GAACGGCCGC ATGATGTTCT CGGTCTTTAC CGACAACAAA TACCTGGTCT TTTCGCTGGA ACCCGACCAG TTGCAGGGCG AACCGGTTGA GCCGGAAAGT GACACAGGCA TTGCCAGCGC CGGTGTGCTT CCCCCGCTGC AACCGCCCAC CCAGGGGCTG GTCAGCAGCT ACCTGAACGA TCCGCTGACG GGCCTGCCGG ACGAACTGGC ACTCCGCCCC CAGCCCTATC GTCGCAAACT ACAGCTCGAC TACGTGGCAC CGCCCAGCTT CGGCGCCAGC GTCGGGGGGC CGTTCGGGAC GATGATCGCC GGTGGCGTGG CCTTCTTCTT CAGCGATATG CTGGGCGATC AGCAACTGGC CGTGGCTGCC CAGGCCAACG GCACCTTCAA GGACATCGGC GGCCAGGTGC TCTACATCAA TCAGGGCCAT CGCCTGAACT ACGGAGCGCT GGCCAGCCAC ATACCGCTGC TCTACGGCTA TGCCTATCTG GACGTGTGTA CGGATCCGGC CACCGGCCTG CAGGTCTACT GCTACATCCA GCGCCTGGAG CGCATCTACA TCGACGAAGC CGGCGTGCTG GGCTACTATC CGCTCAACAC CACCCAGCGC TTCGAGTTGA TAGTGGGATT CCAGCGCTAC GGCTTCGACT ACGACGCCGA GATCTACTAC CTCTACGGGG CCGGCTACCG ACGCTCCACG CAGAACCTAC CCGCACCGGA CCCGATCTAC TTCTTCCAGG CTTCGGGGGC GTTCGTGGGC GACTTTTCGT TCTTTGGCTT CACCTCGCCA GTACGTGGCT CGCGCTATCG GCTGCGGGTA ACCCCGTCGA TCGGGACCGA ACGTTTCGTA CAGGTATTGG TTGACGGGCG CAAGTATTTC TTCTTCCGCC CGCTGACGTT CGCCGTACGC CTGACGCACG TGGGCAACTA CGGCGCCCAA CCCCAGGGTT CGGATCGGCT CTACTTTGCC CAGGAATATC TGGGCTTCGG AAGCACGCTG ACGTTCGTAC GCGGCTACGG CTTTTACTCG CTGGAGCCGG ACGAATGCAC GCCCTCGACC TCAACGCCGA ACCTCTGCGC CGAAGTGGCC CGTCTGGTGG GCACGCACGT GGCCGTAGCC AGCGCCGAGC TACGCATCCC GCTGTTTGGC ACCGAGCGCT TCGGGCTGTT CAACTTCCCC TACCTGCCCA CCGAGCTATC GCTCTTTGCC GACGCGGGCG TGGCCTGGTC GAAAGGTGAT CTGCCCAAGT GGAAGTTCGT GCGTCACACG GGCGATCGAG TGCCCGTCTT CAGCACGGGC ATTTCGACAC GGTTCAACAT TCTGGGCGCC ATGGTACTGG AGCTGTTCTA CGTCTATCCG TTCCAGCGCC CCTATAAGGG CTGGCATCTG GGCGCCCAGC TCGTACCGGG CTGGTAG
|
Protein sequence | MRAFRIAGLL LLLTSWSLSA WAQYFGRNKV QYESFNWRVL RTPHFEIYYY PEEEVAVRDA ARMAERWYQR HSRTFLHEFG ERKPIIFYAD DADFHQTNAI SGEIGEGTGG VTEAIKERVI MPFTGIYREN DHVLGHELVH SFQYDIALNR SDSLSRFNLA LLPLWLVEGM AEYLSLGRND PHTAMWLRDA ALRDDLPTIR QLTRDLRYFP YRYGQAYLAY IGGKYGDQAV TELYKLGGIV GVDTAIAITL GITPDSLSKE WIQAVKNTYL PLLKDRTPPE QAGRKVLAPD LDAGEINLAP ALSPDGRYVA FLSERDLFTI DLFVADAETG KVLRRLSSSA SDPHFDAIRF INSSGSWSPD GQRFAFITFA QGNNEIAIWN LRKGKLERRI AVEGVGAIHN LAWSPDGRTI AFSGLSGGIS DLYLLDLETN QVRKLTDDRY ADLQPAWSPD GRTIAFVSDR GPDGTDFEIL RYGHERLALL DLETGKVRVL RPFRNGQQIN PQFSPDGRSL YFISNHDGFK DIYRMDLNTG AVYRITRLQT GVSGITSISP AMSVAAQNGR MMFSVFTDNK YLVFSLEPDQ LQGEPVEPES DTGIASAGVL PPLQPPTQGL VSSYLNDPLT GLPDELALRP QPYRRKLQLD YVAPPSFGAS VGGPFGTMIA GGVAFFFSDM LGDQQLAVAA QANGTFKDIG GQVLYINQGH RLNYGALASH IPLLYGYAYL DVCTDPATGL QVYCYIQRLE RIYIDEAGVL GYYPLNTTQR FELIVGFQRY GFDYDAEIYY LYGAGYRRST QNLPAPDPIY FFQASGAFVG DFSFFGFTSP VRGSRYRLRV TPSIGTERFV QVLVDGRKYF FFRPLTFAVR LTHVGNYGAQ PQGSDRLYFA QEYLGFGSTL TFVRGYGFYS LEPDECTPST STPNLCAEVA RLVGTHVAVA SAELRIPLFG TERFGLFNFP YLPTELSLFA DAGVAWSKGD LPKWKFVRHT GDRVPVFSTG ISTRFNILGA MVLELFYVYP FQRPYKGWHL GAQLVPGW
|
| |