Gene Rmar_0629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_0629 
Symbol 
ID8567265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp712614 
End bp715700 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content64% 
IMG OID 
Productpeptidase S9B dipeptidylpeptidase IV domain protein 
Protein accessionYP_003289916 
Protein GI268316197 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCCT TTCGCATTGC CGGCCTGCTG CTCCTGCTGA CAAGCTGGAG CCTCTCGGCC 
TGGGCCCAGT ACTTCGGCCG CAACAAGGTC CAGTACGAAT CCTTCAACTG GCGCGTGCTG
CGCACGCCGC ACTTCGAGAT CTACTACTAC CCCGAAGAGG AGGTCGCTGT GCGCGATGCG
GCCCGCATGG CCGAACGCTG GTATCAGCGT CACAGCCGCA CGTTTCTGCA CGAGTTCGGT
GAGCGCAAGC CGATCATTTT CTATGCGGAC GACGCCGATT TCCATCAGAC GAACGCCATC
AGCGGCGAGA TCGGCGAAGG CACCGGCGGC GTGACCGAGG CCATCAAAGA GCGGGTGATC
ATGCCGTTTA CGGGCATCTA CCGGGAGAAC GATCACGTGC TCGGTCACGA GCTGGTGCAC
TCCTTCCAGT ACGACATTGC GCTGAATCGA TCCGACAGCC TGAGCCGCTT CAATCTGGCG
CTGCTGCCGC TGTGGCTCGT CGAAGGCATG GCCGAGTACC TGTCGCTGGG GCGCAACGAC
CCGCACACGG CCATGTGGCT GCGCGATGCG GCGCTGCGCG ACGATCTGCC CACCATCCGG
CAACTCACGC GCGACCTGCG CTACTTCCCC TATCGCTACG GCCAGGCCTA TCTGGCCTAC
ATCGGCGGCA AGTACGGCGA TCAGGCCGTC ACCGAGCTGT ACAAGCTGGG CGGGATCGTG
GGCGTCGATA CGGCCATTGC GATCACGCTG GGCATCACGC CCGACTCGCT TTCTAAGGAA
TGGATTCAGG CGGTCAAGAA CACCTACCTG CCTTTGCTTA AAGACCGCAC CCCGCCTGAG
CAGGCCGGAC GGAAGGTGCT GGCGCCCGAC CTCGACGCGG GCGAGATCAA CCTGGCTCCG
GCGCTCAGTC CCGACGGCCG CTACGTGGCC TTCCTTTCGG AGCGCGACCT GTTCACGATC
GATCTGTTCG TGGCCGACGC CGAGACCGGC AAGGTGCTGC GGCGGCTGAG CAGTAGTGCC
AGCGATCCCC ACTTCGACGC CATCCGCTTC ATCAACTCCT CGGGCTCCTG GTCGCCTGAC
GGCCAGCGCT TCGCGTTCAT CACGTTCGCT CAGGGCAACA ACGAAATCGC CATCTGGAAC
CTGCGGAAGG GCAAACTGGA ACGCCGCATC GCGGTCGAAG GCGTCGGTGC CATCCACAAC
CTGGCCTGGT CGCCCGACGG CCGCACCATC GCCTTTTCGG GGCTTTCAGG CGGCATCAGT
GATCTGTACC TGCTGGACCT GGAGACGAAT CAGGTCCGGA AGCTCACCGA CGACCGCTAC
GCCGACCTGC AGCCGGCCTG GTCGCCCGAC GGCCGCACCA TCGCCTTCGT GTCGGACCGG
GGGCCGGACG GCACCGACTT CGAGATCCTG CGCTACGGGC ACGAACGCCT GGCCCTGCTC
GACCTCGAAA CCGGCAAGGT GCGCGTCCTG CGGCCCTTCC GAAACGGCCA GCAGATCAAC
CCGCAATTTT CACCGGATGG CCGAAGCCTG TATTTCATTT CCAACCACGA CGGCTTCAAG
GACATCTACC GGATGGACCT GAACACCGGA GCCGTCTATC GCATCACCAG GTTGCAGACC
GGCGTCAGCG GCATCACGAG CATCTCGCCG GCCATGAGCG TGGCCGCTCA GAACGGCCGC
ATGATGTTCT CGGTCTTTAC CGACAACAAA TACCTGGTCT TTTCGCTGGA ACCCGACCAG
TTGCAGGGCG AACCGGTTGA GCCGGAAAGT GACACAGGCA TTGCCAGCGC CGGTGTGCTT
CCCCCGCTGC AACCGCCCAC CCAGGGGCTG GTCAGCAGCT ACCTGAACGA TCCGCTGACG
GGCCTGCCGG ACGAACTGGC ACTCCGCCCC CAGCCCTATC GTCGCAAACT ACAGCTCGAC
TACGTGGCAC CGCCCAGCTT CGGCGCCAGC GTCGGGGGGC CGTTCGGGAC GATGATCGCC
GGTGGCGTGG CCTTCTTCTT CAGCGATATG CTGGGCGATC AGCAACTGGC CGTGGCTGCC
CAGGCCAACG GCACCTTCAA GGACATCGGC GGCCAGGTGC TCTACATCAA TCAGGGCCAT
CGCCTGAACT ACGGAGCGCT GGCCAGCCAC ATACCGCTGC TCTACGGCTA TGCCTATCTG
GACGTGTGTA CGGATCCGGC CACCGGCCTG CAGGTCTACT GCTACATCCA GCGCCTGGAG
CGCATCTACA TCGACGAAGC CGGCGTGCTG GGCTACTATC CGCTCAACAC CACCCAGCGC
TTCGAGTTGA TAGTGGGATT CCAGCGCTAC GGCTTCGACT ACGACGCCGA GATCTACTAC
CTCTACGGGG CCGGCTACCG ACGCTCCACG CAGAACCTAC CCGCACCGGA CCCGATCTAC
TTCTTCCAGG CTTCGGGGGC GTTCGTGGGC GACTTTTCGT TCTTTGGCTT CACCTCGCCA
GTACGTGGCT CGCGCTATCG GCTGCGGGTA ACCCCGTCGA TCGGGACCGA ACGTTTCGTA
CAGGTATTGG TTGACGGGCG CAAGTATTTC TTCTTCCGCC CGCTGACGTT CGCCGTACGC
CTGACGCACG TGGGCAACTA CGGCGCCCAA CCCCAGGGTT CGGATCGGCT CTACTTTGCC
CAGGAATATC TGGGCTTCGG AAGCACGCTG ACGTTCGTAC GCGGCTACGG CTTTTACTCG
CTGGAGCCGG ACGAATGCAC GCCCTCGACC TCAACGCCGA ACCTCTGCGC CGAAGTGGCC
CGTCTGGTGG GCACGCACGT GGCCGTAGCC AGCGCCGAGC TACGCATCCC GCTGTTTGGC
ACCGAGCGCT TCGGGCTGTT CAACTTCCCC TACCTGCCCA CCGAGCTATC GCTCTTTGCC
GACGCGGGCG TGGCCTGGTC GAAAGGTGAT CTGCCCAAGT GGAAGTTCGT GCGTCACACG
GGCGATCGAG TGCCCGTCTT CAGCACGGGC ATTTCGACAC GGTTCAACAT TCTGGGCGCC
ATGGTACTGG AGCTGTTCTA CGTCTATCCG TTCCAGCGCC CCTATAAGGG CTGGCATCTG
GGCGCCCAGC TCGTACCGGG CTGGTAG
 
Protein sequence
MRAFRIAGLL LLLTSWSLSA WAQYFGRNKV QYESFNWRVL RTPHFEIYYY PEEEVAVRDA 
ARMAERWYQR HSRTFLHEFG ERKPIIFYAD DADFHQTNAI SGEIGEGTGG VTEAIKERVI
MPFTGIYREN DHVLGHELVH SFQYDIALNR SDSLSRFNLA LLPLWLVEGM AEYLSLGRND
PHTAMWLRDA ALRDDLPTIR QLTRDLRYFP YRYGQAYLAY IGGKYGDQAV TELYKLGGIV
GVDTAIAITL GITPDSLSKE WIQAVKNTYL PLLKDRTPPE QAGRKVLAPD LDAGEINLAP
ALSPDGRYVA FLSERDLFTI DLFVADAETG KVLRRLSSSA SDPHFDAIRF INSSGSWSPD
GQRFAFITFA QGNNEIAIWN LRKGKLERRI AVEGVGAIHN LAWSPDGRTI AFSGLSGGIS
DLYLLDLETN QVRKLTDDRY ADLQPAWSPD GRTIAFVSDR GPDGTDFEIL RYGHERLALL
DLETGKVRVL RPFRNGQQIN PQFSPDGRSL YFISNHDGFK DIYRMDLNTG AVYRITRLQT
GVSGITSISP AMSVAAQNGR MMFSVFTDNK YLVFSLEPDQ LQGEPVEPES DTGIASAGVL
PPLQPPTQGL VSSYLNDPLT GLPDELALRP QPYRRKLQLD YVAPPSFGAS VGGPFGTMIA
GGVAFFFSDM LGDQQLAVAA QANGTFKDIG GQVLYINQGH RLNYGALASH IPLLYGYAYL
DVCTDPATGL QVYCYIQRLE RIYIDEAGVL GYYPLNTTQR FELIVGFQRY GFDYDAEIYY
LYGAGYRRST QNLPAPDPIY FFQASGAFVG DFSFFGFTSP VRGSRYRLRV TPSIGTERFV
QVLVDGRKYF FFRPLTFAVR LTHVGNYGAQ PQGSDRLYFA QEYLGFGSTL TFVRGYGFYS
LEPDECTPST STPNLCAEVA RLVGTHVAVA SAELRIPLFG TERFGLFNFP YLPTELSLFA
DAGVAWSKGD LPKWKFVRHT GDRVPVFSTG ISTRFNILGA MVLELFYVYP FQRPYKGWHL
GAQLVPGW