Gene Rmar_2405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2405 
Symbol 
ID8569070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2788518 
End bp2791754 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content66% 
IMG OID 
Productpeptidase S9B dipeptidylpeptidase IV domain protein 
Protein accessionYP_003291671 
Protein GI268317952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.666802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCTCG GATGTCTTGT GGGGTTGCTG CTGTTTGCTT CGGCCATGCC GGCCGCACAG 
GCGCAGTACT TCCGCTTCGG CAAAAACAAA GTCCACTACC GGACGCCCAC CTGGTACTAC
ATCCAGTCGC AACACTTCGA CATCTACTAC TACGAAGGCG GCTACGAGCT GGCCAGCTTC
ACGGCCGAAG CCGCCGAAGC CGCCTATCAG GAGCTGGTCG AGCTGTTTCA GTATGAGCTT
TCCGGCCGCA TTCCCATTCT GGTCTATCAG AGCCATCACG ATTTCACCGT CACGAACGCG
GTCGATCTGC CGGATTACAG CGAGGGCATC GGAGGCGTCA CGGAACTGTA CAAGAACCGG
ATCGCCGTGC CCTTCATGGG CGACTACCGG GACTACCGCC GGGTGGTTCA CCACGAGCTG
GTCCATGCCG TACTGAACGA CATGTTCTAC GGCGGCTCGC TGCAATCCAT TCTCCAGAAC
AACCTGCAAC TGGTGCTGCC GCTCTGGTTC AACGAGGGGC TGGCCGAATA CGCCGCGCTG
GGCTGGGACA CGAACTCCGA TATGTACGTG CGCGAGGCGA TCCTGAACGA TCATCTTGAT
CCGATCCCCT ACCTGTCGGG CTACTTTGCC TACCGGGGCG GCCAGAGCGT CTGGGACTAC
ATCGCCGAGC AGTACGGCCG TGAAAAGATT GCCGAGATCC TCCAGCGCGT GCGCCTGACG
CACTCGGTGG AGGCCGGCAT CCGACAGGCG ACGGGCCTCT CGCTGCGCGA ACTCTCGGAG
CGCTGGCACA AGGCGCTGCG CGAGATCTAC TATCCGGAAC TGACCGCCCG CGAGCAGCTC
GACGACATCG GCCGACCGCT GCTCACGGCC CGCAACGCGG GCTACTACAA CACGAGTCCG
GCCCTCTCGC CTCAGGGCGA CCGCATCGCC TTCATCACGA CGCGCAACGG CCTGTTCGAC
GTGTACCTGG CCAGCGCCAA CGACGGCAAG ATCCTGCGCC GGCTGGTGGC GGGCCAGACC
AGCCCCGACT TCGAAAGCCT GCGCATTCTG ACGCCCGGCC TGACCTGGAG CCCGGACGGC
CGCTTTCTGG CCCTGGCCGT CAAGAGCGGT CCCACCGATG CCATCGCGGT GATCAACGTC
GAGACCGGCG CGCACGTGCG CTACCGCATC CCGGACGTCG AGCAGATCCT GTCGCTGGCC
TGGAGTCCGG ACGGCCGCCG GATCGCCTTC GCCGGAACGC AACGGGCTCA GAGCGACATC
TACGTGCTGG ACCTGCGCAC CGGCGAGACG ATCAACTACA CGAACGACGC CTTCAGCGAC
CACGAACCCG CCTGGCGCCC CGACGGTCGG GCGCTCGTGT TTCACAGCGA CCGGGGACCC
TACGTGGAGC CCGGCCGCTA TCAGGCCGGT CAGTTCGATC TGACCGCCCG GCTCTCGCGC
GGCTACGACC TCTACCTGCT GCACCTCGAC CCGGTGCGCA TCGAGCGGCT GACGACCACC
GAGCCCTGGG ACGACCGGAG CGGACGCTTC GGCAGCGACC CGGATCGACT GCTGTTCATC
TCGGACCGCA ACGGCATTCC GAACCTGTAC GAAAAAGACC TGCGCACCGG CGCCGAGCGG
CCGTTGACCG ACGTGGTCAT GGGCATCCAG CAGGTCTCGC TCTCAGCCGA CGGCCACAGG
GCCGCCGTGG TCAGCCTGCG CGAGGGCGTT CCTTCGATCT ACCTCATCAA GAATCCCTTC
GAGCGCCACC TGGCATCCGA TACGCTGGCG CCGACCGTCT GGGCCCAGCG GCGCCTGCGC
CAGGTGCCCC GGCCGGCCCC GGCGCTGGCG CTGGCCTCCG AAGCGCTGCG CCAGCGCAAT
CCCTTCCTGC GCGACGCCAG CTACACGACG CCGCCTGCTG GCCCGCTGCT GGCCGCCTCC
GAGCCGGCCG GCAGCAACGG CACCAACGGC CATGGCGAGG CGCCCGACTC CACACGCTAC
GGCACGCTCC GCGTGGACTT CCGCAACTAC GTGTTCAGCT CGGCCTTCGA CGAGGCGCGT
CCGCCCCGGG CCACACCTTC CTACTACAAT GCGGATCCCT TCGCACCAAA AGACAACGTG
GACGAAAGCG GCCGCTACCG TCCCCGGCGG TACCGCCTCT ACTTCACGCC CGATCTGGTC
TATGGCGCTG CCGGCTACGA CATGCTCTAT GGCGTGCAGA GCGTCACCCA GATGATGTTC
AGCGACATGC TGGGCAACCA CCGCATCTGG GCCGCCACAA ACCTGCTCGT GGACCTGCGT
AACTCCGATT ACCTCATCGC CTACAGCTAC CTGCCGCGCC GCACCGACTG GACGGTGGCC
GTCTACCACG TGGCCCGCCT GCTGCCGGAC TACGCGCTGC GCACGCTCTA CCGCTACCGG
CACTACGGGT TGAACCTCAG CGCCAGCTAC CCGCTCAACA AATTCGAGCG CTTCGACCTC
GGCCTGGCCT ACATGGGCGT CAACCAGACC GACATCGGCA ACCTGGCGCG GCCCCCGGTC
ACGCGCACGC TCTTCTATCC GTCCCTCACC TACACGCGCG ACGTGAGCGT GCCGGGACTG
CTGGCACCCA TCGGCGGCCA TCGGCTGGCC CTGCAGCTCT CCGGAAGCCC CGGCAACCTG
CTCTACGGCC GCCAGATCCG CTTCGTGACG CTGCTGGCCG ACGCGCGCAC CTACACTTCG
TTCGGCCGCG GACTCTACAG TTTCGCCTTC CGACTGGCCG GCGGCGCTTC GTTCGGACCG
AATCCCCAGC TCTTCTACTC GGCCGGGGTG GAAAACTGGA TCAACCGTCG CTTCGACAGC
TTCCCGATCG AAGACCTGAC CGACTTCGTC TTTGCCACGC CAGTCCTTCC GCTGCGCGCC
ACCGACATCA ACACGCTCAA AGGCCCCTAC TTCGGCCTGT TCAATGCCGA ATTCCGCTTT
CCGCTTGTAG CCGCCCTGCT GCCGGGTCCG CTCCCCCTCC TCCCGCTTTA CAACCTGCAG
GGCACGGCCT TTCTGGACGC GGGGGCCGTG TGGGGAAGCC CCTCGAACCG CCGCCTGAAC
CTCTTCCGGC GCGACGAACA CGGCCGCCAG GTGCTCGACG ACCTGCGCGT GGCCGGTGGC
CTGGGGCTGC GCACCATCCT CCTCGGCTTT CCGTTCCGCT TCGACTTCGC CTGGCCCTTC
GACGGCCGCC GCTTCCTCCA CCGACGGTTC TATTTCTCGG TAGGTCTTGA TTTTTGA
 
Protein sequence
MRLGCLVGLL LFASAMPAAQ AQYFRFGKNK VHYRTPTWYY IQSQHFDIYY YEGGYELASF 
TAEAAEAAYQ ELVELFQYEL SGRIPILVYQ SHHDFTVTNA VDLPDYSEGI GGVTELYKNR
IAVPFMGDYR DYRRVVHHEL VHAVLNDMFY GGSLQSILQN NLQLVLPLWF NEGLAEYAAL
GWDTNSDMYV REAILNDHLD PIPYLSGYFA YRGGQSVWDY IAEQYGREKI AEILQRVRLT
HSVEAGIRQA TGLSLRELSE RWHKALREIY YPELTAREQL DDIGRPLLTA RNAGYYNTSP
ALSPQGDRIA FITTRNGLFD VYLASANDGK ILRRLVAGQT SPDFESLRIL TPGLTWSPDG
RFLALAVKSG PTDAIAVINV ETGAHVRYRI PDVEQILSLA WSPDGRRIAF AGTQRAQSDI
YVLDLRTGET INYTNDAFSD HEPAWRPDGR ALVFHSDRGP YVEPGRYQAG QFDLTARLSR
GYDLYLLHLD PVRIERLTTT EPWDDRSGRF GSDPDRLLFI SDRNGIPNLY EKDLRTGAER
PLTDVVMGIQ QVSLSADGHR AAVVSLREGV PSIYLIKNPF ERHLASDTLA PTVWAQRRLR
QVPRPAPALA LASEALRQRN PFLRDASYTT PPAGPLLAAS EPAGSNGTNG HGEAPDSTRY
GTLRVDFRNY VFSSAFDEAR PPRATPSYYN ADPFAPKDNV DESGRYRPRR YRLYFTPDLV
YGAAGYDMLY GVQSVTQMMF SDMLGNHRIW AATNLLVDLR NSDYLIAYSY LPRRTDWTVA
VYHVARLLPD YALRTLYRYR HYGLNLSASY PLNKFERFDL GLAYMGVNQT DIGNLARPPV
TRTLFYPSLT YTRDVSVPGL LAPIGGHRLA LQLSGSPGNL LYGRQIRFVT LLADARTYTS
FGRGLYSFAF RLAGGASFGP NPQLFYSAGV ENWINRRFDS FPIEDLTDFV FATPVLPLRA
TDINTLKGPY FGLFNAEFRF PLVAALLPGP LPLLPLYNLQ GTAFLDAGAV WGSPSNRRLN
LFRRDEHGRQ VLDDLRVAGG LGLRTILLGF PFRFDFAWPF DGRRFLHRRF YFSVGLDF