Gene Rmar_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1759 
Symbol 
ID8568411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2053245 
End bp2056541 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content64% 
IMG OID 
ProductASPIC/UnbV domain protein 
Protein accessionYP_003291031 
Protein GI268317312 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.208023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAAC TCCTGTTGGT GCTCGGCCTT GCCGGCCTGT CCGCCTGTCG GCCCGCCGAA 
CCGCCGCTGT TCGAGCGGAT GGATCCTGAC AGGACCGGCA TCACCTTCGT CAACGAGGTG
CCCGTCGATA CGGCTTTTAA CATTATTAAT TATATGTACT ACTATGATGG GGCCGGCGTG
GCGGCGGGCG ACTTCAATGG CGACGGGTGG CCGGACCTGT ACTTTGTCGC CAATCGGGGA
CCGAACCGCC TGTTCCTGAA CCGGAGCGAC TGGCGTTTTG AGGACGTCAC CGACGCGGCT
GGCGTGGCCG GCTCGGGCAA CTGGAACACC GGCGTGGCCG TGGCCGACGT GGACGGCAAC
GGCTGGCTCG ATCTCTACCT GGTCACCTTC AGCAACTACC TGGATCGCAC CGGCCGCAAC
CAGCTCTTTC TGAACCAGGG GCCGGATGAG ACGGGCATCC CACGTTTTCG GGAAGCGGCG
GCTGAATTCG GACTGGACAT GGCCGCCTAC GGCACGCAGG CCGTGTTTTT CGACTATGAT
CGGGATGGCG ATCTGGACCT GTATTTGCTG AACCGGGCAC TGCACACCGA CGAGAGTTTC
GGCCCGGCCG AGCCGCTGCG CCATCGTTTC GATCCCAACG CCAGCGACCG GCTGCTGCGC
AACGACGACG GGCGCTTTGT GGACGTGACG GCCGAGGCCG GGATCGTGGA CGGCCTCATC
GGCTACGGAC TCGGTGTGGT GGTGAGCGAC CTGGATCAGG ACGGCTGGCC CGATCTGTAC
GTGGCCAACG ACTTCCATGA AGACGATCGG ATCTACCGCA ACAACGGCGA CGGCACCTTC
ACCGACGTGC TACGTACGGC CACGGCCTAC ATTTCGAAGG CGTCGATGGG GGTGGATGCG
GGCGACGTCG ACAACGACGG CCTGCCCGAT CTGATCGTGC TCGACATGAT GCCCTTCGAT
CCGATCGTTT TCAAAACGGC CGACGGACCG GAGTCGTTCG AGCTATTTCA GCGCAAGCGG
CAGTTCGGCT ATCATCCCCA GTATCCCCAC AACGTGCTGC TGCGCAATCT GGGCGCCTGG
CAGTTCGTGG ACGTGGCCTT TCGGGCCGGC GTGGCCGCTA CCGACTGGAG CTGGGCCGCG
CTGCTGGCCG ACCTGGACAA CGACGGCTAC CAGGACCTCT TCGTTACGAA CGGCATCTAT
CACCGCCCGA ACGATCTGGA CTACATCCGC TACGTCGGAC AGCCCGAAAT CCAGGAGGCC
CTGGCGCGCG GCATCACGCC GGAGCTGCTG GAGGACCTGT TGCGCCATAT GCCGCAGGTG
CCACAGCCCA ACTTTGCTTT TCACAACAAC GGCGACGGCA CCTTCACGAA CCGAACGCAG
GCGTGGGGGC TGGGGCGGCC GGGCTTTTCG ACCGGGGCCG TGTATGTGGA CCTGGACCGC
GACGGCGATC TGGATCTGGT CACCAGCGAG ATCAACGCCC CCGCGGCCGT GTATCGCAAC
CACACGCGTG AACGCCACAG GACCCATTAC CTGCGCGTCG TGCTGGAAGG CGAGGGGATG
AACCGGTGGG GTATCGGCGC CCGGGTGACT GTGCACTACG GCGACAGCCT GCAGCTTCGA
GAGCTGCAAC CCGTACGTGG CTGGCTCTCG TCGGTCGAGC CCGTGCTGCA CTTCGGGCTG
GGGGCCCGCA CGCAGGTGGA TTCGGTGACG GTGGTATGGC CCGACGGTCG TTATGAAGTG
CGCCGCAGCG TGGCGGCCGA TCAGACGCTG ACGTTCCGCC AGGCCGAGGC GCAGGTGCGT
TATCATCCGC CGGCGCTGCC ACGTCCGCTT TTCCAAGAAG TGTACGAGGC ATTGCCGTAT
CGCCACGAAG AGAACGCCTT TGTGGATTTT ACCCGCGAGC CGCTCCAACC GCACCGGCTC
TCGCGTGAGG GACCAGCACT GGCCGTGGGC GACGTGAACG GCGACGGGCT GGACGACGTG
TTTCTGGGCG GGGCCAAGTG GCAGTCGGCC CGGTTGCTCG TGCAGCAGCC GGACGGGACG
TTTCGGCCGA CCAACGAAGC ACTCTGGGAG GCCGAAAGCC GTTACGAGGA TGTGGATGCG
GCGTTTTTCG ACGCGGACGG CGACGGCGAT CTGGACCTGT ACGTGGTCAG CGCGGGCAAC
GAGTGGTGGG GCCAGGCCGA GGCGCTGCGC GATCGGCTCT ACCGCAACGA CGGCCGCGGG
CAGTTCCGCC GTGACGAGCA GGCGCTGCCG GATCTGTTTG TGAACGGCTG TTGCGTGCGG
GTGGCCGATT ACGACGGGGA TGGTGACCCG GATCTGTTTG TGGGCGGGCG GGTCGAGGCC
CGTCGCTACG GCGAAGCGCC GCGCAGTTTT CTGCTGGAAA ACCGGGGGGA CGGAACGTTT
GCGGACGTTA CCGAGGCGCG TGCGCCGGCA CTGGCCCGCG TGGGCATGGT GACCGATGCG
GTGTGGGAGG ATTTTAATGG GGATGGGCGG CTGGATTTGC TTGTGGTAGG CGAATGGATG
CCGCTAACGC TGCTTTCCCA GGACGCAGAC GGTCGTCTGA TGCCTGTCGC GCTGGAAAAC
ACCGAGGGCT GGTGGTTCAG CGTCCAGGCA GCCGATCTCG ACCAAGATGG CGATCTGGAC
TTTGTAGCCG GCAACCTAGG GCTGAATGCG TCGCTGCAGG CGACTCCTGA TCGGCCAGTG
ATGCTCTACC TACATGATTT TGATCGAGAT GGACAGACCG ATCCTGTGCT GGTAGCCTAC
TGGGACCGAC AGGCTTATCC GGTCGCAACG ATCGACCTGC TGGTGCGGCG TTTTCCGGAG
TTGGGACAGC AATTCGAGAG CTATCGGTCC TGGGGAGCAC GGACGCTGGA CGAGCTATTT
GGCCAAGAAG CACTGCGCCA GGCAACAGTT CGGCAGGCCT ATACCTTTGC TTCGGTATGG
GCTGAAAACG ACGGACAAGG ACATTTCACT TTGCACTCAC TGCCCGAACC GGCGCAGTGG
TTTCCGGTGC GGGCGTTGCA GATCAGCGAT GTGACAGGAG AGGGGCGGCC TGATATCATT
GCAGCCGGTA ACTTCGACGA GGCCAATCCG GCGCTTGGGC ACTACGGCCA TGGGCCGGGC
GCGGTGCTGG TACAGACGAA AACGGGAACG TTCATGCCTT TACGTCCCGA TGCTTCTGGC
TTGATCCTAC GCGGCCAGGT TCGTCACCTC AGCTGGCTAC AGCGCCCCGA TGGGCAGCGA
TGGTTGTTGG CCGCTCGAAA CGACACCTCC GTACAAGTGC TGGCGCTGCG CTACTAA
 
Protein sequence
MRKLLLVLGL AGLSACRPAE PPLFERMDPD RTGITFVNEV PVDTAFNIIN YMYYYDGAGV 
AAGDFNGDGW PDLYFVANRG PNRLFLNRSD WRFEDVTDAA GVAGSGNWNT GVAVADVDGN
GWLDLYLVTF SNYLDRTGRN QLFLNQGPDE TGIPRFREAA AEFGLDMAAY GTQAVFFDYD
RDGDLDLYLL NRALHTDESF GPAEPLRHRF DPNASDRLLR NDDGRFVDVT AEAGIVDGLI
GYGLGVVVSD LDQDGWPDLY VANDFHEDDR IYRNNGDGTF TDVLRTATAY ISKASMGVDA
GDVDNDGLPD LIVLDMMPFD PIVFKTADGP ESFELFQRKR QFGYHPQYPH NVLLRNLGAW
QFVDVAFRAG VAATDWSWAA LLADLDNDGY QDLFVTNGIY HRPNDLDYIR YVGQPEIQEA
LARGITPELL EDLLRHMPQV PQPNFAFHNN GDGTFTNRTQ AWGLGRPGFS TGAVYVDLDR
DGDLDLVTSE INAPAAVYRN HTRERHRTHY LRVVLEGEGM NRWGIGARVT VHYGDSLQLR
ELQPVRGWLS SVEPVLHFGL GARTQVDSVT VVWPDGRYEV RRSVAADQTL TFRQAEAQVR
YHPPALPRPL FQEVYEALPY RHEENAFVDF TREPLQPHRL SREGPALAVG DVNGDGLDDV
FLGGAKWQSA RLLVQQPDGT FRPTNEALWE AESRYEDVDA AFFDADGDGD LDLYVVSAGN
EWWGQAEALR DRLYRNDGRG QFRRDEQALP DLFVNGCCVR VADYDGDGDP DLFVGGRVEA
RRYGEAPRSF LLENRGDGTF ADVTEARAPA LARVGMVTDA VWEDFNGDGR LDLLVVGEWM
PLTLLSQDAD GRLMPVALEN TEGWWFSVQA ADLDQDGDLD FVAGNLGLNA SLQATPDRPV
MLYLHDFDRD GQTDPVLVAY WDRQAYPVAT IDLLVRRFPE LGQQFESYRS WGARTLDELF
GQEALRQATV RQAYTFASVW AENDGQGHFT LHSLPEPAQW FPVRALQISD VTGEGRPDII
AAGNFDEANP ALGHYGHGPG AVLVQTKTGT FMPLRPDASG LILRGQVRHL SWLQRPDGQR
WLLAARNDTS VQVLALRY