Gene Rmar_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1068 
Symbol 
ID8567709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1221563 
End bp1223956 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content60% 
IMG OID 
Productglycoside hydrolase family 43 
Protein accessionYP_003290348 
Protein GI268316629 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000256501 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGATTG TCATCTTGAA CAGGCTTCGC TTTAAGAAAG CCTCCCGTGA ATGGCCGGTA 
ATTCTCGGGG GGTGGCTGAT TCTGGCCGGA TGGATGGCGC TGGGATTTCT GGTGCGGGCG
GGGTATGCGC AGGAGCCAGG TCGTCCGGTG TACGTCAATC CGGTGATCCC CGGCGATCAT
CCAGACCCCA CGCTGACCCG GGTCGGGGCT TACTATTACA CGTCCGGTTC ATCCTTCAAC
GTCACCCCCA GGATTTACCG CTCGACGGAC CTGGTTCACT GGGAGATCGT CGGGCGGCCG
GTTTCGGCCT CCTGGTCGCT GTATGGCGAC GTGCCTGCCG GTGGGGTATG GGGGGGCCAT
ATGGTTTATT ACCAGGGACT GTACTGGCAC TTCTTCGGGC GTGGGACAGG AGATCGGGCC
ATGTATTTCG TAACGGCGCA TCGGCCGGAG GGGCCCTGGG GTATGCCGGT TCGCATGAAC
GTACCGCCCG GCATTCCCGG TCTGGGCGTG GATAATTCCA TCTTCATTGA TGAGGATGGA
CGATGGTTTT TGCTGAGCAA GAATGGTCCC CAGAATAATT ACATCGTCGA ACTGGGACCG
GATGGTCAGC CGGCCGGGGT TGTCTATGAC CTGACCTGGA TCAACCCGGA CTCGGCCGGC
AATCCTTATG GATGGGCTGA AGGGCCGGTG ATGTGGAAGT ATCAGGGCTA TTACTACTAC
AGTTTTGCGC AACATCTGGT GGGTAATCAG TACGTTATGC GCAGCGATAC CCTGTCGGAC
GATCCTGCCG ACTGGGAAGG GCCCCGGTTG CTTTTCGAAA CAGTGCCCGA TCGGTACCAG
CGGGTGTTTC GCAATCCCAA TCATTGCTCA CCGGCCGTTA CCGCCGACGA TGGGACGCAC
TGGATGATCT GTCATGCCTA CGATCAGAGC GGGCCGGGTG AAGAATGGCG AGCGCTGGGA
CGTCAGGGGC TTCTGGTTGA AGTGCGTTAC GAAGAGGGCT GGCCCGTAGC CCGCTTTCCA
ACCACGGAGC CCGTGGAGGG ACCCGCTTTG CCCAGCAGCG GCATCCCCTG GGCGGTGCCC
CGATCGGACT TCTTCGACAG GAGTCGACTG GCGGTGCACT GGTCTCTGCT GGGCTACACG
CCGGAGGAAA CATACGATCT GACGGAACGT CCCGGCTGGC TGCGGCTGAC GCCCAAAGGA
GGGCGCACGT TTCCGCCCAC GCCGGGACGG AATACTGTGC TGCAGGCCGC CGCCGAGCGG
GCCTATTCCC TGATGACCCG GGTCGATTTT GATCCGGCGA CCACCTCGGA TGAAGCCGGA
CTCTGGATTA TCAACGGTCC GGAGACGCTG CAGGCACGGT TATGCGTAAC GCGCAGCTCG
GAGGGCGAAC GTGTCGTGGC TTTTCGTTTC GATACCCTTG CGCACAGTAC GCCCCTGCCC
TCGGAAGAAC CGGTCTGGTT GAAGCTTGAA CGAGAAGGGC ATGAACTGAC CGCTTCCTTC
AGTTTGGATG GGGCAAGCTG GGCCCCGGTA GCCGAGAAGG TGAACGTGGC GCGTCTGGAC
CGCGAGCAGC CCGCTTCGGA GTCCGGTTAC GATTTTAATG CATTTACGGG CAATCAGCAG
GGATTGTACG TGCTCGGCAA TACCCCGGCC TACTTTGACC TCTATATATA TCGGGACGCC
TACTCACGCA TTCCGGCCCA GCATCCTACC AATTACAATG GCGTGATCAC TTCGAGAAAT
GGACTACCGG CGCATGCGAA TTACCTGGCC GGCATCCACG ATGGAGAATG GGCCATGTAC
GCCGGCGTGG AGTTCGGCGC GCCGGGAAGC GATTATTCCA GGATACCCCG CCAGGTTGTG
GTGACGGCTT CCAGCGCTAC CGGAGGTGGT GTGGTCGAAG TCTGGGTGGA CGCGCTGGAT
ACCGGCCAGA AAATCGCGGA AGTGCCGATC AAATCGACAG GGAGCTGGGA CGTGTATCAG
GACTTTACGG CCGAAGTGGT GCCGGTTAGT GGCCGTCATG ATGTTTTTCT GCGGTTTCGG
GGAAATCCCA CGGAGACGTT GTTTCGCATT CGTTCCCTGC TGTTCGAGGG CCAGCTGACG
GAGACGGCAA CAGGACCTGG CGCCGCGGTC CGGCCACTCC TGGTGACGCA TTATCCGAAT
CCGGTGCGCG ATGACGTGAC TTTTCTCGTT TCCCTGCCAC GCACAGGACC TGTTCGCCTG
GTGCTGTACA ACGCACTGGG GCAGCAGGTG GCCACGCTGA TTGATGCGGT GCGTCCGGCG
GGGCGGTATC CGTTGCGCTT CACCATCAAG CATCTCTCGC CGGGCCTTTA TTTCTATCAG
CTTACCACGA AAGACCAGGT GGTAACAGGG CAACTCATCG TGATTTCCCG ATGA
 
Protein sequence
MWIVILNRLR FKKASREWPV ILGGWLILAG WMALGFLVRA GYAQEPGRPV YVNPVIPGDH 
PDPTLTRVGA YYYTSGSSFN VTPRIYRSTD LVHWEIVGRP VSASWSLYGD VPAGGVWGGH
MVYYQGLYWH FFGRGTGDRA MYFVTAHRPE GPWGMPVRMN VPPGIPGLGV DNSIFIDEDG
RWFLLSKNGP QNNYIVELGP DGQPAGVVYD LTWINPDSAG NPYGWAEGPV MWKYQGYYYY
SFAQHLVGNQ YVMRSDTLSD DPADWEGPRL LFETVPDRYQ RVFRNPNHCS PAVTADDGTH
WMICHAYDQS GPGEEWRALG RQGLLVEVRY EEGWPVARFP TTEPVEGPAL PSSGIPWAVP
RSDFFDRSRL AVHWSLLGYT PEETYDLTER PGWLRLTPKG GRTFPPTPGR NTVLQAAAER
AYSLMTRVDF DPATTSDEAG LWIINGPETL QARLCVTRSS EGERVVAFRF DTLAHSTPLP
SEEPVWLKLE REGHELTASF SLDGASWAPV AEKVNVARLD REQPASESGY DFNAFTGNQQ
GLYVLGNTPA YFDLYIYRDA YSRIPAQHPT NYNGVITSRN GLPAHANYLA GIHDGEWAMY
AGVEFGAPGS DYSRIPRQVV VTASSATGGG VVEVWVDALD TGQKIAEVPI KSTGSWDVYQ
DFTAEVVPVS GRHDVFLRFR GNPTETLFRI RSLLFEGQLT ETATGPGAAV RPLLVTHYPN
PVRDDVTFLV SLPRTGPVRL VLYNALGQQV ATLIDAVRPA GRYPLRFTIK HLSPGLYFYQ
LTTKDQVVTG QLIVISR