Gene Rmar_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1069 
Symbol 
ID8567710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1224112 
End bp1227105 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content61% 
IMG OID 
Productglycoside hydrolase family 10 
Protein accessionYP_003290349 
Protein GI268316630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000326536 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACCG CTACTGATCT GCGTTTTTTA AGAAACGCGG TTGGAAACGC TTCTCGAAGC 
GCTATTTTCC TGTTTCTCGT TTTTTCGCTG GTGGGTCAGG CATGGGGCCA GACGCCGGCA
AATGTCAATG GAAGTTTTGA ATCAACACCA GCCGGGGTAG TGACGGATCT GGCCGGAGGC
GTGGAAGGCT GGGTGTTGAA CGTGGGCTCC TTGGTGACAA ATCCTCCGGT CTTTGAGGTT
GTCGAGGCGA CAGATGCGCC CCATGGTAGC AAGGTGTTGG CGGTGACGGT CAACGGGGCG
GGGAACAATC CCTGGGACAT CGAGGCGACG GCCTTCCCGG TGAACGTGGA GCCCGGCGTG
ACCTACACCT ACACGATCTG GGCGCGGGCC GAGCAGGACG GGGCGGTGGT CAGCTTCACG
GTGGGGAACC AGTCGTACCA GGAGTACGGG CGACTGCACG AGCAGCAGAT CACGACGGAG
TGGCAGCCGT ACACGTTCGA GTTTACGGTC AGTGACCAGG AGACGGTCAT TCGGGCGCCG
ATCCATTTTG GCTATGCGGC CAACGCTGGC AATACCATTT ATATCGATGC CCTCGTGATC
ATGGGTCCCG AGCCGGAGCC TGCCGGACCG GAGCTTGTCG CCAACATCAA CGGTGGATTC
GAATCGACGC CGGCCGGGGT GGTGACGGAT CTGGCCGAAG GTGTGGAGGG CTGGGATCTG
AACGTGGGCT CCTCGGTGAC GAATCCGCCG GTTTTTGAAG TGCTGGAGAC GTCCGATGCC
CCTGAAGGGA ATAAGGTGCT GGCGGTGACG GTCAACGGGG TGGGCAACAA CCCCTGGGAC
ATCGAGGCGA CGGCCTTCCC GGTGAACGTA CGTCCGGGCG TGACCTACAC CTACACGATC
TGGGCGCGGG CCGAGCAGGA CGGGGCGGTG GTCAGCTTCA CGGTGGGGAA CCAGTCGTTC
CAGGAGTACG GGCGACTGCA TGAGCAGCAG ATCACGACCG AGTGGCAGCC GTTCACGTTC
GAGTTTACGG TCAGTGATCA GGAGACGGTC ATTCGGGCGC CGATCCATTT TGGCTATGCG
GCCAACGTCG GCAATACCAT CTACATTGAT GGCCTGGCCA TTGTGGACTC GGTGGGAGCC
TGGCGGCCCG TGATTGTCGA GGCCGAAGAT GGAGAGCTTG GCTCGGAATG GGCGGTGGAG
ACCGAAGGGA ACGTCACCTA CATTACGATC ACGACCGACT ATAATGAAAC CACGGGGGAC
GCAGATCATC CCGGCGAGAA TCGGACGGCC ACTTACCAGG TGACCTTCCC GGCTCCGGGC
TGGTACGATC TCTACGCCCG GGTATATGTG GGACCGGAGA CTTTCAACGA CGACAGCTTC
TTCTATGCAG ATTCGTTTGG CGTGAAGGAT CCGGAATCGC CGGATGACTG GATCATTGCC
AACCAGTTGG CGGCGGCCGG ATATACGGAG CCCGATGAGT ATGTGACAGG CCTTGGGGCG
GCAGGTTCGG AGGTGTGGAA ATGGATAAAC CTTTCAGAGA ATAGTTTCAA CGACGTCCCC
TCCGATTCCT TCTATGTCTC GCCGGAATCG TTGACGGTGA CGTTCATGAT CGGCGCCCGG
GAAAACGGGC TCCGAATCGA CAAGCTGGCC TTTGGACGTT CGGACCTGCT CTATACGGTG
GCGGATCTGG ACACGGGCGG ACCGGGCTCG CCGGAGCCGG AAGAGCCTCC GGTGGTGTTG
CCGGAGCGGC CGCTTGCGGC TGATGTGGAT AAGTTCCTGG GTAATATTTA CAGCCCCTCT
CAGGTAGAGA ATTTCGAGTA TTACTGGAAC TGTGTGACGC CGGAGAATGC GGGCAAGTGG
GGCAGTGTCG AGGGTACGCG GGACCAGATG AACTGGAGCA GTCTGGACGC GGCGTATGCG
CTGGCTCGGG ACAATGGCTT CTGCTTCAAT TTCCATGTGC TGCTCTGGGG AGCGCAGCAG
CCTGCCTGGA TTTCGGAGCT GAGCCCGGAG GAGCAGCTGG AGGAAATTCA GGAGTGGTTC
CAGGCGGTGG CCGAGCGCTA CAGCTTCACC GCGTCGCCGT TCGACGTGGT GCAGGTGGTC
AATGAGCCGC TGCATCAGCC GCCGGATGGT CAGGAGGGAC GGGCCAACTA CATCGAGGCG
CTGGGTGGTG CCGGCGAGAC AGGCTGGGAC TGGGTGATCA CAGCCTTCGA ACTGGCCCGG
CAGATTTTCC CGGAGGGCAC GCGGCTGATG ATCAATGACT ACGGCATCCT CAGCAGTCTG
GAAACGGCCC AGCAATATCT GGAACTGATT CAACTGTTGA AGGAGCGCAA CCTGATCGAC
GTGATCGGGG TGCAGGGGCA TGCTTTCTCG ACGCGTTCCG GGGCGCCGAT TCAGGAAGTG
CTGGATCTGC TGGCCACGAC GGGATTGCCG ATTCAGGTTA CCGAGATGGA TATCGACGGC
AATCCCAATC AGAGCCCCTT CGTGACGCGG GAGCAGTCCG AGCAGAATCA GCTCCGGGAC
ATGCAGCGCA TCTTCCCGAC CGTATGGTAT CATCCGGCTG TGGAAGGGGT GACGTTCTGG
GGCTGGCGGC CCGGCCTGTG GCGCAATGAT TACGAAGCCT ACCTGGTGTA CAGCAACGGC
GCTGAGCGTC CGGCCATGGT GTGGCTGCGG GAATTCATGG AAGCCTATCG AGAGTCGTAC
CTGAGCGCCA ACGAGCCGGA AGGGACGTTG CCGGAGGAGC TTTCGGTGGT TTCCTGGCCG
AATCCGTCCC GTGGTCAGGT GCGCTTCCGC TACGCCCTGC CGTTCGAGGC GGAGGTGCGC
CTGCAGGTGT TCGACGTGCT GGGACGTGAA GTGATGACGC TGGCCTCCGG ACGGCATCGG
GCCGGGGTGT ACGAGGTGGC TTTCGACGGC CGGCATCTGC CCAGCGGACT CTACCTGTAT
CGACTGGAAG CGAATGGGCG GGTACGGAGT GGCCGGCTTG TGCTGATGCG GTAA
 
Protein sequence
MRTATDLRFL RNAVGNASRS AIFLFLVFSL VGQAWGQTPA NVNGSFESTP AGVVTDLAGG 
VEGWVLNVGS LVTNPPVFEV VEATDAPHGS KVLAVTVNGA GNNPWDIEAT AFPVNVEPGV
TYTYTIWARA EQDGAVVSFT VGNQSYQEYG RLHEQQITTE WQPYTFEFTV SDQETVIRAP
IHFGYAANAG NTIYIDALVI MGPEPEPAGP ELVANINGGF ESTPAGVVTD LAEGVEGWDL
NVGSSVTNPP VFEVLETSDA PEGNKVLAVT VNGVGNNPWD IEATAFPVNV RPGVTYTYTI
WARAEQDGAV VSFTVGNQSF QEYGRLHEQQ ITTEWQPFTF EFTVSDQETV IRAPIHFGYA
ANVGNTIYID GLAIVDSVGA WRPVIVEAED GELGSEWAVE TEGNVTYITI TTDYNETTGD
ADHPGENRTA TYQVTFPAPG WYDLYARVYV GPETFNDDSF FYADSFGVKD PESPDDWIIA
NQLAAAGYTE PDEYVTGLGA AGSEVWKWIN LSENSFNDVP SDSFYVSPES LTVTFMIGAR
ENGLRIDKLA FGRSDLLYTV ADLDTGGPGS PEPEEPPVVL PERPLAADVD KFLGNIYSPS
QVENFEYYWN CVTPENAGKW GSVEGTRDQM NWSSLDAAYA LARDNGFCFN FHVLLWGAQQ
PAWISELSPE EQLEEIQEWF QAVAERYSFT ASPFDVVQVV NEPLHQPPDG QEGRANYIEA
LGGAGETGWD WVITAFELAR QIFPEGTRLM INDYGILSSL ETAQQYLELI QLLKERNLID
VIGVQGHAFS TRSGAPIQEV LDLLATTGLP IQVTEMDIDG NPNQSPFVTR EQSEQNQLRD
MQRIFPTVWY HPAVEGVTFW GWRPGLWRND YEAYLVYSNG AERPAMVWLR EFMEAYRESY
LSANEPEGTL PEELSVVSWP NPSRGQVRFR YALPFEAEVR LQVFDVLGRE VMTLASGRHR
AGVYEVAFDG RHLPSGLYLY RLEANGRVRS GRLVLMR