Gene Rmar_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1941 
Symbol 
ID8568597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2266979 
End bp2269690 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content64% 
IMG OID 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003291212 
Protein GI268317493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.615536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCT CCGGTTCTCG CTATCGGATG TTTTTAGGAG GCGGGCTGCT CCTGCTGCTG 
ACGATCCTGC CATTGCAGGC ACAACCCGCG CAGGAGCTGC TGCGCGAGCG GGTTTTGCTT
AATGAAGACT GGCGCTTCTT CAAGTACCCC TCCCGGGCGG AAGCCGACGG ACTGGTCTAC
GACGTGTGGC CGATGTACCG GATGGATGCA GACACCGCCG GAGCACCGAA GGAGGCGATC
GTCCTTAAGC CCTGGATTCT GCCCACGGGC AATCCTTTCA TCAAAGATCC GGCCCGACGT
TACAGGCGAC CCCCGGGACA TCCCGGGCGT GACTTCCCGT TCGTGCAGCC GGACTTCGAC
GATAGCGGCT GGGAACGAGT AGACCTGCCG CACGACTGGG CCATTCGGGG CCCCTTCGTG
GAACAACCAC ATCCCGAGGT GGACGGCGGC ATGGGCTGGC TGCCCAGCCC GGGCGTGGCC
TGGTACCGGA AGAAAATCTT CATCCCGGCC GCCGATTCGG AGCGGGCGAT CTACCTGGAC
GTCGAGGGCG CGATGGCTCA CGCCGTCGTC TGGCTGAACG GCTACCTGGT GGGTGGCTGG
CCCTACGGCT ACGCTTCCTG GCGCGTGGAT CTGACCCCAT ACGTCCGCTG GGGCGCGGAA
AACCAGCTTG CCATCCGACT TGAGGTGCTT CCACGCTCGT CGCGCTGGTA TCCCGGCGGT
GGATTGTACC GCAACGTCTG GCTGGTCAAG ACGCATCCGG TGCACGTGGC GCAATGGGGC
ACGTTCGTGC GCACACCCTA CGTCTCGACC GACTCGGCCA CCGTTGTGCT GGATGTGACC
ATCGAAAACC GATGGGCCCG GGCCGTCTCG GTGCGCGTTA CGACAGAGAT CTTCGAGCTG
GGCGAAGAGG ACCGGCCACG GGGCGATGCC GTCGCCGCGT TCCCGCCCCG GCAGGTGCGG
GTACCGGCGC AGGGGCAGGC TACGGCGTCC GGCATGGCGA CGGTTCGCAA CCCGAAGCTG
TGGGGACCTC CGCCCACGCA GACGCCTCAC CGCTACGTGG CCGTCACCAC GCTGTGGCTC
GACGGCCGCC CGGTCGATCG CTACGAAACC CGCTTCGGCA TCCGCTCGCT GCGGTTCGAT
CCGGAGCGCG GGCTGCTGGT CAACGGCGAG CGCATCGAAA TCAAAGGCGT TAACCAGCAT
CACGACCTGG GGCCGCTGGG CGCGGCCTTC AACCGAAGGG CGGCCGAGCG CCAGCTCGAA
CTGCTGCGTG AGATGGGCGC CAACGCCATT CGTACCGCCC ATAACCCGCC AGCTCCCGAG
CTGCTGGAGC TTACCGACCG CATGGGCTTC CTCGTGCTCG ACGAGATCTT CGACGTCTGG
GAGATGCGAA AAACCCCGCT CGACTTCCAC CTGATCTTTC CGGACTGGCA CGAGCAGGAC
CTGCGCGCCT TTATCCGCCG CGACCGCAAC CACCCGTCGG TCATTCTCTG GAGCTTCGGC
AATGAAGTGG GCGAGCAGGG AAGCGGCCGC GAGGGGGCCG AGCTGGCCCG GCGACTGGCC
CGCATCATCA AAGAGGAGGA TCCCACGCGG CCGGTGACGG CGTCTATGAA TTTCGCCCGG
CCGGGGACGC CGATGCCCGA GGTGGTGGAC GTCATCTGCC TGAACTATCA GGGTGAAGGG
ATCCGGGACA TGCCGGCCTA TACTGGTTTG CAGGGGATCA CCACACCACC CCTTTACAGC
GCATTTCGCA TACATTTCCC TACAAAAATG ATTATCAGTT GTGAAAATGC GGCCGCTGTA
AGTTCGCGGG GCGCCTATCT CTTTCCAGTT ACAGATGCCC TGAGTGCACC TGTTCGTGAA
GGCCAGGGGG GTGATCCGGT TACCAGGCAG GTAAGCGGCT ATGAGCTATA TACGGCGGAT
TTTGGCTCTT CGGTGGACAA GGTGTTTTTC GTGCAGGAAC TCCATCCTTT TGTGGCCGGC
GGTTTCGTCT GGAGCGGCTG GGACTATCTG GGCGAGCCCA CGCCCTACTA CAGTTCGCGC
AGCTCTTATT TCGGGATCAT CGATCTGGCC GGTTTCAAAA AGGATCGGTT TTACCTGTAT
CAGGCCCGCT GGCGTCCCGA GCTACCGATG GTGCATATCC TGCCCCACTG GAACTGGCCG
GATCGCGTGG GGGAGATCAC GCCCGTGCAT GTGTTCACCT CGGGCGACGA GGTGGAACTG
TTCCTCAATG GCCGATCGCT CGGCCGCAAG AAAAAAGGGC CCTACCAGTA CCGCCTGCGC
TGGGACGACG TGCGCTACGA GCCCGGCGAA CTACGCGCCG TGGCCTATAA AAACGGCCGG
AAGTGGGCCG AAGCCGTCGT GCAGACCACC GGCCAGCCGA CCGCGCTGGC CGCCGAGGCG
GACCGTGTCC GGATCCAGGC CGACGGCTAC GACCTGGCAT TCATCACGGT GCGGGTGGTC
GATGCAGAAG GGCGAACCGT GCCCACGGCG GACAATCCGG TGCGTTTCAC CGTCGAAGGA
CCCGGCGAGC TGGTGGCCAC GGCCAACGGC GACCCGACCA GCTTCATTCC GTTTTCTTCG
GATGAGCGGC CGGCCTTCAA CGGGCTGGTG CTGGCCATCG TGCGCGCCCG GCGTGGCATG
CCCGGCACTA TTACCGTCAC GGCCAGCGCC CCCGGTCTGC GTCCGGCTCG TATCGTGATA
GAAAGCCAGT GA
 
Protein sequence
MRISGSRYRM FLGGGLLLLL TILPLQAQPA QELLRERVLL NEDWRFFKYP SRAEADGLVY 
DVWPMYRMDA DTAGAPKEAI VLKPWILPTG NPFIKDPARR YRRPPGHPGR DFPFVQPDFD
DSGWERVDLP HDWAIRGPFV EQPHPEVDGG MGWLPSPGVA WYRKKIFIPA ADSERAIYLD
VEGAMAHAVV WLNGYLVGGW PYGYASWRVD LTPYVRWGAE NQLAIRLEVL PRSSRWYPGG
GLYRNVWLVK THPVHVAQWG TFVRTPYVST DSATVVLDVT IENRWARAVS VRVTTEIFEL
GEEDRPRGDA VAAFPPRQVR VPAQGQATAS GMATVRNPKL WGPPPTQTPH RYVAVTTLWL
DGRPVDRYET RFGIRSLRFD PERGLLVNGE RIEIKGVNQH HDLGPLGAAF NRRAAERQLE
LLREMGANAI RTAHNPPAPE LLELTDRMGF LVLDEIFDVW EMRKTPLDFH LIFPDWHEQD
LRAFIRRDRN HPSVILWSFG NEVGEQGSGR EGAELARRLA RIIKEEDPTR PVTASMNFAR
PGTPMPEVVD VICLNYQGEG IRDMPAYTGL QGITTPPLYS AFRIHFPTKM IISCENAAAV
SSRGAYLFPV TDALSAPVRE GQGGDPVTRQ VSGYELYTAD FGSSVDKVFF VQELHPFVAG
GFVWSGWDYL GEPTPYYSSR SSYFGIIDLA GFKKDRFYLY QARWRPELPM VHILPHWNWP
DRVGEITPVH VFTSGDEVEL FLNGRSLGRK KKGPYQYRLR WDDVRYEPGE LRAVAYKNGR
KWAEAVVQTT GQPTALAAEA DRVRIQADGY DLAFITVRVV DAEGRTVPTA DNPVRFTVEG
PGELVATANG DPTSFIPFSS DERPAFNGLV LAIVRARRGM PGTITVTASA PGLRPARIVI
ESQ