Gene Rmar_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1750 
Symbol 
ID8568402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2033527 
End bp2036790 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content62% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003291022 
Protein GI268317303 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTGG TACGGGTATG CTGCACCTGG GTGGCCCTGC TTGTAGTCGG TGTTGCGACT 
GCATGGGCCC AGACAGGCAA GATTGCGGGC GTGGTGACCG ATGCGGAGAC GGGTGAGCCC
TTGCCCGGTG TGAACGTGGT CATCGAAGGG ACCACGATGG GTGCCACGAC CGACGTCGAG
GGCTACTACG TCATTCTGAA CGTGCCGCCC GGCGAATACC GGGTGCGGGC CTCGTTCGTC
GGTTACGCTC CGGTGGTAGC CGAGAACGTG GTGGTCAACA TCGGGCTGAC GACCGAACTC
AACTTCCAGT TGCAGCCGCA GGCGGTCGGT CTGGAGGAGG TGGTCGTGGT GGCCGAGGAG
CCGGTGGTGA AGCCGGACGT GTCGGCCTCG ATCGCCAACA TCCGGGCCGA GCAGGTGGAG
AACCTGCCGG TGGTCAGCGT GGCCGACGTG ATCAGCCTGC AGGCGGGGTT TGAGCCGGGG
CTGACGATCC GTGGCGCTGG CGGCGATCAG GTGGCCTTTG CCGTGGACGG GCTGACGCTG
GCCGACCCCC GTGGCAACAC GCCGATTCTG GGGGTCAGCT TTACGGCCAT CGAGGCGGTG
CAGGTTCAGA CCGGCGGCTT CAATGCGGAG TATGGCAACG TGCGGAGCGG TCTGATCAAC
GTGGTGACGA AAGAGCCGTC GACCGAGCGC TACTTCGGCG ACATCCTGAT GCGGTATTCC
ACCCCGTCAC GGCCCTACTT CGGCCCCCTG CCCAACGACA TGGAGGCCTA TTATATCAAG
CCCTTCCTTG ATACCACGCC GCTGGAGGGG TGCGCTCGCG GCGTAGCCTA TTGTGGCACG
AGTGTCTGGC CGAGGTGGAT GCAAGAGCAG TACCCGGATC ATCAGGGCTG GGATAACCTG
GCGCAGGGAA CGCCCTGGAC GCCCGAGCAG CTGTATTCGG CCTACAAGTG GCTGGTTCGG
AAGGATTTCA CCATTCGCGA ACCGAATTAT GAACTGGACG GCACCATCGG CGGACCGGTG
CCGCTGGTCA GCCGCTATCT GGGCAATCTG CGCTTTACGG CGTCCTATCG TCAGGTGCAG
ACGGCGCTGA TGTTTCCGGA GCAGCGGCCG GCCTATCAGG ATCGCATTTT CCAGGGGAAG
CTGGTCTCGG ACGTGGCCAG TGGCGTGAAG TTGACCATCG ACGGGCTTTA CCGGAAGCAG
AAGGGGCATG CAGCGCATCG CGACGGGCGC GGTACCATCC TGACCGGTGA GATGCCCCGG
TATCCCTGGG ACAACCGGGA AGATCTCCTG CCGGTCCAGA TGAACATTGG TTTCAACATC
AACATGGCGC TGTTTGGCGA CTGGGGCTTC AGTCCCACGA ACATCACCCA GTCGATGATC
GGGGCCAAGC TGACGCATGC GCTGAGTCCG GCCACGTTCT ACGAAGTGCA ACTGCAGCGC
ATCGAGTCGG ACTACTTCAC CTTCATGCCG CGTCCGCGTA AGGGCGGCAT CACCGGGCCG
AGCGAGATTG TCGTGTGTAT CCGTCGGGAT GGCACCTACA CCGACCCGGT GAACGGCCAG
TGTGCCGAAG GGGAGCTGGG CATGTCGGAA GCGCCCATGG GCTACAAGCA ATCCTATGAA
AACGCGCCAA CCCCGACACC CTTCGGGTTG CTCGGGTCGC AGGCCGGTTC GGCGCGCGAC
AGTTCGAACA TCGTACGCTA TGCCGCTCGT GTGGACCTGA CCAGTCAGGT TAACCGCGTG
CTGCAGGTCA AGACGGGGCT GGAATACATC TTCAGTGACT ACCACATCCG GCACGGCGTC
TACGACCCGG CCAACCCGCA CCACGAAAAC GAGAAGTTCC GCTGGGACCG CACGCCCGTG
CAGGCGGCCT ACTATGCGCA GGCTAAGCTG GAGTTCAAGG GCATGATCGC CAACCTGGGC
GTGCGCTTCG ATTACTTCAA CCCCAGCGGT AAGTGGTACG ACTATACGGC TTTCGACCGG
GCGCTCTCGG CAAGCGTGGG CATTCGGGGC ATCGACCAGG CCCTTTCGCG CAAGCCGGTG
AAGAAACAGC TCACGGTGAG CCCGCGCCTG GGCGTTTCGT TCCCGATCAC CGACAACAGC
AAGCTGTACT TCAACTACGG TCACTTCCGC CAGATGCTCA CGCCGCAGGA CCTGTTCCGT
GTCGAGTACA TCTCGAACGG AGCGATTTTC AGCATCGGTA ACCCGAACGT GCCGCTGCCG
CGCACGATTG CCTACGAGCT GGGCTTCGAA CAGAACATCG GTAACCAGTT CCTGCTGCGG
CTGGCGGGCT TCTACCGCGA CATGCATTAC CAGGCGCGTG AGGTCGAATA CATCAGTGTG
GACGATGCCG TGGATTACTT CCGCGTAGAG CCGCTCAATT ATGGCGACGT GCGTGGCTTC
GAGCTGACGC TCGAAAAGAA CCGGGGGCGC TGGATTCGCG GCTTCGTGAA CTACACGTAC
CTGGCGCGGA AGTTCGGCAA CTTCGGCTTC GGACAGATCA ACGAAAACCG GGCCGAATTC
CGGCAATACC TGACGACCAC GACGGACCAC TATCCCTGGG CGCCGGTACC GGAGCCGTTC
GCCCGGTTCA ACCTGGAAAT CATCGTACCC AGAGATTACG GGCTACTGCT GGGCGACTGG
CGCCTCAACC TGCTGGGCGA GTGGCGCGCC GGCGCCAAAG GCACCTGGAA CGGGCAATCT
TTCACCTTCG GCCCGGGCAA CGATCCGGAG ATTGCCTTCA ATACAAGCTG GAAGGACTAC
TACAACCTGG ACCTTCGCCT CAGCAAAAAC TTCGAGACGT CGGCAGGACG GCTCCAGTTC
TTCGTGGACG TAACCAACGT GCTCAATTTG AAGCGCATGT ACTGGAGCAA CGCGTCGCCG
TTTGAGGGAC CCAACGACAT GCTGAACTAC TTCCGCTCGT TGCACCTGCC AGGCGACATC
TTCGGCGAGG AGTTCGATCC GGGCTACGTC TGGGTGCCGG GCAACGACAA GCCGGGTGAT
TTCCGGAAGC CGGGCATACC CTACGACCCG ATCTATGCGG TGGTAGACAT CAACCAGGTG
ACGGAGCCGA TACCCGATGT CCTCTACTGG GATAAGGCAA CCGGCCAGTA TATGACCTAT
ACGAACGGCC AGTGGCAGCC CGCCGACCCG AAACGCGTGG ACTACGTGCT CAAGAACAAG
GCCTACATCG ACAACCCCGA TGAGACGTAC CTGGCCTTCC TGAATCCACG CGATGTCTAC
TTCGGCGTGC GGCTCACATT CTGA
 
Protein sequence
MRLVRVCCTW VALLVVGVAT AWAQTGKIAG VVTDAETGEP LPGVNVVIEG TTMGATTDVE 
GYYVILNVPP GEYRVRASFV GYAPVVAENV VVNIGLTTEL NFQLQPQAVG LEEVVVVAEE
PVVKPDVSAS IANIRAEQVE NLPVVSVADV ISLQAGFEPG LTIRGAGGDQ VAFAVDGLTL
ADPRGNTPIL GVSFTAIEAV QVQTGGFNAE YGNVRSGLIN VVTKEPSTER YFGDILMRYS
TPSRPYFGPL PNDMEAYYIK PFLDTTPLEG CARGVAYCGT SVWPRWMQEQ YPDHQGWDNL
AQGTPWTPEQ LYSAYKWLVR KDFTIREPNY ELDGTIGGPV PLVSRYLGNL RFTASYRQVQ
TALMFPEQRP AYQDRIFQGK LVSDVASGVK LTIDGLYRKQ KGHAAHRDGR GTILTGEMPR
YPWDNREDLL PVQMNIGFNI NMALFGDWGF SPTNITQSMI GAKLTHALSP ATFYEVQLQR
IESDYFTFMP RPRKGGITGP SEIVVCIRRD GTYTDPVNGQ CAEGELGMSE APMGYKQSYE
NAPTPTPFGL LGSQAGSARD SSNIVRYAAR VDLTSQVNRV LQVKTGLEYI FSDYHIRHGV
YDPANPHHEN EKFRWDRTPV QAAYYAQAKL EFKGMIANLG VRFDYFNPSG KWYDYTAFDR
ALSASVGIRG IDQALSRKPV KKQLTVSPRL GVSFPITDNS KLYFNYGHFR QMLTPQDLFR
VEYISNGAIF SIGNPNVPLP RTIAYELGFE QNIGNQFLLR LAGFYRDMHY QAREVEYISV
DDAVDYFRVE PLNYGDVRGF ELTLEKNRGR WIRGFVNYTY LARKFGNFGF GQINENRAEF
RQYLTTTTDH YPWAPVPEPF ARFNLEIIVP RDYGLLLGDW RLNLLGEWRA GAKGTWNGQS
FTFGPGNDPE IAFNTSWKDY YNLDLRLSKN FETSAGRLQF FVDVTNVLNL KRMYWSNASP
FEGPNDMLNY FRSLHLPGDI FGEEFDPGYV WVPGNDKPGD FRKPGIPYDP IYAVVDINQV
TEPIPDVLYW DKATGQYMTY TNGQWQPADP KRVDYVLKNK AYIDNPDETY LAFLNPRDVY
FGVRLTF