Gene Rmar_1758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1758 
Symbol 
ID8568410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2049764 
End bp2053033 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003291030 
Protein GI268317311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.208023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTAC TGGCTGCTCT TGGGACAACC GGGGCATGGG CTCAGACCGG TAAGATTGCC 
GGTCGGGTGG TGGATGCAGA GACAGGGGAG CCATTGCCCG GGGTGAATGT CGTGATTGAA
GGAACTACGG TTGGGGCTAC GACGGACATT GACGGGTACT ATACGATCAT CAACGTCCGT
CCGGGCACCT ACACACTTCG CGCCTCTTTT GTGGGCTATG TGCCCCAGGT TGTAGAGAAC
GTGCAGGTAG ATATTGGCCT GACGACAGAG GTGAACTTCG CGTTGCGTCC GACGGCTATC
GGGCTGGAGG AAGTGGTGGT GCAGGCGGAG CGCCCGATCG TGCAACCGGA TATTTCAGCC
AGTATTGCTA ATATCGATGC GGCAGTCATT GAAGCGCTCC CGGTAGCTAC CGATGTAGTG
CAGGTGATTG GCTTGCAGCC AGGCTTTGAG CCGGGGTTGG TCGTGCGTGG TTTTGGAGGA
AATCAGGTGG CTTTTCTGCT TGATGGAATG AATCTGGCTG ATCCGCGTAC GAACGCGCCT
TTCACCGGTG TGAGCTTCAC GGCTGTTGAA GAGGTGCAGG CGCAGACGGG TGGCTTTACG
GCCGAGTACG GCAACGTGCG GTCAGGGTTG ATCAACGTAG TGATGAAGGA ACCGCGGACG
AATCGCTATA CGGTGGATGC CATTTTGCGC TATGCGCCGG CGCAACCCAA GACGTTTAAT
GGCGATGCCA ATGACAGGGA TTTCTTCTGG ATGCGGCCGT ACTTTGATCC CGAAGTAGCT
ATGATTGGCA CGGAAGCGGC CTGGGACGAT TGGACGGAGC GCCAGTATCC GAAATTTGAA
GGATGGGAGG AGGCGGCCAA TGATTATCCA GCCAATAATG ATAGCGATCC TTCTAATGAT
GTGACTGCAG CGCAGTTGCA GAAGGCTTAT GAGTGGATGA CGCGGAAGAA TACCGGCATT
GATGATCCAG AGTATCAGAT TGACGCTACA ATTACGGGGC CAGTGCCGGC CATTGGGGCT
TCACTGGGGA ATTTGCGCTT TCTGGTTTCA CATCGCCGCA CGCAGTCGGC TTACATCATC
CCCCAGCGTC GCCGCACTTA TCGGGACTAC ACCACCCAGG CCAAGCTAGT TTCTGATATT
GCGGCTGGTG CCAAGCTAGA GCTGGTGGGA CTCTGGTCCA AGCGGAAGGG GCTGGTGCGG
CCAATCGCAA TTGACCAGGG GGCAACGAGT ACCATGCTGA GTGGGGATCC CCCAGCCTAT
CCCTGGGATT GGCGCTATGA TTTGGAGAAC AGAATTCTGG GCGATGATGG GGTGGAAGGC
CACGTAGCGC GTGCCGCACT CTACGGCGAC TGGGTCATTA ATCCAATGGA TATTGACTAT
GCGCTCTATG GCGCCAAGTT TACGCATGCG CTGACGCCGA ACACCTTTTA TGAAGTGCAG
CTGCAGCGGG TGCAGACGGA TTATCTGACC GGACATATTC CGCCTCGCAA CCCGGATCCT
GTGGTATGTG TGACGCCGGA GCCTGCCATT CTTCCCATTG ATGATCCGGC CTGCCAGGCC
CCCAACGTCA TTCGTCTGAA TGAGGCGCCC TTTGGCTATG AGTCTAAGGG GGCGCAGGAT
GGTCTGAGTG CTAACGGGCT TCGGATCGGC GGGCACGGAG GGGCTGCTTT TGACACCTCT
TCTGTGTCAC GGTGGGTGAT CAATGCTTCG TTGACCAGTC AGGTGAACCG CTATCTGCAA
ATAAAGGGAG GCTTCGAGTT TCATCTGAGC GACTACCAGA TGAACTACGG GGAGTATGAT
CCCTTCTTTG TACACCATGC CAACCCCACG TATCGCTGGC ATCGTAAGCC GCAGCAGGGG
GCGGCCTTCC TGCAGAGCAA GCTGGAATTT AAGGGGTTGA TCGCCAATCT GGGGGTGCGG
CTGGATTACT TCAATCCAAG CGGCAAGTGG TATGTGTATG AGCCCTATGA TCGTGCCTTT
ACGGCAGTAT TTGGAGTCGA TAAGCTCGAT GAGGTGCTGT CTAAAGAGCC GGTAGACAAG
CAGCTTACGC TAAGCCCGCG GCTGGGCATC TCCTTCCCGG TGTCGGCCAA TAGTAAGTTG
TACTTCAACT ATGGGCATTT CCGGCAGATG CCCGACCCGG TACCGCTCTT TGAGATAGAG
CGGATTAATA CGGGAGCGGT GGCGCGCATT GGTAACCCGA ATCTACCCCT GCAAAAAACG
GTGGCCTACG AGCTGGGCTT TGAGCAGAAC TTGTTCGACA TGTTTCTGCT TCGCCTAGCC
GGTTACTACC GGGACGTATC CAACCAGGTG CGCTTCATCA ACTTCCAGAG CATTGATGGA
GAAGTAAACT ATTGGGTAGC GCGCCCTTGG AATTACGGAG ATGTGCGGGG GTTTGAGCTC
TCGCTGGCGA AAGATCGCGG GCGGTGGATT CAGGGCTTTA TTAACTACAC CTATATGGCC
TGGAAAGGGG GGAATTTTGG TTTTGAGTAC AATTACGAAA ACCTTGTGGC ACAGCAGAAC
TATCTGTTGA CCAGCACCGA CCATTATCAG AGCAAACCTG TACCTGAGCC GTATGCACGG
GTTAATATTG ATCTGGTCAC CCCCGAAGAC TATGGTCCTA ACTACAACGG AATGCGGCCA
CTGGCCGGGT GGCGCCTGAG CCTGCTGGGG ACCTGGCGTG CGGGGCAGGT GCTTACCTGG
ACGGGACAAC AGCTGGTTGC CGGGGGGAGT CCGATCCGAG GTATCGAACA GAATGTGCAG
TGGAAAGACT ACTACAATCT TGATCTCCGT CTGAGTAAGA CCTTTAACAT GAAGTTCGGA
GAAGTGCAGT TTTTCATGGA TGTCTCCAAT GTGCTTAATC TCCGATACAT GGATCGGATG
AGTGGCTTTA CCGGGGCAGA TGATCTGCTG GATTACATGA AGTCCCTGCA TCTTCCGGCT
AACACCTTTG AAGGGCTGGA GAATCCTCCG TATCAGTTTA TACCTGGCAA CGATCAGCCG
GGAGATTATC GGAAGCCTGG GGTTGAATTT GTGCCCATTG AAGTCTGTCC GCCTGCCGGG
TGCCAACCGG TTGGGGATGA GCCCACACGC CCGCTATACT ACCAGGATGG CACCTACTAC
AGCTACACGG GAAGTGAGTT TGTCGAAGCG GACCCAGACT TCGTGAAGAA GGTGTTGGAT
AATAAGGCCT ATATTGACAT GCCTAATGCG GATTATCTGA CCTTCTTCAA TCCGCGCCGT
GTATACTTTG GGCTGCGAAT TTCCTTCTAA
 
Protein sequence
MLLLAALGTT GAWAQTGKIA GRVVDAETGE PLPGVNVVIE GTTVGATTDI DGYYTIINVR 
PGTYTLRASF VGYVPQVVEN VQVDIGLTTE VNFALRPTAI GLEEVVVQAE RPIVQPDISA
SIANIDAAVI EALPVATDVV QVIGLQPGFE PGLVVRGFGG NQVAFLLDGM NLADPRTNAP
FTGVSFTAVE EVQAQTGGFT AEYGNVRSGL INVVMKEPRT NRYTVDAILR YAPAQPKTFN
GDANDRDFFW MRPYFDPEVA MIGTEAAWDD WTERQYPKFE GWEEAANDYP ANNDSDPSND
VTAAQLQKAY EWMTRKNTGI DDPEYQIDAT ITGPVPAIGA SLGNLRFLVS HRRTQSAYII
PQRRRTYRDY TTQAKLVSDI AAGAKLELVG LWSKRKGLVR PIAIDQGATS TMLSGDPPAY
PWDWRYDLEN RILGDDGVEG HVARAALYGD WVINPMDIDY ALYGAKFTHA LTPNTFYEVQ
LQRVQTDYLT GHIPPRNPDP VVCVTPEPAI LPIDDPACQA PNVIRLNEAP FGYESKGAQD
GLSANGLRIG GHGGAAFDTS SVSRWVINAS LTSQVNRYLQ IKGGFEFHLS DYQMNYGEYD
PFFVHHANPT YRWHRKPQQG AAFLQSKLEF KGLIANLGVR LDYFNPSGKW YVYEPYDRAF
TAVFGVDKLD EVLSKEPVDK QLTLSPRLGI SFPVSANSKL YFNYGHFRQM PDPVPLFEIE
RINTGAVARI GNPNLPLQKT VAYELGFEQN LFDMFLLRLA GYYRDVSNQV RFINFQSIDG
EVNYWVARPW NYGDVRGFEL SLAKDRGRWI QGFINYTYMA WKGGNFGFEY NYENLVAQQN
YLLTSTDHYQ SKPVPEPYAR VNIDLVTPED YGPNYNGMRP LAGWRLSLLG TWRAGQVLTW
TGQQLVAGGS PIRGIEQNVQ WKDYYNLDLR LSKTFNMKFG EVQFFMDVSN VLNLRYMDRM
SGFTGADDLL DYMKSLHLPA NTFEGLENPP YQFIPGNDQP GDYRKPGVEF VPIEVCPPAG
CQPVGDEPTR PLYYQDGTYY SYTGSEFVEA DPDFVKKVLD NKAYIDMPNA DYLTFFNPRR
VYFGLRISF