Gene Rmar_1762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1762 
Symbol 
ID8568414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2061694 
End bp2064765 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content61% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003291034 
Protein GI268317315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTACCA TGAGAACATT GCTTTCGCGC ATGCTGGGGC TGAGCCTGCT GCTTGCATGC 
TGGAATGTAA GCGTTGTCAT TGCGCAACCA CGCACGGTGA CGGGTACCGT CAGTGATGCT
ACGACGGGGG AGCCGTTGCC CGGCGTCAAC ATCGTGGTGC TGGGCACGAT GACCGGTACG
ACCACCGACG TCGAAGGACG CTATCAGATC GAAGTGCCCG GACCCGAAGC CGTGCTGGTT
TTCTCGTTCG TGGGCTACGA ACAGGTGCAG GAGGTGGTAG GCGACCGAAC GGTCATCAAC
GTGCGCATGC AACCCACCGT CGAGGTGCTG GAAGAGATCG TCGTGGTCGG CTACGGCGTG
CAGCGGCGCG AAGATGTGAC CGGCTCGGTG GCGACGATCG ATGCGACGGA GATCAATCAG
GGCGTTTATA CCTCGCCCGA CCAGCTGTTG CAGGGCCAGG TGGCCGGTCT GACGATCATC
TCGAACAATG GCGAACCGGG CGCCGGATTG AACATCCGCC TGCGTGGTGG CACCTCCATC
AGCGCCAGCA ATGATCCGCT GATCGTCATC GATGGCGTGC CGATCGACAA CGTGCGCCTG
ATGCCGGAAG GGGCCGGGAT TGACGGTGCG CCGCCGCCGC CGCGCAACCC CCTGAGCCTG
ATCAATCCCA ACGACATCGA ATCGATCACA GTGCTGAAAG ACGCCGCGGC CACCGCCATC
TACGGATCGC GAGGTGCGAA CGGCGTGATC CTGATCGAGA CGAAGAAGGG CCGTCAGGGC
CAGCTTCAGG TGGACTATGA AGGGTATATT TCGGCGGCCT CTCCTTATAA GAAGCTGGAA
TTGCTGAATG GTGAGGAATA TCGGCGGTTC GTTCAGGAGC AGGTACAGGC CGGAAATCTT
TCGCAGGATG CGCTGAACGT ACTGGGTGAT GCCAATACGG ACTGGGAAGA AGCGGTTACG
CGGACGGGGA TCACGCACTT CCATAATCTG GCCTTTTCGG GCGGTACGAG CCAGACCCGC
TACCGGGCCT CGGTCAGCTA CCTGAATCAG CAGGGCCATG TCATCAGCTC CGGGCTGGAA
CGTCTGACCG GACGCCTGAA CGCCGACCAT CAGGCCTTCG ACGGGCGGCT GCGGCTTCAG
CTCAACCTGA CTTCGTCGTT CCAGCACGAC GACCTGCTGC CCTACAACCA GACGGCCGGC
TTCGAAGGCG GGGTGTTTAC GAACGTCTAT CAGATGAATC CCACCTATCC CATCTACGCC
GATCAAAACC TGGATGGGAC CCCGGCACTG GATGGCTATT TCGAAATTGC GGACGGCCGG
ACCTCGGTGC GCAATCCGGT GGCCATGGCC GAGCAGGTGC TGGACTTCGT AAAGACGACG
CGCTCGCTGG GAAACATCGG GGCCGAGCTG GATCTGCTGC CCGGCCTGAC GGCCCGGGTG
AACTTCGGCG CCGACCGGGC GCAGTCGAGT CGCCGCCAGT ACTTCCCGCA GCAGAATCCG
ACCGGTGCCC AGTACGAAGG GCGGGCGCTG CAGCGCAGCC GGGAACACTC GTCGCTGACG
TTCCAGAGCT ACCTGACCTA CCGTAACACA CTTGCCCAGG CGCACAATGT GGAGTTGCTG
GGCGGCTACG AGTTCAACGA GTACATGACC GAAGAATTCG GCGTGGAAGG GCAGGGATTC
GTCACGGACG TCACCACCTA CAACGCTATG CAGTCGGCCA GCCAGCTGGT CAAAGCCGGG
ACGTTTTCCT ACAAGGAAAA AAGCCGACTC ATTTCGTTCT TCACGCGCCT GAACTACAAC
TACCAGAGCC GGTACTACCT GACCGGCGTG CTTCGGTACG ACGGTTCCTC GCGCTTTGGC
GAAGGGAACA AGTGGGGGCT CTTCCCGGCC ATCTCGGCGG CCTGGCGTAT CAGTGGCGAA
CCGTTCATGC AGGGGCTGGA CTGGCTCACC GACCTGCGCT TGAAGGTGGG CTATGGCATC
ACGGGTAGTC AGGAGATCGG TAACTACCTG TCGCTGGCGC AGCTTGGCGC CAACGAAAGC
CTGCAGGCGG TTTTCGATCA GCAGCCGTAC ACCGGCTTTG CCCCGGTCAA CTATGCGAAT
CCGGACCTGA AGTGGGAAGA GACGGCCACC TTCAACATCG GGTTGGATTA CAGCCTGCTG
AACGGCAAAT TCTCGGGAAC GATCGAATAC TACGAGAAGA ACACCAGCAA CCTGTTGCTG
GAGATCCCTG TGCCGCAGCC GGCGCCGGTC CCGACCCGGA TCGAAAACGT CGGCAAGACG
CGCAACCGGG GCCTTGAGTT TTCGCTCGAT GCGCTGGCTG TCGATCGGCC CGGACTGAAC
GTGCTCTTCG GGTTGGTCTT CAGCACCAAC CGTAACGAAG TGGTCAGCCT GGGTGGTCGC
GATCAGATTA TTACCGGGAC GGTCAGTGGC CGTGGTCAGT CGGACACCTA CGCGCAGATT
CTGCTGCCGG GCGAGCCGAT CGGCACCTTC TACGGACCGG TCTTCCTGGG TGTCGATGCC
AACGGCCGCC AGCAGTTTGC CGACCTGGAC GGCGACGGTC AGGTCGAGAT CACCGGTGAT
GACCGGACCA TCATCGGCAA TGCCCAGCCG GACTTCACCT ACGGCTTCCG TACCAATATC
TACTGGGGCA ACTTCGACTT CTACGTGTTC ATCCGGGGCG AACAGGGCCG CGACGTCTTC
AACAACACGG CCCTGGTCTA TCAGACGAAG AGCGCCGTGC TCCAGAACCA GAACTTCCTG
AAAGCGGCCC TGGACGATCC GGACGCCCTG GACGAACCGG CCATCTACTC CTCGCGCTGG
ATCGAAGATG GCTCGTTTAT CCGGCTGGAC AACGTGACGG TCGGCTATAC CTTCAACAAT
CTGGGGCCAT GGAGCCGTTA TCTCCGGCGG GCACGTATCT ATGTGTCCGG ACAGAACCTG
CTGGTGATTA CCCCTTACAG CGGATACGAT CCGGAGGTTA ACACGAATGC CGGGCTGGCG
ACGCTGGGGA TCGACTACAC GAACTATCCC CGGGCGCGCA CCTTTACGGT CGGCATCAGC
CTGGGCTTCT AA
 
Protein sequence
MGTMRTLLSR MLGLSLLLAC WNVSVVIAQP RTVTGTVSDA TTGEPLPGVN IVVLGTMTGT 
TTDVEGRYQI EVPGPEAVLV FSFVGYEQVQ EVVGDRTVIN VRMQPTVEVL EEIVVVGYGV
QRREDVTGSV ATIDATEINQ GVYTSPDQLL QGQVAGLTII SNNGEPGAGL NIRLRGGTSI
SASNDPLIVI DGVPIDNVRL MPEGAGIDGA PPPPRNPLSL INPNDIESIT VLKDAAATAI
YGSRGANGVI LIETKKGRQG QLQVDYEGYI SAASPYKKLE LLNGEEYRRF VQEQVQAGNL
SQDALNVLGD ANTDWEEAVT RTGITHFHNL AFSGGTSQTR YRASVSYLNQ QGHVISSGLE
RLTGRLNADH QAFDGRLRLQ LNLTSSFQHD DLLPYNQTAG FEGGVFTNVY QMNPTYPIYA
DQNLDGTPAL DGYFEIADGR TSVRNPVAMA EQVLDFVKTT RSLGNIGAEL DLLPGLTARV
NFGADRAQSS RRQYFPQQNP TGAQYEGRAL QRSREHSSLT FQSYLTYRNT LAQAHNVELL
GGYEFNEYMT EEFGVEGQGF VTDVTTYNAM QSASQLVKAG TFSYKEKSRL ISFFTRLNYN
YQSRYYLTGV LRYDGSSRFG EGNKWGLFPA ISAAWRISGE PFMQGLDWLT DLRLKVGYGI
TGSQEIGNYL SLAQLGANES LQAVFDQQPY TGFAPVNYAN PDLKWEETAT FNIGLDYSLL
NGKFSGTIEY YEKNTSNLLL EIPVPQPAPV PTRIENVGKT RNRGLEFSLD ALAVDRPGLN
VLFGLVFSTN RNEVVSLGGR DQIITGTVSG RGQSDTYAQI LLPGEPIGTF YGPVFLGVDA
NGRQQFADLD GDGQVEITGD DRTIIGNAQP DFTYGFRTNI YWGNFDFYVF IRGEQGRDVF
NNTALVYQTK SAVLQNQNFL KAALDDPDAL DEPAIYSSRW IEDGSFIRLD NVTVGYTFNN
LGPWSRYLRR ARIYVSGQNL LVITPYSGYD PEVNTNAGLA TLGIDYTNYP RARTFTVGIS
LGF