Gene Rmar_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1070 
Symbol 
ID8567711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1227151 
End bp1230126 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content62% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003290350 
Protein GI268316631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGACA TGAAGCGGTT TTGTCTGGTG GTGCTCGGGC TGCTGTGGCT CGGCACAGCT 
TCGGAGGTGC TGGGGCAACG GACGGGACGC ATTACCGGCG TGGTTGTCGA TGCCAGTAAC
GGCATGCCGT TGCCCGGCGC CAATGTGCTG GTGGCAGGCA CGACGGTCGG GGCCGCCACC
GACCTGGAAG GGAAGTTTAT CATCCTGAAT GCGCCGGCCG GTCCTCAAAC GCTGGTGATT
TCCTACATCG GATACCAGCG AAAAGAGGTG CCGGTGGAAG TGGTGCCGGG AGGCGAGGTC
AGTGTGGAGG TAGCGCTTCA ATGGGCCGGT ATCGAAACCG GAGAGGTTGT GATCACCGCG
CAGGCGGCCG GCCAGCTTCA GGCGATCAAC GAGCAGCTCA CGGCACGCAA GATCGTCAAT
GTCGTATCAG CCGAACGCAT CCGTGAGCTG CCGGACGAAA GCGCGGCCGC CGCGGTCAGC
CGCCTGCCCG GTATCTCCAT CCAGAACGGC GATCAGATCG TCATTCGAGG CGTCGAGGCC
AAGTACAACA CCGTCACCGT CAACGGCATC CAGTTGCCGT CCACCACGCT CAACCGGACC
ACCGGGCTGG GATTCATTTC GGCCAACATG CTCTCCAGCA TTGAGGTGGC CAAGACGGTG
ACGCCCGATA TGGACGCCAA CACCATCGGG GGTAACGTCA ACCTGCGCCT CCGTGAGGCG
CCGGAAGGGC TGCACTATGA CGCGCTGGTC TTCGGCGACT ACAACACGCA GGACCATACG
GCCGACAACT ACCGGGCCTG GGCGAGCGTC AGCAATCGGT TCTGGAACAA CCGTCTGGGT
GTGTTTCTGC AGGCAAACGC CCGCCGTTTC AACGGCGGGG GAGACATTGC TTCGGCTACC
TGGGCCGAGT TGCCCCAGGC CGACCCGGTG GCCGGTAGAC GCCCCTACGG ACTGAACCAG
TACGACCTGG AAGATCAGGT CAACATCGAC AATGAGTACG GCGCCAGCAT GCTCGTGGAT
TACCGGCTGC CGAATCGCGG CAAGCTCATC CTGCAGAATA CCTACTCGGC CGAAGAGTTC
GACAACGTCA GCTTCATCGA CCGACTGTAC CTGACTACCG GCGAGCGGCG GTTCCGGATC
AATCGGGTGA TCGGCAGCCG CTATCTCCTG GTGAATGCCC TGCAGGGTGA ACACTGGCTC
GGGGATGTGG CCAAAGTGGA CTGGGCCCTT TCCCATGCAA AAAGTCGGCG CAAGGACGAT
CTGGGGTATG AGACGGAGTT TGCCGGCACG AACTACTTCC AAGGACAGCC GCTGACGTAC
TGGACCTCGG AAGACCAGGT CTTCGATATC GAACTGCAAC CAGGGGTCCC GGGAGCGGTG
GGCGACGGCC GCACCTTCTA CGAAGATTTC GGAGAGCGGC GGTTGGTCGG GGCCTTTAAC
ATCCGCGTGC CCATTACGGT AGGCCCCATT TCGGGTGCGC TGCAGGGCGG CGGCAAATAT
ACCCAGCTAA ACCGCGATCG GGACCTGCTC CAGTACTATC GCCGGCTGGG CGACGGGGGC
GGACAGAACG TCGGCGCCAA AGACTTTCTG GCGAGCATTG GAGCGGATCC GGAGGCCGCC
CTCAACCTTC GCTACTTCAT CGACAGCAGT TATGTCGACG AGCGGGGACA GTATTACCTG
GAGGGGCGCT GGCCTTACAG TGGCGCGCTG CGGGTAGATT ATCTGGACAC GTACTTCCGT
CTGGCACAGC AGGGATGGGC CACGCCGGCC CTGGCGCAGT CGAATCGGTA TGACTACGAG
GCGGAAGAGC GGGTCTCGGC CGGCTACATC ATGGCCGATC TGGACATCGG GCGGCACCTG
TCGGTGATCG GTGGGGTGCG CTACGAGAAG TTTAGCTTCA CGAATCGGGC GCCGTTCGTC
AATCAGGTGC TTTACGACGG ATCCGGTGAC GTTCGGGATA CCCTGGAGGT TTCGCGCTCG
CATCCCCAGT GGTTCCCGAA CATTCAGCTG CGCATCAGCC CGATCGAATG GCTCAACATC
CGGCTGGCCT ATACGAAGAC GACCTCTCGC CCGGACTATC AGTACCTGCT GCCCAGCACC
TGGGTCGACT CGGGTGAGCG CGGGGAGGCC GGCAACCCCA ACCTGAAGCC GACGCTGGCC
GACAACTACG ACGCGTACAT TTCGGTGCAC CACGACCGAA TCGGGCTCTT TACGGTGGGC
ATTTTCCGGA AGGTGCTCTC CAACGTGGTG CGTCCGATTT CCATCCAGCG GCGCACGCTC
GACCAGTTCG AGGGCACGTT CTGGGCGCCG GAGGCGGCCG GTTATCCGGA GTGCGACGAC
GGACGGAAAC ACATCTACTG CCCCGACGGC CCCCTGGTGC CGGATATCAA CCCCGTCGGT
CTGATCACTA CCTATGTCAA CAATCCCTAC AAAGGGTATA TCAACGGCTT TGAAATCGAC
TGGCAGACCA ACTTCTGGTA TCTGCCGCGG CCCTTCAACA GCCTGGTGCT CAACTTCAAC
TACACGCGCC TGCGCTCCAA GATGGACTAC CAGTCCATCT TCCTGGTGCG GACCAGTCCC
TTTAGCCCGC CTACCCAGGT AGATACGGTG CGAACGGGGC GACTCTATCA GCAGCCGGAC
GATATCCTGA ACATCACGAT CGGCGTGGAT ATCGGAGGCT TTTCGGGTCG CCTGTCCTTC
CGCTATCAGG GAGAAGTGCT GGCCAACCTG GACCAGCGCG ATCCGGCCAA CGACGCTTTC
ACACGGGCGA TTTATGGCTG GGATTTCTCC CTGCGGCAGC GGCTTCCGAT CAAAGGACTG
TCGCTCTTCT TCAACGGCAT CAACATCACG CATGCCGGTA GCTTCGATTA TCGGCGGCTG
GTCGTCGGAC CCAATGCCAC CGGGGTCAGC GAGGCCATCA CGCGCATGGC CTACTACCCG
CGGCGGTTCC AGCTGGGTAT CCGTTACGGG ATGTAA
 
Protein sequence
MWDMKRFCLV VLGLLWLGTA SEVLGQRTGR ITGVVVDASN GMPLPGANVL VAGTTVGAAT 
DLEGKFIILN APAGPQTLVI SYIGYQRKEV PVEVVPGGEV SVEVALQWAG IETGEVVITA
QAAGQLQAIN EQLTARKIVN VVSAERIREL PDESAAAAVS RLPGISIQNG DQIVIRGVEA
KYNTVTVNGI QLPSTTLNRT TGLGFISANM LSSIEVAKTV TPDMDANTIG GNVNLRLREA
PEGLHYDALV FGDYNTQDHT ADNYRAWASV SNRFWNNRLG VFLQANARRF NGGGDIASAT
WAELPQADPV AGRRPYGLNQ YDLEDQVNID NEYGASMLVD YRLPNRGKLI LQNTYSAEEF
DNVSFIDRLY LTTGERRFRI NRVIGSRYLL VNALQGEHWL GDVAKVDWAL SHAKSRRKDD
LGYETEFAGT NYFQGQPLTY WTSEDQVFDI ELQPGVPGAV GDGRTFYEDF GERRLVGAFN
IRVPITVGPI SGALQGGGKY TQLNRDRDLL QYYRRLGDGG GQNVGAKDFL ASIGADPEAA
LNLRYFIDSS YVDERGQYYL EGRWPYSGAL RVDYLDTYFR LAQQGWATPA LAQSNRYDYE
AEERVSAGYI MADLDIGRHL SVIGGVRYEK FSFTNRAPFV NQVLYDGSGD VRDTLEVSRS
HPQWFPNIQL RISPIEWLNI RLAYTKTTSR PDYQYLLPST WVDSGERGEA GNPNLKPTLA
DNYDAYISVH HDRIGLFTVG IFRKVLSNVV RPISIQRRTL DQFEGTFWAP EAAGYPECDD
GRKHIYCPDG PLVPDINPVG LITTYVNNPY KGYINGFEID WQTNFWYLPR PFNSLVLNFN
YTRLRSKMDY QSIFLVRTSP FSPPTQVDTV RTGRLYQQPD DILNITIGVD IGGFSGRLSF
RYQGEVLANL DQRDPANDAF TRAIYGWDFS LRQRLPIKGL SLFFNGINIT HAGSFDYRRL
VVGPNATGVS EAITRMAYYP RRFQLGIRYG M