Gene Rmar_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2472 
Symbol 
ID8569138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2880750 
End bp2883764 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content62% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003291737 
Protein GI268318018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACT GGTGGCTTGC ACTGAGTCTG ACCCTGGGAG CGCTGACGGC CGTTCCGGCT 
GCACTGGCGC AGGGTACGGT CTATGGCGTG GTGACCGATT CGCTGACCGG CGAGGCGTTG
CCGGGAGCCA ACGTGTATCT GGTGGGCACG GGCAAAGGGA GTGCGACCGA TCTGGAAGGA
CGCTATCGCA TCACGGGCAT TGCACCCGGC ACCTACACGC TGCGCGTTTC CTATCTGAGC
TATCAGACGC GCGAATTGCC CCTGCGCATT CAGGACGGGG AGACGATCGT GCTGAACATT
GCCCTGGTGC CCGAGTCGAT CGGTGGAGAG GAGATCGTGA TTGTCGGACA GGTCGAAGGC
CAGGTGGCCG CCATCAACCA GCAGCTCAAC GCGAACACGA TCGTCAACGT GGTCTCGGAG
GAAAAAATTC AGGAACTGCC GGACGCCAAC GCGGCGGAAG CGCTTGGGCG GCTGCCCGGG
GTGGCCGTGC AGCGTTCCGG CGGCGAGGCC AACAAGATCG TACTGCGTGG GCTGAGCGAT
CGCTTCACAA CGATTACGGT CGACGGCGTG CGGTTGGCCG CGACCGACGC GGACGCCCGG
GGCCTCGATC TGAGCACGAT CTCGCAGAGT TCGCTGGCCG GCATTGAGCT GTACAAGGCG
TTGACCCCCG ACAAGGACGC CGACGCGATT GCCGGCAGCG TCAACCTGGT CACCAGGAAA
GCGCCCGAGC AGCGGCTCGT TCGCTTCGAT GTGCGGGGCG CTTACAATCA GCTGAACCAG
ACCTGGGGGC AGTACGATCT GAGCGCCCGC TACGGAGAGC GCTTCTGGCG CAAGCGGCTG
GGCGTGCAGC TCATCGGCAG CCTGGAGCGG CGCGATCGGA GTCGCGAGTA CTACGATCTC
GCCTACGATA TGGGGCTGGC CGGCGGCACG GACTACGAAA TCTCCGACAT CGAGCTGAAC
TTCCGCAACG AAATCCGCAC GCGCCAGGGC GTCAGTTTGT TGCTGGACCT GGATACGCCG
GATGGGGGGA CGATTCGTTT CAACAACGTC TTCAACGCCA CCCAGCGCAG CTTCATCGAC
TACTATCGCA ACTATCCCAC CGAAGGCTCC GAGATGTTCT ATGGCGCCCG GGATCGCGAG
CAGAACATCT ACACGTTCAC CAGTGCGCTG CGGGGCGAGC AGTATCTGAA GGGATGGCAG
CTTAACTGGG GGGCTTCCTA TGCCCGGTCG CAATCGCACG ATCCGTTCGA CTTCGATATC
AGCTTTACGG AGCCCTCGGC CACCGATCCC CAGGGCAATC CGATCGCCGG CATGGGTCGC
ATCCCGCCCG AAGAGCTGAA AGGCCCGCCG GAAAGCATTA TTTCCCATGC GTTGAACAAC
TTCGAACGGG CCTACTTCTA CACGGCGTTC TATCGGGAGG AGGAAAATCG CGAAGGGGAG
ACCACGCTGT ACCTGGATCT GGCGCGGGAC TACGCGCTGG GCGGGGGAGT GGCCGGACAG
TTCAAGCTGG GCGGGAAGTT CCGGAGCAAG ACCCGCTTCC GGGAGCGCGG TGAGCTGCTG
GCGCCCTATT ACAACGAAGC GTTTCCGCGC TACGTCCGGA CGGCCGATGG GCAGGTCGTG
CCCAAGGACT TTACAGGCAC CCCCTTCGAG CAGCTTCAGA TGGTGGGCGA CCGGCTGCTG
GTGACGAACT TTCTGGGGGC CAATCCGGAG GACCGCGACC TGTTCGACCG CTTCCGGCTC
TATCCGATGA TCGATCGGGA TCTGCTCCGC ACCTGGTGGG ATCTGAACCG AGACGGTTTT
TCGGATCAGG CCGGGACGAA CCCGGAGTTC GAGCGCAATC TGGAAGCCGA CGCTTATTTC
TATGACCTGA CCGAGCGGGT CTCGGCGGCC TATGTGATGA ACACGTTGCA TTTCGGGCCG
CGCGTTTCGC TCATTGCCGG GGTGCGCGTC GAGCGCGAAA ACAACGACTA CCTCACGCGC
TACACGCCGG ATGATCTGTC CGGCTTCCCG GTACCCAAGG GGGCCTATCG AGACACCAGT
AGCACGTTTT CGGAGACCAT CTGGCTGCCG AACGTTCACC TGACGCTGCG GCCCACCGAT
TTTCTCACGG TGCGGCTGGC CGCCTACAAA GCGCTGGCAC GCCCCAACTA CAACCAGCGC
CTGCCCAGCT TCGTGGCCCG TAAGGCCGGC ACGTTCTATC CCGGCAACTC GCTCTTCGTG
GGCAACCCGG ATCTGAAGGC GGCGCAGGCC TGGAATTACG AGGTGAACGT GGCGCTTTAC
GACGGCCGTT TCGGGTTGTT CTCGGTGTCG GCCTTCTACA AGGACATCAA AAACATGTAC
CACCAGATCA ATGGTGCGTT TTTCGATGCG GCTTCGGCCG ACTCGCTGTT CGACAGGCTG
GGGATCAACG TGACCAGTCC CTTCCAGAAC GAAGGGTTCG CCCTGACCTA TCCGTACAAT
TCCTCGAAGC CGACGCGGGT CTGGGGCGTG GAAATCGAGC ATCAGGCCAA CTTTCTGTTT
CTGCCGGGGG TGCTTCGCAA CCTGGTGCTG ACCTACAATC TGTCGTTCAT CCGATCGGAG
ACCTACATCC CGGGCACGCA AACCGAAGAG TACTGCGAGG AGATTCTACC GGGCATCTGT
GTGCCCAAGT TGCGCTATCA CTTTGTTGAG CGCAAGCACA AGCTGGAGCG CCAGCCGGAC
TTCCTCAGCA ACGTGGCGAT CGGCTACGAC TATCGGGGCT TTTCGGCGCG GCTTTCCATG
TTCCACCAGG GCGAATTCAA CACGCGCTTT TCGCCCAATG GCCGCGACGA CCGGGTGGTC
AAGAGCTTCA CACGCTGGGA CCTGGCGCTT CGCCAGCAGA TTCGTCCCAA CCTGTACGTC
CTGCTCAATG TGAACAACCT GACGAACGTC GAGGAGGGCA CCATTGTGCT GAACCGGGTG
CAGGGCTGGC GGCTGCCCAA CGATCGGGAG ATCTACGGAA CCACCGTGGA CTTCGGCCTG
CGGCTGGTAC TCTGA
 
Protein sequence
MKNWWLALSL TLGALTAVPA ALAQGTVYGV VTDSLTGEAL PGANVYLVGT GKGSATDLEG 
RYRITGIAPG TYTLRVSYLS YQTRELPLRI QDGETIVLNI ALVPESIGGE EIVIVGQVEG
QVAAINQQLN ANTIVNVVSE EKIQELPDAN AAEALGRLPG VAVQRSGGEA NKIVLRGLSD
RFTTITVDGV RLAATDADAR GLDLSTISQS SLAGIELYKA LTPDKDADAI AGSVNLVTRK
APEQRLVRFD VRGAYNQLNQ TWGQYDLSAR YGERFWRKRL GVQLIGSLER RDRSREYYDL
AYDMGLAGGT DYEISDIELN FRNEIRTRQG VSLLLDLDTP DGGTIRFNNV FNATQRSFID
YYRNYPTEGS EMFYGARDRE QNIYTFTSAL RGEQYLKGWQ LNWGASYARS QSHDPFDFDI
SFTEPSATDP QGNPIAGMGR IPPEELKGPP ESIISHALNN FERAYFYTAF YREEENREGE
TTLYLDLARD YALGGGVAGQ FKLGGKFRSK TRFRERGELL APYYNEAFPR YVRTADGQVV
PKDFTGTPFE QLQMVGDRLL VTNFLGANPE DRDLFDRFRL YPMIDRDLLR TWWDLNRDGF
SDQAGTNPEF ERNLEADAYF YDLTERVSAA YVMNTLHFGP RVSLIAGVRV ERENNDYLTR
YTPDDLSGFP VPKGAYRDTS STFSETIWLP NVHLTLRPTD FLTVRLAAYK ALARPNYNQR
LPSFVARKAG TFYPGNSLFV GNPDLKAAQA WNYEVNVALY DGRFGLFSVS AFYKDIKNMY
HQINGAFFDA ASADSLFDRL GINVTSPFQN EGFALTYPYN SSKPTRVWGV EIEHQANFLF
LPGVLRNLVL TYNLSFIRSE TYIPGTQTEE YCEEILPGIC VPKLRYHFVE RKHKLERQPD
FLSNVAIGYD YRGFSARLSM FHQGEFNTRF SPNGRDDRVV KSFTRWDLAL RQQIRPNLYV
LLNVNNLTNV EEGTIVLNRV QGWRLPNDRE IYGTTVDFGL RLVL