Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2472 |
Symbol | |
ID | 8569138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2880750 |
End bp | 2883764 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003291737 |
Protein GI | 268318018 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACT GGTGGCTTGC ACTGAGTCTG ACCCTGGGAG CGCTGACGGC CGTTCCGGCT GCACTGGCGC AGGGTACGGT CTATGGCGTG GTGACCGATT CGCTGACCGG CGAGGCGTTG CCGGGAGCCA ACGTGTATCT GGTGGGCACG GGCAAAGGGA GTGCGACCGA TCTGGAAGGA CGCTATCGCA TCACGGGCAT TGCACCCGGC ACCTACACGC TGCGCGTTTC CTATCTGAGC TATCAGACGC GCGAATTGCC CCTGCGCATT CAGGACGGGG AGACGATCGT GCTGAACATT GCCCTGGTGC CCGAGTCGAT CGGTGGAGAG GAGATCGTGA TTGTCGGACA GGTCGAAGGC CAGGTGGCCG CCATCAACCA GCAGCTCAAC GCGAACACGA TCGTCAACGT GGTCTCGGAG GAAAAAATTC AGGAACTGCC GGACGCCAAC GCGGCGGAAG CGCTTGGGCG GCTGCCCGGG GTGGCCGTGC AGCGTTCCGG CGGCGAGGCC AACAAGATCG TACTGCGTGG GCTGAGCGAT CGCTTCACAA CGATTACGGT CGACGGCGTG CGGTTGGCCG CGACCGACGC GGACGCCCGG GGCCTCGATC TGAGCACGAT CTCGCAGAGT TCGCTGGCCG GCATTGAGCT GTACAAGGCG TTGACCCCCG ACAAGGACGC CGACGCGATT GCCGGCAGCG TCAACCTGGT CACCAGGAAA GCGCCCGAGC AGCGGCTCGT TCGCTTCGAT GTGCGGGGCG CTTACAATCA GCTGAACCAG ACCTGGGGGC AGTACGATCT GAGCGCCCGC TACGGAGAGC GCTTCTGGCG CAAGCGGCTG GGCGTGCAGC TCATCGGCAG CCTGGAGCGG CGCGATCGGA GTCGCGAGTA CTACGATCTC GCCTACGATA TGGGGCTGGC CGGCGGCACG GACTACGAAA TCTCCGACAT CGAGCTGAAC TTCCGCAACG AAATCCGCAC GCGCCAGGGC GTCAGTTTGT TGCTGGACCT GGATACGCCG GATGGGGGGA CGATTCGTTT CAACAACGTC TTCAACGCCA CCCAGCGCAG CTTCATCGAC TACTATCGCA ACTATCCCAC CGAAGGCTCC GAGATGTTCT ATGGCGCCCG GGATCGCGAG CAGAACATCT ACACGTTCAC CAGTGCGCTG CGGGGCGAGC AGTATCTGAA GGGATGGCAG CTTAACTGGG GGGCTTCCTA TGCCCGGTCG CAATCGCACG ATCCGTTCGA CTTCGATATC AGCTTTACGG AGCCCTCGGC CACCGATCCC CAGGGCAATC CGATCGCCGG CATGGGTCGC ATCCCGCCCG AAGAGCTGAA AGGCCCGCCG GAAAGCATTA TTTCCCATGC GTTGAACAAC TTCGAACGGG CCTACTTCTA CACGGCGTTC TATCGGGAGG AGGAAAATCG CGAAGGGGAG ACCACGCTGT ACCTGGATCT GGCGCGGGAC TACGCGCTGG GCGGGGGAGT GGCCGGACAG TTCAAGCTGG GCGGGAAGTT CCGGAGCAAG ACCCGCTTCC GGGAGCGCGG TGAGCTGCTG GCGCCCTATT ACAACGAAGC GTTTCCGCGC TACGTCCGGA CGGCCGATGG GCAGGTCGTG CCCAAGGACT TTACAGGCAC CCCCTTCGAG CAGCTTCAGA TGGTGGGCGA CCGGCTGCTG GTGACGAACT TTCTGGGGGC CAATCCGGAG GACCGCGACC TGTTCGACCG CTTCCGGCTC TATCCGATGA TCGATCGGGA TCTGCTCCGC ACCTGGTGGG ATCTGAACCG AGACGGTTTT TCGGATCAGG CCGGGACGAA CCCGGAGTTC GAGCGCAATC TGGAAGCCGA CGCTTATTTC TATGACCTGA CCGAGCGGGT CTCGGCGGCC TATGTGATGA ACACGTTGCA TTTCGGGCCG CGCGTTTCGC TCATTGCCGG GGTGCGCGTC GAGCGCGAAA ACAACGACTA CCTCACGCGC TACACGCCGG ATGATCTGTC CGGCTTCCCG GTACCCAAGG GGGCCTATCG AGACACCAGT AGCACGTTTT CGGAGACCAT CTGGCTGCCG AACGTTCACC TGACGCTGCG GCCCACCGAT TTTCTCACGG TGCGGCTGGC CGCCTACAAA GCGCTGGCAC GCCCCAACTA CAACCAGCGC CTGCCCAGCT TCGTGGCCCG TAAGGCCGGC ACGTTCTATC CCGGCAACTC GCTCTTCGTG GGCAACCCGG ATCTGAAGGC GGCGCAGGCC TGGAATTACG AGGTGAACGT GGCGCTTTAC GACGGCCGTT TCGGGTTGTT CTCGGTGTCG GCCTTCTACA AGGACATCAA AAACATGTAC CACCAGATCA ATGGTGCGTT TTTCGATGCG GCTTCGGCCG ACTCGCTGTT CGACAGGCTG GGGATCAACG TGACCAGTCC CTTCCAGAAC GAAGGGTTCG CCCTGACCTA TCCGTACAAT TCCTCGAAGC CGACGCGGGT CTGGGGCGTG GAAATCGAGC ATCAGGCCAA CTTTCTGTTT CTGCCGGGGG TGCTTCGCAA CCTGGTGCTG ACCTACAATC TGTCGTTCAT CCGATCGGAG ACCTACATCC CGGGCACGCA AACCGAAGAG TACTGCGAGG AGATTCTACC GGGCATCTGT GTGCCCAAGT TGCGCTATCA CTTTGTTGAG CGCAAGCACA AGCTGGAGCG CCAGCCGGAC TTCCTCAGCA ACGTGGCGAT CGGCTACGAC TATCGGGGCT TTTCGGCGCG GCTTTCCATG TTCCACCAGG GCGAATTCAA CACGCGCTTT TCGCCCAATG GCCGCGACGA CCGGGTGGTC AAGAGCTTCA CACGCTGGGA CCTGGCGCTT CGCCAGCAGA TTCGTCCCAA CCTGTACGTC CTGCTCAATG TGAACAACCT GACGAACGTC GAGGAGGGCA CCATTGTGCT GAACCGGGTG CAGGGCTGGC GGCTGCCCAA CGATCGGGAG ATCTACGGAA CCACCGTGGA CTTCGGCCTG CGGCTGGTAC TCTGA
|
Protein sequence | MKNWWLALSL TLGALTAVPA ALAQGTVYGV VTDSLTGEAL PGANVYLVGT GKGSATDLEG RYRITGIAPG TYTLRVSYLS YQTRELPLRI QDGETIVLNI ALVPESIGGE EIVIVGQVEG QVAAINQQLN ANTIVNVVSE EKIQELPDAN AAEALGRLPG VAVQRSGGEA NKIVLRGLSD RFTTITVDGV RLAATDADAR GLDLSTISQS SLAGIELYKA LTPDKDADAI AGSVNLVTRK APEQRLVRFD VRGAYNQLNQ TWGQYDLSAR YGERFWRKRL GVQLIGSLER RDRSREYYDL AYDMGLAGGT DYEISDIELN FRNEIRTRQG VSLLLDLDTP DGGTIRFNNV FNATQRSFID YYRNYPTEGS EMFYGARDRE QNIYTFTSAL RGEQYLKGWQ LNWGASYARS QSHDPFDFDI SFTEPSATDP QGNPIAGMGR IPPEELKGPP ESIISHALNN FERAYFYTAF YREEENREGE TTLYLDLARD YALGGGVAGQ FKLGGKFRSK TRFRERGELL APYYNEAFPR YVRTADGQVV PKDFTGTPFE QLQMVGDRLL VTNFLGANPE DRDLFDRFRL YPMIDRDLLR TWWDLNRDGF SDQAGTNPEF ERNLEADAYF YDLTERVSAA YVMNTLHFGP RVSLIAGVRV ERENNDYLTR YTPDDLSGFP VPKGAYRDTS STFSETIWLP NVHLTLRPTD FLTVRLAAYK ALARPNYNQR LPSFVARKAG TFYPGNSLFV GNPDLKAAQA WNYEVNVALY DGRFGLFSVS AFYKDIKNMY HQINGAFFDA ASADSLFDRL GINVTSPFQN EGFALTYPYN SSKPTRVWGV EIEHQANFLF LPGVLRNLVL TYNLSFIRSE TYIPGTQTEE YCEEILPGIC VPKLRYHFVE RKHKLERQPD FLSNVAIGYD YRGFSARLSM FHQGEFNTRF SPNGRDDRVV KSFTRWDLAL RQQIRPNLYV LLNVNNLTNV EEGTIVLNRV QGWRLPNDRE IYGTTVDFGL RLVL
|
| |