Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2444 |
Symbol | |
ID | 8569110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2839523 |
End bp | 2842528 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003291710 |
Protein GI | 268317991 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGCAC GGAAACGTCT GACACTGAAG CGCTGGCTGG CGCTGGGACT GTGGTTGCTG CTGGTGGGGG CGGCCCACGG GCAGGTGATC CGGGGAACGG TGACCGACCG GGACACGGGC GATCCGCTTC CGGGTGCCAA TGTCGTGATC AAAGGCACCG TGCGGGGGGC CGCCACCGAC GTGGACGGTC GGTTTGTCAT CCCCAACGTG GCGCCCGGCC AGTATACGCT GGTGATTTCG TACCTCGGCT ATCACACCGA AGAGGTGCCG GTGACGGTCG CGGAGGGCGC TTCGGAGGTC GAGGTCAACG TCCAGCTTGT CTGGGAAGGG ATTGTCGGGC AGGAGGTGGT CATCACGGCG CAGGTGGCCG GACAGCTGGC GGCCATCAAC GAGCAGTTCT CGGACGTGGT GGTCAAGAAC GTGGTCTCGC GTGACCGCAT TCTGGAGCTG CCCGACAACA ACGCGGCCGA GTCGATCGGA CGTCTGCCGG GCGTCGTGAT CCTGCGCTCC GGCGGTGAAG CCACGAACGT GGCCATCCGC GGCCTGCTGC CCAAATACAA CACGGTCACG GTCAACGGCG TGCGGTTGCC CGATACCGAC CCCTCCAATC GCCAGGTCGA TCTGTCGCTG ATCTCGTCGA ACATCCTGGA CGGGATCGAG GTGCGCAAGG CGATCACGCC GGACATGGAC GCCGACGCCG TGGGCGGCAA CATCGACCTG CGGCTTCGGA GCGCGCCCTC GGGCTGGCAT TTCGACGTGC TGGCCACCGG TGGCTATGCC GGGCTGCAGC AGTATCTGGG GAACTACAAG CTTGTCGGAA CAGCCAGCAA CCGCTTTCTG AACGAGCGGC TGGGTGTGAT CGCCACGTTC AACGCGGATC GCTACAACCG GAGCGCCGAC AAGCTGGGCA TCAACTGGAC GGCCGACGAC ATCAACCCGC TGACCGGCCA GCGTGAACCC CGGTTCAACA GCTTCAACCT GCGGGAAGAG ACCGTCTTCC GCGGACGTCT GGGCGGCAGT CTGCTGCTGG ACTATAACAT TCCCAACGGG CGCCTGCAGG GCAACATTTT CTACAACGAG CTACGCGACG ATGTATTTGT GCGCCGTTAT GCCCCTTCGG TCGGCAGTCT GGATGCCAGT GTGGAGGAGT ATGACACGCG CACGGCCATT CTCACCAGTG GGCTGGGGAT CGAGCAGGAC CGGGGCAGCT TCAAATACGA CGCCCAGGCC TTTTATACCA TCTCCCGGCG TCGGTCGCCG CACAACTACA TCTGGGAGTT CGGGCGCGAC GGCACGGCGC TGAGCGTGGG GCGGGCCGAG CTGTTCGGCC TGTCGCCCGA CAGCGTCTGG AGTCTGGTGC GTCACGATTC GACCATGCAG CTCTCCTCGA TCTGGGTGGA TTCCGAGCGC CTGGACGAGG ACCAGTACGG CATTCAGGCC AATTTCCAGC GGCCGTTCCA TTTCGGATGG ATTTCCGGCT ACGTGAAGCT GGGCGGTAAG CTGCGCTGGC TGTCGCGCAC GTTCGACCGT GAGCGCAACG GCCGGCAGGG CCTGCGGTAT CCGAGCGATA CGGTCGAGCA GTGTCTGCTG GAGACGCTGG GCCCCGAGTG GGAGGAACGC TATCAGATTG CCGACTCGGT CTATGGCGTG CCGGGGCTTC CGATCGCGCT CATCCAGAAG GATTACAAGC GCGAAGGGGA GTTTGGCGAA GGACAGTTCG GGCTGGGACC GATCGCCGAC GAAGACCTGC TCATGGAGCT GACGCGGGCG CTGCAGGCTT CGCCCTGCCA GGCCGAATAC CAGAACAACA CGATCGAGTC GCTGGGCCGT GATTACGACG GCATCGAGCG GTATCAGGCC GCCTATGTCA TGGCCCAGCT CAAGATCGGG CCCTACGTCA CGCTGATTCC GGGTATTCGC TACGAGCGGG ACTACTCGCG CTATACGGGA CAGCGCTTCC GGGAAGTTAC GTCCGGTTTT GTGTACGCGC CCCCGGCCGA TCTGGCCGAA CTGGAGGTGG AGCGCGAGAA CATCTTCTGG CTGCCCATGG TCCATCTGGA CGTGCGGCCA CTCGACTGGC TGGCCCTCAA GCTGGCCCGC ACCGAGACCA TCTCGCGGCC CAACTACTAT CAGTACGCGC CGATCACCTC GATCAACACG TGGCGCTCGT TCATCTGGGC GGCCAACTCG AAGCTGCGTC CCTCGCATGC CACGAACTAC GACGCCACGC TGCAGCTGGC CAGCTCCCGC TTCGGATTGT TCGGAATCTC GGCCTTCTAC AAACGCATCG ACGATCTGCT GATCGAGGTC GAATTCCCCG CCCAGCTGTT CCGGGATGTC AACGGCGATA CGGTGGTGAT CGGTGTGCCG GAAGGCACGA ACGTGCCGCG TGAGTGGCTG GAAGGTGCCA GCCCGCAGCT CCAGACCACG GTCAACAACG ACGAACCGGC CAAATACTGG GGCTACGAGC TGGAATGGCA GACCAACTTC TCCTATCTGC CCGGTGTGCT CAAAGGCCTC GTGCTGAGCC TGAACTACAC GCGGGGCTTT TCGGAGACGA CCTATCACTA CTACCGCAAA GAGCGGCAGA TTCTGCCGGG ACGTCCCCCG CGTACCATCT ACTCCATCAT CGATACGACC CGCACCGGGC GCATGCCGGG ACAGGCGGCC CACGTGTTCA ACATGACCAT CGGCTTCGAT TACCGCGGCT TTTCGGCGCG GCTGTCTTAC CTGTACCAGA GCGATATTGC CAGCTGGGTC AACCCGCGTG AGCCGTTGAA CGATGTGTTC GTGGGGCCCT ATTCCCGATT CGATCTGTCG GTGCGGCAGA AGATCGGGAC GGGAATGGAG CTCTATGCCA ACTTCAACAA CCTGAATAAT CGACCCGACG AGCAATACAC CGGCCAGAAT ACGCAGGACC CGGATTATAG CTTTACGCGC CGCTATCTGG CCTACAAAGA ACTTTACGGC TATACGATCG ACGTCGGATT CCGCTATCGA TTCTGA
|
Protein sequence | MCARKRLTLK RWLALGLWLL LVGAAHGQVI RGTVTDRDTG DPLPGANVVI KGTVRGAATD VDGRFVIPNV APGQYTLVIS YLGYHTEEVP VTVAEGASEV EVNVQLVWEG IVGQEVVITA QVAGQLAAIN EQFSDVVVKN VVSRDRILEL PDNNAAESIG RLPGVVILRS GGEATNVAIR GLLPKYNTVT VNGVRLPDTD PSNRQVDLSL ISSNILDGIE VRKAITPDMD ADAVGGNIDL RLRSAPSGWH FDVLATGGYA GLQQYLGNYK LVGTASNRFL NERLGVIATF NADRYNRSAD KLGINWTADD INPLTGQREP RFNSFNLREE TVFRGRLGGS LLLDYNIPNG RLQGNIFYNE LRDDVFVRRY APSVGSLDAS VEEYDTRTAI LTSGLGIEQD RGSFKYDAQA FYTISRRRSP HNYIWEFGRD GTALSVGRAE LFGLSPDSVW SLVRHDSTMQ LSSIWVDSER LDEDQYGIQA NFQRPFHFGW ISGYVKLGGK LRWLSRTFDR ERNGRQGLRY PSDTVEQCLL ETLGPEWEER YQIADSVYGV PGLPIALIQK DYKREGEFGE GQFGLGPIAD EDLLMELTRA LQASPCQAEY QNNTIESLGR DYDGIERYQA AYVMAQLKIG PYVTLIPGIR YERDYSRYTG QRFREVTSGF VYAPPADLAE LEVERENIFW LPMVHLDVRP LDWLALKLAR TETISRPNYY QYAPITSINT WRSFIWAANS KLRPSHATNY DATLQLASSR FGLFGISAFY KRIDDLLIEV EFPAQLFRDV NGDTVVIGVP EGTNVPREWL EGASPQLQTT VNNDEPAKYW GYELEWQTNF SYLPGVLKGL VLSLNYTRGF SETTYHYYRK ERQILPGRPP RTIYSIIDTT RTGRMPGQAA HVFNMTIGFD YRGFSARLSY LYQSDIASWV NPREPLNDVF VGPYSRFDLS VRQKIGTGME LYANFNNLNN RPDEQYTGQN TQDPDYSFTR RYLAYKELYG YTIDVGFRYR F
|
| |