Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2398 |
Symbol | |
ID | 8569063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2780859 |
End bp | 2783759 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003291664 |
Protein GI | 268317945 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAGCG GACTGGAAGG CTTCCGGAAA CGGTCGGCCC GGTGGTGGGG CGTGCTGGGA CTGCTCGTGC TGTTGCCGAC GGGCGTCGTC GCGCAGAACG TGGGACGTAT CGCGGGCGTG GTGACCGATG CGACTACCGG TGACCCGTTG CCCGGCGTGA ACGTGACGAT CGAGGGGACG ACGCTGGGAG CGGCCTCCGA CATCGACGGC CAGTATTACA TTCTCAACGT GCCGCCCGGA CGCTACACGG TGCGGGCCAG CATGATCGGC TATCAGCCCG TCGTGGTCGA AAACGTGGTG GTGCATGCCG ACCGGACCAC CGAACTGAAC TTCGCGCTGC AGGAAGCGAC CGTCGAGATC GGCGAGATCG TGGTGCAGGC GGTACGGCCG GACGTGGAGC GCGACAAGAC CTCGACCAGT CAGATCGTAC GCTTCGACGA AGTGCAGGCC ATCCCAGGCA TCCGGGACAT CGGCGATGTG CTGACGCTGG CGGCCGACGT CATCGACGGA CACTTCCGGG GCGGACGTCA GGGCGAGGAA TACTATACGC TCCAGGGCAT GGGGATCGTC AATCCCCTCG ACAATTCCTC CGCCTTCATG CCGATCATGA GCGCGGTGGA AGAAGTGGAG GTGATCACCA GCGGCTTCAG CGCGCGCTAT GGGAATGCGC AGTCCGGCGT GGTGCGCATC TCCATGAAAG AGGGAAGCCG GACCCACTGG TCGACGCGCA TCGACTCGCG CTTTCGGGCG CCCGGCCGCA AGCACTTCGG CCCGAGCGTC TTCGATCCCG AGGCCAATCC CTATCTGAAG CTGCTCTACT ACAACGAAGG CAACATCTGG CTGACCGGCG ATCCCGGGTC CGATACGCCG CAGCCTTTCT ACGGAGCGAT GGCCTCCGGG CTGACCAGCT ATTTCGCCGG AGATACGCTG GCGCAACTGG CCATGGCGCG GGCGCTGTAC GAGCAGATGC GTCGGGATAT AAACCGGAAG TACGGCGACG AGATCGACTA CCAGCTCGAG CTGGCTACCG GCGGGCCGAT CAACGAGCGC ATGCGCATGT TCATGGCGCT CGGCATACGA AAAGAATGGC CGTTTCTGCC CACGGAAAAT CCTGACGTCG AATACCAGGC CATGGGCAAC GTGGTGGTGG ATGTGACGCA GAACACCACG TTTCGGCTCA GTGGGGGCAT GGCGCACCAG CGAAACAACG TGTTTCCGGG CCGAAACAGC GTGAGCGGCT ACCAGCGCTG GCTGTGGGAT CGCATTGTGG GCATCGAGGA CCGGCGGCGC ACCAACGTCC AGCTCGGCGG TCGCTTTACG CATGTGTTGA GCCCCAGCAC CTATTACGAA CTGCAACTCA GCACGCTGCA CACCTTCGAT GAGATCGGAT CGGCCCCCAC GCCCCCGGTG TTGCCCGATA CGGTCGATCT CAACTGGGCG GTCGGAACCC TCTCCTGGCC CAATAACAAC TCTCCCGACG GCATCAACTA TCAAGTGGGC AACGATCTGT TTCAGCGAGA AAAAACGCGC ACCATTTCCT TCGAGGGATC TTTCACCAGT CAGGTGACGC CCGCCCATCT GGTGCAGGGA GGCGTGCAGA TCAACAGCTA CCTGATCGAT GTGTCCAATT TTATCAATGT ACGCTCGACA AGGTATGAAG AAAACTATCG GGCCAGACCC TTCGAAGGCG CCGTATATCT TCAGGACAAA ATGGAATTTG AAGGGCTCAT CGCCAACGTA GGATTGCGCC TGGATGTGTG GTACTCCGGC ATGGACTATT ATGTGGATCT TTTCGAGCCG TTCGGGAAGC CGGATTCTGT AGGCAGGTTT GATCCGGGGA AGGGGGTTCG TGAGAAACCT CCGGTACATG TGCGCCTGCA GCCACGGCTG GGCATTTCGT TTCCGATTTC GTCCACCACC GTGTTTCATC TGAACTACGG TTCGTTCATG CAGCGCCCCT CGTTCCAGTA CATCGTGTCG CGTCAGATCG GGCAACTGCG AAACGAGCCG ATTTATCTGG GCAACCCCCG CCTGCGGCCG GAAACCACGA ACAGCTACGA TGTGGGCTTT GTGCAGGCGC TGGGCGGCGG CTTTACCCTG GACGTGAGTG GATACTACAA AGACGTCAAA AACCTGGTCC AGCAGGCAGA TTTCATCGAC GATCGGGCCG GCTATCAGGT CAGTTCGTAT TTCAACCTGG ACTATGCGGA TATCCGCGGC TTTCGCATCG CGCTGACGAA GCGGCGGGGT TCGTTCACCG GCGCCATCAA CTACCAGTAC AGTCACGCCA CAGGAAAGAG TCCCACCGCT ACGGCCGCCA CGCCGATTTT CAACCGGGAC ACGCTGGGCG TGGTGACCAC CGATCTGACC AACGTCCCCA CGCGCGATAT TCTGCTCGAC TTTGACCGGC GTCACAACCT GATCGTCACG GCTACCTATG CGACCGGAGC AAACTGGGGA CCCAGAATTT TTGGAAAATA TGTACTGAAT AATATGACAT TTTCTGTGTA TTCTACGTTG CGAAGTGGCA GGCCCTATAC ATCTCCATCG GACCTTCGCC GTATTAATGC AAAGAGCACC CCTGCTGAGT ACAATACGGA TCTTAAAATC AGCAAGAGAT TTCGGAATTT CTTCGGGGCT TCTGCTTCCT TCTATTTTGA GGTATTCAAT CTGTTCAATA ACAAAATATT GAACTACAGC TACCTCTTCC GCAGGCCTAC GCCGACCAAC CCGAATTTGC CGTTGCAATA CTATGAACAA TATGGTATCG ACGACAGGGA AAACGGCGTG CGGTACTGGT GGGACAAAGG GCGGCAGGGG CCGTTCGCCG TCGATCAATC CTTCCTGATC TACAGCAATG AGCCGCGCTC TTACCATTTC GGGATGATTC TGGAATTCTA A
|
Protein sequence | MVSGLEGFRK RSARWWGVLG LLVLLPTGVV AQNVGRIAGV VTDATTGDPL PGVNVTIEGT TLGAASDIDG QYYILNVPPG RYTVRASMIG YQPVVVENVV VHADRTTELN FALQEATVEI GEIVVQAVRP DVERDKTSTS QIVRFDEVQA IPGIRDIGDV LTLAADVIDG HFRGGRQGEE YYTLQGMGIV NPLDNSSAFM PIMSAVEEVE VITSGFSARY GNAQSGVVRI SMKEGSRTHW STRIDSRFRA PGRKHFGPSV FDPEANPYLK LLYYNEGNIW LTGDPGSDTP QPFYGAMASG LTSYFAGDTL AQLAMARALY EQMRRDINRK YGDEIDYQLE LATGGPINER MRMFMALGIR KEWPFLPTEN PDVEYQAMGN VVVDVTQNTT FRLSGGMAHQ RNNVFPGRNS VSGYQRWLWD RIVGIEDRRR TNVQLGGRFT HVLSPSTYYE LQLSTLHTFD EIGSAPTPPV LPDTVDLNWA VGTLSWPNNN SPDGINYQVG NDLFQREKTR TISFEGSFTS QVTPAHLVQG GVQINSYLID VSNFINVRST RYEENYRARP FEGAVYLQDK MEFEGLIANV GLRLDVWYSG MDYYVDLFEP FGKPDSVGRF DPGKGVREKP PVHVRLQPRL GISFPISSTT VFHLNYGSFM QRPSFQYIVS RQIGQLRNEP IYLGNPRLRP ETTNSYDVGF VQALGGGFTL DVSGYYKDVK NLVQQADFID DRAGYQVSSY FNLDYADIRG FRIALTKRRG SFTGAINYQY SHATGKSPTA TAATPIFNRD TLGVVTTDLT NVPTRDILLD FDRRHNLIVT ATYATGANWG PRIFGKYVLN NMTFSVYSTL RSGRPYTSPS DLRRINAKST PAEYNTDLKI SKRFRNFFGA SASFYFEVFN LFNNKILNYS YLFRRPTPTN PNLPLQYYEQ YGIDDRENGV RYWWDKGRQG PFAVDQSFLI YSNEPRSYHF GMILEF
|
| |