Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0089 |
Symbol | |
ID | 4895258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 100998 |
End bp | 103097 |
Gene Length | 2100 bp |
Protein Length | 699 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640110672 |
Product | TonB-dependent siderophore receptor |
Protein accession | YP_001041981 |
Protein GI | 126460867 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4774] Outer membrane receptor for monomeric catechols |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.26085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGA AACACCGGTC GATGCTGCTC GCAGGCGCGA GCACCATTGC TCTGGCCCCG GCCGCCCTCT TCGCCCAGGA GGCGATCGAG CTCGACGCGA TCGTGGTCAC GGCCACCACC GACGTCACAA CCCAGGCCGA TGGCTACAAG GCCGACTACA ACCAGTCCGC CACCAAATCC GACACCCCCG TGGCCGAGAC GACCCAATCG GTCTCGGTCG TGACGGCAGA GCAGATCAAG GATCAGGGCG CGGAAACCCT CGGGCAGGCG CTCCGCTACA GCCCCGGCGT GCTCGGCGAC CCCTACGGCG TCGATCCGCG CTTCGACAGC CCCACGATCC GCGGCTTCGA GGCGCGCGGC TCGCAATATG TCAACGGCCT GCGGCAGCTG CGCTACATGG GTGCCCCGGC CTACGAGACC TTCGCGCTCC AGCAGATCGA GGTGCTGAAC GGTCCCAACT CCTCGCTCTA TGGCGCGGGC TCGCCGGCCG GGATCATCAA CCAGGTCCAG AAACGCGCGC AGGGCTTCGA CTTCGGCGAA CTGGGCGTGG GCCTCGACGA CAACGGCTCG CGCCAGAGCT TCTTCGACTG GAACCGCACG GTGAGCGACA CGCTCTCGTT CCGCGCGACC GGGATCGCCA AGGACTACGA GAGCCAGGTG GAGGAGATCG GCCTCGAACG CGGGTATCTG GGCCTCGCCG CCCGCTGGAA GCCCACCGAC CGCACCACGC TCGACATCAT CTCCTCCTAT ACCGACGACG CGCCGACTTC GCCGCCGGGC ATCCCCTTCG CGCTCACCGG GCAGGGGAAC GACAAGTATC TGCGCGAGCT CTACACCGGC GAGCCGGGCT GGGACGACCA TGACCGCCAG ATCTTCAACA TCGGCTACGA GCTGAGCCAC GAGTTCGACA GCGGCTGGAC CTTCAGCCAG GGCTTCCGCT ACGAGAAGTT CGACTGGGAA TATACCGGCC ACTATGTGAC CGGCATCGAC GCCAGCGGCA CCGGGATCAC CCGGGGCGCC AACTATCAGC GCGAGAACAC CACCGGCCTG AGCCTCGATT CACGGCTTGC CGGAGAGGTG CTGACCGGCG GCATGGAGCA CAAGCTGCTC TTCGGGCTCG ACCTGCGCAA ATACGACGCC GACACCGTCA CCGAGTTCTA CAACGCCACC GGCGGCGTCA CGAACCTCGA CTGGCGTAAC CCGATCTATG GCGGCGTTCC CACCGGCGCG CCCTGGTATG TCAGCACGCC CGACGTGACG CAGACCCAGA TCGGCCTCTA CGCGCAGGAC GAGATCACCG CCGGGCGCTG GCGCGGGTCG ATCGCGCTGC GCCACGACTG GTCGAAGCAG GAGGGCACGA CCTACACGAA CTTCGCGGGC GAAGGCGAGA TCGACCAGTC GGACAAGGCG CTGTCGGGTC GCGCGGGCCT TGGCTACGAG ATCGCGCCCG GCGCGCTCGT CTATGCCAAC TACTCCACCT CCTTCGACCC GGAGATCGGC GTGGACGGCG CGGGCGAGCA GCTGGAGCCG ACGACCGGCA AGCAATGGGA GCTGGGCGTG AAATATCAGC CCGACAGCTT CAACGCGCTC TTCACTGCGG CGATCTACGA TCTGCGGCAG GAGAACCTGA CGGTGAATCT CGGCGGGGCC GAGGGCCGTC GTCAGGTCGG CGAGGTCAAG TCCTCCGGCC TTGAACTCGG CGCGGTGGGC GAGCTGGCAC CGGGGCTGAA CCTGCGCGCA AGCTACGCCT ACAACGACAC CGAGCAGGTC GATCCGAGCG GCGCCAACGA CGGCAACGAG ATGCCCAACG CGCCGCGCCA TCTGGCGAGC CTCTGGCTCG ACAAGGCCTT CGACAACGGC GTGAGCCTCG GCGGCGGCCT GCGCTACATC GGCGAGCGCG AGGGCGATCT GGCCAACCTC TATTCGCTCG ACTCCGTGAC GCTGCTCGAC CTCGCCGTGG GCTACAGCCG CGAGAACATG GAGGCCTCGA TCAACCTCAA CAACCTGTCC GACGAGGTCT ACCTCGCCAA CTGCGGCTCC TTCGGCTGCT ACTACGGCGA GGGCCGGACG ATCTCGGCCA AGATCAGCTA CAAGTGGTAG
|
Protein sequence | MKTKHRSMLL AGASTIALAP AALFAQEAIE LDAIVVTATT DVTTQADGYK ADYNQSATKS DTPVAETTQS VSVVTAEQIK DQGAETLGQA LRYSPGVLGD PYGVDPRFDS PTIRGFEARG SQYVNGLRQL RYMGAPAYET FALQQIEVLN GPNSSLYGAG SPAGIINQVQ KRAQGFDFGE LGVGLDDNGS RQSFFDWNRT VSDTLSFRAT GIAKDYESQV EEIGLERGYL GLAARWKPTD RTTLDIISSY TDDAPTSPPG IPFALTGQGN DKYLRELYTG EPGWDDHDRQ IFNIGYELSH EFDSGWTFSQ GFRYEKFDWE YTGHYVTGID ASGTGITRGA NYQRENTTGL SLDSRLAGEV LTGGMEHKLL FGLDLRKYDA DTVTEFYNAT GGVTNLDWRN PIYGGVPTGA PWYVSTPDVT QTQIGLYAQD EITAGRWRGS IALRHDWSKQ EGTTYTNFAG EGEIDQSDKA LSGRAGLGYE IAPGALVYAN YSTSFDPEIG VDGAGEQLEP TTGKQWELGV KYQPDSFNAL FTAAIYDLRQ ENLTVNLGGA EGRRQVGEVK SSGLELGAVG ELAPGLNLRA SYAYNDTEQV DPSGANDGNE MPNAPRHLAS LWLDKAFDNG VSLGGGLRYI GEREGDLANL YSLDSVTLLD LAVGYSRENM EASINLNNLS DEVYLANCGS FGCYYGEGRT ISAKISYKW
|
| |