Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4871 |
Symbol | |
ID | 6412557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5235086 |
End bp | 5237359 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714748 |
Product | TonB-dependent receptor |
Protein accession | YP_001993835 |
Protein GI | 192293230 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.782925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTGTC ACATCAAACG CGGCCTTGCG GCGTCGAGCT CTGCCACGCT GCTGTTAGCC GCTCTGAGTT CACCAAGTCT GGCGCAAAGC GCTGCGCCGG CCAGCCGCGG CGCGACACCA CTTCCAGAAA TCGACGTGAT CCAGCCGCAG CGTGCGCCCC ACCCGGCTCG ACGCCCAAAG ACTCAAACCG CAGCCCGGGC GCGCCGCGGG GCAGCGACAG CCTCGCCGCA GCACGACGCC CAAACGCCTG CGCAAACTGC GGCACAAGCC ATCGCGGCCA AACATGCGGG CTTCGATGCT GCGCGGCAGA CGATCTTTGC GCCGAATGGC GCGAGCGCCT TCGACGTCAA TCATGAGGCC ATCCTGGCAC TGCCGCAGGG CGCCAATGCG ACGCTCGACA AGGTGCTGTT GCAGGCGCCC GGCGTCTCGC AAGACTCCGC GGCCAGCGGC GATCTGCACG TCCGCAACGA GCACGCCAAT GTGCAGTACC GGATCAACGG CATCGCTTTG CCCGACGGCG TCAGCGGCTT CGGCCAGATG CTCGATACGT CGCTGGTCGG ACGCTTAACG CTGCTGACCG GCGCGCTGCC GGCGCAATAC GGCCTCCGCA CCGCCGGCGT CGTCGACATC ACCACGCGCA CCGACGCCTT CAACAACAGC GGCACCGTCA GCGTCTATGG CGGCAGCCGC CAGACCATCA CGCCGAGCGT CGAATATGGC GGCACCGCCG GCAACACCCA GTATTTCATC TCTGGCCGCT ACTTCGGGAC CGGGCTCGGG CTGGAGAACC CGACCTCGTC GCCGAACGCG ATCCATGACG ATTCCCAGCA GGGCAGAGGC TTCGCCTATC TGTCGACCGT GATCGACGAC ACGACGCGGC TGACCTTTAT CGGCGGCGGT TCGGCCAACA ACTACCAGAT CCCCAACAAC CCGGGCCAGA CGCCCGGCTT TACGGCGTTC GGCGTGTCGA ATTTCGACTC GGCGCAGCTC AACGAGACCC AGCGCGAACG CAACGCCTTC GGCGTGCTGG CGCTGCAGAA GTCGATCAAT GGCTTCGACC TGCAGATGTC GGCATTCTCG CGCTATAGCA TGCTGCACTT CACGCCAGAC ACGGTCGGCG ATCTGGTGTT CAACGGCGTC GCCTCTGACG TCTATCGCCG AAGCGTCGCC AGCGGCATCC AGGCCGACGG CTCGTACCGG CTCAGCGACG CGCATACGCT GCGCGGCGGC TTCCAGGTCA CCGCCGAGCA AAGCCGGGTG ACCAACTCCT CGGTGGTGCT GCCGCTCGAC GACAGCGGCA ACCCGCTGGA CGCGCCGTTC GGCGTCGTCG ACTCCAGCAG CAAGCTCGGC TGGCTGTTCA GCACCTATCT GCAGGACGAA TGGCGGGTCA CCAACACGGT GACGCTGAAC TCCGGCCTGC GCTTTGATCA AATGAACGAA TACACCAACG CCAATCAGCT CAGCCCGCGC ATCAGCCTGA CCTGGAAACC GACCGAGGAC ACCACGTTCC ACGCCGGCTA TTCGCGCAAC TTCACGCCGC CCGCGCAGGT GCTGGCGGCG CCGACCAATC TGGCGCTGGT GCAGAACACC ACGCAGCAGC CCGCCGTCAA CGCCAACAGC CCGGTGCTGC CGGAGCGGTC GAACGTGTTT GATGTCGGCG TCACGCAGAA GCTGCTGCCC GGCCTCGAGG TCGGCATCGA CACCTACTAC AAGACCGCGC GCGATCTGCT CGACGATGGC CAGTTCGGCG CCGCCTATGT GCTGTCGGCG TTCAACTACG ACCGCGCCGA GAATGTCGGC ATCGAGTTCA AGGGCGCGTA CACCAACGGC AACTTCCGGA TCTACGGCAA TCTGGCGCTG GCGCGGCAGA TCGCCACCAA GGTTGTCTCG AACCAGTATT TGTTCGATCC GGACGAGCTC GCCTACATCG CCAGCAACTA CATCTACACC GACCACGCGC AACTGGTGAC GGCGTCAGCC GGCGCATCGT ATCGCTGGCA CGATACCAAT TTCAGCGCGT CGATGATCTA TGGCAGCGGA TTGCGCTCCG GCTTTGCCAA TATCGGATCG CTGCCGTCCT ACACCCAGGT CAATCTGGGC GTATCGCACG ATTTCTATCT GGTGAGCGCG ACCAAGCCGA CCACGGTACG GTTCGACGTC GTCAACCTGT TCGACAGCGT GTACGAAATC CGCGACGGCT CGGGGATCGG CGTGTTCGCG CCGCAATATG GTCCGCGGCG CGGCTTCTAT GTCGGCGTGG CGCAGAAGTT CTGA
|
Protein sequence | MFCHIKRGLA ASSSATLLLA ALSSPSLAQS AAPASRGATP LPEIDVIQPQ RAPHPARRPK TQTAARARRG AATASPQHDA QTPAQTAAQA IAAKHAGFDA ARQTIFAPNG ASAFDVNHEA ILALPQGANA TLDKVLLQAP GVSQDSAASG DLHVRNEHAN VQYRINGIAL PDGVSGFGQM LDTSLVGRLT LLTGALPAQY GLRTAGVVDI TTRTDAFNNS GTVSVYGGSR QTITPSVEYG GTAGNTQYFI SGRYFGTGLG LENPTSSPNA IHDDSQQGRG FAYLSTVIDD TTRLTFIGGG SANNYQIPNN PGQTPGFTAF GVSNFDSAQL NETQRERNAF GVLALQKSIN GFDLQMSAFS RYSMLHFTPD TVGDLVFNGV ASDVYRRSVA SGIQADGSYR LSDAHTLRGG FQVTAEQSRV TNSSVVLPLD DSGNPLDAPF GVVDSSSKLG WLFSTYLQDE WRVTNTVTLN SGLRFDQMNE YTNANQLSPR ISLTWKPTED TTFHAGYSRN FTPPAQVLAA PTNLALVQNT TQQPAVNANS PVLPERSNVF DVGVTQKLLP GLEVGIDTYY KTARDLLDDG QFGAAYVLSA FNYDRAENVG IEFKGAYTNG NFRIYGNLAL ARQIATKVVS NQYLFDPDEL AYIASNYIYT DHAQLVTASA GASYRWHDTN FSASMIYGSG LRSGFANIGS LPSYTQVNLG VSHDFYLVSA TKPTTVRFDV VNLFDSVYEI RDGSGIGVFA PQYGPRRGFY VGVAQKF
|
| |