Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2855 |
Symbol | |
ID | 3836295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 3287041 |
End bp | 3289854 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637826966 |
Product | hypothetical protein |
Protein accession | YP_427939 |
Protein GI | 83594187 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACCA GCCGCCTGCG GCCGACCTCT CCGCTGGTTC GCCAAGCGGC GGCTTGCCTG GATGCCGGAG AGGCCGATCA TGCCGATGTT CTTCTTGATC TTCACCTGTC GCGCCATCCC GAGGACTCCG CCGGGCTGAT CCTGTTCGGT TTGACCCGGG TCGCCCTTGG GCGCCCCCTG GAAGCCGAGC AGCCGCTGCG CCGGGCCCTC GCCCTTGATC CCATCGATGA TCCCTTTCAT GCCCGCGCCG TCTGTGCGCT GGTCGATGTT CTCAAGGCCC AGGGGCGGAT GGCCGAGGTC TCCGAGGTTT TGGAGCGGGC GGGGGATGTC GCCCCGATGA ACGCGGTGTT TCCCCGCATG CTCGGCGCCC ATTACCGGGC CCTGGGGCAG GTGGGGGCGG CGATCACGGC GGCGCGCCGC GCCGTGGCCG TGGAGTCCGC TGATGGCGAG AATTGGCGGG GGTTGGCCGC CACCTTGCTG GGCGGCGCTC CAGGCGAGGC GCTTGAGGCC CTGGAGCGGG CGAAGACCTG CGGCGACGAC GGCGCCGACT GGTGGAATAT CCGCGGCGTC GCCCTGGCGG CCTTGGGCGA GGGCGAGGCG GCGCGGGAGG CCTTCAAAAA CGCCATCCTG CGCGATGGCG CCGCGCCGAT GGCCTGGACC AATCTCGGCA ATCTCGAAAT CCGCGCCGCG CGCTCGGCCG ACGCCCTGGA GCGCGCCGTC GAGGCCTATG GTCAGGCCAT CCGTCGCGCC CCCGATAGCT TCGAGGCCCA CAACAATCTG GCCCAGGCCC TGCGCGATCT GGGCCGGCGC GACGAGGCCC TGGTCCATGC CGAGCGGGCG GCGTCGTTGC GTCCGCACGA TGTGACCGTG CTCAACACCC AGGCCAATAT CCTGCGCGGC CTGAGGCGGG TTGACGAGGC CGTGGCGATT CTGGAACGCG CCCTGGCGTG TGATGGCCGT TCGGCGGAAA CCCACAGCAA TCTGGGGCTG GCCCTGCTCG CCGCCCAGGA TCGCCAGAGG GCGGAGGAGC ATTTCCGCAA GGCGGCGGCT CTGGCGCCCG ATTGCGTGGA TATCATCGTC AACCTCGCCT GCTTTCTCAT TCACATCAAT GCCGGCAAGG AGGTTCTTGG CGTTCTGGAA CCGGCGCTCG CGCGGTTTCC CGGCGATGTG GGGCTTCTTG CCACCCTGGG GCTCCATTAC TTCCAGGAAA ACGCCTATGA GGACTGCGCG ACGGTTTTCG AGACCGTTCT CGCCAAAAAA CCCGACCATG TTGAAGCGCG CGCCTCGCTG GGGGTGGTGC GCTGGTCGCA AGGCCGGCTG GTCGAGTCGC TGGCCCTCGC CCAGTCCATC CGCGAGGATT TTCCCGAGGA TATACGCACC TTGTTCCTGC TTGGCTGCGT CCATAGCGAC CTGATGGAAA ACGCCTCCGC CCTGGAGGTT TTCGATCTCG CCGTGGAAAA AGGCGGCGAG CGGGTGCCGC CTTTCCATTA CAGCAATTGC TGCTTCAGCT TTCATTACGT CGAAGCCATG GAGAAGGACG ATCTTTTCGC CCGCCACCGG CGCTGGGACG CCCTGTATGG CGCCGGGGCG ACCGAGCTCT CCGATGTCAC CCATCACAAC ACCCGTGATC CCGAGCGGCG TTTGAGAATC GGCTATGTCT CGCCCGACTT CCGTCGTCAC TCGGTCGCCT ATTTTTTCCT GCCGCTGGTT GGCGCCCATG ATCGGGCCGC GGTCGAGGTC ACCTGCTATT CCCTGTCGAC CTTGACCGAT CAGATGACCG ATCTGATCCG CGAGGGCTGC GACCGCTGGC GCGATATCAC CACCCTGCCG GCGGAAAAGG CGGCCGAGAG GGTGCGCGAG GACGAAATCG ATATCCTGGT CGATCTCTCG GGCCATACCG CCAACAACGG CCTGTCGATC TTCGCTCTCA AGCCCGCACC CGTCCAGGTC ACCTATCTGG GCTATCCCAA TACCACCGGG CTTTCGGCGA TCGATTACCG GCTGACCGAT GGCTTCGCCG ACCCCTTGGG GGTGGACGAG GATCCGGCTT CGGAAACCCT GTGGCGGTTG CCACGCAGTT TTTTGCTGTT TGACGAGGTA CCCGGACTGC CCGATCCGGC GCCGCCGCCG GTTCTGAGCC GGGGCACCAT CACCTTCGGC TCGTTCAATA ATATGGCCAA GGTCAATCGC GGCTGTGTCG CGGTCTGGAA GCGCATCCTC GATGCGGTGC CGGGAAGCCG GCTGTTGCTC AAGGCCAAGG GGTTCCGCGA TCCCGCCACC GCCAAGGTCT TGATCGAGCG TCTGATCGAC TGGGGGGTGG ACCCCGAGCA GGTGAGCTAT GCCCCTTACG CCAAGAACCT GATGGAGCAT GTGGCGGTTT ATGCCGAGGT CGATATCGCG CTGGACACCT TCCCCTATAA CGGCACCACC ACCACCTTCG AAGCCCTGCA CATGGGCGTA CCGGTGATCG GCCTGCGCGG CCATCGCCAT TCGGGGCGGG TGGGGGCGAG CATCCTTGGC AATCTTGGTC TCGCCGATCG TCTGCTTGGC GAGGACGTCG ACGATATGGT GGCCAAGGCG GTTGCCTTGG CCGGCGATCT GGAGGGATTG ATGCGGATGC GCGCGGGCTT GCGCGAGCGT CTGCGCGCCT CGCCCTTGCA AGACGGACCC GGCTTCGTGG CCGACCTTGA AGGAGCCTAT CGCGCCATGT GGCGGCGCTG GTGCGCCGGG CCGCCGACCT TCCACCGCCA ACCAAAGAAT GGCATGTGGG CGGTCGAGGA AATGGAAGAC GCCATCGAAC CCGTGCTGGG ATGA
|
Protein sequence | MDTSRLRPTS PLVRQAAACL DAGEADHADV LLDLHLSRHP EDSAGLILFG LTRVALGRPL EAEQPLRRAL ALDPIDDPFH ARAVCALVDV LKAQGRMAEV SEVLERAGDV APMNAVFPRM LGAHYRALGQ VGAAITAARR AVAVESADGE NWRGLAATLL GGAPGEALEA LERAKTCGDD GADWWNIRGV ALAALGEGEA AREAFKNAIL RDGAAPMAWT NLGNLEIRAA RSADALERAV EAYGQAIRRA PDSFEAHNNL AQALRDLGRR DEALVHAERA ASLRPHDVTV LNTQANILRG LRRVDEAVAI LERALACDGR SAETHSNLGL ALLAAQDRQR AEEHFRKAAA LAPDCVDIIV NLACFLIHIN AGKEVLGVLE PALARFPGDV GLLATLGLHY FQENAYEDCA TVFETVLAKK PDHVEARASL GVVRWSQGRL VESLALAQSI REDFPEDIRT LFLLGCVHSD LMENASALEV FDLAVEKGGE RVPPFHYSNC CFSFHYVEAM EKDDLFARHR RWDALYGAGA TELSDVTHHN TRDPERRLRI GYVSPDFRRH SVAYFFLPLV GAHDRAAVEV TCYSLSTLTD QMTDLIREGC DRWRDITTLP AEKAAERVRE DEIDILVDLS GHTANNGLSI FALKPAPVQV TYLGYPNTTG LSAIDYRLTD GFADPLGVDE DPASETLWRL PRSFLLFDEV PGLPDPAPPP VLSRGTITFG SFNNMAKVNR GCVAVWKRIL DAVPGSRLLL KAKGFRDPAT AKVLIERLID WGVDPEQVSY APYAKNLMEH VAVYAEVDIA LDTFPYNGTT TTFEALHMGV PVIGLRGHRH SGRVGASILG NLGLADRLLG EDVDDMVAKA VALAGDLEGL MRMRAGLRER LRASPLQDGP GFVADLEGAY RAMWRRWCAG PPTFHRQPKN GMWAVEEMED AIEPVLG
|
| |