Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1668 |
Symbol | |
ID | 4022148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1882247 |
End bp | 1884118 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637961863 |
Product | flagellar hook-associated protein |
Protein accession | YP_568806 |
Protein GI | 91976147 |
COG category | [N] Cell motility |
COG ID | [COG1256] Flagellar hook-associated protein |
TIGRFAM ID | [TIGR02492] flagellar hook-associated protein FlgK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.121705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTCG GAGACGCACT TTCGATCGCA ATGGCCGGCC TGCGCGCCAA CCAGGCCTCG ATGTCGCTGG TGTCGTCCAA CGTCGCCAAC GCCGAGACGC CGGGTTACGT CCGCAAGACC GTCGATCAGA TCACCACCAC TGCCGGCCCG TCTGGCAGCG GTGTTTCGAT CATCGGCGTC AACCGCGAAC TCGACGCCTA TCTGCAGTCG CAGCTTCGCA CCGAAACCTC GGGCGCCTCC TACGCCTTGC TGCGCTCCGA CTTTCTGAAG CAATTGCAGG GCCTGTATGG CAACCCGAAC TCGACCGGCA CCCTTGAGAA CGCGTTCAAC AGTTTGACCG CCGCGGTACA GGCGCTCGGC ACCAGCCCCG ACAGCACCTC GGCGCGAATC GGCGTGCTCA ACGCCGCGCG GGTGGTGGCG GGCGGGCTCA ACGCGACATC CAACGGAATC CAGTCGCTCC GCTCCGGCGC CGAGACCGGA CTGGCCGACA GCGTCAACAC GGCGAACAAT CTGCTGCAGC GGATTGCATC GATCAACAAC AACATCCGCA CCAATCCCGC GGGGGGCACC TCGACCGACG TGGCGACCGC GTCGCTGCTC GACCAGCGTG ACGCGGCGAT CAGCCAGCTC TCGCAACTGA TGGACATCCG CGTCGTCACC GACGGCTCCA ATCGGGCCAC GGTGTTCACC GGCTCCGGAA TGCAGCTCGT CGGTATGCAG GCGGCCAAGC TGTCCTTCGA TGCGCAGGGC ACCGTGACGC CGAGCACGAC CTGGAGCTCG AACTCGGCGA CGAGCCAGCT CGGTTCGGTC AAGATCACCT ATGCGGATGG TGGCACGATC GATCTCACCA GTTCGCTGAA ATCGGGCACG ATTGCGGCCT ATATCGAGCT GCGCGACAAG ACTCTGGTGC AGGCCCAGAC CCAGCTCGAT CAATTCGCGG CGTCGATGGC GAGCGCTTTG TCCGACAAGA CCACCGCCGG AACCCCGGCG ACGTCGGGCG CGCAGGCCGG TTTCGCGCTC GATCTGACCA ACATGAAGCC CGGCAACACC TTCAACATCA GCTACACCGA CACGACGACA GGCGCGCAGC GCACGGTGTC GGTGATGCGG GTCGACGATC CCTCGGTGCT GCCGCTGCCG CAGACCGCGA CGCTCGATCC CAACGACTAT GCGGTCGGCA TCGACTTCTC GGGCGCGTCG GGATCGATCA CCGCACAGCT CAACGCCGCG CTGAACGCCA AGAACCTGGA GTTCACCGGC ACGTCGCCGA ACATCACCGT GCTCAACAAT CCAGGCTTCT CGACGGTGAC TGCGGCCTCG GTGACCACGA CCGAAACCTC GCTGACCGGC GGCAGCGCCG AGGTGCCGTT GTTCACCGAC GGCTCGTCGG CCTACACCGG CGTGCTCAGC GGCACCGGAG CGCAGATGAC CGGCTTCGCG CAGCGCATCG CGGTCAATAC CGGGCTGATC ATCGATCCGT CGCGGCTGGT GGTGTATTCG ACCACGCCGC CCACCGCGGC CGGCGACACC ACGCGGCCGG ACTTCCTCAC CAAACAGCTC ACCACCAGCA AGTATCTGTA CTCGGCGGCG ACCGGGATCG GTTCGACCAG TGCGCCGTAT AACGGCACGC TGTCGAGCTA CCTGCAGCAG TTCGTCGGTC AGCAAGGCTC CGACGCATTG GCGGCATCGC AACTCGCCGA GGGGCAGAGC GTCGTGCTGA ACACGCTGCA GCAAAAGTAT TCGACCAGCT CCGGCGTCAA CATGGACGAA GAGATGGCGC ATCTGCTGTC GCTTCAAAAC GCGTATTCGG CGAATGCACG GGTGATGTCG ACGGTGAACC AGATGTATCA GGCCCTGATG CAGGTGATGT GA
|
Protein sequence | MGLGDALSIA MAGLRANQAS MSLVSSNVAN AETPGYVRKT VDQITTTAGP SGSGVSIIGV NRELDAYLQS QLRTETSGAS YALLRSDFLK QLQGLYGNPN STGTLENAFN SLTAAVQALG TSPDSTSARI GVLNAARVVA GGLNATSNGI QSLRSGAETG LADSVNTANN LLQRIASINN NIRTNPAGGT STDVATASLL DQRDAAISQL SQLMDIRVVT DGSNRATVFT GSGMQLVGMQ AAKLSFDAQG TVTPSTTWSS NSATSQLGSV KITYADGGTI DLTSSLKSGT IAAYIELRDK TLVQAQTQLD QFAASMASAL SDKTTAGTPA TSGAQAGFAL DLTNMKPGNT FNISYTDTTT GAQRTVSVMR VDDPSVLPLP QTATLDPNDY AVGIDFSGAS GSITAQLNAA LNAKNLEFTG TSPNITVLNN PGFSTVTAAS VTTTETSLTG GSAEVPLFTD GSSAYTGVLS GTGAQMTGFA QRIAVNTGLI IDPSRLVVYS TTPPTAAGDT TRPDFLTKQL TTSKYLYSAA TGIGSTSAPY NGTLSSYLQQ FVGQQGSDAL AASQLAEGQS VVLNTLQQKY STSSGVNMDE EMAHLLSLQN AYSANARVMS TVNQMYQALM QVM
|
| |