Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3816 |
Symbol | |
ID | 3911619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4356673 |
End bp | 4358544 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885717 |
Product | flagellar hook-associated protein |
Protein accession | YP_487421 |
Protein GI | 86750925 |
COG category | [N] Cell motility |
COG ID | [COG1256] Flagellar hook-associated protein |
TIGRFAM ID | [TIGR02492] flagellar hook-associated protein FlgK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.308813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTCG GAGACGCACT TTCGATCGCA ATGGCGGGCC TGCGCGCCAA CCAGGCCTCG ATGTCGCTGG TGTCGTCGAA CGTCGCCAAC GCCGAGACGC CGGGTTACGT CCGCAAGACC GTCAATCAGG TCACGACGCT GTCCGGCCCG TCGGGCAGCG GCGTTTCGAT CACCGGTGTC AACCGCGAAC TCGACGCCTA TCTGCAGGCG CAGCTCCGCA CCGAGACGTC GGGCGCGTCC TACGCCTCGT TGCGCTCCGA CTTCCTGCAG CAATTGCAGG GACTGTTCGG CGATCCGAAC TCGAACGGCA CGCTGGAGGA CGCGTTCAAC GGTCTCACCG CGGCGACGCA GGCGCTCGCC ACCAGCCCCG ACAGCACCTC GGCGCGGATC GGCGTGCTCA ACGCCGCGCA GGTGGTGTCC GGCGTGCTGA ATTCGATGTC GAACGGCATC CAGACGCTGC GCACCGGCGC CGAGACCGGC CTGACCGACG CCGTCAACAC GGCGAACAAT CTGCTGCAGC AGATCGCCTC GATCAACAAC AACATCCGCA CCAACCCGGC CGGCGGCACC TCCACCGACG CGGCGACCGC GTCGCTGCTC GACCAGCGCG ACGCCGCGAT CAACCAGCTC GCGCAGCTGA TGGACATCCG CGTCGTCACC GACGCCTCCA ACCAGGTCAC GGTGTTCACC GGTTCGGGCA TGCAACTCGT CGGCATGCAG GCCGCCCAGC TCAGCTTCGA CGCCCAGGGC ACCGTGACGC CGAGCACCAC CTGGAATCCC AACACCTCGG CGAGCGAGCT CGGCTCGGTC CGGATCGTCT ACCCTGACGG CAGCACGGCC GACCTGACCA ATTCGCTGAA GTCCGGCAAG ATGGCGGCCT ATGTCGAGCT GCGCGACAAC ACGCTGGTGC AGGCGCAAAC CCAGCTCGAC CAGTTCGCCG CGGCGATGGC CAGCGCGCTG TCGGACAAGA CCACCGCCGG CACGCCGGCG ACCTCCGGCG CGCAGACGGG TTTCGACCTC GATCTGACCG ACATGAAGGC CGGCAACACG GTCAACATCA CTTACACGGA CACCACGACC GGCGCGCAGC GGACCGTCTC GGTGATGCGC GTCGACGATC CGACGGCGCT GCCGCTGCCG CAGACCGCGA CGCTCAATCC CAACGACTAC GTGGTCGGTA TCGATTTCTC CGGCGCCTCG GGGTCGGTCA CCGCGCAGCT CAACGCCGCG CTGAACGCGA GAAACCTGCA GTTCACCGGG ACCTCGCCGA ACATCACCGT CCTGAACAAT CCCGGCTTCT CGACGGTGAA TTCCGCCTCG GTGACGTCGA CGGTGACGTC GCTGACCGGC GGCAGCGCCG AGGTGCCGCT GTTCACCGAC GCCGGCTCGC CCTACACCGG CGCGATCAGC GGCAACGGCA CGCAGATGAC CGGCCTCGCG CAGCGCATCT CGGTCAATCC CGCGCTGGTC ACCGATCCGT CGCGGCTGGT GGTGTATTCG ACCACGCCGC CGACCGCGGC CGGCGACACC ACGCGGCCGG ATTTCATCAC CAAGCAGCTC ACCAGCAGCA AATATCTGTA CTCGGCGACG ACCGGCATCG GATCGAATGC CGCGCCCTAC AACGGCACGC TGGAGAGCTT CCTGCAGCAA TTCGTCAGCC AGCAGGGCTC GAATGCGCAG GCGGCGACAC AGCTCGCCAG CGGACAAAGC GTCGTCCTGA ATACGCTGCA GCAGAAATAC GCGACCAATT CCGGCGTCAA CATGGACGAG GAAATGGCGC ATCTGCTTTC GCTGCAGAAC GCCTATTCGG CGAACGCGCG AGTGATGTCG ACGGTCAACC AGATGTATCA GACGCTGATG CAGGCGATGT GA
|
Protein sequence | MSLGDALSIA MAGLRANQAS MSLVSSNVAN AETPGYVRKT VNQVTTLSGP SGSGVSITGV NRELDAYLQA QLRTETSGAS YASLRSDFLQ QLQGLFGDPN SNGTLEDAFN GLTAATQALA TSPDSTSARI GVLNAAQVVS GVLNSMSNGI QTLRTGAETG LTDAVNTANN LLQQIASINN NIRTNPAGGT STDAATASLL DQRDAAINQL AQLMDIRVVT DASNQVTVFT GSGMQLVGMQ AAQLSFDAQG TVTPSTTWNP NTSASELGSV RIVYPDGSTA DLTNSLKSGK MAAYVELRDN TLVQAQTQLD QFAAAMASAL SDKTTAGTPA TSGAQTGFDL DLTDMKAGNT VNITYTDTTT GAQRTVSVMR VDDPTALPLP QTATLNPNDY VVGIDFSGAS GSVTAQLNAA LNARNLQFTG TSPNITVLNN PGFSTVNSAS VTSTVTSLTG GSAEVPLFTD AGSPYTGAIS GNGTQMTGLA QRISVNPALV TDPSRLVVYS TTPPTAAGDT TRPDFITKQL TSSKYLYSAT TGIGSNAAPY NGTLESFLQQ FVSQQGSNAQ AATQLASGQS VVLNTLQQKY ATNSGVNMDE EMAHLLSLQN AYSANARVMS TVNQMYQTLM QAM
|
| |