Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1670 |
Symbol | |
ID | 4022150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1886025 |
End bp | 1888313 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637961865 |
Product | flagellin |
Protein accession | YP_568808 |
Protein GI | 91976149 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.940906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0365182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGTA TCGTTCTCTC GAACGCCGTC CGCCAGAATC TTTCGTCGCT TCAGGCCACG GCCGATCTGC TCGCCACCAC CCAGAGCCGT CTTTCGTCCG GCAAGAAGGT GAACTCGGCT CTCGACAATC CGACCAACTT CTTCACCGCG TCGGGTCTCG ATGCCCGCTC CAGCGACATT AACAACCTGC TCGACGGTAT CGGCAACGGC GTGCAGATCC TGCAGGCCGC CAACACCGGC ATCACCTCGC TGACCAAGCT GGTCGATAGC GCCAAGTCGA TCGCCAACCA GGCCCTGCAG ACCGTCTCGG GCTATTCGAC GAAGTCGAAT GTCTCGACCA CGATCACCGG TGCCACTGCC AACGACCTCC GCGGCACTAC CAGCTACTCC AGCACTTCCG CGGCGGGTAA CGTGCTGTAT TCGGGCGCCG CCGGCGGTGC CACGGCTGCG ACCTCGGCTG CGACCCTCGG TGGCACTGCT GGTTCGCTTG TCGGGTCTGG CGTGGTCAAC AACAACCTCA CGGTTCCGGT GGCAATCGAT TCGACCACCC GTCTGTTCGC AGCGGGCGGT GGCGGCACTG CCGGTCTTAC CACCCAGGCA AACACCACCT TCACCGATGG TTCGAAGCTG TCGGTCAACG GCAAGACCAT CACCTTCAGT GCGACGGCGG TTCCCGGTGC CAGCGCCGTT GCGGCCGGCT CCAGCCTGTC CAGCACGAAC GTCGTGACCG ACTCCGGCGG CAACTCGACG GTTTATCTCG GGACCGCTGC AGACTCAGCG GCCACGGTCG GCGATCTGAT GGCCGCGATC GATGTCGCCA GCGGCGCACA GTCGATCACC GCGATCAACG CCACCACCAA GATCGCGACC TTGACCGGCG GTGCGGGCGC ATCTTCGATC ACCGGCGGCA CCGTCACGTT GAAAAGCTCG ACCGGCGCCG ATCTGTCGAT CTCTGGCACC GCAGACATGC TGGCGTCCCT GAAACTCACG GCGTCGCTGG GTTCGAGCGT CACGACTGTC GCCGCTGCTC GTGCCACCTC GTCGTCCAGC CTCGGCAGCC TGATTGAGGA CGGCTCGACG CTGAACGTCA ATGGCAAGAC CATCACCTTC AAGAACACGC TGTCGACCGA CGTGAATGCG ATTCCGACCG GCTTCGGCAA GCCGAGCGGC GCGCACTACG CCACCGACGG CAACGGCAAT TCGACCGTGT TCCTGCAGGA CGCGACGGCT GCCGACATGC TGTCGGCGAT CGACCTCGCC ACCGGCACCA AGAGCGCGAC CATTGCCACC AGCGTCGCCA CAGTGACGAC CCCGGCCGGC AACGTCGCGT CGACGGTGCT GAGCGGCGCG CTCAAGCTGT CGACCGGTAC CGCGGCCGAC CTCTCGATCA CTGGTACCGG CAACGCACTC GCCGCCCTCG GCCTCAACGG CCCGACAGGC ACCGACACCT CGTTCAACGC ATCGCGGACG GCGAGCGCCG GCAATGTCAG CGGCAAGTCC TTGACCTTCA CCTCCTTCAA GGATGGTGCG GCGGTGAATG TCACCTTCGG TGACGGCACC AACGGCACCG TCAAGTCGCT CGCCCAGCTC AACACTGCGC TTGCGGCCAA CAACATGGTG GCGGTGGTTG ACAATGCGAC CGGCAAGCTG ACGATCTCGG CGTCGAACGA TTTCGCTTCC CACACATTGG GAAGCAGCGA CGGCGGCGCG ATCGGTGGCA CACTGAGTTC GACACTGACC TTCTCGTCTG CGTCGGCTCC GGTGGCTGAT ACCAATGCCC AGAACACCCG CGCCGGCCTG GTCAAGCAAT ACAACGACAT CATGGACCAG ATCAAAACCA CGGCCCAGGA TGCCTCGTTC AACGGCGTCA ACCTGCTCGA CGGTGACACG CTGAAGCTGG TGTTCAACGA AACCGGCAAG TCGACGATCT CGATCCAAGG CGTCAGCTAC AATCCGACCG GCCTCGGCCT GTCGACCCTG ACTTCGGGCA CCGACTTCAT CGACAACGAT GCGACCAACT CCGTGTTGGC CAAGCTGAGC ACCGCATCCA CGACCCTGCG GTCGCAGGCC TCGGCGTTCG GTTCGAACCT CTCGATCGTC CAGGCGCGTC AGGACTTCTC GAAGAACCTG ATCAACGTGC TGCAGACCGG CTCGTCGAAC CTGACGCTGG CCGACACCAA CGAGGAAGCG GCCAACAGCC AGGCGCTGTC GACCCGCCAG TCGATCGCGG TGTCCGCGCT GTCGCTCGCC AACCAGTCTC AGCAGGGCGT GCTCCAGCTG CTGCGCTGA
|
Protein sequence | MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNSA LDNPTNFFTA SGLDARSSDI NNLLDGIGNG VQILQAANTG ITSLTKLVDS AKSIANQALQ TVSGYSTKSN VSTTITGATA NDLRGTTSYS STSAAGNVLY SGAAGGATAA TSAATLGGTA GSLVGSGVVN NNLTVPVAID STTRLFAAGG GGTAGLTTQA NTTFTDGSKL SVNGKTITFS ATAVPGASAV AAGSSLSSTN VVTDSGGNST VYLGTAADSA ATVGDLMAAI DVASGAQSIT AINATTKIAT LTGGAGASSI TGGTVTLKSS TGADLSISGT ADMLASLKLT ASLGSSVTTV AAARATSSSS LGSLIEDGST LNVNGKTITF KNTLSTDVNA IPTGFGKPSG AHYATDGNGN STVFLQDATA ADMLSAIDLA TGTKSATIAT SVATVTTPAG NVASTVLSGA LKLSTGTAAD LSITGTGNAL AALGLNGPTG TDTSFNASRT ASAGNVSGKS LTFTSFKDGA AVNVTFGDGT NGTVKSLAQL NTALAANNMV AVVDNATGKL TISASNDFAS HTLGSSDGGA IGGTLSSTLT FSSASAPVAD TNAQNTRAGL VKQYNDIMDQ IKTTAQDASF NGVNLLDGDT LKLVFNETGK STISIQGVSY NPTGLGLSTL TSGTDFIDND ATNSVLAKLS TASTTLRSQA SAFGSNLSIV QARQDFSKNL INVLQTGSSN LTLADTNEEA ANSQALSTRQ SIAVSALSLA NQSQQGVLQL LR
|
| |