Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1489 |
Symbol | |
ID | 3972273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1627342 |
End bp | 1629582 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637924604 |
Product | flagellin |
Protein accession | YP_531370 |
Protein GI | 90423000 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.671791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTTA TCCTCTCCTC AGCCGTGCGT CAGAACCTGC TCTCGCTGCA GTCCACTGCC GACCTGCTTG CTACCACCCA GAGCCGCCTG TCGACCGGCA AGAAGGTCAA CACCGCGCTC GACAACCCGA CCAACTTCTT CACCGCCTCC TCTCTCGACA GCCGCGCCAG CGACATCAAC AATTTGCTCG ATGGCATCGG CAACGGCGTG CAGATCCTGC AGGCCGCCAA CACCGGCATC ACCTCGCTCA ACAAACTCGT TGACTCCGCG AAGTCGATCG CCAACCAGGC GCTGCAGACC GTGTCGGGCT ACACCACCAA GTCGAACGTC TCGACCACGA TCGCCGGCGC CACCGCCGAC GACATCCGCG GCACCACCAC TTACGCCAAC GCGTTCGCAG TCAGCGGCGT CGTCACCGAC GGCACCTCCG GCGGCGCTTC GCCGATCACG ACGGCCACCA CCCTCGGCGG CGTTGCTGGA GCTCTGGTTG GAGTTGCGGC CACCGCGGGC GACGGCTCCA CCGCTCTGTC CGGCACCGTC ACACTGGCTG CCGGCGCCAC TGCGACCACT TTGTTGGCCG CCGCGGCTCC GAAAGACGGC GACGTTCTGG TCGTCAACGG CAAGTCCATC ACCTTCAAGG GCGGTGCGGC TCCTGCGGCG GCCGGCGTTG CGACTGGCTC GGGCGTCAGC GGCAACATCG TCACCGATGG ATCCGGCAAC TCGCAGATCT ATCTGACCGG CGGCACCGTT GCCGACGTTC TCAAGGCCGT CGATCTGGCC AGCGGCGTTG CCAAGACCGT GAATAGCGCG GGCGCGGCGA CCATCACCGG CTCGACCGCG GCGTTTGCCG CCGGCGTGCT GACGGTCAAC TCCTCGACCG GTGCCGATCT CAGCGTCACC GGCAAGGCCG ATCTGCTCAA TGCCCTCGGC CTGACCACCT CGATCGGGTC CGGCAACGTC TCGGTCGGCG CCGCTCGCAC CACCGCTGCG ACCACCACCT CGAACCTGAT CCAGGACGGT TCGACCCTGA ATGTGAACGG CAAGACGATC ACCTTCTCGA ACGCTTACCA GCCGCTTCCG GCCAACGTTC CGTCCGGTTC CGGCATCGAG GCCGGCAGTC ATCTGCAGAC CGATGGCAAG GGCAACTCGA TCGTCTATCT GCAGGGTGCG ACCATCGCTG ACACGTTGAA AGCGATCGAC ATTGCGACCG GCGTGCAGAC GGCGACGAAT GCCGCCGGCA TCTCGACCCT GGCGACCGCC AGCGGCGCCT CGGCGTCCAC CGTGGCGGCG AACGGCACGC TGAAGCTCAG CACCGGCACG TTGTCGGATC TCAGCATTGC CGGCGGCGTC GGCAATGCAT TGTCGGCACT TGGTCTCGAC GGCCCGACCC ACACCAGCAG CACGTTCACT GCGACGCGTG CGGCCGGCGC CGGCGGCATC GACGGCAAGA CCCTGACCTT CTCTTCCTTC AACGGCGGAA GCGCGGTCAA TGTCACGCTC GGCGATGGCA CCAACGGAAC CGTCAAGACG CTGGCCCAGC TCAAGACTGC GCTGCAAGCC AACAACCTCG ACGCGACCGT GGATGCCTCC GGCAAGCTGA CGATCTCGGC GGGCAACGAC TATGCGTCGT CGACGCTGGG TTCGACCCTG TCCGGCGGCT CGATCGGCGG CACCTTGACC ACCTCGACCA CCTTCACCAT CGCGGCGGCG CCGGTTGCCG ACACGGTGGC GCAGACCACT CGCGCCAATC TCGTGACGCA GTACAACAAC ATCCTGAACC AGATCAACAC CACGGCGCAG GATGCCTCGT TCAACGGCGT CAACCTGTTG AACGGCGACA ACCTCAAGCT GACCTTCAAC GAAACCGGCA AGTCCACGCT GAACGTGCAG GGCGTGACCT TCAACGCAGC AGGCCTCGGC CTGTCGAACC TGACTGGCGG CACCGACTTC ATCGACAACT CGAACACCAA CAAGACTTTG GCAACTCTGA CGACCGCCAG CACCGCGCTG CGTTCGCAGG CCTCGGCCTT GGGTTCGAAC CTGTCGATCG TGCAGTTGCG TCAGGACTTC TCCAAGAGCC TGATCAACGT GCTGCAGACC GGCGCGTCGA ACCTGACGCT GGCCGACACC AACGAGGAAG CGGCCAACAG CCAGGCGCTG TCGACCCGCC AGTCGATCGC GGTGTCAGCC TTGTCGCTGG CCAACACCTC GCAGCAGAGC GTGTTGCAGC TGCTGCGCTA A
|
Protein sequence | MSVILSSAVR QNLLSLQSTA DLLATTQSRL STGKKVNTAL DNPTNFFTAS SLDSRASDIN NLLDGIGNGV QILQAANTGI TSLNKLVDSA KSIANQALQT VSGYTTKSNV STTIAGATAD DIRGTTTYAN AFAVSGVVTD GTSGGASPIT TATTLGGVAG ALVGVAATAG DGSTALSGTV TLAAGATATT LLAAAAPKDG DVLVVNGKSI TFKGGAAPAA AGVATGSGVS GNIVTDGSGN SQIYLTGGTV ADVLKAVDLA SGVAKTVNSA GAATITGSTA AFAAGVLTVN SSTGADLSVT GKADLLNALG LTTSIGSGNV SVGAARTTAA TTTSNLIQDG STLNVNGKTI TFSNAYQPLP ANVPSGSGIE AGSHLQTDGK GNSIVYLQGA TIADTLKAID IATGVQTATN AAGISTLATA SGASASTVAA NGTLKLSTGT LSDLSIAGGV GNALSALGLD GPTHTSSTFT ATRAAGAGGI DGKTLTFSSF NGGSAVNVTL GDGTNGTVKT LAQLKTALQA NNLDATVDAS GKLTISAGND YASSTLGSTL SGGSIGGTLT TSTTFTIAAA PVADTVAQTT RANLVTQYNN ILNQINTTAQ DASFNGVNLL NGDNLKLTFN ETGKSTLNVQ GVTFNAAGLG LSNLTGGTDF IDNSNTNKTL ATLTTASTAL RSQASALGSN LSIVQLRQDF SKSLINVLQT GASNLTLADT NEEAANSQAL STRQSIAVSA LSLANTSQQS VLQLLR
|
| |