Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1488 |
Symbol | |
ID | 3972272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1624824 |
End bp | 1627031 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637924603 |
Product | flagellin |
Protein accession | YP_531369 |
Protein GI | 90422999 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.723636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.726949 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGCA TTACACTCTC GTCTTCCGTC CGCCAGAACC TGCTCTCTCT GCAGTCGACC GCGGAACTAC TCTCCTCGAC CCAGAGCCGC CTCTCCACCG GTAAGAAGGT GAACTCGGCG CTCGACAACC CCACCAACTT CTTCACCGCC TCCGGCCTCG ACGCCCGTGC TTCCGACATC AACAATCTGT TGGACGGCAT CGGCAACGGC GTGCAGATCC TGCAGGCCGC CAACACCGGC ATCACCTCGC TGCAGAAGCT GGTGGACTCG GCCAAGTCGA TCGCCAACCA GGCGCTGCAG ACCGTCGCCG GCTACACCAG CAAGTCCAGC GTCTCGACCA CGATCGGCGG CGCCACCGCC GACGATCTGC GCGGCACTGC GACCTATTCC AACGGCTCGG CGCAGAGCCT TGCCTTGCAG GACGGTCAGG CCGCTGCCGG CGTGATTTCC GGCGCGACTC TGCTCGGCGG TACCGCTGCG ACCAAGACCG GCGCAACGGT CACGGGTCTC GCGACCACCA CCTTGCTGAG CGCGCTCGGC ACCAACAAGC CGGTAGCCGG CAACAGCCTG ACGGTGAATG GCCACACGGT CACCTTCGCC AATGGCGACG CGCCGGCCTC GACCACTCTG CCGGCCGGTT CTGGTGTCAA CAATCAGCTG GTCACCGACG GAAAGGGCAA CTCGACCGTC TATCTGAACA GCGGCACCGT GCAAGACGTG ATGAACGCCA TCGATCTTGC GAGCGGCGTG CAAAACGCGA CGATCACCGG CGGTGCTGCG ACGGTGGCAA ACAGCTCGGG TGCCAATGCG TCGATCGCGG CCAACGCGCT GGTGTTCAGC ACCTCGACTG GTTCGGATCT GTCGATCTCC GGCAACAACA CTCTGCTGTC GGCGTTCGGT CTGAATTCGG GAGCCACCGG CGTCGGCAGC TTCACGGCTG ATCGAACCGC TACCGCGGCC GCTGGGGTCG GCGTCAGCCG CGCCGGAATG GTGCAGGCTG GCTCGACCCT GACCGTGAAC GGCAAGACCA TCACGTTCCA GGACGCGGCT ATCCCGGCCG CTGCTGATTA CGGTTCCGGT AAAGTGACCG GCAAGAACGT GATCACCGAC GGCAGCGGCA ATTCGACCGT CTACCTGCAG GGCGGCACGA TGAACGATGT GCTCACCGCC ATCGACATCG CGAGCGGCGC GCAGGTTGCC CCGGTCAGCA ACGGTGCCGC CACGTTGGCG GTGGCCGCGG GCAGCGAGGC GTCCAAGGTG CTCTCCGGTG GTCAATTGCA GATCAGCTCG GGCCTGGCAG GCGATCTCAA GCTCTCCGGC ACCGGCAACG CGCTGTCGGC ATTGGGTCTG GCCGGCGCCC AGGGAACCGG GACCAGCTTC AACGTGGCCC GCACGGCATC CGCCGGCGGC ATCAGCGGCA AGACGCTGTC GTTCGAATCG TTCAATGGCG GTACCGCCGT CAACATCACG ATCGGCGACG GCACCAACGG CACCGTCAAG TCGCTCACCG ACCTGAATAC GGCGCTGAAG GCCAACAATA TGCAGGCCTC GATCGACACC ACCGGCAAGC TGACGATCTC GGCGGCGAAC GACTACGCCT CCTCGACGCT CGGCTCCACG CTGTCCGGCG GCAAGATCAG CGGCACCGCG GCGTCGCTCT TCTCGACGCC GACCGACCCT GTTGCCGACC TGGTCGCCCA GACCACCCGC GGCAACCTGG TCAAGCAGTA CAACGACGTC ATGGACCAGA TCAAAACCAC GGCCCAGGAC GCTTCGTTCA ACGGCGTCAA CCTGCTCGGC GGCGACACCC TGAAGCTGGT GTTCAACGAA ACCGGCAAGT CCACCCTGAG CATTCAGGGC GTGACCTTCG ACCCGACCGG CCTCGGCCTG TCGAAGTTGA CGTCCGGCAC CGACTTCATC GACAACGCCG CCACCAACAA GACCTTGGCG ACGCTGACCA CCGCTGCAAC CACGCTGCGG TCGCAGGCTT CGGCTTTGGG TTCCAACCTC TCGATCGTGC AGACCCGTCA GGACTTCTCG AAGTCCCTGA TCAACGTGCT GCAGACCGGT TCGTCGAACC TGACGCTGGC CGACACCAAC GAGGAAGCGG CCAACAGCCA GGCGCTGTCG ACCCGCCAGT CGATCGCGGT GTCCGCGCTG TCGTTGGCCA ACAGCTCGCA GCAGGGCGTG TTGCAGCTGC TGCGCTAA
|
Protein sequence | MSGITLSSSV RQNLLSLQST AELLSSTQSR LSTGKKVNSA LDNPTNFFTA SGLDARASDI NNLLDGIGNG VQILQAANTG ITSLQKLVDS AKSIANQALQ TVAGYTSKSS VSTTIGGATA DDLRGTATYS NGSAQSLALQ DGQAAAGVIS GATLLGGTAA TKTGATVTGL ATTTLLSALG TNKPVAGNSL TVNGHTVTFA NGDAPASTTL PAGSGVNNQL VTDGKGNSTV YLNSGTVQDV MNAIDLASGV QNATITGGAA TVANSSGANA SIAANALVFS TSTGSDLSIS GNNTLLSAFG LNSGATGVGS FTADRTATAA AGVGVSRAGM VQAGSTLTVN GKTITFQDAA IPAAADYGSG KVTGKNVITD GSGNSTVYLQ GGTMNDVLTA IDIASGAQVA PVSNGAATLA VAAGSEASKV LSGGQLQISS GLAGDLKLSG TGNALSALGL AGAQGTGTSF NVARTASAGG ISGKTLSFES FNGGTAVNIT IGDGTNGTVK SLTDLNTALK ANNMQASIDT TGKLTISAAN DYASSTLGST LSGGKISGTA ASLFSTPTDP VADLVAQTTR GNLVKQYNDV MDQIKTTAQD ASFNGVNLLG GDTLKLVFNE TGKSTLSIQG VTFDPTGLGL SKLTSGTDFI DNAATNKTLA TLTTAATTLR SQASALGSNL SIVQTRQDFS KSLINVLQTG SSNLTLADTN EEAANSQALS TRQSIAVSAL SLANSSQQGV LQLLR
|
| |