Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2370 |
Symbol | |
ID | 4022859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2645785 |
End bp | 2647998 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962563 |
Product | flagellin |
Protein accession | YP_569503 |
Protein GI | 91976844 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGTA TCGTTCTCTC CAATGCGGTC CGCCAGAATC TTTCCTCGCT GCAGGCCACC GCGGACTTGC TCGCCACCAC CCAGAGCCGG CTTTCGTCGG GCAAGAAGGT GAACTCGGCG CTCGATAATC CCACCAACTT CTTCACCGCA GCTTCGCTCG ATTCGCGCTC CAGCGACATC AACAACCTGC TCGACGGCAT CGGCAACGGC ATTCAGATCA TTCAGGCCGC CAACACCGGC ATCAGCTCGC TGACCAAGCT GGTGGACAGC GCCAAGTCGA TCGCCAACCA GGCGCTGCAG TCGGTCGCCG GCTACAGCAG CAAATCGAGC GTCACGACCA CGATCGCCGG CGCCACCGCG GACGACCTGC GCGGCACCTC GACCTATTCC AACGGCCTCG CGCAAAGCAT CGGTCTGCAG GACGGCCAGG GCACGCCCGG CGTTGTCGAT GGTGATACCC TGCTCGGCGG CGTCGCTGCG ACCAAGACCG GCGGAACCGT CGGTGGTAGT GGCATCACCG CAGGCACCGC GCTGAGCGCG CTGGGCGCGA ACAAGCCGGT GGCCGGCGAC ACCATGACGG TGAACGGTCG CACCATCACC TTCGCAAGCG GCGGCGCTCC GGACAAGGCT ACCCTGCCGA CCGGCTCGGG TGTCGAAGGT CAGCTCGTCA CAGACGGCAA GGGCAACTCC ACCGTCTTTC TGGACAGCGG TACTGTTCAG GACGTGATGA ACGCGATCGA CCTCGCCAGC GGCGTTCAGA AAGTGACGAT CACCGGTGGC GACGCTACGC TGGCGCCCAG TTCTGGTACT GCCGCTGCGG TCACGTCGAA CGCGCTTGTG CTGTCGACCT CGACGGGTTC GGATCTGTCG ATCTCCGGCA ACAACACGCT GTTGTCGGCC TTCGGCTTGA ATTCGGGCGC CACCGGCGCC GGTACCTTCA AGGCCGAACG CACTGCCAGC CCTGCTGCAG GCGACGGCGT CAGCCGCGCC AACATGATCC AGGCCGACTC GACGCTCAGC ATCAACGGCA AGACCATCAC CTTCAAGGAT GCCGCGATCC CTGCAAACGC TGACTATGGC TTCGGCAAGG TCGGCAGCCA GAACGTCATC ACCGACGGCA ACGGCAACTC CACTGTCTAT CTGCAGGGCG GCACGATCAA GGACGTGCTC ACCGCGGTCG ACATCGCGAG CGGCGCGCAG ACCGCGCCGG TCAGCAACGG CGCAGCCTCC CTCGCTGTGA CAGCGGGCAG CGAAGCCTCC AAGGTGCTCA GTGGCGGTCA GTTGCAGATC AGCTCCGGTC TGGCCGGCGA TCTGAAGATC AGCGGCACCG GCAATGCGCT GTCGGCGCTG GGCCTCGCCG GCAATCAGGG AACCGCGACC AGCTTCTCGG TCGCGCGGAC CGCCACTGCC GGCGGAATCA CCGGCAAGAC GTTGTCGTTC GAAGCCTTTA ATGGCGGCAC CGCGGTCAAC GTGACCATCG GCGACGGCAC CAACGGCACC GTGAAGTCGC TGGCCGACTT GAACTCGGCG CTGTCGGTCA ACAATCTGGC GGCGTCGATC GACACCACCG GCAAGCTGAC CATCTCGGCG TCCAACGACT ATGCCTCCTC GACGATCGGC TCGACCGAAT CGGGCGGCAA GATCGGCGGC ACCGCGGCGT CGCTGTTCTC GACGGCTTCG GCTCCTGTTG CCGACGTCAA CGCCCAGAAC ACCCGCGCCA ATCTGGTGAC GCAGTACAAC AACATCATTC AGCAGATCAA AACCACTGCT CAGGATGCGT CGTTCAACGG CGTCAACCTG CTCGGCGGCG ACACGCTGAA GCTGGTGTTC AACGAAACCG GCAAGTCCAC CCTGAGCATT CAGGGCGTCA CCTTCGACCC GGCCGGCCTC GGCCTGTCGA GCCTGAAGTC GGGCAAGGAC TTCATCGACA ATGCGAACAC CAACAAGGTG CTGTCGTCTC TGAACACCGC GTCGAGCACG CTGCGTTCGC AGGCCTCGGC GTTGGGCTCG AACCTGTCGA TCGTGCAGAC CCGTCAGGAC TTCTCGAAGA ACCTGATCAA CGTGCTGCAG ACCGGCTCGT CCAACCTGAC GCTGGCCGAC ACCAACGAGG AAGCGGCCAA CAGCCAGGCG CTGTCGACCC GCCAGTCGAT CGCGGTGTCC GCGCTGTCGC TGGCCAACTC GTCGCAGCAG AGCGTGCTGC AGCTACTGCG TTAA
|
Protein sequence | MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNSA LDNPTNFFTA ASLDSRSSDI NNLLDGIGNG IQIIQAANTG ISSLTKLVDS AKSIANQALQ SVAGYSSKSS VTTTIAGATA DDLRGTSTYS NGLAQSIGLQ DGQGTPGVVD GDTLLGGVAA TKTGGTVGGS GITAGTALSA LGANKPVAGD TMTVNGRTIT FASGGAPDKA TLPTGSGVEG QLVTDGKGNS TVFLDSGTVQ DVMNAIDLAS GVQKVTITGG DATLAPSSGT AAAVTSNALV LSTSTGSDLS ISGNNTLLSA FGLNSGATGA GTFKAERTAS PAAGDGVSRA NMIQADSTLS INGKTITFKD AAIPANADYG FGKVGSQNVI TDGNGNSTVY LQGGTIKDVL TAVDIASGAQ TAPVSNGAAS LAVTAGSEAS KVLSGGQLQI SSGLAGDLKI SGTGNALSAL GLAGNQGTAT SFSVARTATA GGITGKTLSF EAFNGGTAVN VTIGDGTNGT VKSLADLNSA LSVNNLAASI DTTGKLTISA SNDYASSTIG STESGGKIGG TAASLFSTAS APVADVNAQN TRANLVTQYN NIIQQIKTTA QDASFNGVNL LGGDTLKLVF NETGKSTLSI QGVTFDPAGL GLSSLKSGKD FIDNANTNKV LSSLNTASST LRSQASALGS NLSIVQTRQD FSKNLINVLQ TGSSNLTLAD TNEEAANSQA LSTRQSIAVS ALSLANSSQQ SVLQLLR
|
| |