Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0765 |
Symbol | flgE |
ID | 3909253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 858206 |
End bp | 859471 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882657 |
Product | flagellar hook protein FlgE |
Protein accession | YP_484387 |
Protein GI | 86747891 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTGT CAGGCGCACT TTCGTCCGCG ATCTCCGCGT TGAACGCGCA GAGTTCGGCG CTGGCGATGA TCTCCGACAA CATCGCGAAT TCGTCGACGG TCGGCTACAA GACCTCCTCG GCCTCGTTCG AGTCGCTGGT CACCAACACG TCGGGCGCGT CGTCCTACTC GTCCGGCGGC GTCACGGCGT CGGCGACGTC GAACATCACC ACGCAGGGCC TGCTGACCTC GACCACGACG GCGACCAACA TCGCGATCCA GGGCAACGGC TTCTTTCCGG TGACGACGTC GCTCACCGGC GGATCGACGC TCTACACCAG GAACGGCGCG TTCACGATCG ATTCCGACGG TTATCTCGTC AACAACGGCG CCTATCTGCT CGGCTGGCGG ACCGACACCG ACGGCAACGT CATCGGCACC GCCAGCGAAG GCAATCTGGT TCCGATCGAC ACCAGCGTCG CCAAGACCAG CAGCGGCGCG ACCACCACGA CGTCGTTCGC GGCGAATCTG CCGGCCGACG CCGCGGTCAA CGACACCTTC ACGACGTCGA TGCCGCTGTA TGATTCGCTC GGCACGTCGA GCACGATGCA GGTGACGTGG ACGAAGACCG CGGAGAACAC CTGGAGCGCG AGCTTCGGCA AGCCGACGCT GGCGTCGGAT TCGACCACGA CGCTGGCGGC GGCGCCGACC GACACGATCT CGGTCTCGTT CAACAGCGAC GGTTCGCTGG CCAGCACCAG CCCGAGCCCG GCGACCGTCA CGATCAGCGA CTGGACCACC GGCGCCGCCA ACAGCACGAT CACGCTCGAC CTCGGCACGG TGGGCGGAAA AGACGGCCTG ACGCAACTAT CCTCGGACCT TTCGACGCCG GCCGTGACCA TCGACAGCAT CGAGTCCGAC GGCCTTGCCT TCGGCAAGCT GAGCAGCGTC GCGGTCGGCG ACGACGGCAC CGTCAACGCC ACCTACTCCA ACGGCGAGAC CATCGCGATC TACAAGATCC CGGTCGCGAC CTTCACGGCG TCGGCCGAGC TGGAGGCGCA GAGCGGCGGG TTGTACGCGA CCACCGCCGC GTCCGGCTCG GCGACGCTGC AGGAATCCGG CGCCAACGGC GCCGGCACGA TCTACGGCAG CGAGCTCGAA TCCAGCACCA CCGACACCAA CGAACAATTC AGCAGCATGA TCTCCGCGCA GCAGGCGTAT TCGGCCGCGT CGCAGGTCAT CACGGCGGTC AACAAGATGT TCGACACCCT GATCGCGGCG ATCTGA
|
Protein sequence | MSLSGALSSA ISALNAQSSA LAMISDNIAN SSTVGYKTSS ASFESLVTNT SGASSYSSGG VTASATSNIT TQGLLTSTTT ATNIAIQGNG FFPVTTSLTG GSTLYTRNGA FTIDSDGYLV NNGAYLLGWR TDTDGNVIGT ASEGNLVPID TSVAKTSSGA TTTTSFAANL PADAAVNDTF TTSMPLYDSL GTSSTMQVTW TKTAENTWSA SFGKPTLASD STTTLAAAPT DTISVSFNSD GSLASTSPSP ATVTISDWTT GAANSTITLD LGTVGGKDGL TQLSSDLSTP AVTIDSIESD GLAFGKLSSV AVGDDGTVNA TYSNGETIAI YKIPVATFTA SAELEAQSGG LYATTAASGS ATLQESGANG AGTIYGSELE SSTTDTNEQF SSMISAQQAY SAASQVITAV NKMFDTLIAA I
|
| |