Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1111 |
Symbol | flgE |
ID | 3969545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1213546 |
End bp | 1214781 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637924222 |
Product | flagellar hook protein FlgE |
Protein accession | YP_530994 |
Protein GI | 90422624 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.397372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.647831 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGT ATGGCGTGAT GCGCACCGGC GGCTCCGGAA TGACCGCGCA GTCGAACAAG CTGTCGGCGG TGGCCGACAA TATCGCCAAC GTCAACACCA CCGGCTACAA GCGCGCCTCC ACCGAATTCT CGTCGCTGAT TTTGAAGAGC GGTTCCGGCA GCTACAATTC CGGCAGCGTC GAAACCCAGG TGCGCTATGC GATCTCCGAC CCCGGGACGC TGCAATACAC CACCTCGGCG ACCGATCTGG CGATCCAGGG CAACGGCTTC TTCGCGGTGA GCAACGCCGG CGGCACGCCG TTCCTGACCC GCGCCGGCTC CTTCGTGCCG GATGGTCAGG GAAATCTGTA TAATGCGGCG GGCTACTATC TGATGGGCTA CAACCTCAAG AACGGCCCGC CCAACGTCGT CGCCAATGGC CTGACCGGAT TGGAGGTCGT CAACATCGGG CAGACCGCGC TGGAAGGCAA TCCGTCGACC AAGGCCGCGG CGACCGCCAA CCTCAACGCC AACGCCGGCA TTACCGCGGG TGCGCCCAAC TACACGTCGA AGACCTCGCT CGTCACCTAC GACAATATCG GCAACAAGGT GACGCTCGAC ATCTATGCCT ACAAGACCGC GGCCAACGCC TGGACCATGG AGGTCTACGA CAGCGCCGAC TCGACCGGCG GCGGCTTTCC CTATACGGGC GCGCCGCTGG CGACGGGCAA CTTCAGCTTC GACGTCAGCG CCACCGGCAA GGGCCGGCTG GCGGGGGGCA GCCCGACCTC GCTGTCGCTG ACCATCCCGA ACGGCTCGCC GTTCACACTC GATCTGTCGT CGATGACCCA GGTTGCCGCG GACTTCCAGT TCAAGGCGAC GGTCGACGGC AATTCGCCGA GTTCGATCGA CAAGGTGGAA GTCGCCACCG ACGGCACCAT GTACGCGATC TACAACGATG GCACCCGGAT TGCGACCTAC AAGATTCCGC TGGCGACGGT GCCGAGCCCG GACAACCTCG TGCCGGAAGT CGGCAACGTC TATTCGATCG GGATCAACTC CGGAAACGTG CAGGTCGATT TCGCCGGGCA CAGCGGGCTT GGCATCGTCA AGGCGGAAGC GCTCGAAGCC TCCAACGTCG ACCTCGCCAA CGAACTGACG GCGATGATCG AGTCGCAGCG TGGCTATACG GCGAATTCCA AAGTGTTCCA GACCGGCGCG GATCTGCTCG ACGTCCTCGT CAATCTGAAA CGTTGA
|
Protein sequence | MSLYGVMRTG GSGMTAQSNK LSAVADNIAN VNTTGYKRAS TEFSSLILKS GSGSYNSGSV ETQVRYAISD PGTLQYTTSA TDLAIQGNGF FAVSNAGGTP FLTRAGSFVP DGQGNLYNAA GYYLMGYNLK NGPPNVVANG LTGLEVVNIG QTALEGNPST KAAATANLNA NAGITAGAPN YTSKTSLVTY DNIGNKVTLD IYAYKTAANA WTMEVYDSAD STGGGFPYTG APLATGNFSF DVSATGKGRL AGGSPTSLSL TIPNGSPFTL DLSSMTQVAA DFQFKATVDG NSPSSIDKVE VATDGTMYAI YNDGTRIATY KIPLATVPSP DNLVPEVGNV YSIGINSGNV QVDFAGHSGL GIVKAEALEA SNVDLANELT AMIESQRGYT ANSKVFQTGA DLLDVLVNLK R
|
| |