Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3794 |
Symbol | flgI |
ID | 3911597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4329776 |
End bp | 4330897 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885695 |
Product | flagellar basal body P-ring protein |
Protein accession | YP_487399 |
Protein GI | 86750903 |
COG category | [N] Cell motility |
COG ID | [COG1706] Flagellar basal-body P-ring protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.132686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGCG TTTCCGCCGT GATCCTGAAG CTGGCCGCAG CCGCCCTGTC CGCGCTGCTG CTGTCGGGCG TGGCCGCCAA CGCCACCTCG CGGATCAAGG ACCTCGCCAA TATCGAGGGC GTGCGGCAGA ACCAGTTGAT CGGCTACGGC CTCGTGGTCG GCCTCAACGG CACCGGCGAC ACCCTCAACA ACATTCCCTT CACCAAGCAG TCGCTGCAGG CGATGCTGGA GCGGATGGGC GTCAACATCC GCGGCGCCAC CATCCGCACC GGCAACGTCG CGGCCGTGAT GGTCACCGGC AATCTGCCCG CCTTCGCCAC CCAGGGCACC CGGATGGACG TCACCGTCTC GGCGCTCGGC GACGCCAAGA ATCTGCAGGG CGGCACCCTG CTGGTCACGC CGCTGCTCGG CGCCGACGGC AATGTCTACG CGGTGGCCCA GGGCTCGCTC GCGATCGGCG GTTTCCAGGC CGAGGGCGAG GCCGCCAAGA TCACCCGCGG CGTGCCGACC GTCGGCCGCA TCGCCAACGG CGCGATCATC GAGCGCGAGA TCGAATTCGC GCTGAACCGG CTGCCGATGG TGCGGCTGGC GCTGCGCAAC GCCGATTTCA CCACCGCCAA GCGGATCGCC GCCGCGGTCA ATGATTTCCT CGGCACCAAG AGCGCCGAGC CGATCGACCC CTCGACCGTG CAGCTCACGA TCCCGGCGGA ATTCAAAGGC AACGCGGTCG CCTTCGTCAC CGAGATCGAG CAGTTGCAGG TCGAGCCGGA CCAGGCCGCC AAGATCATCA TCGACGAGCG CAGCGGCATC ATCGTGATGG GCCGCGACGT CCGCGTCGCC ACCGTCGCGG TGGCGCAGGG CAACCTCACG GTCTCGATCT CCGAAAGCCC GCAGGTCAGC CAGCCCAATC CGTTGGCGAA CGGCCGCACC GTGGTCACGC CGAATTCGCG GATCGGCGTC ACCGAGGACG GCAAGAAGCT GGCGCTGGTC AAGGACGGCG TGTCGCTGCA ACAGCTCGTC GATGGCCTCA ATGGCCTGGG CATCGGCCCG CGCGACCTGA TCGGCATCCT GCAGGCGATC AAGGCCGCCG GCGCCATCGA AGCCGATATC GAGGTGATGT GA
|
Protein sequence | MPSVSAVILK LAAAALSALL LSGVAANATS RIKDLANIEG VRQNQLIGYG LVVGLNGTGD TLNNIPFTKQ SLQAMLERMG VNIRGATIRT GNVAAVMVTG NLPAFATQGT RMDVTVSALG DAKNLQGGTL LVTPLLGADG NVYAVAQGSL AIGGFQAEGE AAKITRGVPT VGRIANGAII EREIEFALNR LPMVRLALRN ADFTTAKRIA AAVNDFLGTK SAEPIDPSTV QLTIPAEFKG NAVAFVTEIE QLQVEPDQAA KIIIDERSGI IVMGRDVRVA TVAVAQGNLT VSISESPQVS QPNPLANGRT VVTPNSRIGV TEDGKKLALV KDGVSLQQLV DGLNGLGIGP RDLIGILQAI KAAGAIEADI EVM
|
| |