Gene RPB_3794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3794 
SymbolflgI 
ID3911597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4329776 
End bp4330897 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID637885695 
Productflagellar basal body P-ring protein 
Protein accessionYP_487399 
Protein GI86750903 
COG category[N] Cell motility 
COG ID[COG1706] Flagellar basal-body P-ring protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.132686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGCG TTTCCGCCGT GATCCTGAAG CTGGCCGCAG CCGCCCTGTC CGCGCTGCTG 
CTGTCGGGCG TGGCCGCCAA CGCCACCTCG CGGATCAAGG ACCTCGCCAA TATCGAGGGC
GTGCGGCAGA ACCAGTTGAT CGGCTACGGC CTCGTGGTCG GCCTCAACGG CACCGGCGAC
ACCCTCAACA ACATTCCCTT CACCAAGCAG TCGCTGCAGG CGATGCTGGA GCGGATGGGC
GTCAACATCC GCGGCGCCAC CATCCGCACC GGCAACGTCG CGGCCGTGAT GGTCACCGGC
AATCTGCCCG CCTTCGCCAC CCAGGGCACC CGGATGGACG TCACCGTCTC GGCGCTCGGC
GACGCCAAGA ATCTGCAGGG CGGCACCCTG CTGGTCACGC CGCTGCTCGG CGCCGACGGC
AATGTCTACG CGGTGGCCCA GGGCTCGCTC GCGATCGGCG GTTTCCAGGC CGAGGGCGAG
GCCGCCAAGA TCACCCGCGG CGTGCCGACC GTCGGCCGCA TCGCCAACGG CGCGATCATC
GAGCGCGAGA TCGAATTCGC GCTGAACCGG CTGCCGATGG TGCGGCTGGC GCTGCGCAAC
GCCGATTTCA CCACCGCCAA GCGGATCGCC GCCGCGGTCA ATGATTTCCT CGGCACCAAG
AGCGCCGAGC CGATCGACCC CTCGACCGTG CAGCTCACGA TCCCGGCGGA ATTCAAAGGC
AACGCGGTCG CCTTCGTCAC CGAGATCGAG CAGTTGCAGG TCGAGCCGGA CCAGGCCGCC
AAGATCATCA TCGACGAGCG CAGCGGCATC ATCGTGATGG GCCGCGACGT CCGCGTCGCC
ACCGTCGCGG TGGCGCAGGG CAACCTCACG GTCTCGATCT CCGAAAGCCC GCAGGTCAGC
CAGCCCAATC CGTTGGCGAA CGGCCGCACC GTGGTCACGC CGAATTCGCG GATCGGCGTC
ACCGAGGACG GCAAGAAGCT GGCGCTGGTC AAGGACGGCG TGTCGCTGCA ACAGCTCGTC
GATGGCCTCA ATGGCCTGGG CATCGGCCCG CGCGACCTGA TCGGCATCCT GCAGGCGATC
AAGGCCGCCG GCGCCATCGA AGCCGATATC GAGGTGATGT GA
 
Protein sequence
MPSVSAVILK LAAAALSALL LSGVAANATS RIKDLANIEG VRQNQLIGYG LVVGLNGTGD 
TLNNIPFTKQ SLQAMLERMG VNIRGATIRT GNVAAVMVTG NLPAFATQGT RMDVTVSALG
DAKNLQGGTL LVTPLLGADG NVYAVAQGSL AIGGFQAEGE AAKITRGVPT VGRIANGAII
EREIEFALNR LPMVRLALRN ADFTTAKRIA AAVNDFLGTK SAEPIDPSTV QLTIPAEFKG
NAVAFVTEIE QLQVEPDQAA KIIIDERSGI IVMGRDVRVA TVAVAQGNLT VSISESPQVS
QPNPLANGRT VVTPNSRIGV TEDGKKLALV KDGVSLQQLV DGLNGLGIGP RDLIGILQAI
KAAGAIEADI EVM