Gene RPD_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1667 
Symbol 
ID4022147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1880370 
End bp1882202 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content65% 
IMG OID637961862 
Producthypothetical protein 
Protein accessionYP_568805 
Protein GI91976146 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE
[COG4786] Flagellar basal body rod protein 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.551124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.12233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCT TCGGCGCTCT CACCACCTCG GTCGCGGGCC TGCGCGCAAA TTCCTACGCG 
CTGGAGAACA TCTCCGGAAA CATCGCGAAC TCGCAGACCA CCGCCTTCAA ACGGGTCGAC
ACCGCATTCC TCGATCTGAT CCCGCAGGCC GGCACCAACG CGCAGGTCGC GGGCTCGGTG
ACTTCCGAAT CGCGGCTCAC CAACACGCTG TCGGGGTCGG TGCAATCGGC CTCGGTGTCG
ACCTACATGG CGATCAACGG CGAGGGCTTC TTCGCGGTGC AGAAGCCCGG CTCGTTCTCC
GACAACAGCC CGGTGTTCAC CGGCGTCAAC AACTACACCC GGCGCGGCGA CTTCTCGCTC
GACAAGAACG GCTATCTGGT CAACGGCGCC GGCTATTATC TGCAGGGCGT GGCGATCGAT
CCGACCACCG GCAACCCGGT CGGCAGCACG CCGACGGTGC TGAAATTCCA GAACGACTTC
CTGCCGTCGC AGGCGACCAC CAAGATCAGC TATCGCGCCA ATCTGGCGAG CTATCCGCTC
ACTACCAAGA ATGACAAGTC GGTCCCGGGG TCGGAGCTGT TGCGCCCTGC GGATTTCACG
TCCAACCCGC AGGTCGCCGG CACCGCACCG CCGCCGTTCG CCGACAACAC CAAGGTCGGC
CTGCAAAAGA ACAGCAAGGC CGGCACGGCG ATCACCGCCG CGACGCTGTT GAAGGGCGCC
GCCGCGACCA ACTCGGCCAG CGCCGACTTC ACCATCACCG ACACCATCAC GGTCGGCACC
GGCGGCTCCG CCAAGACGAT CGCGTTCTAC GATTCCGGCG CCGGCGGCTC GGCCGGCGCG
GCGCCGAACA CCACCTATCT CGATCTCGCA ACGGCCACGG TCGGCAATCT GCTCGGCGCG
ATCGACACCG CGAACGGCAA TGGTGGCACC CCCTCGTCGG TGGCCACCGG CGCGATCACG
CTGCACACCG GCATCGCGGC CGACCTGACG CTGACATCGA CCTCCGCCGG CTTCGCGTCG
CTCGGCCTGA CCAGCCCGGT CAGCGTCGCC CGCCTCGGCG GCGGCAGCGT GGGCACCGGC
CAGGTGACCG GCGCGGAGAA CCAGACCTTC CTCGACGAAT CGATTTCGGG CGGCGCGACC
ACCGCCTATG ACGGCAGCGG CGCGCCGGTG AACGTGCAGT TCCGCTGGGC CAAGATCGAC
TCGGCCGCGC TCGGCGTCGG CCATAACGAC ACCTGGAACC TGTTCTACCA GGTCAATCCA
GGCGCCACCG GCAGCCAGGT CGCCTGGCAG AACGTCAACA CCAATTTCAG CTTCAACTCC
ACCGGCCAGA TGAACCCGGT GATCGGCCAG CTCGCGCTTT CGAACCTCAC GGTGTCCGGC
GTCTCGCTCG GTAACGTCAC GATGTCGTTC GGCACCGGCG GCCTGACCCA GTTCTCCGAC
ACCAACGGCA ACGTCCAGGT CAACCAGCTG CAGCAGGACG GCTACGCCGC CGGGCAATTG
ACCAGCGTCT CGGTCAGCAA CGAGGGACGC GTCGTCGGCG CCTATTCCAA CGGTCGCAAT
ATCGACCTCG CCGAAGTCAG CGTCGCCACC TTCAACGGCG CCAATTTCCT CAAGCGCATC
GACGGCGGTG CGTTCGAAGT GACCAACGAA TCCGGCGAGG CTCTGTACGG CAAGGGCGGC
AGCATCTCGG GCTCGTCGCT GGAATCGTCC AACACCGATA TCGCCGACGA ATTCACCAAG
CTGATCGTCA CCCAGCAGGC CTATTCGGCC AACACCAAGG TCATCACCAC GGCCAACACC
ATGGTGCAGG ATCTGCTCAA CGTGATGCGC TGA
 
Protein sequence
MGIFGALTTS VAGLRANSYA LENISGNIAN SQTTAFKRVD TAFLDLIPQA GTNAQVAGSV 
TSESRLTNTL SGSVQSASVS TYMAINGEGF FAVQKPGSFS DNSPVFTGVN NYTRRGDFSL
DKNGYLVNGA GYYLQGVAID PTTGNPVGST PTVLKFQNDF LPSQATTKIS YRANLASYPL
TTKNDKSVPG SELLRPADFT SNPQVAGTAP PPFADNTKVG LQKNSKAGTA ITAATLLKGA
AATNSASADF TITDTITVGT GGSAKTIAFY DSGAGGSAGA APNTTYLDLA TATVGNLLGA
IDTANGNGGT PSSVATGAIT LHTGIAADLT LTSTSAGFAS LGLTSPVSVA RLGGGSVGTG
QVTGAENQTF LDESISGGAT TAYDGSGAPV NVQFRWAKID SAALGVGHND TWNLFYQVNP
GATGSQVAWQ NVNTNFSFNS TGQMNPVIGQ LALSNLTVSG VSLGNVTMSF GTGGLTQFSD
TNGNVQVNQL QQDGYAAGQL TSVSVSNEGR VVGAYSNGRN IDLAEVSVAT FNGANFLKRI
DGGAFEVTNE SGEALYGKGG SISGSSLESS NTDIADEFTK LIVTQQAYSA NTKVITTANT
MVQDLLNVMR