Gene RPD_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1668 
Symbol 
ID4022148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1882247 
End bp1884118 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content65% 
IMG OID637961863 
Productflagellar hook-associated protein 
Protein accessionYP_568806 
Protein GI91976147 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.121705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTCG GAGACGCACT TTCGATCGCA ATGGCCGGCC TGCGCGCCAA CCAGGCCTCG 
ATGTCGCTGG TGTCGTCCAA CGTCGCCAAC GCCGAGACGC CGGGTTACGT CCGCAAGACC
GTCGATCAGA TCACCACCAC TGCCGGCCCG TCTGGCAGCG GTGTTTCGAT CATCGGCGTC
AACCGCGAAC TCGACGCCTA TCTGCAGTCG CAGCTTCGCA CCGAAACCTC GGGCGCCTCC
TACGCCTTGC TGCGCTCCGA CTTTCTGAAG CAATTGCAGG GCCTGTATGG CAACCCGAAC
TCGACCGGCA CCCTTGAGAA CGCGTTCAAC AGTTTGACCG CCGCGGTACA GGCGCTCGGC
ACCAGCCCCG ACAGCACCTC GGCGCGAATC GGCGTGCTCA ACGCCGCGCG GGTGGTGGCG
GGCGGGCTCA ACGCGACATC CAACGGAATC CAGTCGCTCC GCTCCGGCGC CGAGACCGGA
CTGGCCGACA GCGTCAACAC GGCGAACAAT CTGCTGCAGC GGATTGCATC GATCAACAAC
AACATCCGCA CCAATCCCGC GGGGGGCACC TCGACCGACG TGGCGACCGC GTCGCTGCTC
GACCAGCGTG ACGCGGCGAT CAGCCAGCTC TCGCAACTGA TGGACATCCG CGTCGTCACC
GACGGCTCCA ATCGGGCCAC GGTGTTCACC GGCTCCGGAA TGCAGCTCGT CGGTATGCAG
GCGGCCAAGC TGTCCTTCGA TGCGCAGGGC ACCGTGACGC CGAGCACGAC CTGGAGCTCG
AACTCGGCGA CGAGCCAGCT CGGTTCGGTC AAGATCACCT ATGCGGATGG TGGCACGATC
GATCTCACCA GTTCGCTGAA ATCGGGCACG ATTGCGGCCT ATATCGAGCT GCGCGACAAG
ACTCTGGTGC AGGCCCAGAC CCAGCTCGAT CAATTCGCGG CGTCGATGGC GAGCGCTTTG
TCCGACAAGA CCACCGCCGG AACCCCGGCG ACGTCGGGCG CGCAGGCCGG TTTCGCGCTC
GATCTGACCA ACATGAAGCC CGGCAACACC TTCAACATCA GCTACACCGA CACGACGACA
GGCGCGCAGC GCACGGTGTC GGTGATGCGG GTCGACGATC CCTCGGTGCT GCCGCTGCCG
CAGACCGCGA CGCTCGATCC CAACGACTAT GCGGTCGGCA TCGACTTCTC GGGCGCGTCG
GGATCGATCA CCGCACAGCT CAACGCCGCG CTGAACGCCA AGAACCTGGA GTTCACCGGC
ACGTCGCCGA ACATCACCGT GCTCAACAAT CCAGGCTTCT CGACGGTGAC TGCGGCCTCG
GTGACCACGA CCGAAACCTC GCTGACCGGC GGCAGCGCCG AGGTGCCGTT GTTCACCGAC
GGCTCGTCGG CCTACACCGG CGTGCTCAGC GGCACCGGAG CGCAGATGAC CGGCTTCGCG
CAGCGCATCG CGGTCAATAC CGGGCTGATC ATCGATCCGT CGCGGCTGGT GGTGTATTCG
ACCACGCCGC CCACCGCGGC CGGCGACACC ACGCGGCCGG ACTTCCTCAC CAAACAGCTC
ACCACCAGCA AGTATCTGTA CTCGGCGGCG ACCGGGATCG GTTCGACCAG TGCGCCGTAT
AACGGCACGC TGTCGAGCTA CCTGCAGCAG TTCGTCGGTC AGCAAGGCTC CGACGCATTG
GCGGCATCGC AACTCGCCGA GGGGCAGAGC GTCGTGCTGA ACACGCTGCA GCAAAAGTAT
TCGACCAGCT CCGGCGTCAA CATGGACGAA GAGATGGCGC ATCTGCTGTC GCTTCAAAAC
GCGTATTCGG CGAATGCACG GGTGATGTCG ACGGTGAACC AGATGTATCA GGCCCTGATG
CAGGTGATGT GA
 
Protein sequence
MGLGDALSIA MAGLRANQAS MSLVSSNVAN AETPGYVRKT VDQITTTAGP SGSGVSIIGV 
NRELDAYLQS QLRTETSGAS YALLRSDFLK QLQGLYGNPN STGTLENAFN SLTAAVQALG
TSPDSTSARI GVLNAARVVA GGLNATSNGI QSLRSGAETG LADSVNTANN LLQRIASINN
NIRTNPAGGT STDVATASLL DQRDAAISQL SQLMDIRVVT DGSNRATVFT GSGMQLVGMQ
AAKLSFDAQG TVTPSTTWSS NSATSQLGSV KITYADGGTI DLTSSLKSGT IAAYIELRDK
TLVQAQTQLD QFAASMASAL SDKTTAGTPA TSGAQAGFAL DLTNMKPGNT FNISYTDTTT
GAQRTVSVMR VDDPSVLPLP QTATLDPNDY AVGIDFSGAS GSITAQLNAA LNAKNLEFTG
TSPNITVLNN PGFSTVTAAS VTTTETSLTG GSAEVPLFTD GSSAYTGVLS GTGAQMTGFA
QRIAVNTGLI IDPSRLVVYS TTPPTAAGDT TRPDFLTKQL TTSKYLYSAA TGIGSTSAPY
NGTLSSYLQQ FVGQQGSDAL AASQLAEGQS VVLNTLQQKY STSSGVNMDE EMAHLLSLQN
AYSANARVMS TVNQMYQALM QVM