Gene RPB_3816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3816 
Symbol 
ID3911619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4356673 
End bp4358544 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content66% 
IMG OID637885717 
Productflagellar hook-associated protein 
Protein accessionYP_487421 
Protein GI86750925 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.308813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCG GAGACGCACT TTCGATCGCA ATGGCGGGCC TGCGCGCCAA CCAGGCCTCG 
ATGTCGCTGG TGTCGTCGAA CGTCGCCAAC GCCGAGACGC CGGGTTACGT CCGCAAGACC
GTCAATCAGG TCACGACGCT GTCCGGCCCG TCGGGCAGCG GCGTTTCGAT CACCGGTGTC
AACCGCGAAC TCGACGCCTA TCTGCAGGCG CAGCTCCGCA CCGAGACGTC GGGCGCGTCC
TACGCCTCGT TGCGCTCCGA CTTCCTGCAG CAATTGCAGG GACTGTTCGG CGATCCGAAC
TCGAACGGCA CGCTGGAGGA CGCGTTCAAC GGTCTCACCG CGGCGACGCA GGCGCTCGCC
ACCAGCCCCG ACAGCACCTC GGCGCGGATC GGCGTGCTCA ACGCCGCGCA GGTGGTGTCC
GGCGTGCTGA ATTCGATGTC GAACGGCATC CAGACGCTGC GCACCGGCGC CGAGACCGGC
CTGACCGACG CCGTCAACAC GGCGAACAAT CTGCTGCAGC AGATCGCCTC GATCAACAAC
AACATCCGCA CCAACCCGGC CGGCGGCACC TCCACCGACG CGGCGACCGC GTCGCTGCTC
GACCAGCGCG ACGCCGCGAT CAACCAGCTC GCGCAGCTGA TGGACATCCG CGTCGTCACC
GACGCCTCCA ACCAGGTCAC GGTGTTCACC GGTTCGGGCA TGCAACTCGT CGGCATGCAG
GCCGCCCAGC TCAGCTTCGA CGCCCAGGGC ACCGTGACGC CGAGCACCAC CTGGAATCCC
AACACCTCGG CGAGCGAGCT CGGCTCGGTC CGGATCGTCT ACCCTGACGG CAGCACGGCC
GACCTGACCA ATTCGCTGAA GTCCGGCAAG ATGGCGGCCT ATGTCGAGCT GCGCGACAAC
ACGCTGGTGC AGGCGCAAAC CCAGCTCGAC CAGTTCGCCG CGGCGATGGC CAGCGCGCTG
TCGGACAAGA CCACCGCCGG CACGCCGGCG ACCTCCGGCG CGCAGACGGG TTTCGACCTC
GATCTGACCG ACATGAAGGC CGGCAACACG GTCAACATCA CTTACACGGA CACCACGACC
GGCGCGCAGC GGACCGTCTC GGTGATGCGC GTCGACGATC CGACGGCGCT GCCGCTGCCG
CAGACCGCGA CGCTCAATCC CAACGACTAC GTGGTCGGTA TCGATTTCTC CGGCGCCTCG
GGGTCGGTCA CCGCGCAGCT CAACGCCGCG CTGAACGCGA GAAACCTGCA GTTCACCGGG
ACCTCGCCGA ACATCACCGT CCTGAACAAT CCCGGCTTCT CGACGGTGAA TTCCGCCTCG
GTGACGTCGA CGGTGACGTC GCTGACCGGC GGCAGCGCCG AGGTGCCGCT GTTCACCGAC
GCCGGCTCGC CCTACACCGG CGCGATCAGC GGCAACGGCA CGCAGATGAC CGGCCTCGCG
CAGCGCATCT CGGTCAATCC CGCGCTGGTC ACCGATCCGT CGCGGCTGGT GGTGTATTCG
ACCACGCCGC CGACCGCGGC CGGCGACACC ACGCGGCCGG ATTTCATCAC CAAGCAGCTC
ACCAGCAGCA AATATCTGTA CTCGGCGACG ACCGGCATCG GATCGAATGC CGCGCCCTAC
AACGGCACGC TGGAGAGCTT CCTGCAGCAA TTCGTCAGCC AGCAGGGCTC GAATGCGCAG
GCGGCGACAC AGCTCGCCAG CGGACAAAGC GTCGTCCTGA ATACGCTGCA GCAGAAATAC
GCGACCAATT CCGGCGTCAA CATGGACGAG GAAATGGCGC ATCTGCTTTC GCTGCAGAAC
GCCTATTCGG CGAACGCGCG AGTGATGTCG ACGGTCAACC AGATGTATCA GACGCTGATG
CAGGCGATGT GA
 
Protein sequence
MSLGDALSIA MAGLRANQAS MSLVSSNVAN AETPGYVRKT VNQVTTLSGP SGSGVSITGV 
NRELDAYLQA QLRTETSGAS YASLRSDFLQ QLQGLFGDPN SNGTLEDAFN GLTAATQALA
TSPDSTSARI GVLNAAQVVS GVLNSMSNGI QTLRTGAETG LTDAVNTANN LLQQIASINN
NIRTNPAGGT STDAATASLL DQRDAAINQL AQLMDIRVVT DASNQVTVFT GSGMQLVGMQ
AAQLSFDAQG TVTPSTTWNP NTSASELGSV RIVYPDGSTA DLTNSLKSGK MAAYVELRDN
TLVQAQTQLD QFAAAMASAL SDKTTAGTPA TSGAQTGFDL DLTDMKAGNT VNITYTDTTT
GAQRTVSVMR VDDPTALPLP QTATLNPNDY VVGIDFSGAS GSVTAQLNAA LNARNLQFTG
TSPNITVLNN PGFSTVNSAS VTSTVTSLTG GSAEVPLFTD AGSPYTGAIS GNGTQMTGLA
QRISVNPALV TDPSRLVVYS TTPPTAAGDT TRPDFITKQL TSSKYLYSAT TGIGSNAAPY
NGTLESFLQQ FVSQQGSNAQ AATQLASGQS VVLNTLQQKY ATNSGVNMDE EMAHLLSLQN
AYSANARVMS TVNQMYQTLM QAM