Gene RPB_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3814 
Symbol 
ID3911617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4352087 
End bp4354753 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content66% 
IMG OID637885715 
Productflagellin 
Protein accessionYP_487419 
Protein GI86750923 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGTA TCGTTCTCTC GAATGCCGTC CGCCAGAACC TCTCGTCGCT TCAGGCCACG 
GCAGACTTGC TCGCCACCAC CCAGAGCCGG CTCTCGTCCG GTAAGAAGGT GAACTCGGCG
CTCGACAATC CGACCAACTT CTTCACCGCC GCCTCGCTCG ACGCCCGCGC CAGCGACATC
AACAACCTGC TCGACGGCAT CAGCAGCGGC GTGCAGATCC TGCAGGCCGC CAACACCGGT
ATCACTTCGC TGACGAAGCT CGTCGACAGC GCCAAGTCGA TCGCCAACCA GGCCCTGCAG
ACGACCTCGG GCTATGCCAC GAAGTCGAAC GTGTCCGCCA CCATCTCCGG CGCCACCGCC
GAGGATATCC GCGGCACCCA GAGCTTCGAC AATGCGGTTG CGACCGGCAA CGTGATCTTC
GACGGCACCA CCGGCGGCAC CACGGCTGCG GCCGGCACCG ATACGCTCGG TGGCGCGATC
GTCAGCATCG CGGCTTCGGC GGCTGTGACC GTCCTCGGTG CGGTCGACGC GACGGCGACC
GGCGCCGTGC TCTCGGTCGG CACCGCCGCC GCGACCGCGG CCGGCACCAA CCTGATCAGC
GAACTGACCA ACGGTTCGAC CGTGACGGCG ACCGGCCCTG CCGCGGGTGA CTCGATCACC
GTCAACGGCA AGACCATCAC CTTCACGACC GCCGGCGCGT CCAGCAAGGA CTCCTCGGGC
AACTACACGA TCGGTCTCGA CCAGACCGTC AACTCGCTGC TCGACACCAT CGACACCGTC
AACGGCAACA CCGGCCTGTC CTCGGCCGTT ACCGCCGGCC GGCTCGAACT GCACTCGGGC
ACCAACAGCC CGCTGACCGT TGGTGATAAC GCCGGTGGCG CCGTTCTGGC CAAGCTCGGC
CTGACCGCGC AGACCGTCGA TACCGCTGCC GCCACCGCGT CGGCGAACAT CTCGGCGAGC
ACGCAGCTGT TCAACACCCA TGGTGGCCTG ACCACCACGG CGATCGCCGA CGGCACGACC
TTGTCGGTCA ACGGCAAGAC CATCACCTTC AAGACCGCCG ACGCGCCGCA GGGCAACAAC
ATCCCGACCG GCACCGGCGT CCTCGGCCGC ATCGGCACCG ACGGCAACGG CAACTCGACG
ATCTATCTCG GCGACCAGAC CAAGTTCACC AACGCGACGG TGGGCGATCT GTTGACCGCG
ATCGATCTCG CGAACGGCGT CAAGGCGGCG TCGATCTCGT CGGGTGTCGC GACCATCAGC
ACCAACTCCG GCCAGACGGC TTCCGCCGTC GCCGCCGGCA TCACCACCAT CCAGAGCTCG
ACCGGTGGCG ATCTGAACGT CACGGCCTTC ACCGACCTGT TCAAGAACCT CGGCCTGACC
ACTTCGAGTG GTACTGGTCC GCTGACCCTG ACCAAGCAGC GCACCACCAG CGGCACCACG
ATGGGCACGC TGATCGCCGA CGGCTCGACC TTGAACGTGA ACGGTAAGAC CATCACCTTC
AAGAACGCCG CGGTTCCGAC CGCGTCGTCG AGCCACACCG GCATCTCCGG CAACGTCGAA
ACCGACGGCA GCGGCAATTC GACCGTGTAC CTGCAGAAGG GTACGCTCGA TGACGTGCTG
AAGGCGATCG ACCTCGCCAC CGGCGTCCGC GTCGCCACGC TCGGGATCTC CGGCGCGACG
ATTGCGACCG CCAATGGCTC CGCCAACTCG TCGATCACCA GCGGTTCGCT GAAGCTGTCG
ACCGGCCTCG CGTCGGATCT GACCATCAAC GGCACCGGCA ATGCGCTGGC CGCCCTCGGC
CTCACCGGCC CGAGCGGGAC CTCGACCTCG TTCACCGCGA CCCGCGGCGT TGCCGCCGGC
AGCCTGAACG GCAAGAGCTT GACCTTCTCC TCCTTCAATG GCGGTTCGGC GGTCAATGTC
ACGCTCGGCG ATGGCAGCAA CGGTACCGTC AAGTCGCTGG CCCAGCTCAA CGTCGCGCTG
GCTGCCAACA ACCTGACGGC GTCGATCGAC AACGCCTCCG GCAAGCTGAC GATCGCAGCG
TCGAACGACT ACGCCTCCCG CACGCTGGGC GGCGCTGACG GCGGCGTGCT CGGCGGCACT
CTGGCCTCGC AGTTGACCTT CACCGTGCCG ACCGCACCGG TGGCCGACGT CAACGCCCAG
AACACCCGCG CCGGCCTGGT CAAGCAGTTC AACGACGTGC TCGACCAGAT CAAGACCACC
GCGCAGGACT CTTCGTTCAA CGGCGTCAAC CTGCTGAACG GCGACAACCT GAAGTTGGTG
TTCAACGAAA CCGGCAAGTC CACGATCTCG ATCCAGGGCG TGACCTTCAA CCCGACCGGC
CTCGGCCTGT CGACCCTGGC CTCGGGCACG GACTTCATCG ACAACAACGC CACCAACTCG
GTGTTGACCA AGCTGAGTGC CGCCTCGACC GCGCTGCGGT CGCAGTCGTC CGCTTTCGGT
TCGAACCTGT CGATCGTGCA GGCCCGTCAG GACTTCTCGA AGAGCCTGAT CAACGTGCTG
CAGACCGGCT CGTCCAACCT GACGCTGGCC GACACCAACG AAGAAGCGGC GAACAGCCAG
GCGCTGACGA CCCGTCAGTC GATCGCGGTC TCGGCGCTGT CGCTGGCCAA CCAGTCGCAG
CAGAGCGTGC TCCAGCTGCT GCGCTAA
 
Protein sequence
MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNSA LDNPTNFFTA ASLDARASDI 
NNLLDGISSG VQILQAANTG ITSLTKLVDS AKSIANQALQ TTSGYATKSN VSATISGATA
EDIRGTQSFD NAVATGNVIF DGTTGGTTAA AGTDTLGGAI VSIAASAAVT VLGAVDATAT
GAVLSVGTAA ATAAGTNLIS ELTNGSTVTA TGPAAGDSIT VNGKTITFTT AGASSKDSSG
NYTIGLDQTV NSLLDTIDTV NGNTGLSSAV TAGRLELHSG TNSPLTVGDN AGGAVLAKLG
LTAQTVDTAA ATASANISAS TQLFNTHGGL TTTAIADGTT LSVNGKTITF KTADAPQGNN
IPTGTGVLGR IGTDGNGNST IYLGDQTKFT NATVGDLLTA IDLANGVKAA SISSGVATIS
TNSGQTASAV AAGITTIQSS TGGDLNVTAF TDLFKNLGLT TSSGTGPLTL TKQRTTSGTT
MGTLIADGST LNVNGKTITF KNAAVPTASS SHTGISGNVE TDGSGNSTVY LQKGTLDDVL
KAIDLATGVR VATLGISGAT IATANGSANS SITSGSLKLS TGLASDLTIN GTGNALAALG
LTGPSGTSTS FTATRGVAAG SLNGKSLTFS SFNGGSAVNV TLGDGSNGTV KSLAQLNVAL
AANNLTASID NASGKLTIAA SNDYASRTLG GADGGVLGGT LASQLTFTVP TAPVADVNAQ
NTRAGLVKQF NDVLDQIKTT AQDSSFNGVN LLNGDNLKLV FNETGKSTIS IQGVTFNPTG
LGLSTLASGT DFIDNNATNS VLTKLSAAST ALRSQSSAFG SNLSIVQARQ DFSKSLINVL
QTGSSNLTLA DTNEEAANSQ ALTTRQSIAV SALSLANQSQ QSVLQLLR