Gene RPB_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3800 
Symbol 
ID3911603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4333358 
End bp4336018 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content67% 
IMG OID637885701 
Productflagellin 
Protein accessionYP_487405 
Protein GI86750909 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATG TGGTTCTTTC AGCAGCAGTT CGCCAGAATC TGCTTTCGCT GCAATCGACG 
GCGGATCTGC TCTCGACCAC CCAGAATCGT CTTGCGAGCG GCAAGAAGGT CAACACCGCG
CTCGACAATC CGACCAACTT CTTTACCGCC GCGGGGCTCG ACAGCCGCGC CAGCGACATC
AACAACCTGC TCGACGGCAT CAGCAACGGC GTCCAGATCC TGCAGGCGGC CAACACCGGC
ATCACCTCGC TGAACAAGCT GATCGACACC GCCAAGTCGA TCGCCAACCA GGCGCTGCAG
TCGAACGTCG GCTACTCCAC GAAATCCAAC GTCTCGGCGA CCATCGCGGG CGCCACCCCG
GACGATCTGC GCGGCACCCA GACCTTCGCC AGCGCGACCG CCACCAGCAA CGTGGTCTAT
GATGGCACCG CCGGCGGCAC CAACGGCGTG TCGCTGACGG ATACGCTCGG CGGCGGCGTC
GGCAGCATCA CCGGCACCAA CATCACCAAG GCGGTCGCGG CCGACGCGAC GGCCACCGGC
GGCGTACTCT ACACCGGCAC GGCGACGGCG ACGGCGACCA GCGCCGACCT GATCAGTTCG
CTGACCAACG GCTCGACCGT GACGCCGACC GGCCCGCAGG CCGGCGACAT CATCGTGGTC
AACGGCAAGA ACATCACCTT CACCACCACG GGCTCGGCGA CCGCAGACAG CAACGGCAAC
TATACGATCG GCATCAACCA GCCGATCAGC GCGCTGCTGG CGAGCATCGA CACCATCAAC
GGCAACACCA GCAACCCGTC GGTCGTCGAC GCCAACGGCC ATATCCAGCT CCACACCGGC
ACCAACCGTT CGCTGTCGAT CAGCGACACC AGCAGCGGCA CGGTGCTGGC GAAGCTCGGC
TTCGGTTCGA CGGTCACGGT CCCGCTCGGC ACCGGCGCCG CCACGGCGAT CACCGCGACC
ACGAAGCTGT TCAATTCGGT CGGCGGCCTC GGACCGGCGA TCGCCGACGG CACCACGCTG
ACGGTCAACG GCAAGTCCGT CACCTTCAAG GCGAGCGATC CGCCGAGCGC CGCGGGCCTG
CTCGCGGGCT CCGGCGTGCT CGGCAATATC GTCACGGATA CGGCCGGCAA CTCCACCATC
TATATGGGGA CGAGCAACAC CTACACCTCG GCCACTGTCG GCGACGTGCT CACCGCGATC
GATCTCGCCA GCGGCGTCAA GTCGGCGACG ATCGCCAACG GCATCGCGAC CTTCGCGGCC
AACGGCACGC CGTCGCAGAT CTCCGCCGGC GGCGCGGTGA CGCTGCAGAC TTCGACCGGC
GCCGATCTCA GCATCACCGG CCCGGCCGAC TTCCTGAGCT CGCTGAACCT GACCGCGTCG
ACCGGCCCGG GCCCGGCGAC GCTCACCGCC ACCCGTTCGA CCGGCGCCGG CACCATCGGC
ACGCTGATCG AGGACGGCTC GACGCTGAAC GTCAACGGCA ACATCATCAC CTTCAAGAAC
GCCCCGGTGC CGCTCGCCTC GGCCAGCCAC ACCGGCATCA GCGGCCATGT CGAGACCGAC
GGTCTCGGCA ATTCGACCGT GTATCTGCAG GGCGGCACCA TGGCCGACGT GCTGAAGGCG
ATCGACCTCG CCACCGGCGT GCAGACGGCG ACGCTGTCGC AGACCGGCGC GACGCTGACG
ACGCAGACCG GCTCGGCCAA CTCGTCGCTG TCCAGCGGCT CGCTGAAGGT CTCGACCGGC
AGCGCCTCCG ACCTCACCAT CAGCGGCACC GGCAACGCGA TGCTGGCGCT GGGCCTCGCC
GGCAACACCG GCACCTCGAC CGAGTTCAAG GCGTCGCGCT CGTCCGGCAC CGGCGGCGTC
AGCGGCAAGA CGCTGAGCTT CACCTCGTTC AAGGGCGGCA CCCCGGTCAG CGTCACCTTC
GGCGACGGCA CCGGCGGCAC CGTGAAGACG CTGTCGCAGC TCAACGTCAA GCTGGCGACC
AACAACATGA TCGCGCAGAT CGACGCCAAC GGAAAGCTGA CGATCTCGTC GAACAACGAC
TACGCCTCCG CGACGCTCGG CTCGACCACG GACGGCGGCA CGCTCGGCGG CACCATCACC
GCGACGCTGA CCTTCTCGAC GCCGAACCCG CCGGAACCGG ACGTCACCGC GCAGGTGGCG
CGCGCCAAGC TGGTCGAACA GTACAACAAC GTCATCCAGC AGATCACCAC GACGTCGCAG
GACGCGTCGT TCAACGGCGT CAATCTGCTC AACGGCGATA CGCTGAAGCT GGTGTTCAAC
GAGACCGGCA AGTCGACGCT GAACATCGTC GGCACCGCGC TGAGCCCGGC GGCGCTCGGC
CTGCCGACGC TGGTGTCGGG CGTCGACTTC ATCGACAACG CCTCGACCAA CAAGACGCTG
GCCTCGCTCA ACACCGCGGC GACCACGCTG CGGTCGCAGG CGTCGTCCTA CGGTTCCAAC
CTGTCGATCG TGCAGATCCG GCAGGACTTC GCCAAGAACC TGATCAACGT GCTGCAGACC
GGCTCGTCGA ACCTGACGCT GGCCGACACC AACGAGGAAG CCGCCAACAG CCAGGCGCTG
TCGACCCGCC AGTCGATCGC GGTCTCGGCG CTGGCGCTGG CCAACCAGTC GCAGCAGAGC
GTGCTGCAGC TGCTGCGATA A
 
Protein sequence
MSDVVLSAAV RQNLLSLQST ADLLSTTQNR LASGKKVNTA LDNPTNFFTA AGLDSRASDI 
NNLLDGISNG VQILQAANTG ITSLNKLIDT AKSIANQALQ SNVGYSTKSN VSATIAGATP
DDLRGTQTFA SATATSNVVY DGTAGGTNGV SLTDTLGGGV GSITGTNITK AVAADATATG
GVLYTGTATA TATSADLISS LTNGSTVTPT GPQAGDIIVV NGKNITFTTT GSATADSNGN
YTIGINQPIS ALLASIDTIN GNTSNPSVVD ANGHIQLHTG TNRSLSISDT SSGTVLAKLG
FGSTVTVPLG TGAATAITAT TKLFNSVGGL GPAIADGTTL TVNGKSVTFK ASDPPSAAGL
LAGSGVLGNI VTDTAGNSTI YMGTSNTYTS ATVGDVLTAI DLASGVKSAT IANGIATFAA
NGTPSQISAG GAVTLQTSTG ADLSITGPAD FLSSLNLTAS TGPGPATLTA TRSTGAGTIG
TLIEDGSTLN VNGNIITFKN APVPLASASH TGISGHVETD GLGNSTVYLQ GGTMADVLKA
IDLATGVQTA TLSQTGATLT TQTGSANSSL SSGSLKVSTG SASDLTISGT GNAMLALGLA
GNTGTSTEFK ASRSSGTGGV SGKTLSFTSF KGGTPVSVTF GDGTGGTVKT LSQLNVKLAT
NNMIAQIDAN GKLTISSNND YASATLGSTT DGGTLGGTIT ATLTFSTPNP PEPDVTAQVA
RAKLVEQYNN VIQQITTTSQ DASFNGVNLL NGDTLKLVFN ETGKSTLNIV GTALSPAALG
LPTLVSGVDF IDNASTNKTL ASLNTAATTL RSQASSYGSN LSIVQIRQDF AKNLINVLQT
GSSNLTLADT NEEAANSQAL STRQSIAVSA LALANQSQQS VLQLLR