Gene RPC_1489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1489 
Symbol 
ID3972273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1627342 
End bp1629582 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content65% 
IMG OID637924604 
Productflagellin 
Protein accessionYP_531370 
Protein GI90423000 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.671791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTA TCCTCTCCTC AGCCGTGCGT CAGAACCTGC TCTCGCTGCA GTCCACTGCC 
GACCTGCTTG CTACCACCCA GAGCCGCCTG TCGACCGGCA AGAAGGTCAA CACCGCGCTC
GACAACCCGA CCAACTTCTT CACCGCCTCC TCTCTCGACA GCCGCGCCAG CGACATCAAC
AATTTGCTCG ATGGCATCGG CAACGGCGTG CAGATCCTGC AGGCCGCCAA CACCGGCATC
ACCTCGCTCA ACAAACTCGT TGACTCCGCG AAGTCGATCG CCAACCAGGC GCTGCAGACC
GTGTCGGGCT ACACCACCAA GTCGAACGTC TCGACCACGA TCGCCGGCGC CACCGCCGAC
GACATCCGCG GCACCACCAC TTACGCCAAC GCGTTCGCAG TCAGCGGCGT CGTCACCGAC
GGCACCTCCG GCGGCGCTTC GCCGATCACG ACGGCCACCA CCCTCGGCGG CGTTGCTGGA
GCTCTGGTTG GAGTTGCGGC CACCGCGGGC GACGGCTCCA CCGCTCTGTC CGGCACCGTC
ACACTGGCTG CCGGCGCCAC TGCGACCACT TTGTTGGCCG CCGCGGCTCC GAAAGACGGC
GACGTTCTGG TCGTCAACGG CAAGTCCATC ACCTTCAAGG GCGGTGCGGC TCCTGCGGCG
GCCGGCGTTG CGACTGGCTC GGGCGTCAGC GGCAACATCG TCACCGATGG ATCCGGCAAC
TCGCAGATCT ATCTGACCGG CGGCACCGTT GCCGACGTTC TCAAGGCCGT CGATCTGGCC
AGCGGCGTTG CCAAGACCGT GAATAGCGCG GGCGCGGCGA CCATCACCGG CTCGACCGCG
GCGTTTGCCG CCGGCGTGCT GACGGTCAAC TCCTCGACCG GTGCCGATCT CAGCGTCACC
GGCAAGGCCG ATCTGCTCAA TGCCCTCGGC CTGACCACCT CGATCGGGTC CGGCAACGTC
TCGGTCGGCG CCGCTCGCAC CACCGCTGCG ACCACCACCT CGAACCTGAT CCAGGACGGT
TCGACCCTGA ATGTGAACGG CAAGACGATC ACCTTCTCGA ACGCTTACCA GCCGCTTCCG
GCCAACGTTC CGTCCGGTTC CGGCATCGAG GCCGGCAGTC ATCTGCAGAC CGATGGCAAG
GGCAACTCGA TCGTCTATCT GCAGGGTGCG ACCATCGCTG ACACGTTGAA AGCGATCGAC
ATTGCGACCG GCGTGCAGAC GGCGACGAAT GCCGCCGGCA TCTCGACCCT GGCGACCGCC
AGCGGCGCCT CGGCGTCCAC CGTGGCGGCG AACGGCACGC TGAAGCTCAG CACCGGCACG
TTGTCGGATC TCAGCATTGC CGGCGGCGTC GGCAATGCAT TGTCGGCACT TGGTCTCGAC
GGCCCGACCC ACACCAGCAG CACGTTCACT GCGACGCGTG CGGCCGGCGC CGGCGGCATC
GACGGCAAGA CCCTGACCTT CTCTTCCTTC AACGGCGGAA GCGCGGTCAA TGTCACGCTC
GGCGATGGCA CCAACGGAAC CGTCAAGACG CTGGCCCAGC TCAAGACTGC GCTGCAAGCC
AACAACCTCG ACGCGACCGT GGATGCCTCC GGCAAGCTGA CGATCTCGGC GGGCAACGAC
TATGCGTCGT CGACGCTGGG TTCGACCCTG TCCGGCGGCT CGATCGGCGG CACCTTGACC
ACCTCGACCA CCTTCACCAT CGCGGCGGCG CCGGTTGCCG ACACGGTGGC GCAGACCACT
CGCGCCAATC TCGTGACGCA GTACAACAAC ATCCTGAACC AGATCAACAC CACGGCGCAG
GATGCCTCGT TCAACGGCGT CAACCTGTTG AACGGCGACA ACCTCAAGCT GACCTTCAAC
GAAACCGGCA AGTCCACGCT GAACGTGCAG GGCGTGACCT TCAACGCAGC AGGCCTCGGC
CTGTCGAACC TGACTGGCGG CACCGACTTC ATCGACAACT CGAACACCAA CAAGACTTTG
GCAACTCTGA CGACCGCCAG CACCGCGCTG CGTTCGCAGG CCTCGGCCTT GGGTTCGAAC
CTGTCGATCG TGCAGTTGCG TCAGGACTTC TCCAAGAGCC TGATCAACGT GCTGCAGACC
GGCGCGTCGA ACCTGACGCT GGCCGACACC AACGAGGAAG CGGCCAACAG CCAGGCGCTG
TCGACCCGCC AGTCGATCGC GGTGTCAGCC TTGTCGCTGG CCAACACCTC GCAGCAGAGC
GTGTTGCAGC TGCTGCGCTA A
 
Protein sequence
MSVILSSAVR QNLLSLQSTA DLLATTQSRL STGKKVNTAL DNPTNFFTAS SLDSRASDIN 
NLLDGIGNGV QILQAANTGI TSLNKLVDSA KSIANQALQT VSGYTTKSNV STTIAGATAD
DIRGTTTYAN AFAVSGVVTD GTSGGASPIT TATTLGGVAG ALVGVAATAG DGSTALSGTV
TLAAGATATT LLAAAAPKDG DVLVVNGKSI TFKGGAAPAA AGVATGSGVS GNIVTDGSGN
SQIYLTGGTV ADVLKAVDLA SGVAKTVNSA GAATITGSTA AFAAGVLTVN SSTGADLSVT
GKADLLNALG LTTSIGSGNV SVGAARTTAA TTTSNLIQDG STLNVNGKTI TFSNAYQPLP
ANVPSGSGIE AGSHLQTDGK GNSIVYLQGA TIADTLKAID IATGVQTATN AAGISTLATA
SGASASTVAA NGTLKLSTGT LSDLSIAGGV GNALSALGLD GPTHTSSTFT ATRAAGAGGI
DGKTLTFSSF NGGSAVNVTL GDGTNGTVKT LAQLKTALQA NNLDATVDAS GKLTISAGND
YASSTLGSTL SGGSIGGTLT TSTTFTIAAA PVADTVAQTT RANLVTQYNN ILNQINTTAQ
DASFNGVNLL NGDNLKLTFN ETGKSTLNVQ GVTFNAAGLG LSNLTGGTDF IDNSNTNKTL
ATLTTASTAL RSQASALGSN LSIVQLRQDF SKSLINVLQT GASNLTLADT NEEAANSQAL
STRQSIAVSA LSLANTSQQS VLQLLR