Gene RPD_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1670 
Symbol 
ID4022150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1886025 
End bp1888313 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content64% 
IMG OID637961865 
Productflagellin 
Protein accessionYP_568808 
Protein GI91976149 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.940906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0365182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGTA TCGTTCTCTC GAACGCCGTC CGCCAGAATC TTTCGTCGCT TCAGGCCACG 
GCCGATCTGC TCGCCACCAC CCAGAGCCGT CTTTCGTCCG GCAAGAAGGT GAACTCGGCT
CTCGACAATC CGACCAACTT CTTCACCGCG TCGGGTCTCG ATGCCCGCTC CAGCGACATT
AACAACCTGC TCGACGGTAT CGGCAACGGC GTGCAGATCC TGCAGGCCGC CAACACCGGC
ATCACCTCGC TGACCAAGCT GGTCGATAGC GCCAAGTCGA TCGCCAACCA GGCCCTGCAG
ACCGTCTCGG GCTATTCGAC GAAGTCGAAT GTCTCGACCA CGATCACCGG TGCCACTGCC
AACGACCTCC GCGGCACTAC CAGCTACTCC AGCACTTCCG CGGCGGGTAA CGTGCTGTAT
TCGGGCGCCG CCGGCGGTGC CACGGCTGCG ACCTCGGCTG CGACCCTCGG TGGCACTGCT
GGTTCGCTTG TCGGGTCTGG CGTGGTCAAC AACAACCTCA CGGTTCCGGT GGCAATCGAT
TCGACCACCC GTCTGTTCGC AGCGGGCGGT GGCGGCACTG CCGGTCTTAC CACCCAGGCA
AACACCACCT TCACCGATGG TTCGAAGCTG TCGGTCAACG GCAAGACCAT CACCTTCAGT
GCGACGGCGG TTCCCGGTGC CAGCGCCGTT GCGGCCGGCT CCAGCCTGTC CAGCACGAAC
GTCGTGACCG ACTCCGGCGG CAACTCGACG GTTTATCTCG GGACCGCTGC AGACTCAGCG
GCCACGGTCG GCGATCTGAT GGCCGCGATC GATGTCGCCA GCGGCGCACA GTCGATCACC
GCGATCAACG CCACCACCAA GATCGCGACC TTGACCGGCG GTGCGGGCGC ATCTTCGATC
ACCGGCGGCA CCGTCACGTT GAAAAGCTCG ACCGGCGCCG ATCTGTCGAT CTCTGGCACC
GCAGACATGC TGGCGTCCCT GAAACTCACG GCGTCGCTGG GTTCGAGCGT CACGACTGTC
GCCGCTGCTC GTGCCACCTC GTCGTCCAGC CTCGGCAGCC TGATTGAGGA CGGCTCGACG
CTGAACGTCA ATGGCAAGAC CATCACCTTC AAGAACACGC TGTCGACCGA CGTGAATGCG
ATTCCGACCG GCTTCGGCAA GCCGAGCGGC GCGCACTACG CCACCGACGG CAACGGCAAT
TCGACCGTGT TCCTGCAGGA CGCGACGGCT GCCGACATGC TGTCGGCGAT CGACCTCGCC
ACCGGCACCA AGAGCGCGAC CATTGCCACC AGCGTCGCCA CAGTGACGAC CCCGGCCGGC
AACGTCGCGT CGACGGTGCT GAGCGGCGCG CTCAAGCTGT CGACCGGTAC CGCGGCCGAC
CTCTCGATCA CTGGTACCGG CAACGCACTC GCCGCCCTCG GCCTCAACGG CCCGACAGGC
ACCGACACCT CGTTCAACGC ATCGCGGACG GCGAGCGCCG GCAATGTCAG CGGCAAGTCC
TTGACCTTCA CCTCCTTCAA GGATGGTGCG GCGGTGAATG TCACCTTCGG TGACGGCACC
AACGGCACCG TCAAGTCGCT CGCCCAGCTC AACACTGCGC TTGCGGCCAA CAACATGGTG
GCGGTGGTTG ACAATGCGAC CGGCAAGCTG ACGATCTCGG CGTCGAACGA TTTCGCTTCC
CACACATTGG GAAGCAGCGA CGGCGGCGCG ATCGGTGGCA CACTGAGTTC GACACTGACC
TTCTCGTCTG CGTCGGCTCC GGTGGCTGAT ACCAATGCCC AGAACACCCG CGCCGGCCTG
GTCAAGCAAT ACAACGACAT CATGGACCAG ATCAAAACCA CGGCCCAGGA TGCCTCGTTC
AACGGCGTCA ACCTGCTCGA CGGTGACACG CTGAAGCTGG TGTTCAACGA AACCGGCAAG
TCGACGATCT CGATCCAAGG CGTCAGCTAC AATCCGACCG GCCTCGGCCT GTCGACCCTG
ACTTCGGGCA CCGACTTCAT CGACAACGAT GCGACCAACT CCGTGTTGGC CAAGCTGAGC
ACCGCATCCA CGACCCTGCG GTCGCAGGCC TCGGCGTTCG GTTCGAACCT CTCGATCGTC
CAGGCGCGTC AGGACTTCTC GAAGAACCTG ATCAACGTGC TGCAGACCGG CTCGTCGAAC
CTGACGCTGG CCGACACCAA CGAGGAAGCG GCCAACAGCC AGGCGCTGTC GACCCGCCAG
TCGATCGCGG TGTCCGCGCT GTCGCTCGCC AACCAGTCTC AGCAGGGCGT GCTCCAGCTG
CTGCGCTGA
 
Protein sequence
MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNSA LDNPTNFFTA SGLDARSSDI 
NNLLDGIGNG VQILQAANTG ITSLTKLVDS AKSIANQALQ TVSGYSTKSN VSTTITGATA
NDLRGTTSYS STSAAGNVLY SGAAGGATAA TSAATLGGTA GSLVGSGVVN NNLTVPVAID
STTRLFAAGG GGTAGLTTQA NTTFTDGSKL SVNGKTITFS ATAVPGASAV AAGSSLSSTN
VVTDSGGNST VYLGTAADSA ATVGDLMAAI DVASGAQSIT AINATTKIAT LTGGAGASSI
TGGTVTLKSS TGADLSISGT ADMLASLKLT ASLGSSVTTV AAARATSSSS LGSLIEDGST
LNVNGKTITF KNTLSTDVNA IPTGFGKPSG AHYATDGNGN STVFLQDATA ADMLSAIDLA
TGTKSATIAT SVATVTTPAG NVASTVLSGA LKLSTGTAAD LSITGTGNAL AALGLNGPTG
TDTSFNASRT ASAGNVSGKS LTFTSFKDGA AVNVTFGDGT NGTVKSLAQL NTALAANNMV
AVVDNATGKL TISASNDFAS HTLGSSDGGA IGGTLSSTLT FSSASAPVAD TNAQNTRAGL
VKQYNDIMDQ IKTTAQDASF NGVNLLDGDT LKLVFNETGK STISIQGVSY NPTGLGLSTL
TSGTDFIDND ATNSVLAKLS TASTTLRSQA SAFGSNLSIV QARQDFSKNL INVLQTGSSN
LTLADTNEEA ANSQALSTRQ SIAVSALSLA NQSQQGVLQL LR