Gene RPB_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1008 
Symbol 
ID3909132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1154444 
End bp1156507 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content69% 
IMG OID637882901 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_484629 
Protein GI86748133 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.597439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCA GGCCGTCTTC CCAAACGATC TCCGACGACC GCAATCCCGA CGGGATCGAT 
TTCAGGAACG TCGCCGGCAT TCTGGCGCGG CGCAAGACCT GGGTGTTCGG CGTTCCGCTG
GCGCTGTGCG CGGTGGTCCT GGCCTATCTC CTGGTCGCGC AGCCGTCCTA CACCGGATGG
GCGCAGGTGT TCGTCGATCC GCGCGATCAG TACACGCCGA AGGACGACCC GCTGCAGAAT
TCGGTGCCGG GCGACGGCCT GCTGCTGGTC GAGAGCCAGC TCAAGATCAT CACCTCGAAC
GAGGTGCTGA ACCGCGTCAT CGAGCAGATG AATCTGCAGA ACGATCCGGA GTTCAACGGC
GAGCGGATGG GGCTCGGCCG GCTGGTGAAG GCGCTGATCG GGCTCGGCAA GACCGAGGAC
CGCGCCCTCG TCACGCTGCG CAATCTGCGC AAGAAGGTCG CCACCAAGCG GGTCGACCGC
TCCTTCGTGA TCGACATCAT GGCCTCGGCC GACACCGCGC CGCGCGCGGC CGCGCTCGCC
AATGCGGTGG CGACCGCCTA TCTCGACGAG CAGGCCGGCG CCAACGCCGC GTTTCAGCGC
CGAACCTCGG AAGCGATCTC GGCGCAGCTC GGCAAGCTGC GGCAGGAGGT CAAGCGCGGC
GAGGAAGCCG TCGCCGCCTA CAAGGCGGCC AACAATCTGG TCGGCGCGCG CAGCCGGATG
GTGAGCGAGC AGCAGCTCGA CGAAGCCAAC ACCCAGCTCA CCAACGCCAA GACCCGGCTG
GCCGATGCGC AGGCGCGGGT CCGGCTGATC GAAACCATCG AGCACGGCGA CGCCGGCCTC
GAGGCGGTGC CCGAGGCGAT GCAGTCGGCC GCGATCGTGC AGTTGCGCGG GCGGCTGGCC
GACGCGTCGC GCGAGGAGGC GCAACTCGCG CAGATCGACG GCCCCAATCA TCCGGCGCTG
CAGGGCGCGC GGGCGCAGGT GCGTGACGTT CAGGCCGCGA TCCAGCGCGA GCTGAAGACG
ATCGCGCGCT CGGTGCGCAA CACCTACGCC AGCGAACGCA CCAATGTGCA GACCCTGCAG
GCCAATTTCG ACGCTCTGAA GACGCAGTCG CAGGCCAACG AGAAACTGCT GGTGCCGCTG
CGCGAGCTGG AGCGCAAGGC GGAATCCAGC CGCATCGTCT ACGAGAACTT CCTCGCCAAG
GCGAAGACCG CCGAGGAGCG GCAGGGCATC GACACCACCA ACATCCGGCT GATCTCGCGC
GCCACCACGC CGGAAAACAA GAGCTGGCCG CCGACGCTGA TCATGCTGGC CGCCGCGATC
TTCGCCGGGC TGACCATCGG CATCGCGCTG GCGCTGGCGC GCGATCACTT CGAGCGCCCG
GACCGTGGAC CCGAGCCGGA GGCCGTCGAC GAAGTCGATC CTCCCGTCGC GGTCGCGGTC
GCGCCCGTCC CGGCGCCGCG GCCGGTGATG GCGCAGCCCC GCACCGGCCG GCTGAAGGCG
CTGAGCGCGG ACCTGCTCGC GGCGCCGAAG GGCCACACCA TCGTGCTGGT CCAGGTGCAA
CGCGCCGCGT GGCTCGACGA CGTCGCGCTG CAACTCGCGC GGACCGTGAT CGCCGCCGAG
ATGGACGTGA TGCTGGTCGA CGCCGATCTG GCGCGGCATC ACACCACGTC GCGGCTCGGC
TTCGACGGTG CGCCCGGCCT GCGTGACGTG ATGGCCGGAA CCGCCGCGAT CAACGAGGTC
GTGAAGTTGC ACCAGCCGAC CGCGATGCGG ATCGTGCCGG TCGGGCTGTC GGCCGTCGGC
AATCGCGATC CGCGCGCCCG GCAGGCGCTG CAGTCGGCGG TGCAGCAGCT GCGCGCGTTC
GACCGCGTCA TCGTCGACGG CGGCGAGATC GGATCGACCG CGTCCGAATT CGGGCTGTAC
TACATGGCCG ACGAAGTCGT GTTCCTGGCG CAGGGCCCCG GCGGCAAGAG CGAGGACGCC
GCCATCCTGG TCGATCTGCT GCAATTGCGT CAGGTCAAGG CGCGGATCGT GTTCGTCGAG
CCGGACGTCG CGGTGGCGGC ATGA
 
Protein sequence
MSPRPSSQTI SDDRNPDGID FRNVAGILAR RKTWVFGVPL ALCAVVLAYL LVAQPSYTGW 
AQVFVDPRDQ YTPKDDPLQN SVPGDGLLLV ESQLKIITSN EVLNRVIEQM NLQNDPEFNG
ERMGLGRLVK ALIGLGKTED RALVTLRNLR KKVATKRVDR SFVIDIMASA DTAPRAAALA
NAVATAYLDE QAGANAAFQR RTSEAISAQL GKLRQEVKRG EEAVAAYKAA NNLVGARSRM
VSEQQLDEAN TQLTNAKTRL ADAQARVRLI ETIEHGDAGL EAVPEAMQSA AIVQLRGRLA
DASREEAQLA QIDGPNHPAL QGARAQVRDV QAAIQRELKT IARSVRNTYA SERTNVQTLQ
ANFDALKTQS QANEKLLVPL RELERKAESS RIVYENFLAK AKTAEERQGI DTTNIRLISR
ATTPENKSWP PTLIMLAAAI FAGLTIGIAL ALARDHFERP DRGPEPEAVD EVDPPVAVAV
APVPAPRPVM AQPRTGRLKA LSADLLAAPK GHTIVLVQVQ RAAWLDDVAL QLARTVIAAE
MDVMLVDADL ARHHTTSRLG FDGAPGLRDV MAGTAAINEV VKLHQPTAMR IVPVGLSAVG
NRDPRARQAL QSAVQQLRAF DRVIVDGGEI GSTASEFGLY YMADEVVFLA QGPGGKSEDA
AILVDLLQLR QVKARIVFVE PDVAVAA