Gene RPD_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1111 
Symbol 
ID4021587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1262993 
End bp1265059 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content67% 
IMG OID637961303 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_568250 
Protein GI91975591 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCC AGCCGTCTTC GCCCGATGCG CCGACCGACC GGACCTCCGA TGGGCTCGAT 
TTCAGGAATG TCCTCGGCAT TCTGGCGCGG CGAAAGAACT GGGTGATCGG CGTCCCGCTG
GCGCTATGCG CTATGGTGGC CGCCTATCTG TTGATCGCCC AGCCGGCCTA TACCGGGATG
GCGCAGGTCT TCGTCGATCC GCGCGATCAG TACACGCCGA AGGACGATCC GCTGCAGAAT
TCGGTCCCCG GTGACGGTCT GTTGCTGGTC GAAAGCCAGC TCAAGATCAT CACCTCGAAC
GAGGTGCTGA ATCGCGTCAT CGAACAGATG GATCTGCAGA ACGATCCGGA ATTCAACGGC
CAGCGCTTCG GCATCGGCAG ATTCGTGAAA TCGCTGATCG GGCTCGGCAA GACCGAAGAC
CGCGCGCTGA CCACGCTGCG CAATCTGCGT CTCAAGGTCG CAACCAGGCG GATCGATCGC
TCCTTCGTGA TCGACATTCT GGCTTCCGCC GACACCGCGC CGCGCGCCGC CGCGCTCGCC
AATGCGGTGG CGACCTCGTA TCTCGAAGAA CAGGCCGGCG CCAATTCGGC GTTCCAGCGC
CGAACCTCGG AGGCGATCTC GGCGCAGCTC GGCAAGCTGC GGCAGGAGGT GAAGCGCGGT
GAGGAGGCGG TGGCGGCGTT CAAGGCGGCC AACAATCTGG TCGGGGCGCG CAGCCGCTTG
GTGAGCGAGC AGCAGCTCGA CGAAGCCAAC ACCCAATTGA CCAATGCCCG GACCCGGCTG
GCCGATGCGC AGGCGCGCGT GCGGCTGATC GAAACCGTCG AGCAGGGCAA TGCCGGTCTC
GATTCGCTGC CCGAGGCGAT GCAGTCGGCC GCGATCGTGC AGCTGCGCGG ACGCGCGGTC
GATGCGTCGC GCGAGGAGGC GCAGCTCGCG CAGATCAACG GGCCCAACCA TCCGGCGCTG
CAGGCGGCGC GGGCGCAGGT GCGTGACGTT CAGGCGGCGA TCCAGCGCGA GCTGAAGACG
ATCGCCCGGT CGGTCCGCAA CACCGAGGCC AGCGAGCGAA CCAATGTGCA GAACCTGCAG
GCCAATTTCG ACGCGCTGAA AGCCCAATCG CAGGCCAATG ACAAATTGAT GGTCCCGCTG
CGCGAACTCG AGCGCAAGGC GGAGTCGAGC CGCATCGTTT ACGAAAGCTT CCTCGCCAAA
GCGAAGACGG CCGAAGAACG GCAGGGCATC GACACCACCA ACATTCGGCT GATCTCCCGC
GCCACGGCGC CGGAAAACAA GAGCTGGCCG CCGACGCTGA TCATGCTCGC GGCCGCGATC
TTCGCGGGCC TGACGATCGG AATAGCGCTC GCGCTGGCGC GTGATTACTT CGAGCGGCCT
GTCAGGGACC CCGAGCCGGA TGCCGTCGCC GTGTCCGATG CGCCGTCTCT GGCCCCGGCC
GATCCGGCGC CGGTGTTTCG GCCGGCGGTG AAGGCGCAGA GCCACGCCGG CCGGTTGAGT
GCGCTGAGCG AGGAGTTGCT CGCGGCGCCG AAGGGCCACA CCGTCGTGCT GGTTCAGGTC
CAGCGCGGCG CGTGGCTGGA CGACGTCGCA TTGCAGCTCG CCCGCACCGT GATCGCCAAC
CAGATGGACG TCATGCTGGT CGACGCCGAC CTCGCGCGGC ATCAAACGAC CTCGCGGCTG
GGCTATGACG ACGCGCCCGG CCTGCGCGAC GTCATGGCCG GCGATGCCGC GATCGGCGAC
GTCGTCCGAT TGCACAAGCC GACCGCGATG CGCGTCGTTC CGGTGGGGCT CGCCGCCGTC
GGCAGTCGCG ATCCGCGGGC GCGGCAGGCG TTGTCCCTGG CCTTTCAGCA GCTTCGGGTA
TTCGATCGCG TGATCGTCGA TGGCGGCGAG ATGGGATCGA CGGCCTCCGA ATTCGGCATG
TACTACATGG CCGACGAAGT GGTGTTCCTC GCGCAGGCGC CGGGCGGCAA AAGCGACGAC
GCGGCGATCC TGGTCGATCT GCTGCAGCAC CGCCAGATCA AGGCGCGGGT CGTGTTCGTG
GAGCCGGACG TTTCGGTGGC GGCATGA
 
Protein sequence
MSPQPSSPDA PTDRTSDGLD FRNVLGILAR RKNWVIGVPL ALCAMVAAYL LIAQPAYTGM 
AQVFVDPRDQ YTPKDDPLQN SVPGDGLLLV ESQLKIITSN EVLNRVIEQM DLQNDPEFNG
QRFGIGRFVK SLIGLGKTED RALTTLRNLR LKVATRRIDR SFVIDILASA DTAPRAAALA
NAVATSYLEE QAGANSAFQR RTSEAISAQL GKLRQEVKRG EEAVAAFKAA NNLVGARSRL
VSEQQLDEAN TQLTNARTRL ADAQARVRLI ETVEQGNAGL DSLPEAMQSA AIVQLRGRAV
DASREEAQLA QINGPNHPAL QAARAQVRDV QAAIQRELKT IARSVRNTEA SERTNVQNLQ
ANFDALKAQS QANDKLMVPL RELERKAESS RIVYESFLAK AKTAEERQGI DTTNIRLISR
ATAPENKSWP PTLIMLAAAI FAGLTIGIAL ALARDYFERP VRDPEPDAVA VSDAPSLAPA
DPAPVFRPAV KAQSHAGRLS ALSEELLAAP KGHTVVLVQV QRGAWLDDVA LQLARTVIAN
QMDVMLVDAD LARHQTTSRL GYDDAPGLRD VMAGDAAIGD VVRLHKPTAM RVVPVGLAAV
GSRDPRARQA LSLAFQQLRV FDRVIVDGGE MGSTASEFGM YYMADEVVFL AQAPGGKSDD
AAILVDLLQH RQIKARVVFV EPDVSVAA