Gene RPB_2661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2661 
Symbol 
ID3910454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3043165 
End bp3045486 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content67% 
IMG OID637884561 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_486274 
Protein GI86749778 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.305468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTCG CGTTCTGGCG TCGGACGAAG GGCGAGACGA CGAGCAATGA ACCGGCTTCG 
CCCGCGCATC CGGCCGCCGC TACGGCCTCG AAGGCAGAGC GTATCAAGGC CGCGGCGAAG
CAGCCGGTCT TCAACAAGTC TTCGCCCGAC GAGCCGGTCA TCGTGCGGAG GACGCCGGCG
CCGATCGAGC CCCCGACCAA CGGCGATATC GACCTGCGGC TGATCGGGCA GGCGCTGGCG
CGCAAGAAGC ACCTGATCAT CGCACCGACA TTGCTGGCGC TGGTGCTGTC GCTCGCCATC
GTCAATCTGA TCACGCCGCG TTACAAATCT GAAGCCCGCA TCCTGATCGA CGGCCGCGAA
AACATCTTCC TGCGCCCGAC CGGCGAACGC GACGAGCAAC GCAATGCGGT CGATCCGGAA
GCAGTCACCA GCCAGGTGCA GTTGCTGCTG TCGCGCGAGC TCGCGCTCGA AGTCATCAAG
CAGAACAAGC TGGCCGAGCG TCCCGAATTC GATCCCGTCC TGAAGGGCAT CAACCCGCTG
AAGTCTCTTT TGGCGATGGT CGGTATCGGC CGCGATCCGT TCTCCATGAC GCCGGAAGAG
CGCGTTCTCG ACGCGTATTA CGAGCGGCTC ACCGCCTATG CGGTCGACAA GTCGCGCGTG
ATGGTCGTCG AATTCCAGTC GCAGGATCCC GAGCTCGCGG CGCGCGTCGC CAATTCCATC
GCCGACGACT ATCTGGTGCT GCAGCAGAAC GCGCGCCAGG CGCAGGCGCG GTCCGCGGGG
CAATGGCTGT CCGGCGAGAT CGAATCGTTG CGCAAGAAGG TCGCCGAGGC CGAATCCAAG
GCCGAGGATT TCCGATCGAA ATCGAGTCTG TTCATCGGGA CCAACAACAC CACGCTGTCG
AACCAGCAGC TCGGCGAGCT CAACACCCAG CTCGGCAATG CGCGCGCCCT GAAATCCGAC
GCCGAGTCCA AATCACGGCT GATCAAGGAG ATGCTGCAGG GCGGTCGTCC GATCGAAGTG
TCGGACGTGC TGAACTCCGA TGTGATGCGG CGATTGTCGG AGCAGCGTGT GATGCTGCGC
ACGCAGCTCG CCGAACAATC GTCGACGCTG CTCGACAATC ACCCGCGGAT CAAGGAGCTG
AGGGCGCAGC TCGCCGATCT CGACCGGCAA TTGCGCGACG AGGCGATGAA GCTGTCGCGT
TCGTTCGAGA GCGACGCGCG GATCGCGAGC GGACGGGTCG ACAGCCTGAT CGCCAGTCTC
GAGCAGCTGA AGAAACAGGC GTCTTCGACC AATGGTCAGG ACGTCGAACT GCGCGCGCTC
GAGCGCGAAG CCAAGGCGCA GCGCGATCTG CTGGAATCCT ACCTGGCGAA ATACCGCGAA
GCGACCACCC GAGAGACCAT CGATCAGGCG CCGGCCGACG GCCGTATCAT TTCGCGGGCC
ATCGTGTCGA ACACGCCGGC CTATCCGAAG AAGTTGCCGA TCGTGCTGAT CGCCACGCTG
GCGACGCTGA TCCTGACCGC CGGCGGCATC GCCACCGGCG AATTGCTGCG AATGACCCAG
CCGCGCGCCG CCGGCCTCGC GATTCCGGCG GCCGAGCCGG CGCGGACGCC CGCCGCGATG
CAGGCACCGA TGTTCGTCAC GCCTGCCGCA GCAGCTTCGC CGCCGATGGC GCCGGCCCGC
GCCGGCACCG GCGAACCCGC GGCCGAACGA GCCGCCGATG CGGACGACAT CGAGGCGTTG
GCGCATCGGC TGCGCAGCGG GGGCGAGGCT GCACGCAAGC TGACCGTGCT CGGCACCGGC
GACACTGCCG ATGTCACGGC GACGGCTTTG AGCCTGGCGC GTCTGTTGTC GCGCGACGCC
AGGGTGGTGC TGGTCGATCT GTCGGAATCC TCCGCGATGT TGAAAGCGGC CTCGGCCGAT
CCGGCCGCGC CGGGACTCGC GGAACTGATG CAGGGCGAAG CGTCGTTCGG CCAGGTCATC
ACCCGCGACC GCAGCACGGC GCTGCATCTC GTCAGTGCGG GACGGCCCGG CTTCGATCGC
AACCTGCTAC AATCGCCGCG GCTGGTCGTG GCGCTGAACG CGCTGCTGCG GGTGTACGAT
CACGTCCTGC TCGACGCCGG GACCGCCGCC GATCTGCCCG CCGAAATGTT GACGGCGCAG
GCGCGGGCCG TGGTGGTGCC GGCTTCCGAC ATGCCGGCGG ACGCCCGTCT CAAGATGGCG
GACCAGCTCA GGGCGGTCGG CTTCTCGGAG GCGACGATGG TGCGCGCCGC GGCGCGGCCG
TCGGGTCGCA TCGAGCCCGG CGCGCGCACC GTCGCCGCGT AA
 
Protein sequence
MRFAFWRRTK GETTSNEPAS PAHPAAATAS KAERIKAAAK QPVFNKSSPD EPVIVRRTPA 
PIEPPTNGDI DLRLIGQALA RKKHLIIAPT LLALVLSLAI VNLITPRYKS EARILIDGRE
NIFLRPTGER DEQRNAVDPE AVTSQVQLLL SRELALEVIK QNKLAERPEF DPVLKGINPL
KSLLAMVGIG RDPFSMTPEE RVLDAYYERL TAYAVDKSRV MVVEFQSQDP ELAARVANSI
ADDYLVLQQN ARQAQARSAG QWLSGEIESL RKKVAEAESK AEDFRSKSSL FIGTNNTTLS
NQQLGELNTQ LGNARALKSD AESKSRLIKE MLQGGRPIEV SDVLNSDVMR RLSEQRVMLR
TQLAEQSSTL LDNHPRIKEL RAQLADLDRQ LRDEAMKLSR SFESDARIAS GRVDSLIASL
EQLKKQASST NGQDVELRAL EREAKAQRDL LESYLAKYRE ATTRETIDQA PADGRIISRA
IVSNTPAYPK KLPIVLIATL ATLILTAGGI ATGELLRMTQ PRAAGLAIPA AEPARTPAAM
QAPMFVTPAA AASPPMAPAR AGTGEPAAER AADADDIEAL AHRLRSGGEA ARKLTVLGTG
DTADVTATAL SLARLLSRDA RVVLVDLSES SAMLKAASAD PAAPGLAELM QGEASFGQVI
TRDRSTALHL VSAGRPGFDR NLLQSPRLVV ALNALLRVYD HVLLDAGTAA DLPAEMLTAQ
ARAVVVPASD MPADARLKMA DQLRAVGFSE ATMVRAAARP SGRIEPGART VAA