Gene RPB_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2820 
Symbol 
ID3910613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3212823 
End bp3215360 
Gene Length2538 bp 
Protein Length845 aa 
Translation table11 
GC content63% 
IMG OID637884720 
Productsurface antigen (D15) 
Protein accessionYP_486433 
Protein GI86749937 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.985968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTG GAATGCGATT GTTGCGGGGG GGCCTTCTTG CTGCCACCCT GGTGTTTTTC 
GCCGTACCGG TCGCCACGAC GGCGACCGCT GTATTGACGG CGTCGCCTGC GGCCGCGCAG
TCTGCCTCTT CGATTCAGGT CGAGGGCAAC CGCCGGGTCG AGGCCGACAC CATTCGCTCC
TATTTCAAGC CAGGCCCGAG CGGCCGGCTC GACCAGGGCA GCATCGACGA CGGCCTCAAG
GCGCTGATCG AGACGGGGCT GTTCCAGGAC GTCCGGATCA ATCAGGGTGG CGGCGGTCGC
CTCGTCGTCA GCGTCGTCGA AAACCCGGTG ATCGGCCGGC TCGCTTTCGA GGGCAACAAG
AAGATCAAGG ACGAGCAGCT TTCGGCGGAA ATCCAGTCGA AGCCGCGTGG CACGCTGTCG
CGTCCGATGA TCCAGTCCGA CGCGCTGCGG ATCGCTGAAA TCTACCGCCG GTCGGGCCGT
TACGACGTTC GCGTCGATCC GCAGATCATC GAACAGCCGA ACAACCGCGT CGATCTGGTG
TTCGTCGTCA ACGAAGGCGA CAAGACCGGC GTCAAGTCGA TCGAATTCAT CGGCAACAAG
GCGTTCTCGT CCTATCGGCT GAAGGACGTC ATCAAGACCC GCGAATCCAA CCTGCTGAGC
TTCCTCGGCT CGGGCGACGT CTACGATCCG GATCGGGTCG AGGCGGACCG CGATCTGATC
CGGCGCTTTT ATCTGAAGAA CGGCTATGCC GACGTTCAGG TGGTGGCCGC GCTGACCGAA
TACGATCCGG AGCGCAAGGG CTTCCTCGTC TCCTTCAAGA TCGAGGAAGG TCAGCAATAT
CGCGTCGGCT CGGTGAGCTT CGAATCGACG ATTCCGAATT TCGACGCCAA TTCCCTGAGC
AGCTATTCGC GGGTGAATGT CGGCTCGCTG TACAACGCCG AGGCGCTCGA GAAGTCCGTC
GAGGAAATGC AGATCGAGAT GTCGCGGCGC GGCTATGCAT TCGCGACGGT GCGTCCGCGT
GGCGATCGTA ATTTCGAATC CCATACCGTC TCGATCGTGT TCTCGATCGA GGAGGGCGCT
CGGGTCTACA TCGAGCGGAT CAACGTCGTC GGCAACACCC GGACCCGCGA CTACGTCATC
CGGCGCGAGT TCGATATCGC GGAAGGCGAT GCCTACAACC GCGCGCTGGT CGACCGGGCC
GAGCGCCGGC TGAAGAACCT CGACTTCTTC AAGTCCGTGA AGATCTCGAC CGAACCCGGC
TCGTCGAGCG ACCGCGTCAT CCTGGTGGTC AATCTCGAAG AGAAATCGAC CGGCGACTTC
TCGGTCTCCG GCGGCTATTC GACCAGCAAC GGCGCGATGG GCGAAGTCAG CGTCTCGGAG
CGCAACTTCC TCGGCCGCGG CCTGTTCGCC AAGGCGACCG TGCAATACGG CCAGTATGCG
CGCGGCTACT CGCTGTCGCT CGTCGAGCCC TATCTGCTCG ACTACCGCGT CGCGCTCGGC
CTCGACCTGT ATCAGCGCGA GCAGCTCGCC AACAGCTACA TCTCGTACGG CACCAAGACG
CTCGGCATCA GCCCGCGGCT CGGCTTCGCC CTGCGCGAAG ACCTGACCCT GCAGCTGCGC
TATTCGCTGT ACCGGCAGGA AATCACGCTG CCGTCGTACC TGAACAATTG TAACAACAAT
CTCGGCTCGG CGAACTACTT CCCGACGCCT CAGTTCATCG CGGCCGGCAA TCCGAACAAC
ACCGGCTACG GCGTGCTCGG CTGCTACGGC GACGGCGAAG CCTCGCTTCC GGTCCGCATC
GGCCTGTCCA ACGGCGCCTA CTGGACCTCC TCGGTCGGCT ACACCCTGAC CTACAACACG
CTGGACAACA CCCGGAACCC GACCAACGGT CTGCTGGTCG ACTTCCGTCA GGACTTCGCC
GGCGTCGGCG GCGACGTGAA GTTCCTGAAG TCGGCGTTCG ACGCCAAGTA CTACACCCCG
CTGGTGTCGG ACATCGTCGG CATCGTCCAC CTGCAGGCCG GCAATCTCAG CACCTATGGC
GGCAACCAGC TGCGCATGCT CGACCACTTC CAGATGGGTC CGAACCTGGT CCGCGGCTTC
GCGCCGAACG GTATCGGTCC GCGCGACATC GGCCAGTACG CCTTCTACGG CTACGGCGGC
GACGCGCTCG GCGGCACCAA CTACTGGGGC GCATCGGTCG AGTTGCAGAT GCCGTTCTGG
TTCCTGCCGA AGGAAGTCGG GCTCAAGGGC GCCGTCTATG CCGACGCCGG CTCGCTGTTC
GACTACAAGG GCCCGACGTC GTGGACGCTC ACCAACGAAG TCAACGCGCC CGGTTGTACG
CCGGCGAGCC AGACCTCGAT CGGGACCTGC GCCGGCCTGA ATTACGACGA CACCAATCTG
GTCCGCACCT CGGTGGGTGT CGGCCTGATC TGGGCCTCGC CGTTCGGTCC GCTGCGGTTC
GACTACGCTG TCCCGATCAC CAAGGGTAAG TACGACCGCG TCCAGGAATT CAAATTCGGC
GGCGGGACTT CGTTCTAA
 
Protein sequence
MNAGMRLLRG GLLAATLVFF AVPVATTATA VLTASPAAAQ SASSIQVEGN RRVEADTIRS 
YFKPGPSGRL DQGSIDDGLK ALIETGLFQD VRINQGGGGR LVVSVVENPV IGRLAFEGNK
KIKDEQLSAE IQSKPRGTLS RPMIQSDALR IAEIYRRSGR YDVRVDPQII EQPNNRVDLV
FVVNEGDKTG VKSIEFIGNK AFSSYRLKDV IKTRESNLLS FLGSGDVYDP DRVEADRDLI
RRFYLKNGYA DVQVVAALTE YDPERKGFLV SFKIEEGQQY RVGSVSFEST IPNFDANSLS
SYSRVNVGSL YNAEALEKSV EEMQIEMSRR GYAFATVRPR GDRNFESHTV SIVFSIEEGA
RVYIERINVV GNTRTRDYVI RREFDIAEGD AYNRALVDRA ERRLKNLDFF KSVKISTEPG
SSSDRVILVV NLEEKSTGDF SVSGGYSTSN GAMGEVSVSE RNFLGRGLFA KATVQYGQYA
RGYSLSLVEP YLLDYRVALG LDLYQREQLA NSYISYGTKT LGISPRLGFA LREDLTLQLR
YSLYRQEITL PSYLNNCNNN LGSANYFPTP QFIAAGNPNN TGYGVLGCYG DGEASLPVRI
GLSNGAYWTS SVGYTLTYNT LDNTRNPTNG LLVDFRQDFA GVGGDVKFLK SAFDAKYYTP
LVSDIVGIVH LQAGNLSTYG GNQLRMLDHF QMGPNLVRGF APNGIGPRDI GQYAFYGYGG
DALGGTNYWG ASVELQMPFW FLPKEVGLKG AVYADAGSLF DYKGPTSWTL TNEVNAPGCT
PASQTSIGTC AGLNYDDTNL VRTSVGVGLI WASPFGPLRF DYAVPITKGK YDRVQEFKFG
GGTSF