Gene RPB_3590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3590 
Symbol 
ID3911392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4114920 
End bp4116275 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content71% 
IMG OID637885492 
ProductOmpA/MotB 
Protein accessionYP_487196 
Protein GI86750700 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.663745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTATC TCCTGACTCA TTACTGGATC TGGCTGGCCG TCGCGTTCGC GGTCGGGGTC 
GTCACGCCGC TGCTGGCGCA GAAACTCGGC TGGGCCGCGG ACCTCGAAAA CTACTGGCTG
CAGCGCATCG GTGTGGTGCT GGCGATCGTC GTGGTGGTGA TCGTGATGCA GCTCGTCTCG
GGCCGGCCGG AGCTGATGCT GGAGGCGGCC GCCGGGCTGG TCGCCGCTTA CATCGTCGGC
GGCATTGCTG GTGGTATCGC GCCGGGGCTG CTGCCGGCGC GGTTCGAGGG CTGGTGGGTC
GGGCTGTTCG CCACCGGGCT GATCTGGCTG GTGTTCAGCA TCACCACGAT GCCGAAGGTC
GAGCCGGACC TGCGCGAGCG CGTCGCCGAG GTGGTGAAGA GCGCCGGCGC CGATCCGTTG
AATTTCGATG TGTCGGGCCG CGACGTGCTG CTGCCGGACG ATCTCGGCGC GAAGCGCGCG
GTGCTGAGCG AGCAGATCGG GCAGGTCAAG GGCGTGCGGC TGGTATCGGA GGTCGAGGAG
CTGAGCGGCG CGGCGCTGGC GGCGAAGACG GTCGCCAGGC TGCAGGCGGA AGTTGCCGCG
AAGGCCGAGG CCGACGCCAA GGCCAAGGCC GAAGCCGAGG CGCGCGAGGC GGTCGAGAAG
GCAGCGGTCG CCAAGGCCGC CAAGGACACT GCGGCGAAGC AGGCGGCGGC CAAGGCCAAG
GCCGACGCCG AGGCGAAGGA AGCCGCGGCG AAGGAAGCCG CGGCGAAGGA AGCCGCCGAG
ATCGCGGCAA AGGAAGCGGC CGCCAAGGAG GCGGCGGCGA AGGAAGCTGC GGCCAAGGAG
GCAGCCGCCA AGGAGGCCGC GGCCAAGGAA GCGGCTGCGA AGTCGGCCGC CGCGGAAGAG
GCCAAGCCGG CCGCCGTCGC GCCGCCGGCC GCGCCCGCCG CCGCCGCGGC GACCCGCACC
GCCGATGCCG CCGGCACCGC GTCGATCCCG TCGGTGGAGG CCTGCCAGAG CAAGCTGTCG
ACATTGGTCG CCGCGCAGAA GATCAATTTC ATGCGCGGCA GCGCCGAGAT CGCGCAGGCC
TCATTGCCGG TGCTGAAGCA ACTCGCGGAG GTGATCGCGC ATTGCCCGGC GGCGACCATC
GAGGTCGCCG GCCACACCGA TGCCGCCGGC AAGAAGGCCG CCAACGAGGC GCTGTCGAAG
CGCCGCGCCG AGGCGGTCGC CGACAACCTC ACCAAGGCCG GGATCGGCTC GGCCAAGCTC
ACCGCCGTCG GCTACGGCGC GTCGAAGCCG CTCGCGGCCA ACGACAGTGC GGACGGTCGT
GCGAAGAATC GGCGGATCGA GTTCGTGGTG AAGTGA
 
Protein sequence
MLYLLTHYWI WLAVAFAVGV VTPLLAQKLG WAADLENYWL QRIGVVLAIV VVVIVMQLVS 
GRPELMLEAA AGLVAAYIVG GIAGGIAPGL LPARFEGWWV GLFATGLIWL VFSITTMPKV
EPDLRERVAE VVKSAGADPL NFDVSGRDVL LPDDLGAKRA VLSEQIGQVK GVRLVSEVEE
LSGAALAAKT VARLQAEVAA KAEADAKAKA EAEAREAVEK AAVAKAAKDT AAKQAAAKAK
ADAEAKEAAA KEAAAKEAAE IAAKEAAAKE AAAKEAAAKE AAAKEAAAKE AAAKSAAAEE
AKPAAVAPPA APAAAAATRT ADAAGTASIP SVEACQSKLS TLVAAQKINF MRGSAEIAQA
SLPVLKQLAE VIAHCPAATI EVAGHTDAAG KKAANEALSK RRAEAVADNL TKAGIGSAKL
TAVGYGASKP LAANDSADGR AKNRRIEFVV K