Gene RPD_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4401 
Symbol 
ID4024926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4869694 
End bp4871373 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content64% 
IMG OID637964610 
ProductOmpA-like transmembrane region 
Protein accessionYP_571518 
Protein GI91978859 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAAA TCATTTTTGC TGCGGCCGCT GTGACCCTGG CCGCTGGAAC CGCCGCCGCC 
GCCGATCTGC CGCGATTGCC CCCGGCTGCG CCGGTGTTGT GGAACTGGAC CGGCCTGTAT
TGGGGCGCCC ATCTCGGCGG CAGCTTCGGC GCGTCGAGCT TCAGCGATCC GGCCGGCCCC
GGCATCTATG GCGGCAACGT CCGTAGCCCG GCGGCGATGG CCGGCATTCA GCTCGGCTAC
AACTACCAGC CGAATGCGAA CTGGCTGGTC GGCGTCGAGG CCGACCTCAG CGCGCTGAAC
GGCAACGGCA CCAATAGCTG CATGGTCTCG TCCGGCCTCG TCATCTCGGC GAACTGCCGC
GTCCGCCAGG ATGCGATGGC GACCGTGACC GGCCGCGCCG GCCTTGTCAC CGGCCCCGGA
GGCCGCACGC TGCTTTACGC CAAAGGCGGC GCGGCCTTCC TCAGCGAACG AATAGATATC
ACCGTCGGAA ATCCGCTCCG CTCCTCGACC GACAATAGCG ACGGCCGCTG GGGATGGACC
GCGGGCGCCG GCATCGAGCA GGCGCTGGCG CCGGCCTGGT CGGTCAAGTT CGAATATGAC
TACGCCAATT TCGGCAGCAG CGATGTGGCG ACGCCTGCGA GCTATCGGCT GGTTCCATGG
GTCGGCTATT TCCCGACGCC GCAGGGAACC AGCAAGGTGA GTCAGGATTT GCACGCGGTG
AAAGTCGGGC TCAACCTCAA ATTCGGCGGC GATGTCGACG CGCGCTTCGA CGACTATCAT
CTGCGCGGAA GCCAGGCCGC GAACGAGATC GTCGAGCGCG GCGCAGTCGA GGTCGGCGGC
CGGGTCTGGT ACAGCTCCGG CCGTTTCCAG AAAGACCTCG GCAACACCGT CGATCAGGGT
AGCCAGAATC ATCTGATTTC GCGGCTGACC TATCAGAGCA CGGCGGCGTC GGGCGAGGCG
TTCGGCCGCG TCGATGGTCC CTACGACATG TTCCTCAAGG GCTTCGCCGG CGGCGGAACG
CTGCTGAGCG GACGCATGAA CGACGAGGAC TGGATTGCCA ATAGAGGCAT CCCGTATTCC
AACACGCTCC ACGATCCGGT CAAGGGCAGC ATCGCCTATG CGACGCTCGA CCTCGGTTAC
AATCTGCTGC GCGGACCGGA TTACAAGTTC GGCGGCTTTG TCGGCTACAA TTACTATCGC
GAGAACAAAT CGGCCTATGG CTGTGCCCAG ACGGCCGGCC CGACGGGGCA GGTCTGTGCC
GATCCGGTTC CCAACACCGT TCTCGCGATG ACGCAAAACA ATACCTGGCA TTCGCTCCGG
GTCGGCTTCA ACGGCGAAAT CGGACTCGGC CGCGGGTTGA AGCTCTCCGC CGACGCCGCC
TATCTGCCTT ATGTGAAGAC CTTCGGTGTC GATAATCACG TGATGCGTAC CGATGTCACC
GATACTGTAT CGCCGGAACA GGGAACCGGG CAGGGCGTGC AGCTCGAAGC GATCCTGTCG
TACCAGTTCA ACAATGCCTT CAGCGTCGGT GCCGGCGCAC GCTATTGGGC GATGTGGGCA
ACCACCAATG CCTACACCAA CATCTTCGGC TCGGAGTGTC CGTGCCAGAC CTTGCCGGAG
CGCACCGAGC GCTATGGAAC CTTCCTGCAG GCCGCCTACA AGTTCGACAC GCTGAACTAG
 
Protein sequence
MRQIIFAAAA VTLAAGTAAA ADLPRLPPAA PVLWNWTGLY WGAHLGGSFG ASSFSDPAGP 
GIYGGNVRSP AAMAGIQLGY NYQPNANWLV GVEADLSALN GNGTNSCMVS SGLVISANCR
VRQDAMATVT GRAGLVTGPG GRTLLYAKGG AAFLSERIDI TVGNPLRSST DNSDGRWGWT
AGAGIEQALA PAWSVKFEYD YANFGSSDVA TPASYRLVPW VGYFPTPQGT SKVSQDLHAV
KVGLNLKFGG DVDARFDDYH LRGSQAANEI VERGAVEVGG RVWYSSGRFQ KDLGNTVDQG
SQNHLISRLT YQSTAASGEA FGRVDGPYDM FLKGFAGGGT LLSGRMNDED WIANRGIPYS
NTLHDPVKGS IAYATLDLGY NLLRGPDYKF GGFVGYNYYR ENKSAYGCAQ TAGPTGQVCA
DPVPNTVLAM TQNNTWHSLR VGFNGEIGLG RGLKLSADAA YLPYVKTFGV DNHVMRTDVT
DTVSPEQGTG QGVQLEAILS YQFNNAFSVG AGARYWAMWA TTNAYTNIFG SECPCQTLPE
RTERYGTFLQ AAYKFDTLN