Gene RPD_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1013 
Symbol 
ID4021488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1150835 
End bp1152772 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content64% 
IMG OID637961204 
ProductOuter membrane autotransporter barrel 
Protein accessionYP_568152 
Protein GI91975493 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.645383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.106647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACACG CGCGGACCCG AATTCTGGCA GGCTTCGCTT TCGCGATGGC CGCCTCCGTC 
TCCACCGGCG CCTTGGCCGA ATGTACCGGC ACAGGCAGCT TCGTTCCGGG CGCCGTGATT
CCGGGCACCA ATTTCAGTCC GAGCGCGCTC TTGCCGTTCG CCGCGGGCGG TGCGGTGAAT
TCCCTCGTCT CGGCGATCAA CACCGCCAAC ACGGCATTCC TCACCCAATC GACCGCCTTC
GTGAGCGCCC CGCGCAATCC GGCGCCGAAT CAGGAAGGCG GCGGCGTCTG GACCCGTGCG
ATCGGCGGTG AAGTCACCAC CAAATCGACC AGCACGACCT CCAACGTGTC GCTCGGGGGT
GTCGGACTGC CGGGCTCGGT GAACTGCAAC AACGAGAACA AGCTCAGCTT CGCTGGCGTC
CAGGTCGGGG CGGATACCTC GATTTTGAAC TACAACGGCT GGAATATGCA TCTCGGCTCG
ACCGTCGGCT ATCTCGGCGC CAAGTCCCGC GACAAGTCGT CGGCCGGTGC GCTGAACCCG
CTTGGCGGCA CTTTCGAAGA CACGCTGCAG GTGCCGTTTG CGGGCGTCTA TGTCGCGATC
ACCAAAGGCG GTTTCTTCGC CGACGGCCAG GTTCGCCTCG ATTATTATCA GAACTCGCTG
AGTGATCCGA TCGTCGGCGG TATCTTCAGC CAGAAGCTGG ATGCCCGTGG CCTCTCCTTC
ACCGGTAACG TCGGCTACAA CCATGCGTTG GAGAACAACT GGTTCATCGA GCCCTCGGCC
GGTATCGTGG TCTCGAAGGT CAAGGTCGAT CCGCTCAACG TGACCGGCTC GCTGGTGTTG
CCCGCGACCT TCACGCCGGG CGTCACGTTC CCCGGCCAGT TGCAGGTCGA CGATATCAAC
AGCACGCTCG GTCGCCTCAG CTTGCGCGGC GGCACCAGCA TCGCATCCGG CAACATGATC
TGGCAGCCCT TCGCCATTGC GAGCGTCTAT CATGAATTCA GCGGGGCGGT GACTTCCACG
TTCAACGGCG ATGCGGCGTT CAACGCGACC GGCATCCCGT CGGCGACCGG CACGATCTCC
AGCACCAACC TCGGCACCTA CGGCCAGTTC GGGCTCGGCG TCGCCGGCCA GCTCGTCAAC
ACCGGCCTGC TCGGCTACGT CCGTGCCGAC TATCGCACCG GTGATCATAT CGACGGCTAC
AGCCTGAACG GTGGCGTCCG CTACCAGTTC GCGCCCGACG CGATCGTCGC TGCGCCGCTC
TACACCAAGG CCGCGAAGGC TCCGGTGCTG GTCCGCTCGG CCTATAACTG GACCGGCTTC
TTCATCGGCG GCAGCTTCGG CGCACTGAAT GGCCGGACCG ACTGGACATT CCAGCCGGTC
GGCACGCGCA CCGATCCGCG TTTCGCCGGC GCGATCGGCG GCGGCCAGAT CGGTTATGAC
CATCAGTTCG GCAAGTGGGT GGTCGGCGTC GAAGGCAACC TGTTCGCGAC CAACGCCAAC
GGCGCCCGTC CCTGCCCGAA TGGCGTGTTC TTCACGTGTG AAAACAATGT GAGCTGGATG
GGCACTGCGA CCGCGAAGCT CGGCTACGCG TTCTGGGACC GCTCGCTCTG GTACGTCCGC
GGCGGCGGCG CCTTCGGCGA TCTCAAGGTC ACCACCAACT GCAACACCGG TCCGGTGGTT
CCCAATCCGG CATTCCTCGT CGTGGCGGGT TGCGGCGAAA GCGCCAGCCG CAACCGTGCC
GGCTGGACCA TTGGGTTCGG TTCGGAGTTC GCGCTGAGCA AGAACTGGAC GGTGCGCGCC
GAGACCAACT ATTTCGACAT GGGTCGCGAG CGCTACACGC TGCCGACCTC GACCATCGAC
GTCAAGGAAA ACGGTTTCAT CTCGACCGTC GGCCTCAACT ATCGCTTCGC GCCCACGGCG
CTGGTCGCAA AATACTGA
 
Protein sequence
MQHARTRILA GFAFAMAASV STGALAECTG TGSFVPGAVI PGTNFSPSAL LPFAAGGAVN 
SLVSAINTAN TAFLTQSTAF VSAPRNPAPN QEGGGVWTRA IGGEVTTKST STTSNVSLGG
VGLPGSVNCN NENKLSFAGV QVGADTSILN YNGWNMHLGS TVGYLGAKSR DKSSAGALNP
LGGTFEDTLQ VPFAGVYVAI TKGGFFADGQ VRLDYYQNSL SDPIVGGIFS QKLDARGLSF
TGNVGYNHAL ENNWFIEPSA GIVVSKVKVD PLNVTGSLVL PATFTPGVTF PGQLQVDDIN
STLGRLSLRG GTSIASGNMI WQPFAIASVY HEFSGAVTST FNGDAAFNAT GIPSATGTIS
STNLGTYGQF GLGVAGQLVN TGLLGYVRAD YRTGDHIDGY SLNGGVRYQF APDAIVAAPL
YTKAAKAPVL VRSAYNWTGF FIGGSFGALN GRTDWTFQPV GTRTDPRFAG AIGGGQIGYD
HQFGKWVVGV EGNLFATNAN GARPCPNGVF FTCENNVSWM GTATAKLGYA FWDRSLWYVR
GGGAFGDLKV TTNCNTGPVV PNPAFLVVAG CGESASRNRA GWTIGFGSEF ALSKNWTVRA
ETNYFDMGRE RYTLPTSTID VKENGFISTV GLNYRFAPTA LVAKY