Gene RPB_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0901 
Symbol 
ID3909081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1038118 
End bp1040034 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content66% 
IMG OID637882794 
Productouter membrane autotransporter 
Protein accessionYP_484523 
Protein GI86748027 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAGG CTCGGACACG AATTCTGGCT GGCTTCGCCT TCGCGATGGC GACCTCGGTT 
TCCACCGGCG CCGTTGCGGC GTGTACCGGC ACCGGCAGTT TCGTTCCGGG TCTGGTACCG
GGCTTCAACC CGTCATCGGT TCTTCCATTC GCCGCCGGTG GCGCGGTGAA CTCGCTGGTC
TCGGCGATCA ACACGGCGAA CACCGCGTTT CTCACCCAGT CGACCGCCTT CGTCAGCGCG
CCGGCGAGTC CGCGGCCGAA TCAGGAAGGC GGCGGGGTCT GGACCCGCGC GATCGGCGGT
GAGGTCACCA CCAAATCGAC CAGCACCACC ACCAACGTCG CAGTCGGAGG CGTCGGCCTG
CCCGGCACCG TGACCTGCGA CAACGAGAAC AAGCTCAGCT TCGCCGGCGT CCAGGTCGGG
GCCGATACCT CGGTCCTGAA CTACAACGGC TGGAACCTGC ATCTCGGCTC GACGGTGGGC
TATATCGGCG CCAAGTCGCG CGACAAATCG TCGGCCGGGG TGCTCAATCC GCTCGGCGGC
ACCTTCGAGG ACACGCTGCA GGTTCCGTTC GCGGGCGTCT ATGTCGCCAT GACCAAAGGC
GGCTTCTTCG CCGACGGCCA GGTCCGCCTC GACTACTACC AGAACTCGCT GAGCGACCCG
ATCGTCGGCG GCATTTTCGG CCAGAAGCTG GATGCCCGCG GCCTCTCCTT CACCGGCAAT
GTCGGCTACA ATCACGCGCT GCAGAACAAC TGGTTCATCG AGCCGTCGGC CGGCGTGGTG
GTGTCGAAGG TCAAGGTCGA TCCGCTCAAC GTCACCGGTT CGCTGGTGCT GCCCGCGAGC
TTCACCCCGG GCGTGACCTT CCCCGGCCAA TTGCAGGTCG ACGACATCAA CAGCACGCTC
GGTCGCTTCA GCCTGCGCGG CGGCACCAGC ATCGCCTCGG GCAACATGAT CTGGCAGCCC
TTCGCGATCG CCAGTGTGTA TCACGAATTC AGGGGCGCGG TGACCTCGTC GTTCAACGGC
GCGGCGGCGT CGGCGGCCAC CGGTCTTCCG TCGGCGACCG GCAACATCTC CAGTTCCAAC
CTCGGCACTT ACGGCCAGTT CGGCATCGGC GTCGCCGGTC AACTCGTCAA CACCGGCCTG
CTCGGCTACA TCCGCGCCGA CTATCGCGCC GGCGAGAACA TCGACGGCTA CAGCTTCAAC
GGCGGCGTGC GCTATCAATT CGCCCCCGAG GCGGTGGCCG CGGCTCCGCT CTACACCAAG
GCGGCGAAGG CGCCGATTCT GGTTCACTCG GCCTACAACT GGACCGGCTT CTTCGTCGGC
GGCAGCTTCG GCGTGCTGAA CGGCCGCACC GACTGGACGT TCCAGCCCGG CGGCACCACC
ACCGACCCGC GCTTCGCCGG CGCCATCGGC GGCGGCCAGA TCGGCTACGA CTATCAGTTC
GGCAAATGGG TGGTGGGCGT CGAGGGCGCG CTCAATGCGA CCAACGCCAA CGGCGCGCGG
CCGTGCCCGT CCAGCGTGTT CTTCACCTGC GAGACCAATG TCAGCTGGAT GGGAACCGCG
ACCGCCAAGC TCGGCTATGC GTTCTGGGAT CGCTCGCTGT GGTACGTCCG CGGCGGCGGT
GCCTTCGGCG ACCTGAAGGT CACCACGACC TGCAACACCG GACCGTTCAA CCCGCTCGGC
CTCGCGGGTT GCGGCGAGAA CGCCAGCCGC AGCCGCGCCG GCTGGACCAT CGGCATCGGC
TCGGAATTCG CGCTCAGCAA GAACTGGACG GTGCGGACCG AGACCAATTA TTTCGACATG
GGCCGCGAGC GCTACGTGCT GCCGACCTCG ACGATCGACG TCAAGCAGAA CGGCTTCATC
TCGACCGTCG GTGTGAACTA TCGCTTCGCG CCGACCACGC TGGTCGCGAA ATACTGA
 
Protein sequence
MQKARTRILA GFAFAMATSV STGAVAACTG TGSFVPGLVP GFNPSSVLPF AAGGAVNSLV 
SAINTANTAF LTQSTAFVSA PASPRPNQEG GGVWTRAIGG EVTTKSTSTT TNVAVGGVGL
PGTVTCDNEN KLSFAGVQVG ADTSVLNYNG WNLHLGSTVG YIGAKSRDKS SAGVLNPLGG
TFEDTLQVPF AGVYVAMTKG GFFADGQVRL DYYQNSLSDP IVGGIFGQKL DARGLSFTGN
VGYNHALQNN WFIEPSAGVV VSKVKVDPLN VTGSLVLPAS FTPGVTFPGQ LQVDDINSTL
GRFSLRGGTS IASGNMIWQP FAIASVYHEF RGAVTSSFNG AAASAATGLP SATGNISSSN
LGTYGQFGIG VAGQLVNTGL LGYIRADYRA GENIDGYSFN GGVRYQFAPE AVAAAPLYTK
AAKAPILVHS AYNWTGFFVG GSFGVLNGRT DWTFQPGGTT TDPRFAGAIG GGQIGYDYQF
GKWVVGVEGA LNATNANGAR PCPSSVFFTC ETNVSWMGTA TAKLGYAFWD RSLWYVRGGG
AFGDLKVTTT CNTGPFNPLG LAGCGENASR SRAGWTIGIG SEFALSKNWT VRTETNYFDM
GRERYVLPTS TIDVKQNGFI STVGVNYRFA PTTLVAKY