Gene RPB_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1166 
Symbol 
ID3910101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1336848 
End bp1337879 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content73% 
IMG OID637883060 
Productextensin-like protein 
Protein accessionYP_484787 
Protein GI86748291 
COG category[S] Function unknown 
COG ID[COG3921] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.27994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCCGCG GCAAAGCGTC GACGCGCGCG GTGCTGATCG CGCTGGTTGT GCTCGGCGCG 
TCGTCCGCGT GGGCGCAGGG TGAGATACCG TTGCCGAAGC CGCGCCCGGC CGAGGCGCCG
CAACTGCAGG GCGAACGCGC GGCGGACCGG CCCGAGGCCG ATGCGCCGCA GGCGGAAGCG
GCGCCGCCGC CGAAGCCGGA GCCGAAGCCG GAGCCGAAGC CGCCCTCGGC ATGCCGGCTG
GCGCTGACCG ACGCGATCGC GATCGCGCCG AGCCTTGCCG ACATTGCCGG CCCCGGCAGT
TGTGGCGGAA CCGATCTGGT GAAGCTCGAA GCGGTGGTGT TGCCGGACGG CAGCCGCGTG
CCGCTGACGC CGGCGGCGAC GTTGCGCTGC CCGATGGCCA GCGCGCTCGT CGACTGGGTT
CGCAGCGACC TCGCGCCGCT CGCCGCGTCG TTGGCCACGC GTCTCGCCGC GCTCGACAAT
TACGACTCCT ACGATTGCCG CGGTCGCAAC CGGGTGCGCG GGGCCAAGCT GTCGGAGCAC
GGCCGCGCCA ATGCGATCGA TCTGCGCGGC TTCAAGCTGG CCGATGGCCG CATGCTGTCG
CTGACCGACC GCGCCGCGCC GCGCGCGGTG CGCGAGAGCG TGCGGCAGTC GGTGTGCGAC
CGCTTCGCCA CCGTGCTCGG CCCGGGCTCG GACGGCTATC ACGAGGAGCA CGTCCACCTC
GATCTCGCCG AGCGTCGCGG CGGCTACAAG ATGTGTCAAT GGGAGGTGTG GGAGCCGCTG
CCCGTCATCG CGCCGCTGCT GCCGGCGGAG CGCCCGGCCG AAGCGCCGCC GCGCGAGGTG
GCGGCCGGCG AGCCACAGGA CCGCGACCCC AACCGCTCGC CGCAGCAGGC TCAGCCTGAG
AACGCGCCGC CGCAGCAGGT TGAGCCGGAG CAGGGCGAGC AAGCCGCCAA GCCAGAGCCG
CCGCCGGCGA GCAGCAAGGC GAAGAGAAAG CCGAAGAAGT CGCGGGGTGA GCGGCAGTCG
GAACTGCGGT AG
 
Protein sequence
MFRGKASTRA VLIALVVLGA SSAWAQGEIP LPKPRPAEAP QLQGERAADR PEADAPQAEA 
APPPKPEPKP EPKPPSACRL ALTDAIAIAP SLADIAGPGS CGGTDLVKLE AVVLPDGSRV
PLTPAATLRC PMASALVDWV RSDLAPLAAS LATRLAALDN YDSYDCRGRN RVRGAKLSEH
GRANAIDLRG FKLADGRMLS LTDRAAPRAV RESVRQSVCD RFATVLGPGS DGYHEEHVHL
DLAERRGGYK MCQWEVWEPL PVIAPLLPAE RPAEAPPREV AAGEPQDRDP NRSPQQAQPE
NAPPQQVEPE QGEQAAKPEP PPASSKAKRK PKKSRGERQS ELR