Gene RPB_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3551 
Symbol 
ID3911353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4064406 
End bp4065521 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID637885453 
Producthypothetical protein 
Protein accessionYP_487157 
Protein GI86750661 
COG category[S] Function unknown 
COG ID[COG5330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.335217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCGA AATCGGCCCT TTCCGCCGCC ACCCTCCTCG ACGAGCTGCA ATCCACGCTG 
GCGCATGGCA CAGTGGCGCG ACGCGTCGAG ACCTTGCGCC GCGTCACCGA CCTGTATCTC
GACAGCGGCG TGGACTACAG CGACGATCAG ATCGCGGTGT TCGACGACGT CTTCAACTGT
CTGGTGGATA GTATCGAGAC CCATGCCAAG GTGCTGCTGG CCGAGCGCCT GGCGCCCGTC
AATGCCCCGC CGCGGATCAT TCATCATCTC GCCTTCGAGG ACCTGATCGA GATCGCGGCG
CCGGTGCTGT CGCAGTCCGA TCAGCTCGAC GACGCCATGC TGATCGCCAA TGCCCGAAGC
AAAGGGCAGA GCCACATGAT GGCGATCTCG ACCCGCAGAT CGCTCAGCGG CGCGGTGACC
GACGTCCTGG TCGAGCTCGG CAATCAGCAG GTGGTGCAGA GCACCGTCAA GAATCCCGGC
GCGGAATTCT CGGACAATGG CTATTCGGTA CTGGTCAAGC GCGCCGAACT GGATGACGAC
CTCGCCACCG AGCTGGGCCG GCGGGCGATC CCTCGCGCGC AATATCTCAA GCTGATCGCG
ATCGCATCGG CCTCGGTGCG GGCGAAACTC AAGGCCGCGA ACCCGAACGC GGCCTCCGAG
GTCGCGACCG CGGTGAAGCA GGCGTCGCGT CTGGCGCGCT CGGCGCCGGC GGCGATCAGC
CGCCAGACCA GCATCGCCCA TGGTCTGGTC CGGTCGCTGT ACGAAGACGG CCGCATCACC
GAAGAGCAGG TCAACACCTT CGCAAACGAA CGCAAATTCG ACGAGATCAA TCAGGCGCTC
GCATGCCTCG CCGGCACCTC GGTCGAGACC GCCGAGGCGA TGATGATCGA ATCCCGCGAC
GAGGGTCTGC TGATCCTCGC CAAGGTCTGC AAATTGTCGT GGCCGACGGT CAAGGCGATC
ATCAGGATGC GCGACGAGGC GACCGGCACG ATGTCCACCG ATCTCGACGA ATGTCGCTTC
ACCTACGAGC GGTTGCGCAT CGCGACCGCG CAGCAGGTGC TCCGCTTTCA CCGCATGCAG
CAATCCAGCG CCGCAACCAA GGCGCCGGCC GCCTGA
 
Protein sequence
MSPKSALSAA TLLDELQSTL AHGTVARRVE TLRRVTDLYL DSGVDYSDDQ IAVFDDVFNC 
LVDSIETHAK VLLAERLAPV NAPPRIIHHL AFEDLIEIAA PVLSQSDQLD DAMLIANARS
KGQSHMMAIS TRRSLSGAVT DVLVELGNQQ VVQSTVKNPG AEFSDNGYSV LVKRAELDDD
LATELGRRAI PRAQYLKLIA IASASVRAKL KAANPNAASE VATAVKQASR LARSAPAAIS
RQTSIAHGLV RSLYEDGRIT EEQVNTFANE RKFDEINQAL ACLAGTSVET AEAMMIESRD
EGLLILAKVC KLSWPTVKAI IRMRDEATGT MSTDLDECRF TYERLRIATA QQVLRFHRMQ
QSSAATKAPA A