Gene RPB_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0544 
Symbol 
ID3909583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp609374 
End bp610528 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content68% 
IMG OID637882432 
Producthypothetical protein 
Protein accessionYP_484166 
Protein GI86747670 
COG category[S] Function unknown 
COG ID[COG4246] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.200606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAC CGACCGCGCA CTCGGCTGAG CTGAAAGTGC ACAGCGCGAG CAAGGGAGCC 
GGCGCGCTCC CTCTCCCCTC TCCGCAAGCG GGAGGGGAGA CGACGCGGCG GCGCTTTCTT
GCCGGGAGTT TGGCGATCGG CGCTCTGGCC TGCTTTCCGG CGCTCGGTCC GCGGCCCGCG
CTGGCGCAGG CCACCAAGGC CGAGCTCGAC GCCTATTCGA TCCCCGCCGC GGAGCGCATC
GCGGTGCGGG CGCGTCCGAT CGACCAGTTC GACCTGCGCG ACCGCGGCGG CCGACGCTTC
GGCGCGCTGC AGTTTCGCAG CGGGCTGATT CTGACCTCGC CGTTTCGCGG CTTCGGCGGG
TTGTCGGCGC TGCGGCTCGA TCCGAAAGGC GAGCGCTTCG TGGCGATCAG CGATCGCGGC
GTCTGGTTCA CCGGCCGCAT CGTCTATGAC GGCGCCGCCA TGGCGGGCGT CGCCGACGTC
GAGGCGGCGC CGCTGCTCGG GCCCGACCGC CAGCCGCTGA CGAAGAGCAA ATGGTACGAC
AGCGAAGCGC TGGCGTTCGA CGGCGGCACC GCCTATGTCG GCTATGAGCG CGTCAATCAG
ATCGTCAAAT TCGATTTCGG CCGCGACGGC GTTCGCGCTT CGGGGCAGCC GATCGCCGTG
CCGCCGGGCT TGCGCAAGCT GCCGAACAAC AAGGGCATCG AGTCGCTGGT CGTGGTGCCG
AAGGGGCTGC CGCTCGCCGG CACGCTGATC GCGATCTCCG AGCGCGGCCT CGACGCCGGC
GGCAATGTCG TCGGCTTCCT GATTGGCGGC AAGACGCCGG GTCCATTCGC CGTTCGCCGC
TCCGATAATT TCGACGTCAG CGACGCGGTG CTGCTGCCGT CGGGCCAACT CCTGATTCTC
GAGCGCAAAT TTTCGTGGAT CGAGGGCGTG CATATCCGGA TCCGGCGGAT CGCATTGGCG
ACGCTGGTGC CCGGTGCGAC CGTGGATGGC CCGGTGTTGT TCAACGCCGA TCTCGGCCAC
GAGATCGACA ACATGGAAGG CCTCGACGCC CATCAGGACG CCGCCGGCGA CACCGTGCTG
ACGATGGTGT CGGACGACAA TTTCTCGATG CTGCAGCGGA CGCTGCTGCT GCAGTTCACC
CTCGTGGACG ACTGA
 
Protein sequence
MTRPTAHSAE LKVHSASKGA GALPLPSPQA GGETTRRRFL AGSLAIGALA CFPALGPRPA 
LAQATKAELD AYSIPAAERI AVRARPIDQF DLRDRGGRRF GALQFRSGLI LTSPFRGFGG
LSALRLDPKG ERFVAISDRG VWFTGRIVYD GAAMAGVADV EAAPLLGPDR QPLTKSKWYD
SEALAFDGGT AYVGYERVNQ IVKFDFGRDG VRASGQPIAV PPGLRKLPNN KGIESLVVVP
KGLPLAGTLI AISERGLDAG GNVVGFLIGG KTPGPFAVRR SDNFDVSDAV LLPSGQLLIL
ERKFSWIEGV HIRIRRIALA TLVPGATVDG PVLFNADLGH EIDNMEGLDA HQDAAGDTVL
TMVSDDNFSM LQRTLLLQFT LVDD