Gene RPB_4538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4538 
Symbol 
ID3912355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5131243 
End bp5132865 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content68% 
IMG OID637886442 
Producthypothetical protein 
Protein accessionYP_488132 
Protein GI86751636 
COG category[S] Function unknown 
COG ID[COG5338] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGCG CGGCCGGGCA CGCCGAGGCG CAGAGCCTGA CCCGCGACCT GTTCCGGCCG 
GAGCGCGGCG CCTTCGTCGC GCCGCAGGAC CTGCCGCTGC AACGCACGCC GCGGCTGGCC
GCCGCGCCCG ATCCCTATGC GACCGACAAT GATCCGCGCA GAGACAGAAC CACGCCGCCG
CGGCTCGGAT CGCCGAATTT CGGGCTGCAG CCCAGCCTGG GCAGCGCCGG CACCGGCTAT
GACGCGCTCG GCCGCAAACG CCAGAAGCCG AAGATCTTTC CCGGCGCGCC GCAGCCCAAG
GCGGTCGGCC CCGGCTCAAA GCCGGTGATC GCCGCCCCGC CGCCGGCCCG TCCGCTGCCG
CCGTCGCAGA ATGCCGCCAA GCCGCCGGTT CCGGCCGCCT TCACCGGAAC GCTGCCCGGC
CAGCCGACCC GGCGCCGGCT CAAGCCCGAC CTCGATCCGT TCGGCTCGGT CGGCGATTAC
GCCGGCAGCT TTCTGTTCAA GGGCGCGATC GAACTGAACG GCGGCTACGA TACCAATCCG
GCCCGCATCA CCACGCCGCG CGCCTCCGGC TTCTACAAGA TCTCGCCCGA ATTGATGGTG
ACCTCGGACT GGGAGCGCCA CGCGCTGGTT GCCGATCTGC GCGGCTCCTT CACCGGCTAT
GGCCACGACT TCATCAATCC GGCCGGCGCC GTGTCGTCGG CGCCGCGCAA TCTCGACCGC
CCCGATTTCA ACGGCCACGT CGACGGCCGG ATCGACGTCA CCCGCGATAC CAGGCTGCTC
GGGCAGGGCC GGCTGATCGT CGGCACCGAC AATCCCGGCA GCCCGAACGT GGAGGTCGGG
CTGCAGAAAT ATCCGATCTA TACCCAGACC GGCGTGTCCG GCGGCATCGA CCAGAACTTC
AACCGGCTTC AGGTCACGGC GATCGGCAGC GCGGATCGTC GCGCCTTCCA GAAGTCGGTG
TTCACCGACG GCAGCACCGA CACCAACAAC GACCGCAACT ACAATCAATA TGCCGGCACC
GGCCGCGTCA GCTACGAGAT TCTGCCGGGG CTGAAACCGT TCGTCGAAGG CCAGGCCGAT
ACGCGCACGC ACGACACCGT CACCGATCGC TACGGCTATC GGCGCGACAG CAATGGCGGC
TACGTCAAGG GCGGCACCAG CTTCGAACTG ACGCGGCTAT TGACCGGCGA AGCTTCGATC
GGCTGGGCAT CGCGCACCTA CAGCGATCCG CGGCTGCTGA AGCTCGACGG CCTGCTCACC
AGCGCCTCGC TGATCTGGAC GATGACGCCC CTGACGACGG TCAAATTCAT CGCCGACACC
AGCATCGACG AATCGCCGCT GTCCGGCGTG TCCGGCGTGC TGACGCGGAC CTACACGGCC
GAGGTCGACC ACGATTTCCG CCGCTGGCTG ACCGCGATCG GCAAGTTCAC CTACGCGACC
TACGATTACC AGGGCTCGGG CCGCAGCGAC CGCTTCACCT CGCTCGAGGG CAATCTGGTC
TACAAGCTGA ATCGCTCGCT TTGGGTCAAA GGCACGCTGC GCCGCGACCA GCTCGATTCC
AATATCGTCG GCGGCAGCTA CAATGCCACC GTCGTGATGC TCGGCGTGCG GCTGCAGAAC
TGA
 
Protein sequence
MAGAAGHAEA QSLTRDLFRP ERGAFVAPQD LPLQRTPRLA AAPDPYATDN DPRRDRTTPP 
RLGSPNFGLQ PSLGSAGTGY DALGRKRQKP KIFPGAPQPK AVGPGSKPVI AAPPPARPLP
PSQNAAKPPV PAAFTGTLPG QPTRRRLKPD LDPFGSVGDY AGSFLFKGAI ELNGGYDTNP
ARITTPRASG FYKISPELMV TSDWERHALV ADLRGSFTGY GHDFINPAGA VSSAPRNLDR
PDFNGHVDGR IDVTRDTRLL GQGRLIVGTD NPGSPNVEVG LQKYPIYTQT GVSGGIDQNF
NRLQVTAIGS ADRRAFQKSV FTDGSTDTNN DRNYNQYAGT GRVSYEILPG LKPFVEGQAD
TRTHDTVTDR YGYRRDSNGG YVKGGTSFEL TRLLTGEASI GWASRTYSDP RLLKLDGLLT
SASLIWTMTP LTTVKFIADT SIDESPLSGV SGVLTRTYTA EVDHDFRRWL TAIGKFTYAT
YDYQGSGRSD RFTSLEGNLV YKLNRSLWVK GTLRRDQLDS NIVGGSYNAT VVMLGVRLQN