Gene RPB_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4246 
Symbol 
ID3912059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4829175 
End bp4830803 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content66% 
IMG OID637886151 
ProductNHL repeat-containing protein 
Protein accessionYP_487845 
Protein GI86751349 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.359028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.663745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACCA AACTTGCGCT CGGCGCGAGC GCGATGGCGC TCGGACTGCT GGCCGTTTCC 
GCCGCGCGCG CGGATGATTA CCACGTCACG AAATTGGTGC AGGGCTCGGC GTTTCACGGC
GTGCATGGCC TCGGCATCGA CAAGGCCGGC CGGCTGTTCG CCGGCTCGGT CGCGGGCGCA
GCGCTCTACG AGGTCGATCG CGACAAGGGC ACGGCGAAGA TCGCGGTGCC GACGCCGGAA
GGCATGGCCG ACGACATCGC GTTCGCGCCC GACGGCACCA TGGCGTGGAC CGCGTTCCTC
ACCGGCGACC TCTATGCGCG CAAGGGCGAC GGCCCGATCA AAAAGCTCGC CTCGGGTCTG
CCCGGCATCA ACTCGCTCGC CTTCCGCAAG GACGGCCGGC TGTACGCCAC GCAGGTGTTT
CTCGGCGACG CGCTGTACGA GATCGACGTC GCGGGCGAAA AGCCGCCGCG CAAGATCATG
GAGAAGATCG GCGGGCTGAA CGGCTTCGAA TTCGGTCCCG ACGACAAGCT CTACGGCCCA
TTGTGGTTCA AGGGCCAGGT CGCCGCGGTC GATGTCGACA AGGGCGAGCT CAACGTCGTC
GCCGACGGTT TCAAGATTCC GGCGGCGGCG AATTTCGACA GCAAGGGCAA TCTCTACGTG
CTCGACACCG CGCTCGGCCA GCTCGTCCGG GTCGATATCA AGACCGGCAA GAAACAGCTC
GCGGCGCAGC TGAAGCCGTC GCTCGACAAT CTGGCGATCG ACGCGCAGGA CCGCATCTTC
GTCTCCAACA TGGCCGACAA CGGCATCCAG GAGGTCGATC CGGCGACCGG CACCGCCAGG
CAGGTGATCA TCGGCAAGCT GGCATTTCCG GGCGGCATCG GCGTGGTCTC CGACGGCGGC
AAGGACACGA TCTATGTGGC GGATGTTTTC GCCTATCGCA GCGTCGATGG CGCCACCGGC
GAGGTCCGCG AACTGGCGCG GATGCATGCC GACGGCGTCA CGCTGGAATA TCCGATGAGC
GCCACCGCCA AGGGCAACGA GGTGATGCTG TCGAGCTGGT TCACCGGCAC CGTGCAGGTG
ATAGACCGCG CGAGCGGCAA GACCATCGAG ATGCTGCACG ACTTCAAGGC GCCGTATGAT
GCGATCCGGC TGGCGAACGG CAAGCTGGTG GTCGCCGAAC TCGGGACCAA ATCGCTGGTC
GAAGTCGGCG GCGAGCACGG CAAGGACCGA AAAGCCATCG CCACCGACCT TAACGGCCCG
GTCGGCCTCG CCGCGGCGCC CGACGGCGCG GTCTACGTCA CCGAGGCGTT CGCCGGACAA
GTGACCAAGA TCGATCCGGC CACCGGCGCC AAGACCGTGG TGGCGAAGGA TCTGAAGATG
CCGGAGGGGT TGGCGCTGGC GCCGTCCGGC AAGCTCGTCG TGGCAGAAGT CGGCGCCAAA
CGCGTGGTTG AGATCGATCC TGCGACCGGC AAGGTCACCG AACTCGCCGG CAACCTCCCG
ATCGGCCTCG TCCCCGCCCC CGGCCTGCCG CCGACCAACA TGCCGACCGG AATCGGCGTC
GGCGCCAGCG GCACGATCTA CGTGTCGTCG GACATCGAGA ACGCGATCTA CAAGATCGCG
AAGAAGTAG
 
Protein sequence
MRTKLALGAS AMALGLLAVS AARADDYHVT KLVQGSAFHG VHGLGIDKAG RLFAGSVAGA 
ALYEVDRDKG TAKIAVPTPE GMADDIAFAP DGTMAWTAFL TGDLYARKGD GPIKKLASGL
PGINSLAFRK DGRLYATQVF LGDALYEIDV AGEKPPRKIM EKIGGLNGFE FGPDDKLYGP
LWFKGQVAAV DVDKGELNVV ADGFKIPAAA NFDSKGNLYV LDTALGQLVR VDIKTGKKQL
AAQLKPSLDN LAIDAQDRIF VSNMADNGIQ EVDPATGTAR QVIIGKLAFP GGIGVVSDGG
KDTIYVADVF AYRSVDGATG EVRELARMHA DGVTLEYPMS ATAKGNEVML SSWFTGTVQV
IDRASGKTIE MLHDFKAPYD AIRLANGKLV VAELGTKSLV EVGGEHGKDR KAIATDLNGP
VGLAAAPDGA VYVTEAFAGQ VTKIDPATGA KTVVAKDLKM PEGLALAPSG KLVVAEVGAK
RVVEIDPATG KVTELAGNLP IGLVPAPGLP PTNMPTGIGV GASGTIYVSS DIENAIYKIA
KK