Gene RPB_0687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0687 
Symbol 
ID3908193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp771461 
End bp772702 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID637882579 
ProductHipA-like protein 
Protein accessionYP_484309 
Protein GI86747813 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGG CTGTCCTCGC GGCCGAGCGA ATCCAGTCAC TCGATATCTC ATTGAACGAC 
CTTCCCGTCG GCACCCTCGT CCGGACGCCG GGTGACTACA ACGCGTTCAA CCTCTTGCCC
GCTTACCGGG CCATGAACAA TCCGCCGGTC TTCAGCCTGT CGCTTCGCTC AGCGGATGGC
GGCCTCCGGC GAGATCCCAA GCCCATACGC AGAGCACTGC CTCCGTTCTT TGCGAACCTG
CTGCCCGAAG AGAAACTGCG CGAAGCGATG GAAAAGCACC ACGCCGCGTC CGTCAGGCCG
GGCAACGACT TCGATCTTCT GGCCGCGCTG GGCGGCGATC TGCCGGGAGC GGTCCGCGCC
CTACCGAGTG ACGGCAGCCC CGTCGTGGCC GGCCCGGAAG CCCAAGGCCG CAACACGCGG
TTCTCTCTCG CCGGTGTGCA AATGAAACTG TCGGTGATGA AGAACACCGG CAAACAAGGC
GGCATCACGC TCGCGGTCGG CGACGGGCAA GGCCAGTACA TCGCCAAATT TCCCTCGCTC
ACGCATATCG GACTCTCGGA GAACGAGTTC GCCCTGATGG CCCTGGCGGA AGCGCTGGGC
ATGGAGGTGC CCGCGCGCGA GCTCGTCGAC AAGACCGAGT TCACCGGCAT CCCCGACGAG
TTCACCACCC AGTCCACCGG CAAGGTGCTG CTCGTCCGCC GCTTCGACCG CGGCGATGGC
GACACGCGCG TGCATATCGA AGACTTCGCG CAGGTGTTCG GCCGCTACCC GTCCGAGAAG
TACAATGGCG GCGCGTATCA CAATATCGGC GCGGCGCTCA CCAGCGGGGT CTCGTTCGAT
TCGGCCATCG AGTTTGCTCG GCGCCTCGCG CTCGCCGCGA TCACCGGCAA TGGCGACATG
CATCTGAAGA ACTGGTCGCT GCTCTATCCC GGCGACGGCC GGACGCCGAT GCTGGCGCCG
GTCTACGACA TGCTCTCGAC GATTCCTTAC CTCCCTAAGG ATGGGCTCGC CCTCAGCCTC
GCCGGCGAAA AGTCGCTCCA GGCGCTCACG CCGGAGCGCT GGCGCAACTT CGCAAACCGA
AGCCGTCTTC CGGAGGGCGC CGTGTTAACC GCTGTCGCCG AGACTGCGGC CGCCGTGCGT
GACAAGTGGC TCGTTCTTCC GGAACGCGAC GTTGTGCCTG CGCAGGTCCG CGCGCGGATC
GATGCCCACA TCGATGAAAT GGTGCCGCTG CTCGACCCCT GA
 
Protein sequence
MTQAVLAAER IQSLDISLND LPVGTLVRTP GDYNAFNLLP AYRAMNNPPV FSLSLRSADG 
GLRRDPKPIR RALPPFFANL LPEEKLREAM EKHHAASVRP GNDFDLLAAL GGDLPGAVRA
LPSDGSPVVA GPEAQGRNTR FSLAGVQMKL SVMKNTGKQG GITLAVGDGQ GQYIAKFPSL
THIGLSENEF ALMALAEALG MEVPARELVD KTEFTGIPDE FTTQSTGKVL LVRRFDRGDG
DTRVHIEDFA QVFGRYPSEK YNGGAYHNIG AALTSGVSFD SAIEFARRLA LAAITGNGDM
HLKNWSLLYP GDGRTPMLAP VYDMLSTIPY LPKDGLALSL AGEKSLQALT PERWRNFANR
SRLPEGAVLT AVAETAAAVR DKWLVLPERD VVPAQVRARI DAHIDEMVPL LDP