Gene RPB_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4020 
Symbol 
ID3911827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4588237 
End bp4589847 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content67% 
IMG OID637885924 
Producthypothetical protein 
Protein accessionYP_487624 
Protein GI86751128 
COG category[I] Lipid transport and metabolism 
COG ID[COG2267] Lysophospholipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.905898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.886685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG CCGCCAGGTC GACACTCGCG CGGCTGCTCG TCGCCCTCGT CGCGGTGATC 
GCGATCGCCA CCGCCTTGTG GCAATTGCAT CGCGCCAGCG GCGACCTGAT CGTCACCCAT
GCGCGCGTCG GCCAGACGCC GGTGTCGGTG TTCCGCGAGC CGACCACCAC GCGCGCGCCG
GTGGTGGTGA TCGCGCACGG CTTCGCCGGC TCGCAACAAC TCATGCAGCC GTTCGCCCAG
ACGCTGGCGC GCAACGGTTA TATCGCGGTG ACGTTCGATT TCACCGGCCA CGGCCGCAAC
CCCGTGACGA TGGTCGGCGA CGTCGACGAG CCGACAAAAA TCACCGGCGT GCTGGTCGAC
GACCTCGCCC GGGTCACCGA CTACGCCCGC GCGCTGCCGC AAAGCGACGG CCGCGCCGCG
GTGCTCGGGC ATTCGATGGC GTCCGACATC GTCGTCGCCT ATGCGGTGGC GCATCCGGAG
ATCACCGCCA CCGTCGCGGT GTCGGTGTTC ACCCGCAAAT CGACGCCGAC CCTGCCGCAC
AATCTGCTGG TGATCGTCGG CGACTGGGAA CCGCAGATGC TGAAGGACGA GGGCCTCCGC
ATCGTCGATC AAGTCGCCGG CGGCGGGGCG GTGGCCGGGC GGAGCTATGG CAGCTTCGCC
GACGGCACCG CGCGGCGCGT GGCGTTCTCG TCCGGAGTCG AACATATCGG CGTGCTGTAC
AGCCAGGACA GCATGCGCGA ATCGCTGCAA TGGATGAACG AATCGTTCGG CCGGCAAAGC
GCGGGCTGGA TCGATCGCCG CCCGGTCTGG CTGGCGCTGC TGTTCGGCGG ATTGATCGCG
ATGGCCTGGC CGCTGTCGAA GCTGCTGCCG CAAGCGGCGC CGCTGCCGAT GGGAGCCAGT
CTGGCGTGGA AACCGTTGCT GATCGCCGCA ATCGTCCCCG CCGTGCTGAC GCCGCTGATC
CTCTGGAAGG CGCCGACCGA CTTCCTGACC ATCCTGCTCG GCGACTATCT GACGTTGCAC
TTTCTGCTGT ACGGCGCTTT GACCGCCGCG ATCCTGCTGC TGATCCGCCG GCGCGGCCGC
AAGGCGCATT CTCAGGCCCA TGCTTCCACG CATCACGCGC CCGGACTGGA GCGACTGGAG
TCGCTGCCCG ACCCCGCGCA TCCGCGCGTC GCAATCACGG CGCTGGTGAT CGCGTCGGTG
GCGGCGATCG CCTACAACAT CATCGGCTTC GGCGTGCCGC TCGACACTTA CGCATTCTCG
TTCATGCCGA TCGAACCGCG GCTGCATCTG ATCGCCGCGG TCGCCTGCGG CACGGTTCCG
TATTTTCTCA CCGCGGAATG GATGGCGCAT GGCACGGGCG CCAGACGCGG TGCCTATGCG
CTGGCGAAAT TCTGCTTCCT CGCCTCGCTC GCCGCCGCCG TCGCGCTCAA TCTGCAGAAG
CTGTTCTTCC TGATCATCAT CGTGCCGGCG ATCCTGCTGT TGTTCATCGC CTTCGGCGTG
ATCAGCAACT GGACCTACAA GGCGACCAAC CACCCCCTCC CCGGCGCGCT CGCCAATGCG
ATCCTGTTCG CCTGGGCGAT CGCGGTGACG TTCCCGATGG TGATCCGCTG A
 
Protein sequence
MTIAARSTLA RLLVALVAVI AIATALWQLH RASGDLIVTH ARVGQTPVSV FREPTTTRAP 
VVVIAHGFAG SQQLMQPFAQ TLARNGYIAV TFDFTGHGRN PVTMVGDVDE PTKITGVLVD
DLARVTDYAR ALPQSDGRAA VLGHSMASDI VVAYAVAHPE ITATVAVSVF TRKSTPTLPH
NLLVIVGDWE PQMLKDEGLR IVDQVAGGGA VAGRSYGSFA DGTARRVAFS SGVEHIGVLY
SQDSMRESLQ WMNESFGRQS AGWIDRRPVW LALLFGGLIA MAWPLSKLLP QAAPLPMGAS
LAWKPLLIAA IVPAVLTPLI LWKAPTDFLT ILLGDYLTLH FLLYGALTAA ILLLIRRRGR
KAHSQAHAST HHAPGLERLE SLPDPAHPRV AITALVIASV AAIAYNIIGF GVPLDTYAFS
FMPIEPRLHL IAAVACGTVP YFLTAEWMAH GTGARRGAYA LAKFCFLASL AAAVALNLQK
LFFLIIIVPA ILLLFIAFGV ISNWTYKATN HPLPGALANA ILFAWAIAVT FPMVIR