Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4020 |
Symbol | |
ID | 3911827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4588237 |
End bp | 4589847 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885924 |
Product | hypothetical protein |
Protein accession | YP_487624 |
Protein GI | 86751128 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2267] Lysophospholipase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.905898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.886685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCG CCGCCAGGTC GACACTCGCG CGGCTGCTCG TCGCCCTCGT CGCGGTGATC GCGATCGCCA CCGCCTTGTG GCAATTGCAT CGCGCCAGCG GCGACCTGAT CGTCACCCAT GCGCGCGTCG GCCAGACGCC GGTGTCGGTG TTCCGCGAGC CGACCACCAC GCGCGCGCCG GTGGTGGTGA TCGCGCACGG CTTCGCCGGC TCGCAACAAC TCATGCAGCC GTTCGCCCAG ACGCTGGCGC GCAACGGTTA TATCGCGGTG ACGTTCGATT TCACCGGCCA CGGCCGCAAC CCCGTGACGA TGGTCGGCGA CGTCGACGAG CCGACAAAAA TCACCGGCGT GCTGGTCGAC GACCTCGCCC GGGTCACCGA CTACGCCCGC GCGCTGCCGC AAAGCGACGG CCGCGCCGCG GTGCTCGGGC ATTCGATGGC GTCCGACATC GTCGTCGCCT ATGCGGTGGC GCATCCGGAG ATCACCGCCA CCGTCGCGGT GTCGGTGTTC ACCCGCAAAT CGACGCCGAC CCTGCCGCAC AATCTGCTGG TGATCGTCGG CGACTGGGAA CCGCAGATGC TGAAGGACGA GGGCCTCCGC ATCGTCGATC AAGTCGCCGG CGGCGGGGCG GTGGCCGGGC GGAGCTATGG CAGCTTCGCC GACGGCACCG CGCGGCGCGT GGCGTTCTCG TCCGGAGTCG AACATATCGG CGTGCTGTAC AGCCAGGACA GCATGCGCGA ATCGCTGCAA TGGATGAACG AATCGTTCGG CCGGCAAAGC GCGGGCTGGA TCGATCGCCG CCCGGTCTGG CTGGCGCTGC TGTTCGGCGG ATTGATCGCG ATGGCCTGGC CGCTGTCGAA GCTGCTGCCG CAAGCGGCGC CGCTGCCGAT GGGAGCCAGT CTGGCGTGGA AACCGTTGCT GATCGCCGCA ATCGTCCCCG CCGTGCTGAC GCCGCTGATC CTCTGGAAGG CGCCGACCGA CTTCCTGACC ATCCTGCTCG GCGACTATCT GACGTTGCAC TTTCTGCTGT ACGGCGCTTT GACCGCCGCG ATCCTGCTGC TGATCCGCCG GCGCGGCCGC AAGGCGCATT CTCAGGCCCA TGCTTCCACG CATCACGCGC CCGGACTGGA GCGACTGGAG TCGCTGCCCG ACCCCGCGCA TCCGCGCGTC GCAATCACGG CGCTGGTGAT CGCGTCGGTG GCGGCGATCG CCTACAACAT CATCGGCTTC GGCGTGCCGC TCGACACTTA CGCATTCTCG TTCATGCCGA TCGAACCGCG GCTGCATCTG ATCGCCGCGG TCGCCTGCGG CACGGTTCCG TATTTTCTCA CCGCGGAATG GATGGCGCAT GGCACGGGCG CCAGACGCGG TGCCTATGCG CTGGCGAAAT TCTGCTTCCT CGCCTCGCTC GCCGCCGCCG TCGCGCTCAA TCTGCAGAAG CTGTTCTTCC TGATCATCAT CGTGCCGGCG ATCCTGCTGT TGTTCATCGC CTTCGGCGTG ATCAGCAACT GGACCTACAA GGCGACCAAC CACCCCCTCC CCGGCGCGCT CGCCAATGCG ATCCTGTTCG CCTGGGCGAT CGCGGTGACG TTCCCGATGG TGATCCGCTG A
|
Protein sequence | MTIAARSTLA RLLVALVAVI AIATALWQLH RASGDLIVTH ARVGQTPVSV FREPTTTRAP VVVIAHGFAG SQQLMQPFAQ TLARNGYIAV TFDFTGHGRN PVTMVGDVDE PTKITGVLVD DLARVTDYAR ALPQSDGRAA VLGHSMASDI VVAYAVAHPE ITATVAVSVF TRKSTPTLPH NLLVIVGDWE PQMLKDEGLR IVDQVAGGGA VAGRSYGSFA DGTARRVAFS SGVEHIGVLY SQDSMRESLQ WMNESFGRQS AGWIDRRPVW LALLFGGLIA MAWPLSKLLP QAAPLPMGAS LAWKPLLIAA IVPAVLTPLI LWKAPTDFLT ILLGDYLTLH FLLYGALTAA ILLLIRRRGR KAHSQAHAST HHAPGLERLE SLPDPAHPRV AITALVIASV AAIAYNIIGF GVPLDTYAFS FMPIEPRLHL IAAVACGTVP YFLTAEWMAH GTGARRGAYA LAKFCFLASL AAAVALNLQK LFFLIIIVPA ILLLFIAFGV ISNWTYKATN HPLPGALANA ILFAWAIAVT FPMVIR
|
| |