Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3333 |
Symbol | |
ID | 3911135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3813540 |
End bp | 3814565 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885236 |
Product | extracellular solute-binding protein |
Protein accession | YP_486940 |
Protein GI | 86750444 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.686225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.192968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGA TATTCGCCAA GATCGCCGCG GTGCTGTCGG CGCTGCTGCT GACCACCACC TTCGCGACCG CGCAGAGCAA GGTGACGATC GCGATCGGCG GCGGGGCGTG TCTGTGCTAC CTGCCGACGG TGCTGGCCAA GCAACTCGGC GAATACGACA AGGCGGGGCT CAGCGTCGAA CTGGTCGATC TCAAGGGCGG TTCCGATGCG CTGAAGGCGG TGCTCGGCGG CAGTGCCGAC GTCGTCTCCG GCTATTTCGA CCACACCGTC AATCTCGCCG CCAAGAAGCA GGAGATGCAG TCCTTCGTGG TCTACGACCG CTATCCCGGG CTGGTCCTGG TGGTGTCGCC GGGGCATACC GCGAAGATCG CATCGGTCAA GGACCTCGCC GGCAAGAAGG TCGGCGTCAG CGCGCCGGGC TCGTCGACCG ATTTCTTCCT GAAATATCTC CTGAAGAAGA ACGGCGTCGA TCCGAACGAC GTGGCGGTGA TCGGCGTCGG CCTCGGCGCC ACCGCGGTGG CGGCGATGCA GCAGGGCCAG ATCGAGGCCG CGGTGATGCT CGATCCGGCG GTGACGATCC TGCAGGCGGC GCACGCCGAT CTGCGCATCC TCAGCGACAC GCGCACCGAA CACGACACCC GCGAAGTGTT CGGCGGCGAC TATCCGGGCG GCGCGCTGTA TTCGACGGTG GCCTGGATCA AGGCGCATCC GAAGGAGGCG CAGGGCCTGA CCAACGCCAT CCTGAACACG CTGAGCTGGA TCCACGCGCA TTCGGCCGAG GAGATCGCCG ACAAGATGCC GCCGAACATC GTCGGCAAGG ACAAGGCGCA ATATGTCGCC GCGCTGAAGA ATACGATCCC GATGTATTCG ACCACCGGGC TGATGGACCC GAAGGGCGCG GAGGCGGTTC TGGCGGTGTT CAGCACCAGC TCGCCGGATG TGGCGAAAGC CAATATCGAC GTCACCAGGA CCTACACCAA CGCCTTCGTC GAGCAGGCGG CGAAGACGTC GGGCGCGGCG AAGTAG
|
Protein sequence | MKTIFAKIAA VLSALLLTTT FATAQSKVTI AIGGGACLCY LPTVLAKQLG EYDKAGLSVE LVDLKGGSDA LKAVLGGSAD VVSGYFDHTV NLAAKKQEMQ SFVVYDRYPG LVLVVSPGHT AKIASVKDLA GKKVGVSAPG SSTDFFLKYL LKKNGVDPND VAVIGVGLGA TAVAAMQQGQ IEAAVMLDPA VTILQAAHAD LRILSDTRTE HDTREVFGGD YPGGALYSTV AWIKAHPKEA QGLTNAILNT LSWIHAHSAE EIADKMPPNI VGKDKAQYVA ALKNTIPMYS TTGLMDPKGA EAVLAVFSTS SPDVAKANID VTRTYTNAFV EQAAKTSGAA K
|
| |