Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4417 |
Symbol | |
ID | 3912232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5004058 |
End bp | 5005569 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886322 |
Product | hypothetical protein |
Protein accession | YP_488014 |
Protein GI | 86751518 |
COG category | [S] Function unknown |
COG ID | [COG3333] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGACG CCCTGGAAAA ACTCGCCTAC GGTATCTCGC TGTCGCTCGA GCCGTCGAAC CTGCTGTACG CGGCGATCGG CTCGGTGCTC GGCACGCTGG TCGGGGTGCT GCCGGGGCTC GGCCCGGTGA CGACGATCGC GGTGCTGCTG CCGCTCACCT ATCACGCCGG CTCGCCGCTC GGCGCCATCA TCATGCTGGC CTCGATCTAT TACGGCGCGA TGTATGGCGG CTCGACCACC TCGATCCTGC TCAAGGTCCC GGGCGAGGCC GCCTCGGTGA TCACCTGCAT CGACGGCTAT CAGATGGCCA AGAAGGGCCG CGCCGGCCCG GCACTGGCGA TCGCCGCGAT CGGGTCGTTC ATCGCCGGCA CCGTCGCAGT GTGCGCGCTG GCGCTGGTCG GCCCGCTGTT CGCCAAATTC GCCGTCACCT TCGGGCCACC CGAGTACTTC GCGCTGGCCC TGTTCGGGCT GTCGTTGAGC GCCACGCTGT CCGGCGGCTC GCCGGTCCGC GGCATCACCA TGGTGCTGGT CGGGCTGCTG CTCGGTCTGG TCGGCATCGA CACCATCACC GGCGTCGAGC GCTACACTTT CAACATCATG GCCATCACCG ACGGCATCGA TCTGGTGCCG ATGCTGATGG GCCTGTTCGG CGTCGCCGAG ATCCTGCACA ATCTCGAGGA GAAATCGCGC GGCTCGCTGC TGTCGACCAA GATCGGCCGG CTGTTTCCGA GCCGCCAGGA TTGGCGCGAA TCCAGTGGCC CGATCGCACG CGGCTCGGTG ATCGGGTTCT TCGTCGGGCT GATTCCCGGC GGCGGCGCCA TTCTCGCCTC GCTGATGAGC TACACGCTGG AGAAGAAGCT GTCGAAGACG CCGGAGCAGT TCGGCCACGG CGCCATCGCC GGCGTCGCCG GGCCGGAATC GGCCAACAAT TCGGCGGCGA CGGCCTCGTT CATTCCGCTG CTCACGCTCG GCCTGCCGGG CAACGCGGTG ACGGCGGTGC TGTTCGCCGG TCTGCTGATC CAGAACGTGC AACCCGGCCC ATTGATGCTG GTGAAGAATC CCGACGTGTT CTGGGGTGTC ATTGCGTCGA TGTATGTCGG CAACATCATG CTGCTGGTGC TGAACCTGCC GCTGGTCGGG CTGTGGGTGC AGTTGCTCCG CGTGCCGTCC TGGCTGCTCA GCGCCACGAT CCTGCTGATC GCGATCTTCG GCACCTACAG CCTGCGCAGC AATTTCGCCG ACGTGACCAC GCTGATGCTG TTCGGCGGCA TCGGCTATCT GCTGCGCAAG GCCAGCCTCG ACGCCGGTCC GCTGATCATG GCGTTCATCC TCGCCAACAT TCTCGACACC GCGCTGCGGC AATCGATGCT GATGGGCGAC GGCAGCCTGC TGATCGTCCT GCAGCGGCCG ATGTCGCTGA CGATCCTGCT GGTCGCCGCG GTCATCCTGA CGGCCCAGCT GTGGTCGCAT TTCGGCCGCA CGCGCCACCG CGCCGTGCCG TCGGAGGGCT GA
|
Protein sequence | MFDALEKLAY GISLSLEPSN LLYAAIGSVL GTLVGVLPGL GPVTTIAVLL PLTYHAGSPL GAIIMLASIY YGAMYGGSTT SILLKVPGEA ASVITCIDGY QMAKKGRAGP ALAIAAIGSF IAGTVAVCAL ALVGPLFAKF AVTFGPPEYF ALALFGLSLS ATLSGGSPVR GITMVLVGLL LGLVGIDTIT GVERYTFNIM AITDGIDLVP MLMGLFGVAE ILHNLEEKSR GSLLSTKIGR LFPSRQDWRE SSGPIARGSV IGFFVGLIPG GGAILASLMS YTLEKKLSKT PEQFGHGAIA GVAGPESANN SAATASFIPL LTLGLPGNAV TAVLFAGLLI QNVQPGPLML VKNPDVFWGV IASMYVGNIM LLVLNLPLVG LWVQLLRVPS WLLSATILLI AIFGTYSLRS NFADVTTLML FGGIGYLLRK ASLDAGPLIM AFILANILDT ALRQSMLMGD GSLLIVLQRP MSLTILLVAA VILTAQLWSH FGRTRHRAVP SEG
|
| |