Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0877 |
Symbol | |
ID | 4021351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 987359 |
End bp | 988312 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637961067 |
Product | anaerobic benzoate catabolism transcriptional regulator |
Protein accession | YP_568016 |
Protein GI | 91975357 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0703] Shikimate kinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.54694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.67724 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCCGC ACGACATCAT GAGCGCCCTT TTGGAAATGA CAGAATCTGC CGAATCCGAT GCCGGCTTCC TGCTGGAGCT GGGGCAGCGC GTCCGTACGC TGCGCGGACT GCGCGGCATG TCGCGCAAGG TGCTGGCGAA AGTGTCGGGG ATTTCCGAAC GCTATATCGC GCAGCTCGAG AGCGGCAAAG GCAACGTCTC GATTATCCTG CTGCGCCGGG TGTCCGACGC GCTCGCGACG CCGCTGGAGG ATCTGCTCCC CAACAGCGAC CCGTCGATCG ACTGGCCGGT GATCCGCGAT CTGGTCCGCC GCGCCACGCC GGTGCAGATC GCACATGCCA AGGACGTGCT GTCCGGCATG GGCCGGCTCG GCGCCGGCGC GCGGCGCGCC CAGGCGCTGC ACGGCATCGC GCTGATCGGG CTGCGCGGCG CCGGCAAGTC GACGCTCGGG CGAATGCTCG CCGACAAGAT CGGCTGGTCG TTCGTCGAGC TGAACAAGGA AATCGAGCAG CAGAACGGCC TGTCGGTCGC CGAGATCATC GCGCTGTACG GCCAGGAAGG TTTCCGCCGG ATGGAGCAGG CGGCGCTGAC GCAATTGCTG GCGCGCAAGG AACTGATGGT GCTGGCGACC GGCGGCGGCA TCGTCTCCGA GGCGATCAGC TTCGATCAGA TCCTGTCGTC GTTCTACACG ATTTGGCTGA AGGCCGACCC CGAGGAGCAT ATGGCGCGGG TGCGCGGCCA GGGCGATCTG CGGCCGATGG CCGACGATCG CGCCGCGATG CAGGAGCTGC GCACCATCCT GCAGAGCCGC GAACCGCTCT ACGCCCGCGC CTCCGGCGTG CTCGACACCG CGGGGCTGAG CGTCGACGAG GCCGCGGCGA AACTGACCGC GATGGTCTCG CCGGTGCTGT GCCGCGACGC CTGCGCGTTC GGGCTGAAGA ACGCGGCGGT TTAG
|
Protein sequence | MIPHDIMSAL LEMTESAESD AGFLLELGQR VRTLRGLRGM SRKVLAKVSG ISERYIAQLE SGKGNVSIIL LRRVSDALAT PLEDLLPNSD PSIDWPVIRD LVRRATPVQI AHAKDVLSGM GRLGAGARRA QALHGIALIG LRGAGKSTLG RMLADKIGWS FVELNKEIEQ QNGLSVAEII ALYGQEGFRR MEQAALTQLL ARKELMVLAT GGGIVSEAIS FDQILSSFYT IWLKADPEEH MARVRGQGDL RPMADDRAAM QELRTILQSR EPLYARASGV LDTAGLSVDE AAAKLTAMVS PVLCRDACAF GLKNAAV
|
| |