Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4759 |
Symbol | |
ID | 6412445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5125241 |
End bp | 5126239 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714638 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_001993725 |
Protein GI | 192293120 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTG ATCTACTCGG GGAGCCGCTC GATCGGTTTC CCATGGTGCG CGGTTCCAAT CCGACCGATT TCGAATCCGC ACTCAAGTCT GTGTTCGGCG ATGCGTCGGT CGAGGTCTCG GATCGAGACG GCTTCAGGGC GCGGCTCAAT TTCGTCCGGC TGAGCGATAT CGAGATGGCC TATAGCTGGG CGACCGTGCC GTCGCGGCTG CGGCTGCCGC CCGACGATTT CGTCGGCCTG CAGCTCGCGC TGAGCGGCAG CGCCATCACC ATGGTCGGAA ATCGGCGGGT CGCCACCAAT GCCCGGCAGT CGTGCATTTG TCCGCCCGGT CAGGGGCGCG ACTATCAGTT CGATGCCGAG TTCGAACAGC TGTTTCTCGG CGTCCGGCTC AGTGCGCTGG AACGGACGCT CGCCGGGCTG CTTGGCGGCA AGCCGAATGC GCCGCTCGAA TTCGAGCCGG TGGCCGACAA CGACTATCCG CACTCGGAAA ATCTGCGCCA GCTGACGCTG TTCTTCGGCG GCACGCTGAA CGCCACCAAG GTGTCGTTGC CGTCGCAATA TCTGGCCGAG CTCGAGCAGG CGACCGCGGT CGCCTTCCTG CACGCCTGTA AACACAATTT CAGCAGCTAT CTCGGCGTCG CTGAGAAGGA CGCCGCGTCG CGCCACGTCA AATTGGTCGA GGAGTACATC GAGGCCAATT GGAACGAGTC GCTCACGATC GAGAAGCTGG TGGAGCTGAC CGGCATGAGC GCCCGCACCG TGTTCAAGGC GTTTCAGCGC ACCCGCGGTT ATTCGCCGAT GGCCTTTGCC AAGCGGGTCC GGATGGAGCG GGTGCGGCAG CTGCTGCTGG AGGCCGGCGG CGACGCCTCG GTCGGCGCCA TCGCGGTGCA ATGCGGCTTT CCGCATCTCG GTCATTTCGC CAAGGATTAT CGCAAGACGT TCGGCGAGAA TCCGTCCGAT ACGCTGGCAA GGGGACGCCG TTTCCGCGGC GTTCGATAG
|
Protein sequence | MKIDLLGEPL DRFPMVRGSN PTDFESALKS VFGDASVEVS DRDGFRARLN FVRLSDIEMA YSWATVPSRL RLPPDDFVGL QLALSGSAIT MVGNRRVATN ARQSCICPPG QGRDYQFDAE FEQLFLGVRL SALERTLAGL LGGKPNAPLE FEPVADNDYP HSENLRQLTL FFGGTLNATK VSLPSQYLAE LEQATAVAFL HACKHNFSSY LGVAEKDAAS RHVKLVEEYI EANWNESLTI EKLVELTGMS ARTVFKAFQR TRGYSPMAFA KRVRMERVRQ LLLEAGGDAS VGAIAVQCGF PHLGHFAKDY RKTFGENPSD TLARGRRFRG VR
|
| |