Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1140 |
Symbol | |
ID | 3909228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1310770 |
End bp | 1311738 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637883034 |
Product | AraC family transcriptional regulator |
Protein accession | YP_484761 |
Protein GI | 86748265 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0567003 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCAAAGA CGTTGCCTGC GCCGGCCTCG CGCCCGTCCC GGGTCGGCTT CCTGCTGATC GACGGCTTCG CGCTGATGTC GTTCGCTTCT GCCGTCGAGC CGCTGCGCGC CGCCAACGCC ATCGCCGGGC GCACGCTGTA TCAATGGTTT CACGTCTCGA TCGACGGCCG TCCGATCGGC GCCTCCAGCG GGCTGTCGAT CCAGCCGGAC TGCGCGATCG ACGGCGCGCA GGAGTTCGAC ATCGTGCTGG TCTGTGCCGG CGGCAATCCG ACCAAATTCT CCGATCGGCG GACGATGAGC TGGCTGCGCG CGCAGGCGCG GCGCGGCGTG GCGATCGGCG GCATCTCCGG CGGTCCTTAT CTTCTCGCCC GCGCCAAGGT GCTCGACGGC TATCGCTGCA CGATCCATTG GGAGCACGCG CCGGCCTTCG CCGAGGCGTT TCCGCATCTC GACCTGACGC GCAATTTGTT CGAGATCGAC CGCGAGCGCC TGACCTGCGG CGGCGGCGTC GCCGGGCTCG ACATGATGCA GGCGCTGATC CGCCGCGACC ACGGCCCGGA GCTGGCCGCC AAGGTCAGCG ACTGGTTTCT GCAGACCAAT GTCCGGCTCG GCGATTCCAG CCAGCGGCCG AATGCGCGCG AGCGCGGGCG GCTCGGTCAT CCCGCGCTGC AGGCCGCGAT CGAGCTGATG GAGCGCCGGC TGCGCGAGCC GGCGAGCCGG ACCGAGATCG CCCGCGCCGC CGGCGTCTCG CTGCGCCAGC TCGAGCGATT ATTCACGACG CATCTCAAGA CGACGATCGA GCGACGCTAC CTGATGATCC GGCTGCAGCG CGCGCGGACT CTGCTGCGGC AGACGTCGCT CCCCGTCACC CAGATCGGCG CGGATTGCGG CTTCGTCAGC CTCGCGCATT TCTCGCGCGT CTATAGGCAG CGCTTCGCGC GCACGCCCTC GGCCGACCGC AGGCTCTGA
|
Protein sequence | MPKTLPAPAS RPSRVGFLLI DGFALMSFAS AVEPLRAANA IAGRTLYQWF HVSIDGRPIG ASSGLSIQPD CAIDGAQEFD IVLVCAGGNP TKFSDRRTMS WLRAQARRGV AIGGISGGPY LLARAKVLDG YRCTIHWEHA PAFAEAFPHL DLTRNLFEID RERLTCGGGV AGLDMMQALI RRDHGPELAA KVSDWFLQTN VRLGDSSQRP NARERGRLGH PALQAAIELM ERRLREPASR TEIARAAGVS LRQLERLFTT HLKTTIERRY LMIRLQRART LLRQTSLPVT QIGADCGFVS LAHFSRVYRQ RFARTPSADR RL
|
| |