Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2694 |
Symbol | |
ID | 3910487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3077995 |
End bp | 3078993 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884594 |
Product | AraC family transcriptional regulator |
Protein accession | YP_486307 |
Protein GI | 86749811 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.899645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCCGG AACGACTGTC GAAAGTCGTC TGCGCCGACG ATCTCGAGCG TTTCGCCGGC GCATCCGGCA GCGAGTTGCG GCTCGTCACA CCCGGCGTCG GCGGGCGCGA TCCGATCCTC GTCGGCGAAT TCCAGCGCGT TCAGTTGCGC TCCGGGCTGG CGCTGCACAC CTCGAACACC CGCGAAGTCC ATGATCTCCA CACCGAGGCC GTGCAGCATC CAGGCCTGAC GATCGCGCTG TTTCTGAGAG GACACATCGA CGCCTGGTTC GGCGGGCGCC TGATCGAGAT GGGGCCGCGG ACCGAGGACG CGCGCGATAT CGAAGCCATC GTGGTCGCGC GCTCCGAGTC CGACAGCTTC GTCCGACGAT CGGTGAAGGG CGCCCGGATC CGCAAGCTCA ACGTCACGAT CACGCCGGAA TGGCTCGACC AGCAGGCCCT GCTGAGTTCA CCCGAATGCG CGGCGATCCT GCGCTTCTCC CGCACACATC TGGCCACCTT GCGCTGGACG GCGTCGCCGC GTCTGATCAC GCTCGCCGAA CAGATCCTCG GCCCGCCGCT GTTCGCCGCG CCGCTGCAGA AGCTGTACTA CGAGTCGCGC GCGATCGACG TGGTGTCGGA GGCACTGCTG GCGATCTCGG ATGCGCCGGC ACAATCTGGA AGCGCGACGA TCCATCCGAC TCATCACCGC AGCGTGCGGC GCGCCTGTGA CTTCATCGAC GCCAATCTCG ATCACGAACT GATGCTGCCG TCGATCGCGG CGGCCGCGGG CCTCAATTCC GGCAGCCTGC AGCGCGCCTT TCGCCTGCTG TACGGCGTGA CCGTCTTCGA ATACGTCCGC AGCCGGAAAC TCGACCGCGC CAGGGCGGCG CTCGAACGCG ACGGCATCTC CGTGGGCGAG GCCGCCTACC TCGCCGGATA CAAGACACCC GGCAATTTTT CCACGGCGTT CCGCCGCCGC TTCGGCGTCA CGCCCCGGCA AATCCGCGCG ACGTCCTGA
|
Protein sequence | MLPERLSKVV CADDLERFAG ASGSELRLVT PGVGGRDPIL VGEFQRVQLR SGLALHTSNT REVHDLHTEA VQHPGLTIAL FLRGHIDAWF GGRLIEMGPR TEDARDIEAI VVARSESDSF VRRSVKGARI RKLNVTITPE WLDQQALLSS PECAAILRFS RTHLATLRWT ASPRLITLAE QILGPPLFAA PLQKLYYESR AIDVVSEALL AISDAPAQSG SATIHPTHHR SVRRACDFID ANLDHELMLP SIAAAAGLNS GSLQRAFRLL YGVTVFEYVR SRKLDRARAA LERDGISVGE AAYLAGYKTP GNFSTAFRRR FGVTPRQIRA TS
|
| |