Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3045 |
Symbol | |
ID | 3910845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3469155 |
End bp | 3470147 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637884952 |
Product | AraC family transcriptional regulator |
Protein accession | YP_486658 |
Protein GI | 86750162 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG CTTCCGCGAG CGCCATCGTC CCCGATTACC CGCTGAATCG GACCCCGGTC GTCCACACCA CAGACATCGC GGAAATGCGC CACGCACTGA TCAATTTCTA TGGTGCACGG GCGTTCTCGG CGGAGGCCGA GGGGTTCGAG GGGTTCGGCA GCTTCGTCAA GCTCGACGTG ACGGCGTTCG GCTTTTGCCG CTACGCCTCG CAGGCTGTCG CGGAGTTTTC CGAAACCGAT TTCGCGCGGC TGCAGATCGC GTTATGCGGC ACCGGCCGGA CCACCTCGGG CGGCTCGAGT GTCGAGGTCG ACCCTTCGCG CTGGTGTGTT TCGTCGCCGG GGGCGCCGAC GGTGCTGGAG TTCGGCGCCG ACTACGAGCA GCTCATCATC CGGTTCTCCA ATGAAAAGCT GATGGCCACG CTGGAGGCGA TGCTGGGTGT CAAGCCGCGC GGCCGGCTGG TGTTCGCGCC ATCGGTGCGG ATCGACGATT GCGGCGCGCG TGCGCTGCGC GATCTCGGCC TGTTTCTCGC CCGGCATGTC GATCCGGCGC AGGCCCCGCT GCCGCCGCTG ATGCTGCGCG AACTCGAACA TACGCTGATG GTGTCGCTGC TCAGCGTGGC GCGGCACAAT TTCAGCGACC AGCTCGACCG CGATGCGCCG GACTGCGCGC CGGACTATGT TCGCGTCGCG GAGGAGTTCA TCGCGGCGTC TTGGAACAGG GCGATCACCA TCAACGATCT CGCCGCCGTG ACCAATGTCG GGGTCCGCAG CCTGTTCAAA TCGTTTCAAA AGCACCGTGG CTATTCGCCG ATGGCGTTCG CCAAGACGGT GCGGCTCAAC AAGGCCCGCG AAATGCTGCT GCAGGGCGAT CCGTCGCGCT CGGTCACCTC GGTCGCGTTC GCCTGCGGCT TCAGCAATCT CGGCCATTTC GCCCACGACT ATCGCCAGAA ACACGGAGAA TTGCCGTCGG AAACGTTGGC CCGGGCGCGC TGA
|
Protein sequence | MSDASASAIV PDYPLNRTPV VHTTDIAEMR HALINFYGAR AFSAEAEGFE GFGSFVKLDV TAFGFCRYAS QAVAEFSETD FARLQIALCG TGRTTSGGSS VEVDPSRWCV SSPGAPTVLE FGADYEQLII RFSNEKLMAT LEAMLGVKPR GRLVFAPSVR IDDCGARALR DLGLFLARHV DPAQAPLPPL MLRELEHTLM VSLLSVARHN FSDQLDRDAP DCAPDYVRVA EEFIAASWNR AITINDLAAV TNVGVRSLFK SFQKHRGYSP MAFAKTVRLN KAREMLLQGD PSRSVTSVAF ACGFSNLGHF AHDYRQKHGE LPSETLARAR
|
| |