Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0570 |
Symbol | |
ID | 6408220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 619039 |
End bp | 620016 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642710483 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_001989605 |
Protein GI | 192289000 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTCG AGCTTCTCAA CGAACCACTC TCCAGGTACC CCGCGTTCGA CACCGCCGAT CCGGAAGAAC TGCGACAGGC CTTGAGCTCC GTTTACGGCG CCAAGCTGGT GGAAGGTCCC ATCGCCGCTG ATTTCCATGC GCGCAGCAAT TTCGTCCAGC TCGATGACAT CGCGCTCGCC TTCGGTGTCG GCACGGGATC AATCGCGGTC GAATTCCCCG AGGGGGAATG TGCGCGGCTG CAGATCGCGC TCTCCGGGCA GTTCACCACC CGCAGCGGCG GCACCAGCAC GGCGATCAAC GCACGGCAGG CCGGCATCAT CTCGCCGGGC CGCAATGCCC GCACCGAGTA TGGGCAGAAC TACAGCTTCA CCCTGCTGCG CATAAGCACC TCGGCGCTGG AGCGGAAGCT GACCACTCTG CTCGGCTGCA AGCCGAAGGC TTCGCTCGAA TTCGAGCCGG CGGCCAACAA CGAAGCTCCG CAGGTGGTCA ACTTGCGCCG GATGCTGTGC TTCCTGGCCA ATCAGCTGAA TTGCAGCCCG CTGCCGCCGG TGGTTCTGGC CGAGCTGCAG GAAGCGATCA CCCTGCTGTT CCTCAGCGCC TTCCGGCACA ATTACAGCCG GCAGCTCGAA CGGGAGTCGC ACGGGATCGC GCCGAAGCAC GTCCGCCAGG TCGAGGAATA TATCGAAGCC AACTGGATGC GGCCGATCAC CATCGAGAAG CTGACGGCTC TGACCGGGAT CAGTTCACGC GGCATCTTCA AGGCGTTCCA GCGCAGCCGT GGCTACTCGC CGATGGCCTT CGCCAAGCGG GTGCGGCTGC AGCACGCCCA TAACCTGCTG TCGGACGGCG CGACGCCCAC CACGGTCACG GCTGCAGCGC TGTCCTGCGG CTTTTCCAAT CTCGGGCATT TCGCCCGCGA CTATCGCGAC ATGTTCGGTG AAAAACCCTC GGAAACGCTG CAACGCGCGC GACCCTAA
|
Protein sequence | MTVELLNEPL SRYPAFDTAD PEELRQALSS VYGAKLVEGP IAADFHARSN FVQLDDIALA FGVGTGSIAV EFPEGECARL QIALSGQFTT RSGGTSTAIN ARQAGIISPG RNARTEYGQN YSFTLLRIST SALERKLTTL LGCKPKASLE FEPAANNEAP QVVNLRRMLC FLANQLNCSP LPPVVLAELQ EAITLLFLSA FRHNYSRQLE RESHGIAPKH VRQVEEYIEA NWMRPITIEK LTALTGISSR GIFKAFQRSR GYSPMAFAKR VRLQHAHNLL SDGATPTTVT AAALSCGFSN LGHFARDYRD MFGEKPSETL QRARP
|
| |