Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1134 |
Symbol | |
ID | 3909222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1304221 |
End bp | 1305330 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883028 |
Product | AraC family transcriptional regulator |
Protein accession | YP_484755 |
Protein GI | 86748259 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.955983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.648616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTTC GGCGGGTGGT GCGGTTTTTG CTTCGTGATC CCGGTGTTTC CTTTGGATCG GTGGACATGT CTGCTTCGGG AACCATCGCC TGCGGCCCTG TGCAGCAACT TCTGGTAGCG CTCGCTCGGT CGAGGCCGGA ATCCGCGGGA ATCAATGCCC GCGTGGGGAT TTCACGCGCG ACACTCGACG ATCCGTCCGG GATGCTGCCG CTGGCGGCGT TCACGTCGAT GCTCGAGGCG GCTGCGCATG AAAGCGGCAA TCGCACCCTT GGGATCGAGC TCGGCCGCGA CTTCAAGCTC GCGGCGCTGG GGCCGATCAG CGATCTGATG CGGACTGCCC AGACCGTGGG CGACGCGCTG GAGAGCTTCA GTGGCTTCTT CGCCAGCATC CAGACCAGCA CGCGGACGAC GCTGTCGGTC AGCGACGGCA TTGCGCGGCT GTCCTATGCG ATCGAGGATC CGGCGATCCG GTTTCGCGAG CAGGACGCCG GCTTCTCGCT GGCGATCGAA TATTCGATGC TGGCCGGATT TCTCGGTCCG GCGTGGCGGG CGAGCGGCGT CGAATTCGAG CACGCGGCCG GGGATGATCT GCCGTTCTAT CAGCAGCATT TCGACTGCCC ACTGCGGTTC GGACGGCGCG AAAACGCGTT GCTGTTCCAG GCGCGGTGCC TCGACGTGCC GCTGCAGCAG GCGGACCGCA ACCTGCACGC GCGGCTCCGC GCCGATCTCG CGGAGGTGAT CCAGCGGCGG GCGACGCGGC TCGATCTGGT CCGCGGCATC GAGGCGTGGA TCGCGGCCTC GCTGTGCCGG TCGGTCGCGA CCGATATCGA GGTCGTCGCC TGTGATTTCG GCATGAGCAC GCGGTCGTTC CAGCGCAGGC TCGCCGACCA CGGCGTCAAC TATCTCGACA TCCGCAACCG GGTCCGCTCG CATATCGCCA AATGCATGCT GGCCGAGACC GGCGCTCCCG TGACGTCGAT CGCGCTGCAA CTCGGCTACA GCGAGACCAG CGCGTTCTCG CGCGGGTTCA AGAGCCAGGT AGGCGAGACC CCGGTCGAGT TTCGCAAGCG TCGGCGTGGT ATTGACCCTG CCGCGGCCGC TGCGGCGTGA
|
Protein sequence | MRLRRVVRFL LRDPGVSFGS VDMSASGTIA CGPVQQLLVA LARSRPESAG INARVGISRA TLDDPSGMLP LAAFTSMLEA AAHESGNRTL GIELGRDFKL AALGPISDLM RTAQTVGDAL ESFSGFFASI QTSTRTTLSV SDGIARLSYA IEDPAIRFRE QDAGFSLAIE YSMLAGFLGP AWRASGVEFE HAAGDDLPFY QQHFDCPLRF GRRENALLFQ ARCLDVPLQQ ADRNLHARLR ADLAEVIQRR ATRLDLVRGI EAWIAASLCR SVATDIEVVA CDFGMSTRSF QRRLADHGVN YLDIRNRVRS HIAKCMLAET GAPVTSIALQ LGYSETSAFS RGFKSQVGET PVEFRKRRRG IDPAAAAAA
|
| |