Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4200 |
Symbol | |
ID | 3912008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4772482 |
End bp | 4773927 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886104 |
Product | XRE family transcriptional regulator |
Protein accession | YP_487803 |
Protein GI | 86751307 |
COG category | [R] General function prediction only |
COG ID | [COG3800] Predicted transcriptional regulator |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.306631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGTG AATCCGGGAA AAAACTGTTC GTCGGCCCCC GGTTCCGGCG AATCCGGCAG CAACTCGGCC TGTCGCAGAC CCAGATTGCC GAAGGGCTCG GGATTTCGCC GAGCTATATC AATCTGATCG AACGGAACCA GCGCCCCGTG ACCGCCCAGA TTCTGCTCAG GCTGGCGGAG ACCTACGATC TCGATCTGCG TGACCTCGCG ACCGCCGACG AGGACCGTTT TTTCGCCGAA CTCAACGAGA TCTTCTCCGA CCCGCTGTTC CGCCAGATCG ACCTGCCGAA GCAGGAACTG CGCGACCTCG CCGAGCTGTG CCCCGGCGTC ACCCACTCGC TGCAGCGATT GTACGCCGCC TACACCGAAG CGCGGCGCGG CGAGACGCTG GTCGCGGCGC AGATGGCCGA TCGCGACGAG GGCACCAGGT TCGAGGCCAA CCCGATCGAG CGCGTCCGCG ACCTGATCGA GGCCAACCGC AACTATTTCC CGGAGCTCGA GCAGGCCGCC GAGGCGGTGC GCGACGAACT GAATGTCGGC TCGCAGGAGG TCTATGGCGC GCTCGCCGAC CGGCTGCGCG AGCGGCATTC GATCACCACC CGGATCATGC CGGTCGACGT GATGCGCGAG ACGCTGCGCC GGTTCGACCG CCACCGCCGG CAATTGCTGA TCTCCGAACT GGTCGACTCG CAGGGCCGCG CCTTCCAGGC CGCGTTCCAG ACCGGCCTCA CCGAATATGG CAGCGTGATC GACGGCATCG TCAACCGCGC CGGCGCCCTC GACGAGCCGG CGCGGCGACT CTACCGGATC ACGCTCGGCA ATTACTTCGC CGCCGCGCTG ATGATGCCCT ACGCCGCTTT CCATGCCGCC GCCGAACAGC TCAGCTACGA TGTCAACGTG CTGGCGCAGC GCTTCAACGC CGGCTTCGAG CAGGTCTGCC ATCGCCTCAC CACGCTGCAA CGGCCGACCG CGCGCGGCGT GCCGTTCTTC CTGCTGCGGG TCGACAACGC CGGCAACGTC TCCAAGCGGT TCTCCTCCGG CACCTTCCCG TTCTCGAAAT TCGGCGGAAC CTGCCCGTTG TGGAACGTGC ACTCGACCTT CGATACGCCA GACCGGCTGC TGAAACAGGT GATCGAACTG CCCGACGGCA GCCGCTATTT CTCGATCGCC CAGATGGTGC GCCGGCCGGT GGCGCCGCAC CCGCAGCCGC AGCCGCGCTT CGCCATCGGG CTCGGCTGCG AAATCCGCCA CGCGTCGAAG CTGACCTACG CCGCCGGCAT GGACCTGGAG AAAGCCGAAG GCACGCCGAT CGGCGTCAAC TGCCGCCTCT GCGAACGCGA AAACTGCAGC CAGCGCGCCG AGCCGCCGAT CACCCGGACG CTGATCCTGG ACGAGAACAC GCGGCGGGCG TCGAGCTTTG CGTTCAGCAA TGCAAGGGAG TTGTGA
|
Protein sequence | MAGESGKKLF VGPRFRRIRQ QLGLSQTQIA EGLGISPSYI NLIERNQRPV TAQILLRLAE TYDLDLRDLA TADEDRFFAE LNEIFSDPLF RQIDLPKQEL RDLAELCPGV THSLQRLYAA YTEARRGETL VAAQMADRDE GTRFEANPIE RVRDLIEANR NYFPELEQAA EAVRDELNVG SQEVYGALAD RLRERHSITT RIMPVDVMRE TLRRFDRHRR QLLISELVDS QGRAFQAAFQ TGLTEYGSVI DGIVNRAGAL DEPARRLYRI TLGNYFAAAL MMPYAAFHAA AEQLSYDVNV LAQRFNAGFE QVCHRLTTLQ RPTARGVPFF LLRVDNAGNV SKRFSSGTFP FSKFGGTCPL WNVHSTFDTP DRLLKQVIEL PDGSRYFSIA QMVRRPVAPH PQPQPRFAIG LGCEIRHASK LTYAAGMDLE KAEGTPIGVN CRLCERENCS QRAEPPITRT LILDENTRRA SSFAFSNARE L
|
| |