Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0714 |
Symbol | |
ID | 3908220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 801110 |
End bp | 802090 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882606 |
Product | AraC family transcriptional regulator |
Protein accession | YP_484336 |
Protein GI | 86747840 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.571232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC TCCTCAGCGA GCCACTCAGC CGTTTCCGTG CGATGGACAC GTCCGATCCG GACGAACTCG CGCACGCGCT CTCCACCGTC TACGGCGCCC GCAACTTTCA GTGCGGTTCG GCCGGCACGT TCCGCGTGCG GGGCAATTTC GTGCAATTGC AGGATATCGC GCTCGGCTTC ACCTGCGGTC GCGCGCCGCT GAGCGTGGAC TTCCCCGAGG CCGACTTTGC GCGACTACAG ATCGCGCTGA CCGGGCAATC GAGTACCCGC AGCGCCGGCG TCACGACGGC GATCGATCCG CGGCAGGCCT GCGTCTCGTC GCCCGGACGC ACCGCTCACA CCGAGTTCGG CCCGTTCTAC GAGCATTTGC TGCTTCGCGT GCAAAGCAGC GCGATCGACC GCAAGCTCAC GGCGCTGCTC GGCACCAAGC CGAAGCGCGC GATCGAGTTC GAGCCCGCCG CGAGCAACGA CCTGCTGCAG GCGGCCAATC TTCGCCGCCT GATCGGGTTC GTGAACAGTC AGATCAATTC GGCGAGTGCC CCGCTACCCG AGCTGGTGAT GGCGGAGTTG CAGGAAGCCA TCACCCTGCT GTTCCTGACC GCCTATCCGC ACAACTTCAC CAAATTTCTG GAAGCCGACT CGCGCAGTGC GGCGCCCGGG CACGTGCGCA GGATCGAGGA ATACATCGAG GCCAACTGGG CCGAACCGAT CACCATCGAA ACGCTGACGG ACCTGACGGG AATCAGCGCG CGCGGCATCT TCAAGGCCTT CCAGCGCAGC CGCGGCTACT CGCCGATGGC GTTCGCCAAG CAGGTCCGGC TTCGGCAGGC ACGCACCATG CTGCAGCAGG GACGCGCGCT GACCTCGGTC ACGGCCGCCG CCTTCGCCTG CGGCTTCTCC AATCTCGGCC ACTTCGCCAA GGACTATCGG ACTGCGTTCG GCGAACGGCC GTCGGAAACG CTGGTCCGAT CGAGCAGATA G
|
Protein sequence | MTELLSEPLS RFRAMDTSDP DELAHALSTV YGARNFQCGS AGTFRVRGNF VQLQDIALGF TCGRAPLSVD FPEADFARLQ IALTGQSSTR SAGVTTAIDP RQACVSSPGR TAHTEFGPFY EHLLLRVQSS AIDRKLTALL GTKPKRAIEF EPAASNDLLQ AANLRRLIGF VNSQINSASA PLPELVMAEL QEAITLLFLT AYPHNFTKFL EADSRSAAPG HVRRIEEYIE ANWAEPITIE TLTDLTGISA RGIFKAFQRS RGYSPMAFAK QVRLRQARTM LQQGRALTSV TAAAFACGFS NLGHFAKDYR TAFGERPSET LVRSSR
|
| |