Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2016 |
Symbol | |
ID | 3973879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2197698 |
End bp | 2198825 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637925125 |
Product | AraC family transcriptional regulator |
Protein accession | YP_531890 |
Protein GI | 90423520 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.875344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.643509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTACGGGC CAACCTCACA GGAGAAAAGA GTCGCGGAAA CGCGGATGCC GAACCTTGAC GTAATTCGAC TGCAAGCTTC CAGGCGATCA TGGGTCTCGA TCGGAGGTTC GGACCCAAAC ACCCCCATGG CGCTCAAACT CATAGAACCA AGATTTTCGG AAGTTGACGA TGATCAAATG AGGTTTGAGA AATGGCAGCA GCTCACGTCA AGCCTGTTCG TGGCGAGCAA GAAGAGCCCG ATGGTCGGCT CTCGCGATCT TTCCTTTCGT TGCTACAATC TGGATCGGTT GCTGTTTTTC GACCACAGCG CCGGCGCATA TGGCGCCGAG CGGACGCCGT CTCAGATCGT GACGCAGGGA GTCGATCATA TCCTTCTCGG CTTGCAAACC GTCGGAACGA CGCTGCTGAT GCGCTCGGAC GGCGTTGCAA TGGCCGGGGT TGGCGACCTT GTGGTCCTCG ATCTCTCCCA ACGATTCCAA TTCGCAACCG AAGGGATGTC GGCGATCCAT ATTTGCCTGC CGCGAAGGCG ATTCGAGAAC CATGCAAGAA AAATGGGTGC TCGGCACATG CAAATCCTTC GCTCCGAGGG TGAACCGCTT CTGAAGTTGA TGGCGGATCA TCTGCTGAAC ATGCGAACAT GCCTGCATCA CGCCGTTGCT GAGCAACTGC ATCTTCTGAC CTCGGCGGCG ATTGCGATTT GCAACGCGGC GTTCACGCCA CCTGAAGACA GTTCTTACAA TGAACCGGCC GTTGCCGCGA TCGAAGTCCG CCAATTCATC GAGGAGAATA TTCAGCACCA GGATCTCGGA ATCGAATTGC TCTGCGCGCG GTTTGGCCTC TCCCGGACCC CACTCTATAA GCTATTTGAG GTTGACGGCG GGATCGTGAG TTACATTAGA AGCCGGCGGC TCGCTCGAGC CATGCTGATG CTTTCCGGAG TCGAAGGGCG ATCGCACCAG CGCGTGTCGT CGGTCGCCTA TGCCTGCGGA TACCAATCGG CGAAGATGTT CAGCCGCGCT TTCCATCGCC GGTATGGCGT CAATCCGCGC GAGGTGAATA GAACGTACCA GACGGTGGCG ATCCAGGAAA AGGGTGCTCT TTTGGCGTCC TGGATACAGA ACCTATGA
|
Protein sequence | MYGPTSQEKR VAETRMPNLD VIRLQASRRS WVSIGGSDPN TPMALKLIEP RFSEVDDDQM RFEKWQQLTS SLFVASKKSP MVGSRDLSFR CYNLDRLLFF DHSAGAYGAE RTPSQIVTQG VDHILLGLQT VGTTLLMRSD GVAMAGVGDL VVLDLSQRFQ FATEGMSAIH ICLPRRRFEN HARKMGARHM QILRSEGEPL LKLMADHLLN MRTCLHHAVA EQLHLLTSAA IAICNAAFTP PEDSSYNEPA VAAIEVRQFI EENIQHQDLG IELLCARFGL SRTPLYKLFE VDGGIVSYIR SRRLARAMLM LSGVEGRSHQ RVSSVAYACG YQSAKMFSRA FHRRYGVNPR EVNRTYQTVA IQEKGALLAS WIQNL
|
| |