Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3438 |
Symbol | |
ID | 3911240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3941868 |
End bp | 3942926 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885341 |
Product | AraC family transcriptional regulator |
Protein accession | YP_487045 |
Protein GI | 86750549 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0480484 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCCC TTCCCCGACT GTTCTACGTG ACTTCCAGCC TGGACGAGGC GACCGCCCTG GCGACCTGGA GTGCGGTGAT CTCGCCGCTG TTCGAACCGC GCCCCTGCGG CCCGAGCAAG AAGACACCGA CCGGCTCGGC CTATGGCATC ATCATTGGCG ACCTGATCAT CGCCAAGGTC GCCTTCAACG CGCAGGACTT CGTCCGCGAC GAGCCACGCA TCGCGGCGAC GCCGGATCAC CTGCTGCTGC ATCTCTACGT AACCGGCGGG TTCAACGGTG TGGTCACCCG GCAGCAGACG GCGATCGGCC CCGGCAAGGT CGCGCTGATC GATCTGGCCC ATCCGATCGC CACGCGCGCT TTCGCCTCCA GCACGGTGTG CCTGATCGTT CCGCGCAAGC TGCTCGGCGG CCTGCCGCTC GACACGCTGA AGCCGAGGCT CGATCCGCTC CGGAACGATC TGCTCGCGGC GCATCTGCGA TCGCTTCAGG AACGCAGCGC GCAATTGACC GAGACGGACG TGGCCGACAC GGTGGCCGAC ACCGTGGGTT TTCTGAGACG GCTGCTCGCC CCCGCCCAGG ATGAATCGCC AGCCGCCGAG CAGCGAACCG ACGAGACCAT CCTGGCGCTT CTGGAAGCGC TGATCCGCGA CAATCTCGCT TCGCCCGATC TGTCGCCGGA TTGGCTGGCA CAGCGACTGG ATGTCTCGCG CGCGTCGCTG TATCGGCTGT TTGCCGACCG CGGCGGCATC ATGCGCTACG TCCAGGAACG GCGGCTGCTC GCGGTCCAGG CGGCGCTGAG CGATCCGATC GAAACGCGCC GCTTGTCCCG CCTGGCGTCC GATCTCGGCT TCAAGAGCGA GGCGCATTTC AGCCGGAGCT TTCGCGCCCG CTTCGGCGTC ACCGCCAGCG CCTTTCGCAA GGCGCAACTC GACGCCTCCG CGGCGATCCA GCTCACCAGC CCGGCGGTGG TGCAACAATG GTGGACGGCG GTCGCTCGGA GCCCGCCGGC CCGCGGCCTG GCCCCGGCCG ACGAGCGGGG CGCGGTCCTT CCGCTGTAG
|
Protein sequence | MASLPRLFYV TSSLDEATAL ATWSAVISPL FEPRPCGPSK KTPTGSAYGI IIGDLIIAKV AFNAQDFVRD EPRIAATPDH LLLHLYVTGG FNGVVTRQQT AIGPGKVALI DLAHPIATRA FASSTVCLIV PRKLLGGLPL DTLKPRLDPL RNDLLAAHLR SLQERSAQLT ETDVADTVAD TVGFLRRLLA PAQDESPAAE QRTDETILAL LEALIRDNLA SPDLSPDWLA QRLDVSRASL YRLFADRGGI MRYVQERRLL AVQAALSDPI ETRRLSRLAS DLGFKSEAHF SRSFRARFGV TASAFRKAQL DASAAIQLTS PAVVQQWWTA VARSPPARGL APADERGAVL PL
|
| |