Gene RPB_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3438 
Symbol 
ID3911240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3941868 
End bp3942926 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID637885341 
ProductAraC family transcriptional regulator 
Protein accessionYP_487045 
Protein GI86750549 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0480484 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCCC TTCCCCGACT GTTCTACGTG ACTTCCAGCC TGGACGAGGC GACCGCCCTG 
GCGACCTGGA GTGCGGTGAT CTCGCCGCTG TTCGAACCGC GCCCCTGCGG CCCGAGCAAG
AAGACACCGA CCGGCTCGGC CTATGGCATC ATCATTGGCG ACCTGATCAT CGCCAAGGTC
GCCTTCAACG CGCAGGACTT CGTCCGCGAC GAGCCACGCA TCGCGGCGAC GCCGGATCAC
CTGCTGCTGC ATCTCTACGT AACCGGCGGG TTCAACGGTG TGGTCACCCG GCAGCAGACG
GCGATCGGCC CCGGCAAGGT CGCGCTGATC GATCTGGCCC ATCCGATCGC CACGCGCGCT
TTCGCCTCCA GCACGGTGTG CCTGATCGTT CCGCGCAAGC TGCTCGGCGG CCTGCCGCTC
GACACGCTGA AGCCGAGGCT CGATCCGCTC CGGAACGATC TGCTCGCGGC GCATCTGCGA
TCGCTTCAGG AACGCAGCGC GCAATTGACC GAGACGGACG TGGCCGACAC GGTGGCCGAC
ACCGTGGGTT TTCTGAGACG GCTGCTCGCC CCCGCCCAGG ATGAATCGCC AGCCGCCGAG
CAGCGAACCG ACGAGACCAT CCTGGCGCTT CTGGAAGCGC TGATCCGCGA CAATCTCGCT
TCGCCCGATC TGTCGCCGGA TTGGCTGGCA CAGCGACTGG ATGTCTCGCG CGCGTCGCTG
TATCGGCTGT TTGCCGACCG CGGCGGCATC ATGCGCTACG TCCAGGAACG GCGGCTGCTC
GCGGTCCAGG CGGCGCTGAG CGATCCGATC GAAACGCGCC GCTTGTCCCG CCTGGCGTCC
GATCTCGGCT TCAAGAGCGA GGCGCATTTC AGCCGGAGCT TTCGCGCCCG CTTCGGCGTC
ACCGCCAGCG CCTTTCGCAA GGCGCAACTC GACGCCTCCG CGGCGATCCA GCTCACCAGC
CCGGCGGTGG TGCAACAATG GTGGACGGCG GTCGCTCGGA GCCCGCCGGC CCGCGGCCTG
GCCCCGGCCG ACGAGCGGGG CGCGGTCCTT CCGCTGTAG
 
Protein sequence
MASLPRLFYV TSSLDEATAL ATWSAVISPL FEPRPCGPSK KTPTGSAYGI IIGDLIIAKV 
AFNAQDFVRD EPRIAATPDH LLLHLYVTGG FNGVVTRQQT AIGPGKVALI DLAHPIATRA
FASSTVCLIV PRKLLGGLPL DTLKPRLDPL RNDLLAAHLR SLQERSAQLT ETDVADTVAD
TVGFLRRLLA PAQDESPAAE QRTDETILAL LEALIRDNLA SPDLSPDWLA QRLDVSRASL
YRLFADRGGI MRYVQERRLL AVQAALSDPI ETRRLSRLAS DLGFKSEAHF SRSFRARFGV
TASAFRKAQL DASAAIQLTS PAVVQQWWTA VARSPPARGL APADERGAVL PL