Gene RPB_1134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1134 
Symbol 
ID3909222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1304221 
End bp1305330 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content67% 
IMG OID637883028 
ProductAraC family transcriptional regulator 
Protein accessionYP_484755 
Protein GI86748259 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.955983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.648616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTTC GGCGGGTGGT GCGGTTTTTG CTTCGTGATC CCGGTGTTTC CTTTGGATCG 
GTGGACATGT CTGCTTCGGG AACCATCGCC TGCGGCCCTG TGCAGCAACT TCTGGTAGCG
CTCGCTCGGT CGAGGCCGGA ATCCGCGGGA ATCAATGCCC GCGTGGGGAT TTCACGCGCG
ACACTCGACG ATCCGTCCGG GATGCTGCCG CTGGCGGCGT TCACGTCGAT GCTCGAGGCG
GCTGCGCATG AAAGCGGCAA TCGCACCCTT GGGATCGAGC TCGGCCGCGA CTTCAAGCTC
GCGGCGCTGG GGCCGATCAG CGATCTGATG CGGACTGCCC AGACCGTGGG CGACGCGCTG
GAGAGCTTCA GTGGCTTCTT CGCCAGCATC CAGACCAGCA CGCGGACGAC GCTGTCGGTC
AGCGACGGCA TTGCGCGGCT GTCCTATGCG ATCGAGGATC CGGCGATCCG GTTTCGCGAG
CAGGACGCCG GCTTCTCGCT GGCGATCGAA TATTCGATGC TGGCCGGATT TCTCGGTCCG
GCGTGGCGGG CGAGCGGCGT CGAATTCGAG CACGCGGCCG GGGATGATCT GCCGTTCTAT
CAGCAGCATT TCGACTGCCC ACTGCGGTTC GGACGGCGCG AAAACGCGTT GCTGTTCCAG
GCGCGGTGCC TCGACGTGCC GCTGCAGCAG GCGGACCGCA ACCTGCACGC GCGGCTCCGC
GCCGATCTCG CGGAGGTGAT CCAGCGGCGG GCGACGCGGC TCGATCTGGT CCGCGGCATC
GAGGCGTGGA TCGCGGCCTC GCTGTGCCGG TCGGTCGCGA CCGATATCGA GGTCGTCGCC
TGTGATTTCG GCATGAGCAC GCGGTCGTTC CAGCGCAGGC TCGCCGACCA CGGCGTCAAC
TATCTCGACA TCCGCAACCG GGTCCGCTCG CATATCGCCA AATGCATGCT GGCCGAGACC
GGCGCTCCCG TGACGTCGAT CGCGCTGCAA CTCGGCTACA GCGAGACCAG CGCGTTCTCG
CGCGGGTTCA AGAGCCAGGT AGGCGAGACC CCGGTCGAGT TTCGCAAGCG TCGGCGTGGT
ATTGACCCTG CCGCGGCCGC TGCGGCGTGA
 
Protein sequence
MRLRRVVRFL LRDPGVSFGS VDMSASGTIA CGPVQQLLVA LARSRPESAG INARVGISRA 
TLDDPSGMLP LAAFTSMLEA AAHESGNRTL GIELGRDFKL AALGPISDLM RTAQTVGDAL
ESFSGFFASI QTSTRTTLSV SDGIARLSYA IEDPAIRFRE QDAGFSLAIE YSMLAGFLGP
AWRASGVEFE HAAGDDLPFY QQHFDCPLRF GRRENALLFQ ARCLDVPLQQ ADRNLHARLR
ADLAEVIQRR ATRLDLVRGI EAWIAASLCR SVATDIEVVA CDFGMSTRSF QRRLADHGVN
YLDIRNRVRS HIAKCMLAET GAPVTSIALQ LGYSETSAFS RGFKSQVGET PVEFRKRRRG
IDPAAAAAA