Gene RPB_3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3986 
Symbol 
ID3911793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4553594 
End bp4554955 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content63% 
IMG OID637885890 
ProductFis family transcriptional regulator 
Protein accessionYP_487590 
Protein GI86751094 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02040] transcriptional regulator PpsR 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.994596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.222002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAA AACCCGCCCA ACCCGACATC ACCCTCCTTC TGGATATGGA TGGCGTGATC 
CGCGACGCTT CCCTTTCACC GGCTCTGGCC GGGGAAAGTG TTCAGGGTTG GCTCGGCCAA
GCCTGGACCG ATGTTGCCGG CGATGATGGT GGCGACAAGG TCCGGCGCAT GGTCGAGGAT
GCCCGGACCA GCGGAATCTC TGCATTTCGG CAGGTAAATC AACGATTTCC GTCCGGCGCC
GAGCTCCCGA TCGAATTCAC CACCATGCTA CTCGGTGATC GCACCGGACT GCTCGCGGTG
GGCAAGAACC TGCAGGCGGT GACCGAACTG CATGCGCGAC TGATTGCGGC GCAACAAACG
ATCGAGCGCG ACTACTGGCG GCTGCGTGAA ATGGAAACGC GTTATCGGCT GGTATTCGAC
GCTGCAACCG AAGCGGTGAT GATCGTCTCG GCGCACGATC TCCGGATCGT CGAGGCCAAT
CGGGCCGCGG TGCAGGCCCT CAGCGGGGTC GACCGTGACA ACGAGGATGT CACCGGCCGC
GAGATCTTGA ACGAGGTTGC GCAGCTCGAC CGCGATGCGG TTCGCGAAAT GCTGGCGCGG
GTCCGTGATC GCGGCAAAGC GCTGAGCATC CTCGTGCATA TCGGCCGCGA TGCGCGGCCA
TGGATGTTGC GTGGCTCGCT GGTGTCGTCG GAACAAGGGC AGGTGTTCCT GCTGCAATTC
TCACCGGTCG CCACGACTTC GCGGGACGAC GAACGCGCCG AGCCGACCGT GCTCAAGACG
CTGATCGACC GGGTGCCGGA CGGCTTCGTC GCGCTCGACG CCGGCGGCGT CATCCGCCAC
GCCAATCAGG CCTTTCTGGA CCTCGTCCAG ATCGGCTCGA AGGGATCCGC GATCGGCGAA
TCGCTCGGTC GTTGGCTCAA TCAGCCCGGT GCAGACCTCG CGGCGCTGAT GTCGCATCTG
CAGCGCTACA AGACCGTGCG GCTGTTTCAG ACCTCCATTC GCGGCGAACT CGGGAGCGAG
ACGGATGTCG AGATCTCGGC GGTCGACGGC GACGACCACA ACTACCTCGG TGTACTGATC
AGGAATGTGT CGCGGCGTCT CGGCGGCGGC GAAAGCGATG CACTACGCTC GGCGCTCGGC
CCGATCAGCA AGCAGCTCGG CCGCTCTTCG CTGAGAAAGC TGGTCAAGAA TACCGTAGGC
ATCGTCGAAC GCCACTATGT CAAGGAAGCG CTGGAACTCA CCAAGGGCAA CCGCACGGCC
ACTGCCGAAC TGCTGGGGTT GAGCCGGCAA AGTCTCTATG CCAAGCTCGC GCGCTATGGG
CTCGACGACA AGGGTGCCGT CTCCCAAAAC GCCGAGGACT GA
 
Protein sequence
MSAKPAQPDI TLLLDMDGVI RDASLSPALA GESVQGWLGQ AWTDVAGDDG GDKVRRMVED 
ARTSGISAFR QVNQRFPSGA ELPIEFTTML LGDRTGLLAV GKNLQAVTEL HARLIAAQQT
IERDYWRLRE METRYRLVFD AATEAVMIVS AHDLRIVEAN RAAVQALSGV DRDNEDVTGR
EILNEVAQLD RDAVREMLAR VRDRGKALSI LVHIGRDARP WMLRGSLVSS EQGQVFLLQF
SPVATTSRDD ERAEPTVLKT LIDRVPDGFV ALDAGGVIRH ANQAFLDLVQ IGSKGSAIGE
SLGRWLNQPG ADLAALMSHL QRYKTVRLFQ TSIRGELGSE TDVEISAVDG DDHNYLGVLI
RNVSRRLGGG ESDALRSALG PISKQLGRSS LRKLVKNTVG IVERHYVKEA LELTKGNRTA
TAELLGLSRQ SLYAKLARYG LDDKGAVSQN AED