Gene Rpal_4667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4667 
Symbol 
ID6412353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5026595 
End bp5028082 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content66% 
IMG OID642714546 
Producttwo component, sigma54 specific, transcriptional regulator, Fis family 
Protein accessionYP_001993633 
Protein GI192293028 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.387828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGAGC GTATTCTGAT CGCCGACGAC GATGCAGTGC AGCGTCGGCT GGTCGAGAAC 
ATGGTGCAGA AGTGCGGCTA TGAGGCGGTT TCGGTCGATA GCGGTGACGC TGCGGTGGAG
GCCCTGACCG CGCCCGATGC GCCTGCGATC GACGCCGTGG TGCTCGACCT GGTGATGCCC
GGACTCGATG GCCTCGGCGT GCTGTCGAAG ATTCGCGCCA GCGGACTCGA CGTGCCGGTG
ATCGTGCAGA CCGCACATGG CGGCATCGAC AATGTGGTGT CGGCGATGCG TGCCGGTGCG
CACGATTTCG TCGTTAAGCC GGTCGGCATC GAGCGCCTGC AGGTGTCCTT GCGCAACGCA
CTGAACGCCA GCGCGATGAA GGGCGAGCTG CAGCGCATCC GCCATGCCCG CGAAGGCCGG
CTGACATTTT CCGACATCAT CACCCGCAGC GAGGCGATGG CGCCGGTGCT GCGCGCCGCC
GAGAAGGCTG CAGGCTCCGC GATCCCGGTG CTGATCGAAG GTGAATCCGG CGTCGGCAAG
GAGCTGTTCG CGCGCGCCAT CCACGGCTCC AGCGACCGCC GTTCAAAACC ATTCGTGGCC
GTGAACTGCG GCGCGATTCC CGACAATCTC GTCGAGTCGA TTCTGTTCGG CCACGAGAAG
GGTGCGTTCA CCGGCGCCAC CGAGCGCCAC GACGGCAAGT TCGTCGAAGC CTCCGGCGGC
ACGCTGTTTC TCGACGAGGT CAGCGAGCTG CCGCTGGCTG CGCAGGTCAA GCTGCTGCGC
GCGCTGCAGG AAGGCGCGGT CGAAGCGGTC GGCGGACGCC GGCCGGTCAA GGTCGATGTC
CGCATCATCT CGGCCACCAA CCGCCGGTTG CTCGACCGGG TGAAGGCGGG CCAATTCCGC
GAAGATCTGT TCTACCGGCT GCACGTGCTG CCGCTGACGA TTCCGCCGCT GCGCACCCGC
CGCGAAGACA TTCCGCCGCT GCTGCGGCAC TTCCTGATGC GGTTCTGCGC CGAAGAGAAG
CGCAGCATCG GCGGCATCAC CGGCGAGGCA ATGGCACGGC TGGCGCAACT CGACTGGCCG
GGCAATATCC GTCAGCTCGA AAATGCGGTG TATCGCGCCG TGGTAATGAG CGACGGCGAT
CAGCTCGGCC TTGCCGACTT CCCGCTGGCG ATCGCGCCGT CGGTCGTCCC TGCAGAAGAT
ACAACCGGCG AGCCGCTGGT GATCGAACGC AGCGAGCCAC AATTTGTCGC CGCAAGCGAA
GTGCCGATCG CGCCGCTGCC GAGCGTCGGC AATCTGTCGA TGCTGACAGC GGACGACGAA
GTGCGCCCCC TCGACGAAAT GGAGCGGGAG ATCATCCGGT TTGCGATCTC GCATTATCGC
GGGCAGATGT CGGAAGTGGC GCGGCGGCTG AAGATCGGCC GCTCGACGCT GTATCGCAAA
CTCGACGAGA TCGAAGCCGA CCGCGCCGCG CAGGCCGAGG CGCGATAA
 
Protein sequence
MVERILIADD DAVQRRLVEN MVQKCGYEAV SVDSGDAAVE ALTAPDAPAI DAVVLDLVMP 
GLDGLGVLSK IRASGLDVPV IVQTAHGGID NVVSAMRAGA HDFVVKPVGI ERLQVSLRNA
LNASAMKGEL QRIRHAREGR LTFSDIITRS EAMAPVLRAA EKAAGSAIPV LIEGESGVGK
ELFARAIHGS SDRRSKPFVA VNCGAIPDNL VESILFGHEK GAFTGATERH DGKFVEASGG
TLFLDEVSEL PLAAQVKLLR ALQEGAVEAV GGRRPVKVDV RIISATNRRL LDRVKAGQFR
EDLFYRLHVL PLTIPPLRTR REDIPPLLRH FLMRFCAEEK RSIGGITGEA MARLAQLDWP
GNIRQLENAV YRAVVMSDGD QLGLADFPLA IAPSVVPAED TTGEPLVIER SEPQFVAASE
VPIAPLPSVG NLSMLTADDE VRPLDEMERE IIRFAISHYR GQMSEVARRL KIGRSTLYRK
LDEIEADRAA QAEAR