Gene RPC_3514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3514 
Symbol 
ID3973880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3905310 
End bp3906725 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content62% 
IMG OID637926626 
ProductXRE family transcriptional regulator 
Protein accessionYP_533373 
Protein GI90425003 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.15562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA CCTTTCTCGG AGTCCGGTTG AAACGGCTGC GTGAGGAGCA TCGCCTGACC 
CAGGCCGCGC TCGCCGCCAA GCTTGGTATT TCGCTGAGTT ACCTCAACCA GTTGGAGAAT
AATCAGCGAC CGCTCACGCT TCCGGTGGCG CTGGCGTTGA ACAAGACGTT CGGACTCGAC
ATCCAGTCTT TTTCAGAGGA CGACGAAGCG CGTTTGATTG CCGATTTGCG CGAGGCGGTT
GCCGATCCCG CCTTGGGCGA AACCATCGCC ACCTCCGATC TGCGCGAACT GGCACTCAAT
ATGCCGGCGA TCGGTCGGAC GCTGGTTGCG CTGTCGCAGA GATATCGCCA AGCCATCGAG
CAGTCCGCGG CGCTCTCCGC CCGGCTCGGG GAGGATCGCC AGGCCGCTTC GGCGATTCTG
CCTTCGACCC CGTTCGAGGA GGTCCGCGAT TTCTTCTATG CCCAGCATAA CTACATTCCC
GAACTCGACG AAGCCGCCGA GGCAATCGCC TTGCGAATGA ACCTGCCGCC CGGACGGATG
GCGCCCGCTC TGTCGTCTTT TCTCGAAGAT CGCGGCATGA CGATATTGAT CGGCGGCTCC
ACCGAATCCG GTCTGCAGCG GGAGTTTGAT CGGCAAACCC GCACCGTGCG GCTGTCGTCG
AGCCTTCACC CCGGACAGCA AGCTTTCCAA TTGGCAACCC ACATCGCTTT TCTCGATTTC
GACGACGCCA TCCGTTCAAT CGTCAGCAAC GCAGCATTCA CCAGCGACGA ATCGCGTGGC
CTCGCGCGGA TCGGGCTCGC GCATTATTTC GCGGGCGCGC TGGTGTTGCC CTACTCGGCG
TTCCTCCAAG AGGCGCAGCG CCGCCGTTAC GATATCGAAT TGCTCGGCCA CACTTTCGGC
GTCGGATTTG AAACCGCGTG CCATCGCTTG AGTACCTTGC AGCGCCACAA TGCCCGGGGC
GTCCCGTTCT TCTTCATTCG CGTCGATCGG GCCGGCAATA TTTCGAAACG TCAATCCGCG
ACCGACTTTC ATTTTTCGCG GGTCGGCGGC ACCTGTCCGC TGTGGAACGT CTATGAAGCC
TTCGCCTGTC CTGGACGAAT CCTCACCCAG TTGGCGCGAA TGCCGGACGG GCGAACCTAC
CTTTGGATCG CCCGCACGGT GTCGCATAGC CAGGGCGGCT ATCGGGCGCC CGGGAAAACC
TTCGCGGTGG CACTCGGCTG CGACGTCCGC CATGCCGGTA GCGTCGTCTA TTCGGAGGGA
CTCGACATCG ATCCGGCAAT CGCGACGCCG ATCGGCATGG GCTGCAAGGT CTGCGAACGG
CCGAATTGTC CCCAGCGGGC CTTCCCGCCG ATTGGTCATG CGCTCAACGT CGATGAGACG
CGCGCGCATT TTGCCCCCTA CGCGACCTCG TCTTGA
 
Protein sequence
MKKTFLGVRL KRLREEHRLT QAALAAKLGI SLSYLNQLEN NQRPLTLPVA LALNKTFGLD 
IQSFSEDDEA RLIADLREAV ADPALGETIA TSDLRELALN MPAIGRTLVA LSQRYRQAIE
QSAALSARLG EDRQAASAIL PSTPFEEVRD FFYAQHNYIP ELDEAAEAIA LRMNLPPGRM
APALSSFLED RGMTILIGGS TESGLQREFD RQTRTVRLSS SLHPGQQAFQ LATHIAFLDF
DDAIRSIVSN AAFTSDESRG LARIGLAHYF AGALVLPYSA FLQEAQRRRY DIELLGHTFG
VGFETACHRL STLQRHNARG VPFFFIRVDR AGNISKRQSA TDFHFSRVGG TCPLWNVYEA
FACPGRILTQ LARMPDGRTY LWIARTVSHS QGGYRAPGKT FAVALGCDVR HAGSVVYSEG
LDIDPAIATP IGMGCKVCER PNCPQRAFPP IGHALNVDET RAHFAPYATS S