Gene RPC_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2016 
Symbol 
ID3973879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2197698 
End bp2198825 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content56% 
IMG OID637925125 
ProductAraC family transcriptional regulator 
Protein accessionYP_531890 
Protein GI90423520 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.875344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.643509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGGGC CAACCTCACA GGAGAAAAGA GTCGCGGAAA CGCGGATGCC GAACCTTGAC 
GTAATTCGAC TGCAAGCTTC CAGGCGATCA TGGGTCTCGA TCGGAGGTTC GGACCCAAAC
ACCCCCATGG CGCTCAAACT CATAGAACCA AGATTTTCGG AAGTTGACGA TGATCAAATG
AGGTTTGAGA AATGGCAGCA GCTCACGTCA AGCCTGTTCG TGGCGAGCAA GAAGAGCCCG
ATGGTCGGCT CTCGCGATCT TTCCTTTCGT TGCTACAATC TGGATCGGTT GCTGTTTTTC
GACCACAGCG CCGGCGCATA TGGCGCCGAG CGGACGCCGT CTCAGATCGT GACGCAGGGA
GTCGATCATA TCCTTCTCGG CTTGCAAACC GTCGGAACGA CGCTGCTGAT GCGCTCGGAC
GGCGTTGCAA TGGCCGGGGT TGGCGACCTT GTGGTCCTCG ATCTCTCCCA ACGATTCCAA
TTCGCAACCG AAGGGATGTC GGCGATCCAT ATTTGCCTGC CGCGAAGGCG ATTCGAGAAC
CATGCAAGAA AAATGGGTGC TCGGCACATG CAAATCCTTC GCTCCGAGGG TGAACCGCTT
CTGAAGTTGA TGGCGGATCA TCTGCTGAAC ATGCGAACAT GCCTGCATCA CGCCGTTGCT
GAGCAACTGC ATCTTCTGAC CTCGGCGGCG ATTGCGATTT GCAACGCGGC GTTCACGCCA
CCTGAAGACA GTTCTTACAA TGAACCGGCC GTTGCCGCGA TCGAAGTCCG CCAATTCATC
GAGGAGAATA TTCAGCACCA GGATCTCGGA ATCGAATTGC TCTGCGCGCG GTTTGGCCTC
TCCCGGACCC CACTCTATAA GCTATTTGAG GTTGACGGCG GGATCGTGAG TTACATTAGA
AGCCGGCGGC TCGCTCGAGC CATGCTGATG CTTTCCGGAG TCGAAGGGCG ATCGCACCAG
CGCGTGTCGT CGGTCGCCTA TGCCTGCGGA TACCAATCGG CGAAGATGTT CAGCCGCGCT
TTCCATCGCC GGTATGGCGT CAATCCGCGC GAGGTGAATA GAACGTACCA GACGGTGGCG
ATCCAGGAAA AGGGTGCTCT TTTGGCGTCC TGGATACAGA ACCTATGA
 
Protein sequence
MYGPTSQEKR VAETRMPNLD VIRLQASRRS WVSIGGSDPN TPMALKLIEP RFSEVDDDQM 
RFEKWQQLTS SLFVASKKSP MVGSRDLSFR CYNLDRLLFF DHSAGAYGAE RTPSQIVTQG
VDHILLGLQT VGTTLLMRSD GVAMAGVGDL VVLDLSQRFQ FATEGMSAIH ICLPRRRFEN
HARKMGARHM QILRSEGEPL LKLMADHLLN MRTCLHHAVA EQLHLLTSAA IAICNAAFTP
PEDSSYNEPA VAAIEVRQFI EENIQHQDLG IELLCARFGL SRTPLYKLFE VDGGIVSYIR
SRRLARAMLM LSGVEGRSHQ RVSSVAYACG YQSAKMFSRA FHRRYGVNPR EVNRTYQTVA
IQEKGALLAS WIQNL