Gene RPC_4617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4617 
Symbol 
ID3972108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5155970 
End bp5157412 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content58% 
IMG OID637927728 
ProductGntR family transcriptional regulator 
Protein accessionYP_534458 
Protein GI90426088 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGACC TTCTATCCGG TTTGATCAAA TTGTCGCCGG ACAGCGACGA GACATTGTTG 
CGACAGCTCA CGCAACAATT GAGATCACTC ATCACGAGCG GACGTTTGGC ACCAGCTCAG
CGCTTGCCTT CAAGTCGCGA TCTGGCTCAA TCACTGAGTG TCGGCCGCAA CACAGTCTCG
TTTGCGATAG AGCAGCTTGC AGCCGAGGGT TATTTGACGA CTTCGCCTGG CCGACGGGCA
GTCGTGTCAA TTGGAGCCTC GCTTGATGCA AGGAAGCCGC AGCGAATAAA GACCGATCAG
CGTACCGTGG AACTCAGGGT GTCGCCTTGG GCACGCAGTC TCCATAACGC GAGTTGGCCG
CCAATCTATA GGGGGCGACC GCGCGCGTTT CAGCCGGGAT TGGCTGACGA GCGGGAGTTT
CCTCACGATA TCTGGGCCCG CTGTCTGCGG CGTGCAGCCC GTAGCGCCCG TGTACGCGGA
GACAGTTCCC ACAACAGCGC CGCACTGCGG AAAGCATTGC AGCAGCATCT GGCGGAGCAT
AGGGGCGTCA AGGCGACACC TGACCAAATC ATGATTGTGC CTTCCGCACA GGCGGGGATT
GCGTTGATCG CCAAAGTGAT GATCGGTACG GGCGATCTCG CCTGGATTGA AAGTCCAGGG
TATGGCGGGG CGTTCGCCGC ATTGCAAAGC GCCGGCGCTA TCGTCGCTGG AGTGCCGCTC
GATGAGTTTG GTATGGCTTT GGGCGAACGC AAGGACATTC CTCGCCTGAT TTTTGTAACC
CCGTCCCATC AGTACCCGAC GGGACGGCTG ATGCCTGCCG GTCGTCGGCT GGAGTTGCTC
CGATTCGCAG CATCCGTTGG GGCTTCGATC GTCGAGGATG ACTACGACAG CGAATTTCAT
TATGAGGCGC GCCCAGTGTC TGCATTACAG GGGATGGGTC TTTCGACAAC AGTCCTTTAC
GTCGGCACGT TTTCGAAATC GATGTTTGCG GATATTCGGA TTGGCTACAT CATCGTTCCC
GAAAGACTCA TCGAGATATT CGAATTAGCG CAGCGCCATA TGGGATTGTC GGCAGCGATA
CCGATGCAGG ACGCGTTGGC CGAGTTCATT CGTAATGGAA TGTACCTCGC TCACGTCCGA
AAAATGACCC GCCTCTACAA GGAACGGCGC GACCAATTGC TGAAGGCCCT CGCATCCGAG
ACGGGCGATC GGCTGTCGAT CGAGATCCCA GCGGGGGGGA TGCAACTCTT GGCTCGATGT
GGTGCTTTTG CGAGAGACGA TCAGCTATCG GCGCGGCTTC TTGAAGCGGG AGTCGTCGCC
CGCCCCTTGT CGAGCATGGT CTTTCACAAG ACTAAGGCGC AGGGGCTCTT TCTCGGCTTC
GCCGCGTGGA ATGACGTCGA GATCGAGCGG GCTGCTCGCA TCCTTGGCAG GATTGTCCGT
TGA
 
Protein sequence
MGDLLSGLIK LSPDSDETLL RQLTQQLRSL ITSGRLAPAQ RLPSSRDLAQ SLSVGRNTVS 
FAIEQLAAEG YLTTSPGRRA VVSIGASLDA RKPQRIKTDQ RTVELRVSPW ARSLHNASWP
PIYRGRPRAF QPGLADEREF PHDIWARCLR RAARSARVRG DSSHNSAALR KALQQHLAEH
RGVKATPDQI MIVPSAQAGI ALIAKVMIGT GDLAWIESPG YGGAFAALQS AGAIVAGVPL
DEFGMALGER KDIPRLIFVT PSHQYPTGRL MPAGRRLELL RFAASVGASI VEDDYDSEFH
YEARPVSALQ GMGLSTTVLY VGTFSKSMFA DIRIGYIIVP ERLIEIFELA QRHMGLSAAI
PMQDALAEFI RNGMYLAHVR KMTRLYKERR DQLLKALASE TGDRLSIEIP AGGMQLLARC
GAFARDDQLS ARLLEAGVVA RPLSSMVFHK TKAQGLFLGF AAWNDVEIER AARILGRIVR