Gene RPC_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2019 
Symbol 
ID3973919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2201967 
End bp2203427 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content59% 
IMG OID637925128 
ProductGntR family transcriptional regulator 
Protein accessionYP_531893 
Protein GI90423523 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.437741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACCTTT TGAGATATGG TACGCAATAC AAAATGGCTG TCAATTTGAA TAATGACACC 
GTTTCAATCC TTCATAGCGC GTTGCGCGAA GGCATGGGAC CGAAGTACCG GCGGCTGGCG
CGACGGCTCG AGGCACTAAT CCTGAGCCGC GGGTTACGGA CCGGCGAAAA GTTACCGGCT
CACCGCGACC TGGCTGCGCT CCTGGGCGTG ACGGCAGGAA CGATTAGTCG GGCTTACGGC
GAGTTGCGGA AGATCGGGCT GATTTCGTCC CGCGTGGGCG ACGGCACCTT CGTGCTGGAA
TTGGCAAAGA AGAAGCGCAA GGAAACCGAC TTCAAGCCTT ACGGGGGCGG CGAACCGGGT
CGCTACGACC TCAGTCGGAA TACGGCGATC CCCGGCCGCA TGCTAACATC GGTCAGCGAA
ACGTTGCGGC GTCTGGCGCT GCAGCCGGAG GCTCTCGAAG AACTGCTGCA ATATGGGCCG
GAGCTTGGAT TGAATCGCCA CCGCGCGGCG GGAGCCCGGT GGCTCAGCAA CCATCATTCC
GACGCCAACG CCGAACAGAT CGTCTGCGTC AACGGCGGCC AGCACGGCTT GTTCTGCGTG
CTCATGGCGC TGCTCGAACG TGGTGACACC CTGGTCTCCG AACAGTTCAC CTATCCGGGA
TTGATATCGG CCTGCCGAAT TCTTGGCATC AATCTGGTCG GCCTCAAGAT GGATGACGAA
GGGCTGATTC CGAAATCGCT CGATGCAGTC TGCCGGACTG CAACGGTCAG AGCGCTGTTC
TGCACGCCGA CGCTGCAGAA TCCCACCACG GCGGTGCTTG GCCTGGAACG GCGCGCCGAG
ATCGCCCGGC TTTGTCGCGC GCATAATCTA TTGGTCATCG AAGATGACGC GCACGGCGTC
TTGGTCAAGG ATCGTCACCC GCATATCGGA CATTTCGTAC CGGAACGAAG CATTCTGATT
TCAAGCCTGT CAAAAGCGAT CGCGGCCGGC CTGAGGGTCG GCTACGTCCA TGCGCCGTTG
CCATTGGTCG GACGCATCGG CACCATGGTG CGAACCAATT GCTGGATGGC GAACCCTCTG
GCCTTCGAGA TGGTCAGTCT CTGGATCGAG GACGGCAGCG CGCTGCGCTT CCTCGAAGAC
CAGATCGAGG AAATCGTTCG ACGCAAAACC CTCGTTCAAC CGCTTCTGGA CGGCTTCGTC
GTCAAAACCC ATCCAAGGAG TCCGCACTTC TGGATCGAAG TTCCAAGCCC TTGGCGCGCC
TCAGAGATTG CAAGCGAGTT AAGGCAGAAG AACTGTCTCG TTGCGCCTGC GGAGGCATTC
GCCGTGGACC GTGACCGCAC CGTCCAATTT CTGCGGGCTA GCGTTAGCAG CGCTGAAAAG
ACCGACGCTG CCATCAGTGA AGGATTTCGC ATTCTCTCCG CCGTATTGAG AAATCCTTCA
ACAACTACCG CGATCCACTA G
 
Protein sequence
MHLLRYGTQY KMAVNLNNDT VSILHSALRE GMGPKYRRLA RRLEALILSR GLRTGEKLPA 
HRDLAALLGV TAGTISRAYG ELRKIGLISS RVGDGTFVLE LAKKKRKETD FKPYGGGEPG
RYDLSRNTAI PGRMLTSVSE TLRRLALQPE ALEELLQYGP ELGLNRHRAA GARWLSNHHS
DANAEQIVCV NGGQHGLFCV LMALLERGDT LVSEQFTYPG LISACRILGI NLVGLKMDDE
GLIPKSLDAV CRTATVRALF CTPTLQNPTT AVLGLERRAE IARLCRAHNL LVIEDDAHGV
LVKDRHPHIG HFVPERSILI SSLSKAIAAG LRVGYVHAPL PLVGRIGTMV RTNCWMANPL
AFEMVSLWIE DGSALRFLED QIEEIVRRKT LVQPLLDGFV VKTHPRSPHF WIEVPSPWRA
SEIASELRQK NCLVAPAEAF AVDRDRTVQF LRASVSSAEK TDAAISEGFR ILSAVLRNPS
TTTAIH